Data Analysis with Python | Prerequisites

Data Analysis with Python refers to manipulating, processing, cleaning, and crunching data. It is used in scientific computing for data-intensive applications.

The following are the libraries available in the Python ecosystem that are used by data analysts and data scientists: NumPy, SciPy, Pandas, Matplotlib, Ipython, and Jupiter Notebook. Of course, there are many others but these are enough to get started. You have to install them using pip.

Note:
I assume you have Python 3.X and pip3 installed locally. If you don’t then go ahead and install it.

First you have to create a virtual environment:

python3 -m venv data_analysis

Activate the virtual environment:

source data_analysis/bin/activate

Note:
You have to be in the folder where your virtual environment folder is (in my case, in the folder where the data_analysis folder is).

Install the libraries:

pip3 install NumPy
pip3 install ScyPi
pip3 install Pandas
pip3 install Matplotlib
pip3 install ipython
pip3 install jupyter

Leave a Reply