Data Analysis with Python refers to manipulating, processing, cleaning, and crunching data. It is used in scientific computing for data-intensive applications.
The following are the libraries available in the Python ecosystem that are used by data analysts and data scientists: NumPy, SciPy, Pandas, Matplotlib, Ipython, and Jupiter Notebook. Of course, there are many others but these are enough to get started. You have to install them using
I assume you have
Python 3.X and
pip3 installed locally. If you don’t then go ahead and install it.
First you have to create a virtual environment:
python3 -m venv data_analysis
Activate the virtual environment:
You have to be in the folder where your virtual environment folder is (in my case, in the folder where the
data_analysis folder is).
Install the libraries:
pip3 install NumPy pip3 install ScyPi pip3 install Pandas pip3 install Matplotlib pip3 install ipython pip3 install jupyter