For this workshop, we will be using Python via Jupyter
You can think of Python like a car’s engine, while Jupyter is like a car’s dashboard
Let's open JupyterLab and create our first Jupyter notebook! Two options:
Illustration by Allison Horst
It's good practice to keep all the files for a project in one folder, and use sub-folders to keep things organized.
python-beginner
data
figures
python-beginner
folderA notebook consists of a series of "cells":
By default, a new cell is always a code cell.
To run a code cell, click in it and press Shift-Enter
or press the Run button on the toolbar
print('Hello world!')
Hello world!
2 + 2
4
today = 'Friday'
Some handy features:
Shift-Enter
or the Run button on the toolbarFor a great example of how an interactive workflow in Jupyter notebook can progress into automation with libraries/scripts, check out Jake VanderPlas' blog post Reproducible Data Analysis in Jupyter.
The Python libraries for data science are developed and maintained by external "3rd party" development teams
conda
(which comes with Anaconda/Miniconda)Some of the libraries in the Python data science ecosystem:
From The Unexpected Effectiveness of Python in Science (Jake VanderPlas)
In this workshop, we'll be using pandas
to work with tabular data and will give a brief introduction to data visualization with the seaborn
and plotly
libraries.
Go to: next lesson