P01: Our environment¶
Jupyter Notebooks¶
This is a jupyter notebook. A notebook is file that combines explanatory text (written in markdown), computer code, and the output of the computer code.
A notebook is comprised of “cells”. A given cell can contain different types of things. We will concern ourselves with cells that contain either markdown (like this cell), or python code.
Markdown Cells¶
Markdown is a way of specifying text formatting in plain text.
For instance, this cell starts with the text ### Markdown
which tells the computer that it should display a level 3 heading with the text “Markdown”. The cell above starts with the text # CSS 1:...
which specifies a level 1 heading, which you can see is rendered in larger text.
Some other common markdown formatting syntax:
paragraphs and linebreaks¶
Markdown ignores single line breaks. For instance, the following markdown is rendered on one line:
The text here
is
formatted
like so.
The text here is formatted like so.
To create linebreaks you can either add two linebreaks between each line, like so:
The text here
is
formatted
like so.
The text here
is formatted like so.
Or you could add two spaces to the end of each line:
The text here
is
formatted
like so.
The text here
is
formatted
like so.
text formatting¶
*text*
yields italics: text
**text**
yields bold: text
`text`
yields a code formatting: text
links¶
[text](URI)
yields a link. For instance [pydocs](https://docs.python.org/3/)
yields a link to python docs: pydocs
lists¶
- item 1
- item 2
yields a bulleted list:
item 1
item 2
1. item 1
2. item 2
yields an automatically numbered list:
item 1
item 2
Code cells¶
The virtue of jupyter notebooks is that you can mix explanatory text with computer code, and the results of running that code. We will be using these notebooks with Python code. Python code will run in special kinds of cells called code cells.
A given code cell has some content, which is sent to a Python interpreter. Upon executing that code, the cell prints some output. The output may be more or less complicated. Without any particular instructions about what to output, the code cell will output whatever the last line “returns”
34
34
Magics¶
We will spend most of our time worrying about how to write code that goes inside code cells, but it is worth noting that code cells can also contain special commands called “line magics” or “cell magics” which do something, but are not part of the python language itself. These commands occur on lines starting with %
or %%
. We will cover them as needed, but for now, here is an example:
%lsmagic
will list all the line and cell magics that are available.
%lsmagic
Available line magics:
%alias %alias_magic %autoawait %autocall %automagic %autosave %bookmark %cat %cd %clear %colors %conda %config %connect_info %cp %debug %dhist %dirs %doctest_mode %ed %edit %env %gui %hist %history %killbgscripts %ldir %less %lf %lk %ll %load %load_ext %loadpy %logoff %logon %logstart %logstate %logstop %ls %lsmagic %lx %macro %magic %man %matplotlib %mkdir %more %mv %notebook %page %pastebin %pdb %pdef %pdoc %pfile %pinfo %pinfo2 %pip %popd %pprint %precision %prun %psearch %psource %pushd %pwd %pycat %pylab %qtconsole %quickref %recall %rehashx %reload_ext %rep %rerun %reset %reset_selective %rm %rmdir %run %save %sc %set_env %store %sx %system %tb %time %timeit %unalias %unload_ext %who %who_ls %whos %xdel %xmode
Available cell magics:
%%! %%HTML %%SVG %%bash %%capture %%debug %%file %%html %%javascript %%js %%latex %%markdown %%perl %%prun %%pypy %%python %%python2 %%python3 %%ruby %%script %%sh %%svg %%sx %%system %%time %%timeit %%writefile
Automagic is ON, % prefix IS NOT needed for line magics.
Kernel¶
A notebook is edited in a browser, but it “talks” to a “kernel” that is being run on some server. The Kernel is a running session of (in our case) Python. When we run a code cell in the notebook, the code from that cell is sent to the running Python kernel, which executes that code, and then sends the output back to the notebook. The notebook then embeds that output below the code cell.
This arrangement is rather complicated. It has a number of benefits and a number of costs.
The benefits: The output has to be run once, and is then embedded in the notebook, and can be viewed without rerunning the code. The code can be embedded in a document such as this one, with text, explanations, output, etc. The kernel can be run on various remote servers, while you access the notebook through your own browser, so you do not have to set up your own python environment on your own computer, and so you can easily plug into a kernel running on a server more powerful than your own computer.
There are costs however: Code cells can be executed out of order, and the state of the kernel is changed by each cell that is run. This can cause quite a bit of confusion. A notebook is a complicated file, and you can’t just look at it in a text editor, as you could would just code on its own.