Summary and Schedule
In this lesson, you will learn the basics of data analysis and computational humanities using Python.
Setup Instructions | Download files required for the lesson | |
Duration: 00h 00m | 1. Introduction |
What is the purpose of quantitative data analysis in the
humanities? When is it meaningful to use quantitative data analysis in humanities research? What kinds of operations can be performed in quantitative data analysis? How can these operations be performed using Python programming? |
Duration: 00h 30m | 2. Analyzing Tabular Data |
What quantitative analysis operations can be performed on tabular
data? How can these operations be translated into Python code? |
Duration: 02h 38m | 3. Analyzing Text Data |
What quantitative analysis operations can be performed on data composed
of literary texts? How can these operations be translated into Python code? |
Duration: 04h 46m | Finish |
The actual schedule may vary slightly depending on the topics and exercises chosen by the instructor.
What background knowledge do you need for this lesson?
- Basic mathematical background
- Familiarity with quantitative research methods
- Curiosity to learn about quantitative data analysis and Python programming
Datasets
In this lesson, we will work with multiple datasets. Each dataset represents a specific type of data that is suited to particular methods of quantitative analysis.
Links to be updated.
Data for Episode 2 (Analyzing Tabular Data): Artworks from MoMA
The metadata for artworks in the Museum of Modern Art (MoMA) collection can be downloaded from the museum’s GitHub Repository.
Since the data in this GitHub repository is continuously updated, we will be working with a version previously downloaded on April 9, 2025. This version is stored in the GitHub repository for this lesson, ensuring that the results you see during live coding match exactly what is demonstrated in the lesson.
To load the data from this lesson’s repository into your code, use the following URLs:
https://github.com/HERMES-DKZ/python_101_humanities/blob/main/episodes/data/moma_artworks.csv
Data for Episode 3 (Analyzing Text Data): Plays by Shakespeare and Marlowe
This dataset consists of the full texts of four plays by Christopher
Marlowe and nine plays by William Shakespeare as follows. The Marlowe
texts were downloaded from the website of Project
Gutenberg. The Shakespeare texts stem from Folger
Shakespeare Library. Each play has been saved to a .txt
file.
Plays by Christopher Marlowe:
- Doctor Faustus
- Edward II
- The Jew of Malta
- The Massacre at Paris
Plays by William Shakespeare:
- All’s Well That Ends Well
- The Comedy of Errors
- Hamlet
- Julius Caesar
- King Lear
- Macbeth
- Othello
- Romeo and Juliet
- The Winter’s Tale
You can download each set of works from the folders provided below:
https://github.com/HERMES-DKZ/python_101_humanities/blob/main/episodes/data/shakespeare
https://github.com/HERMES-DKZ/python_101_humanities/blob/main/episodes/data/marlowe
Software Setup
Python and Jupyter Notebook/Google Colab
To do the exercises in this lesson, you need an IDE (Integrated Development Environment). We recommend you use Jupyter Notebook or cloud-based equivalent such as Google Colab.
If you’re using Google Colab, you don’t need any installation. Just create a Google account - if you don’t have one already -, create a new Colab Notebook by clicking on New Notebook in the above link, and start coding.
Otherwise, to install Jupyter Notebook and Python on your computer together, we recommend using Anaconda. To do so, click on your operating system from the list below and follow the instructions.
- Open https://www.anaconda.com/download/success with your web browser.
- Download the Anaconda for Windows installer with Python 3. (If you are not sure which version to choose, you probably want the 64-bit Graphical Installer Anaconda3-…-Windows-x86_64.exe.)
- Install Python 3 by running the Anaconda Installer, using all of the defaults for installation except make sure to check Add Anaconda to my PATH environment variable.
This video tutorial can help you with the installation.
- Open https://www.anaconda.com/download/success with your web browser.
- Download the Anaconda Installer with Python 3 for macOS (you can either use the Graphical or the Command Line Installer).
- Install Python 3 by running the Anaconda Installer using all of the defaults for installation.
This video tutorial can help you with the installation.
- Open https://www.anaconda.com/download/success with your web browser.
- Download the Anaconda Installer with Python 3 for Linux. (The installation requires using the shell. If you aren’t comfortable doing the installation yourself stop here and request help at the workshop.)
- Open a terminal window and navigate to the directory where the
executable is downloaded (e.g.,
cd ~/Downloads
). - Type
bash Anaconda3-
and then press Tab to autocomplete the full file name. The name of file you just downloaded should appear. - Press Enter (or Return depending on your
keyboard). You will follow the text-only prompts. To move through the
text, press Spacebar. Type
yes
and press Enter (or Return) to approve the license. Press Enter (or Return) to approve the default location for the files. Typeyes
and press Enter (or Return) to prepend Anaconda to yourPATH
(this makes the Anaconda distribution the default Python). - Close the terminal window.
After installing Python and Anaconda, make sure to install the Python libraries that we are using in this lesson. If you haven’t installed Python libraries on your computer before, ask the instructors for support.