> For the complete documentation index, see [llms.txt](https://jiemakel.gitbook.io/cl4hss/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://jiemakel.gitbook.io/cl4hss/master.md).

# Computational literacy for the humanities and social sciences

By [Eetu Mäkelä](http://iki.fi/eetu.makela), professor in Digital Humanities ([Human Sciences–Computing Interaction](http://heldig.fi/hsci)) at the [University of Helsinki](https://www.helsinki.fi/).

{% hint style="warning" %}
This content is not yet complete, in the sense that some sections have not yet been converted from their original lecture slide format into self-contained texts for self-study. Each such section has a header similar to this at the top noting its draft status, as well as a :construction\_site: mark in the [table of contents below](/cl4hss/master.md#course-contents).
{% endhint %}

## Target audience

People of all levels in the humanities and interpretive social sciences (henceforth abbreviated as human sciences) interested in whether computational methods might help them in their own work.&#x20;

**Prerequisites:** Absolutely none.

**Aside:** Why should you be interested in computational methods? Two reasons:

1. they may allow you yourself to do your work more efficiently, and
2. they may lead to completely new and powerful ways of addressing questions in your field

The probability of either of these happening very much depends on what you are interested in, but not in any way that can be shortly enumerated. Instead, that is what this course aims at enabling you to discover yourself.&#x20;

## Course concept and learning goals

This course is an introductory course on **applying modern data processing to complex social and historical data**. As a signposting course, the course describes the landscape of computational human sciences. The main learning goals of the course are that after completing it, a student will be able to:&#x20;

1. make informed decisions on which computational approaches will be of use to themself, and
2. understand, follow and discuss the development of computational approaches within their field in general&#x20;

They will also have the necessary background to avail of more specific courses and learning resources to further their understanding in these directions. With regard to subfields of the humanities or social sciences, the course makes no delineations, on the contrary arguing that by taking examples from different fields, a deeper understanding of the possibilities afforded by computation can be attained. For more details, see the [introduction](/cl4hss/introduction-three-approaches-to-methods-for-digital-humanists.md).

In terms of smaller objectives, as part of the above, after the course:

* The student understands the multiple ways in which computational approaches benefit work within the human sciences.&#x20;
* They are able to use [ready tools](/cl4hss/three-approaches-to-methods-for-digital-humanities-work-area/easy-tools-for-processing-and-exploring-data.md) to work with data.&#x20;
* In addition, they have attained knowledge of the [fundamental concepts of programming](/cl4hss/data-processing-fundamental-concepts-of-programming-for-humanists.md), through which they can start to expand their capabilities, should they so choose.&#x20;
* The student also gains a basic understanding of the central [fundamental concepts of statistics](/cl4hss/data-analysis-fundamental-concepts-of-statistics.md), which both 1) act as a general framework with regard to which many statistical approaches encountered later can be positioned, and 2) act as a practical foundation from which to pursue further understanding.
* Further, the student gains a general literacy on advanced [statistical and computer science methods](/cl4hss/three-approaches-to-methods-for-digital-humanities-work-area/computational-data-analysis-method-literacy.md) applicable to computational human sciences, and when to apply them (as well as crucially, when and how **not** to apply them).&#x20;
* They also learn how [open, reproducible research and publishing](/cl4hss/three-approaches-to-methods-for-digital-humanities-work-area/open-reproducible-research-and-publishing.md) is done in practice.&#x20;
* Finally, the student learns to apply all of the above in practice in a [small concrete computational human sciences project](/cl4hss/final-project.md).&#x20;

## Format

This course is meant for both independent self-study (reading up on only certain sections of the course), as well as for completing as either a contact learning or MOOC course with a group of like-minded students. For material relating to particular instances of this latter mode of study, see [here](https://studies.helsinki.fi/courses/course-unit/otm-7a0d4651-9800-474d-9c32-668f753bd638).

Workload-wise, the full course is rated at 5 ECTS, which officially translates to \~135 hours of study. However, ECTS workload ratings have always diverged both from reality, as well as student expectations. In practice, I expect the load to be some 60-70 hours, or about ½ to ⅔ of the official norm. Generally, courses at this workload-level seem to be evaluated by students as "moderate to heavyish" in workload (because sometimes you can get 5 ECTS even for something like 25h or ⅕ of the official norm, for example from just sitting in lectures 14 x 1½ hours, and then doing a couple of hours of work on top of that!)

## Course contents

( :construction\_site: marks parts of the course not yet fully converted out of lecture slide format)

1. [Introduction: three approaches to methods for digital humanists](/cl4hss/introduction-three-approaches-to-methods-for-digital-humanists.md)
   * Easy, ready-made tools for data acquisition, cleanup, visualisation and exploration
   * Fundamentals of programming for data processing
   * Data analysis method literacy
2. [Data](/cl4hss/three-approaches-to-methods-for-digital-humanities-work-area/different-types-of-data-data-quality-available-open-datasets.md) :construction\_site:&#x20;
3. [Easy tools for acquiring, processing and exploring data](/cl4hss/three-approaches-to-methods-for-digital-humanities-work-area/easy-tools-for-processing-and-exploring-data.md) :construction\_site:&#x20;
4. [Data processing: fundamental concepts of programming for humanists](/cl4hss/data-processing-fundamental-concepts-of-programming-for-humanists.md)
5. [Data processing: regular expressions](/cl4hss/regular-expressions.md)
6. [Data analysis: fundamental concepts of statistics](/cl4hss/data-analysis-fundamental-concepts-of-statistics.md)
7. [Computational data analysis method literacy](https://docs.google.com/presentation/d/e/2PACX-1vTEAtbzLYJXn2Pp8ozrSfxmzQOxo6SfVOXpscLbgCXkeXtqpzlwlU37dmQTWEAjIUAPedbT_BG1x0Ll/pub?start=false\&loop=false\&delayms=3000) :construction\_site:&#x20;
8. [Open, reproducible research and publishing](/cl4hss/three-approaches-to-methods-for-digital-humanities-work-area/open-reproducible-research-and-publishing.md) :construction\_site:&#x20;
9. [Digital humanities project](/cl4hss/final-project.md)

### **General note**

*"At times the course felt like being hit by a bus, the way we were forced to figure out many things on our own. It did at times result in an awful lot of stress, but it actually was the best way to learn how to do these things and more importantly, how to find info on how different things work and should be done."*  *-* course feedback

There's a lot to take in during the course, and much of it may be unfamiliar and at first confusing. A major principle of the course is that you should not try to wholly understand everything in the first instance. While an effort has been made to keep the language and concepts as simple as I could make them, as well as order them sensibly with regard to each other, often there was no way I could order everything neatly into a linear learning progression.&#x20;

For example, to really understand [easy to use end-user tools](/cl4hss/three-approaches-to-methods-for-digital-humanities-work-area/easy-tools-for-processing-and-exploring-data.md), one needs to know how they relate to the [possibilities of computational analyses in general](/cl4hss/three-approaches-to-methods-for-digital-humanities-work-area/computational-data-analysis-method-literacy.md), as well as [different types of data](/cl4hss/three-approaches-to-methods-for-digital-humanities-work-area/different-types-of-data-data-quality-available-open-datasets.md) and different types of [preprocessing ](/cl4hss/regular-expressions.md)of that data. Further, to properly contextualise them, one also needs to understand how their affordances differ from those available to users of [programmatic](/cl4hss/data-processing-fundamental-concepts-of-programming-for-humanists.md) analysis libraries. However, ready to use tools are still presented before programming, data transformations and computational analyses, because I feel having tried them in practice provides a good springboard for understanding these more abstract and complex topics.

Thus, when going through the course and doing the assignments, try not to be bothered by not understanding everything in the first go. Instead, it is enough at each point to just have even a vague general notion or gist of things, and trust that it will all make sense in the end, once you've gone through all the subtopics. &#x20;

## Practical matters

* The course has Slack channels at [dhintros.slack.com](https://dhintros.slack.com/) used for both returning some assignments as well as peer and teacher support. Please [join](https://join.slack.com/t/dhintros/shared_invite/zt-2q92fd2b4-kwyIXUaAi_98J6rbv2qzTg) the Slack as well as the channel for the instance of the course you're on (e.g. #cl4hss2024).
* For linking to quotes in their original context, the course uses [hypothes.is](http://hypothes.is). To be able to use this, you must [join](https://hypothes.is/groups/W6MAkGe8/clit4hss) the CL4HSS group (as well as register in general if you don't already have an account). You also naturally need access to the sources (most commonly through accessing them from a university network / VPN. For example for Helsinki, see [this guide](https://helpdesk.it.helsinki.fi/en/logging-and-connections/networks/connections-outside-university)).
* If you use the material for self-study and it ends up being useful for you, I'd appreciate a note about this. Feel free to send that either through Slack, e-mail, Twitter or wherever you [find me](http://iki.fi/eetu.makela).

## Licensing

The text of this course is licensed under a [Creative Commons Attribution 4.0 International License](http://creativecommons.org/licenses/by/4.0/). This means that you are free to use, embed, remix and further develop any part of this course for use in your own course or other material. The only requirement is that you give appropriate credit for this material, provide a link to the license, and indicate if changes were made (see the [license](https://creativecommons.org/licenses/by/4.0/) for more details).&#x20;

If you do make use of this material, I'd naturally also appreciate a ping, as well as the possibility to merge any improvements to this version, even if neither of those is actually required by the license.

For access to the source code of this GitBook, please see [this](https://github.com/jiemakel/METH4DH) GitHub repository.

<div align="left"><img src="/files/-LRI9_vVSQfWYJKBIAlD" alt=""></div>


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://jiemakel.gitbook.io/cl4hss/master.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.