# Helsinki fall 2020

Zoom meeting for the course: <https://helsinki.zoom.us/j/68027991035>. To get into the meeting, use the code 065380.

The course relies heavily on blended learning and flipped classroom techniques. Therefore, much work will happen outside of class, in interactive assignments, reading of literature, testing tools or creating presentations. Meetings will be used to give presentations, discuss, share knowledge and ensure understanding. Accordingly, after the first week, we will fall into a schedule where meetings happen on each Thursday, while each Tuesday is reserved for doing the assignments for that week.

![Meetings and assignments at a glance](https://1892588370-files.gitbook.io/~/files/v0/b/gitbook-legacy-files/o/assets%2F-LPB0-GYwxbJd7xOS0FD%2F-MMzk1nk83S2lsYhmsm4%2F-MMzkLqueJp4pVsIBNvq%2Fimage.png?alt=media\&token=46e1fc25-f05d-48f9-94c1-dea4199933a4)

### Tue **27.10.** — [Introduction](https://jiemakel.gitbook.io/cl4hss/introduction-three-approaches-to-methods-for-digital-humanists)

{% tabs %}
{% tab title="Meeting contents" %}

* Activatory pair discussion
* [Introduction, practicalities -lecture](https://docs.google.com/presentation/d/e/2PACX-1vT9OQtyE2pb8I3_TPDOTO5fmF8G4Mx2cE7WpP4ebORcjg5o6b55y96CHo-0CLLgX05l0V4x2nUnvu50/pub?start=false\&loop=false\&delayms=3000)
* [Flinga questionnaire on background and interests](https://edu.flinga.fi/s/E39GQBX)
* Briefing of assignments
  {% endtab %}

{% tab title="Assignments given" %}
For Thu 29.10. (in two days):&#x20;

1. Join the course Slack and optionally the hypothes.is group
2. Answer the course background [questionnaire](https://goo.gl/forms/gQpLPyOVV4ZvtL1x1) (\~5min)
3. Look over the [final projects from previous years](https://jiemakel.gitbook.io/cl4hss/final-project#submissions-from-previous-years). Select the project that interests you the most. Be prepared to discuss why you chose those that project in class. (\~20-40min)

For Thu 5.11. (in 1½ weeks, but more assignments will be given on Thursday, so start already):

1. Read up on [the history of humanities computing](https://jiemakel.gitbook.io/cl4hss/history-of-humanities-computing#history-of-humanities-computing) and do the assignment mentioned there (\~1-2h)
   {% endtab %}
   {% endtabs %}

### Thu 29.10. — Debriefing of assignments, data

{% tabs %}
{% tab title="Assignments due" %}

1. Answer the course background [questionnaire](https://goo.gl/forms/gQpLPyOVV4ZvtL1x1)
2. Look over the [final projects from previous years](https://jiemakel.gitbook.io/cl4hss/final-project#submissions-from-previous-years). Select the project that interests you the most. Be prepared to discuss why you chose those that project in class.

{% endtab %}

{% tab title="Meeting contents" %}

* Group discussion of projects from previous years
* [Lecture on data](https://docs.google.com/presentation/d/e/2PACX-1vSv9s1sY5NMfjnCKPU7NJZyB6zY3B7BMSNMXuWSDBi71uDkn6tq_u53qYbpnhJN3etf9n_oJgJrU7U8/pub?start=false\&loop=false\&delayms=3000)
* Briefing of assignments on data and tools
  {% endtab %}

{% tab title="Assignments given" %}
For Thu 5.11. (in one week):

1. Read up on [the history of humanities computing](https://jiemakel.gitbook.io/cl4hss/history-of-humanities-computing#history-of-humanities-computing) and do the assignment mentioned there (\~1-2h).
2. Find a [dataset](https://jiemakel.gitbook.io/cl4hss/three-approaches-to-methods-for-digital-humanities-work-area/different-types-of-data-data-quality-available-open-datasets) that could be of interest to you in your final project. Be prepared to present in class (max one slide, 3 minutes):
   1. why you chose those that dataset,
   2. what types of information does it contain,
   3. what the structure, technical format and way of accessing the data is, and
   4. what potential sources of problems or biases does it have. (\~15-45min)
3. Data cleanup: complete the [OpenRefine tutorial](https://programminghistorian.org/lessons/cleaning-data-with-openrefine). (\~30-60min)
4. Experiment with at least one of the following tools (\~30-60min + \~15-30min):

   1. Visualization:
      * tabular data → chart visualisations: [RAW](http://rawgraphs.io/)​
      * tabular data → chart visualisations: [Voyager](http://vega.github.io/voyager/)
      * tabular data → chart visualisations: ​[Tableau](https://www.tableau.com/)​
      * tabular data → ​interactive map/network/timeline/list/facet visualisations: [Palladio](https://moodle.helsinki.fi/hdlab.stanford.edu/palladio/)​
        * Palladio has [help pages](http://hdlab.stanford.edu/palladio/help/). There are also multiple tutorials on using Palladio, for example [this one](http://miriamposner.com/blog/getting-started-with-palladio/), or [this one](https://programminghistorian.org/en/lessons/creating-network-diagrams-from-historical-sources) which is particularly on network analysis.
      * tabular data → map(+timeline) visualisations: ​[Carto](https://carto.com/)​
      * ​text →​ interactive explorative interface for linguistic study: [Voyant tools](https://voyant-tools.org/)​
      * ​big, preselected collections of text → interface for linguistic study: [Korp](https://moodle.helsinki.fi/korp.csc.fi) / [corpus.byu.edu](http://corpus.byu.edu/)​
      * If you're feeling explorative, feel free to also dig for more tools in [TAPoR](http://tapor.ca/home).
   2. Data acquisition:
      1. Hand-written text transcription: [Transkribus](https://transkribus.eu/)
      2. Layout and text transcription: [OCR4all](https://github.com/OCR4all/getting_started)
      3. Keyword generation from text: [Annif](http://annif.org/)
      4. An [automated sound transcription tool](https://www.google.com/search?q=automated+sound+transcription)
      5. An automated image/video description tool
      6. Twitter archiving: [TAGS](https://tags.hawksey.info/)

   If you're short on inspiration, feel free to go through [this](https://docs.google.com/document/d/13I7svLlqrg7i0iisw2E_v48Gae5tnXVFWxmeHyGAKFU/edit) hands-on tutorial covering OpenRefine, RAW and Palladio. Afterwards, find other people who experimented with the same tool on Slack. Together, prepare a short demonstration (5-10 minutes) of the tool for class, describing:

   1. What is the tool good for?
   2. What kind of data do you need for the tool to be useful?&#x20;
      1. What information does the data need to contain?
      2. What format does it have to be in?
   3. Your experience with the tool.
   4. For groups studying visualization tools, also read [Perception deception](https://trinachi.github.io/data-design-builds/ch17.html) & [Common visualization mistakes](https://trinachi.github.io/data-design-builds/ch18.html), and reflect on how likely it is that you could use the visualizations to deceive yourself
      {% endtab %}
      {% endtabs %}

### Thu 5.11. — Debriefing of assignments, Programming

{% tabs %}
{% tab title="Assignments due" %}

1. Read up on [the history of humanities computing](https://jiemakel.gitbook.io/cl4hss/history-of-humanities-computing#history-of-humanities-computing) and do the assignment mentioned there (\~1-2h).
2. Find a [dataset](https://jiemakel.gitbook.io/cl4hss/three-approaches-to-methods-for-digital-humanities-work-area/different-types-of-data-data-quality-available-open-datasets) that could be of interest to you in your final project. Be prepared to present in class (max one slide, 3 minutes):
   1. why you chose those that dataset,
   2. what types of information does it contain,
   3. what the structure, technical format and way of accessing the data is, and
   4. what potential sources of problems or biases does it have. (\~15-45min)
3. Data cleanup: complete the [OpenRefine tutorial](https://programminghistorian.org/lessons/cleaning-data-with-openrefine). (\~30-60min)
4. Experiment with at least one of the following tools (\~30-60min + \~15-30min):

   1. Visualization:
      * tabular data → chart visualisations: [RAW](http://rawgraphs.io/)​
      * tabular data → chart visualisations: [Voyager](http://vega.github.io/voyager/)
      * tabular data → chart visualisations: ​[Tableau](https://www.tableau.com/)​
      * tabular data → ​interactive map/network/timeline/list/facet visualisations: [Palladio](https://moodle.helsinki.fi/hdlab.stanford.edu/palladio/)​
        * Palladio has [help pages](http://hdlab.stanford.edu/palladio/help/). There are also multiple tutorials on using Palladio, for example [this one](http://miriamposner.com/blog/getting-started-with-palladio/), or [this one](https://programminghistorian.org/en/lessons/creating-network-diagrams-from-historical-sources) which is particularly on network analysis.
      * tabular data → map(+timeline) visualisations: ​[Carto](https://carto.com/)​
      * ​text →​ interactive explorative interface for linguistic study: [Voyant tools](https://voyant-tools.org/)​
      * ​big, preselected collections of text → interface for linguistic study: [Korp](https://moodle.helsinki.fi/korp.csc.fi) / [corpus.byu.edu](http://corpus.byu.edu/)​
      * If you're feeling explorative, feel free to also dig for more tools in [TAPoR](http://tapor.ca/home).
   2. Data acquisition:
      1. Hand-written text transcription: [Transkribus](https://transkribus.eu/)
      2. Layout and text transcription: [OCR4all](https://github.com/OCR4all/getting_started)
      3. Keyword generation from text: [Annif](http://annif.org/)
      4. An [automated sound transcription tool](https://www.google.com/search?q=automated+sound+transcription)
      5. An automated image/video description tool
      6. Twitter archiving: [TAGS](https://tags.hawksey.info/)

   If you're short on inspiration, feel free to go through [this](https://docs.google.com/document/d/13I7svLlqrg7i0iisw2E_v48Gae5tnXVFWxmeHyGAKFU/edit) hands-on tutorial covering OpenRefine, RAW and Palladio. Afterwards, find other people who experimented with the same tool on Slack. Together, prepare a short demonstration (5-10 minutes) of the tool for class, describing:

   1. What is the tool good for?
   2. What kind of data do you need for the tool to be useful?&#x20;
      1. What information does the data need to contain?
      2. What format does it have to be in?
   3. Your experience with the tool.
   4. For groups studying visualization tools, also read [Perception deception](https://trinachi.github.io/data-design-builds/ch17.html) & [Common visualization mistakes](https://trinachi.github.io/data-design-builds/ch18.html), and reflect on how likely it is that you could use the visualizations to deceive yourself
      {% endtab %}

{% tab title="Meeting contents" %}

* Group discussion on the history of humanities computing
* Debriefing of the OpenRefine assignment
* Group presentations of tools
* Briefing of assignments on programming and research

[slides](https://docs.google.com/presentation/d/e/2PACX-1vTmMCdQHDsAp35X5Nv7VDl6O8fhxqirYhJC40wAQ5ix0C-1XVPZR9HLPnzLNLhzKilwoUOILsqdVLlV/pub?start=false\&loop=false\&delayms=3000)
{% endtab %}

{% tab title="Assignments given" %}
For Thu 12.11. (in one week):

1. Programming: Go through the [fundamental concepts of programming for humanists](https://jiemakel.gitbook.io/cl4hss/data-processing-fundamental-concepts-of-programming-for-humanists) and complete the assignments there.
2. Regular expressions: Read the section on [regular expressions](https://jiemakel.gitbook.io/cl4hss/regular-expressions) and go through the assignments there. For the second assignment, add your solutions to the Flinga here: <https://edu.flinga.fi/s/ESFEH28>
3. Research on visualization tool development:&#x20;

   Read the following two articles on developing  tools for particular text-based humanities research questions:

   * [Visualizing Mouvance: Toward a visual analysis of variant medieval text traditions](https://doi.org/10.1093/llc/fqx033)
   * [Rule‐based Visual Mappings – with a Case Study on Poetry Visualization](https://onlinelibrary.wiley.com/doi/full/10.1111/cgf.12125)

   Now, think of a visualisation that would help you in your field. What information would it visualise? Prepare to discuss in class.
4. Research 2: read [this research article](https://doi.org/10.1073/pnas.1405984111). Try to understand on a general level what is being done on a methodological level, and how that feeds into the content argument. There will be group work relating to this in the next meeting.
   {% endtab %}
   {% endtabs %}

### Thu 12.11. — Debriefing, Recap

{% tabs %}
{% tab title="Assignments due" %}

1. Programming: Go through the [fundamental concepts of programming for humanists](https://jiemakel.gitbook.io/cl4hss/data-processing-fundamental-concepts-of-programming-for-humanists) and complete the assignments there.
2. Regular expressions: Read the section on [regular expressions](https://jiemakel.gitbook.io/cl4hss/regular-expressions) and go through the assignments there. For the second assignment, add your solutions to the Flinga here: <https://edu.flinga.fi/s/ESFEH28>
3. Research on visualization tool development:&#x20;

   Read the following two articles on developing  tools for particular text-based humanities research questions:

   * [Visualizing Mouvance: Toward a visual analysis of variant medieval text traditions](https://doi.org/10.1093/llc/fqx033)
   * [Rule‐based Visual Mappings – with a Case Study on Poetry Visualization](https://onlinelibrary.wiley.com/doi/full/10.1111/cgf.12125)

   Now, think of a visualisation that would help you in your field. What information would it visualise? Prepare to discuss in class.
4. Research 2: read [this research article](https://doi.org/10.1073/pnas.1405984111). Try to understand on a general level what is being done on a methodological level, and how that feeds into the content argument. There will be group work relating to this in the next meeting.
   {% endtab %}

{% tab title="Meeting contents" %}

* Debriefing of programming and regular expression assignments
* Group discussion on the visualization research
* Group work on the Old Bailey research
* Recap of the course thus far
* Briefing of assignments

[slides](https://docs.google.com/presentation/d/e/2PACX-1vSfX3eqEwbjYibm3AD6KvEad55QOD1aBLA9NUjwTjrYLuanjoVOzEngXHr8RY424T6qDYnFDl0ADMAQ/pub?start=false\&loop=false\&delayms=3000)
{% endtab %}

{% tab title="Assignments given" %}

1. Select (at least) one of the following sets of paired articles based on your own interests:

   * language change, simulation: [Social networks and intraspeaker variation during periods of language change](http://repository.upenn.edu/cgi/viewcontent.cgi?article=1041\&context=pwpl) + [Utterance selection model of language change](http://journals.aps.org/pre/abstract/10.1103/PhysRevE.73.046118). Also note that you can experiment yourself with the model described in the first paper [here](http://www.netlogoweb.org/launch#http://www.netlogoweb.org/assets/modelslib/Sample%20Models/Social%20Science/Language%20Change.nlogo).
   * twitter, sentiment analysis: [What a Nasty day: Exploring Mood-Weather Relationship from Twitter](https://arxiv.org/abs/1410.8749) + [A Biased Review of Biases in Twitter Studies on Political Collective Action](https://doi.org/10.3389/fphy.2016.00034)
   * dynamics of modern-day media: [Using the president’s tweets to understand political diversion in the age of social media](https://doi.org/10.1038/s41467-020-19644-6) + [Emotive, evaluative, epistemic: a linguistic analysis of affectivity in news journalism](https://researchportal.helsinki.fi/en/publications/emotive-evaluative-epistemic-a-linguistic-analysis-of-affectivity)
   * simulation, archaeology: [Population growth and collapse in a multiagent model of the Kayenta Anasazi in Long House Valley](https://doi.org/10.1073/pnas.092080799) + [Understanding Artificial Anasazi](http://jasss.soc.surrey.ac.uk/12/4/13.html)
   * geographic information, network analysis, archaeology: [Exploring the dynamics of transport in the Dutch limes](http://journal.topoi.org/index.php/etopoi/article/view/203) + [Testing the Robustness of Local Network Metrics in Research on Archeological Local Transport Networks](http://dx.doi.org/10.3389/fdigh.2016.00006)
   * history, text reuse detection: [Plundering Philosophers:Identifying Sources of the Encyclopédie](http://hdl.handle.net/2027/spo.3310410.0013.107) + [The Use and Abuse of the Digital Humanities in the History of Ideas: How to Study the Encyclopédie](http://www.tandfonline.com/doi/pdf/10.1080/01916599.2013.774115?needAccess=true) (Interestingly, first article doesn't have affiliations. Digging thrhough, most people seem to be from this project)
   * network analysis: [Protestant Letter Networks in the Reign of Mary I: A Quantitative Approach](https://muse.jhu.edu/journals/elh/v082/82.1.ahnert.html) + [Automated analysis of the US presidential elections using Big Data and network analysis](https://doi.org/10.1177%2F2053951715572916)
   * 3D/spatial analysis, archaeology: [A Survey of Geometric Analysis in Cultural Heritage](https://doi.org/10.1111/cgf.12668) + [A GIS-based viewshed analysis of Chacoan tower kivas in the US Southwest: were they for seeing or to be seen?](https://doi.org/10.15184/aqy.2016.144)
   * Image recognition of woodcut prints: [Image-matching technology applied to Fifteenth-century printed book illustration](https://doi.org/10.1007/s40329-017-0201-5) / [Wormholes record species history in space and time](https://doi.org/10.1098/rsbl.2012.0926)

   Post which article you've selected on #research in Slack by Tuesday the 17th at the latest. Form a group with all the other people who selected the same articles. For class, prepare a presentation on them, detailing:

   1. How do the two articles relate to each other?&#x20;
   2. Research questions - What are the human research questions? Do the projects also target computer science research questions? If so, what? What is the relationship between the CS and human research questions?
   3. Data - How has the data used been gathered? What are the data sources used? Is the data available for others to use?
   4. Methods - What methods do the projects apply? How do the methods support answering the research questions?
   5. Partners - What is the make-up of the projects? Which disciplines are represented by the participants?
2. Explore [this topic model](http://aulis.sange.fi/~humis/tmp/ceecvis/) of [CEEC](https://www.helsinki.fi/en/researchgroups/varieng/corpus-of-early-english-correspondence) and read the [explanation](https://jiemakel.gitbook.io/cl4hss/digging-into-a-method-topic-modeling) on topic modelling
   {% endtab %}
   {% endtabs %}

### Thu 19.11. — Debriefing, Computational analysis

{% tabs %}
{% tab title="Assignments due" %}

1. Select (at least) one of the following sets of paired articles based on your own interests:

   * Language change, simulation: [Social networks and intraspeaker variation during periods of language change](http://repository.upenn.edu/cgi/viewcontent.cgi?article=1041\&context=pwpl) + [Utterance selection model of language change](http://journals.aps.org/pre/abstract/10.1103/PhysRevE.73.046118). Also note that you can experiment yourself with the model described in the first paper [here](http://www.netlogoweb.org/launch#http://www.netlogoweb.org/assets/modelslib/Sample%20Models/Social%20Science/Language%20Change.nlogo).
   * Twitter, linguistic analysis, geographical analysis: [What a Nasty day: Exploring Mood-Weather Relationship from Twitter](https://arxiv.org/abs/1410.8749) + [Mapping Lexical Innovation on American Social Media](https://doi.org/10.1177%2F0075424218793191)

     &#x20;[A Biased Review of Biases in Twitter Studies on Political Collective Action](https://doi.org/10.3389/fphy.2016.00034)
   * dynamics of modern-day media: [Using the president’s tweets to understand political diversion in the age of social media](https://doi.org/10.1038/s41467-020-19644-6) + [Emotive, evaluative, epistemic: a linguistic analysis of affectivity in news journalism](https://researchportal.helsinki.fi/en/publications/emotive-evaluative-epistemic-a-linguistic-analysis-of-affectivity)
   * Simulation, archaeology: [Population growth and collapse in a multiagent model of the Kayenta Anasazi in Long House Valley](https://doi.org/10.1073/pnas.092080799) + [Understanding Artificial Anasazi](http://jasss.soc.surrey.ac.uk/12/4/13.html)
   * Geographic information, network analysis, archaeology: [Community structure of copper supply networks in the prehistoric Balkans: An independent evaluation of the archaeological record from the 7th to the 4th millennium BC](https://doi.org/10.1093/comnet/cnx013) + [Testing the Robustness of Local Network Metrics in Research on Archeological Local Transport Networks](http://dx.doi.org/10.3389/fdigh.2016.00006)
   * History, text reuse detection: [Plundering Philosophers:Identifying Sources of the Encyclopédie](http://hdl.handle.net/2027/spo.3310410.0013.107) + [The Use and Abuse of the Digital Humanities in the History of Ideas: How to Study the Encyclopédie](http://www.tandfonline.com/doi/pdf/10.1080/01916599.2013.774115?needAccess=true) (Interestingly, first article doesn't have affiliations. Digging through, most people seem to be from [this project](https://artfl-project.uchicago.edu/content/papers-and-presentations))
   * Network analysis: [Protestant Letter Networks in the Reign of Mary I: A Quantitative Approach](https://muse.jhu.edu/journals/elh/v082/82.1.ahnert.html) + [Automated analysis of the US presidential elections using Big Data and network analysis](https://doi.org/10.1177%2F2053951715572916)
   * 3D/spatial analysis, archaeology: [A Survey of Geometric Analysis in Cultural Heritage](https://doi.org/10.1111/cgf.12668) + [A GIS-based viewshed analysis of Chacoan tower kivas in the US Southwest: were they for seeing or to be seen?](https://doi.org/10.15184/aqy.2016.144)
   * Image recognition of woodcut prints: [Image-matching technology applied to Fifteenth-century printed book illustration](https://doi.org/10.1007/s40329-017-0201-5) / [Wormholes record species history in space and time](https://doi.org/10.1098/rsbl.2012.0926)
   * Visual analysis, art history: [A Quantitative Approach to Beauty. Perceived Attractiveness of Human Faces in World Painting](https://doi.org/10.11588/dah.2015.1.21640) / [Against Digital Art History](https://humanitiesfutures.org/papers/digital-art-history/)

   Form a group with all the other people who selected the same articles. For class, prepare a presentation on them, detailing:

   1. How do the two articles relate to each other?&#x20;
   2. Research questions - What are the humanities research questions? Do the projects also target computer science research questions? If so, what? What is the relationship between the CS and humanities research questions?
   3. Data - How has the data used been gathered? What are the data sources used? Is the data available for others to use?
   4. Methods - What methods do the projects apply? How do the methods support answering the research questions?
   5. Partners - What is the make-up of the projects? Which disciplines are represented by the participants?
2. Explore [this topic model](http://aulis.sange.fi/~humis/tmp/ceecvis/) of [CEEC](https://www.helsinki.fi/en/researchgroups/varieng/corpus-of-early-english-correspondence) and read the [explanation](https://jiemakel.gitbook.io/cl4hss/digging-into-a-method-topic-modeling) on topic modelling
   {% endtab %}

{% tab title="Meeting contents" %}

* Group presentations on research articles
* [Lecturer presentation on research using computational analysis](https://docs.google.com/presentation/d/e/2PACX-1vQh0aFghShAvO3BWk5MTQC4wLBkKYCWF1GTqZZkZ9Y7UZYyegDJmGzc7-T9QGGtUE_FV4S-EVUXuaDS/pub?start=false\&loop=false\&delayms=3000)
* Briefing of assignments
  {% endtab %}

{% tab title="Assignments given" %}
For Thu 26.11. (in one week):

1. Form a group with people from your own or nearby fields. Find a computational research paper from your field. For class, prepare a presentation on the article, detailing:
   1. Research questions - What are the human research questions? Do the projects also target computer science research questions? If so, what? What is the relationship between the CS and human research questions?
   2. Data - How has the data used been gathered? What are the data sources used? Is the data available for others to use?
   3. Methods - What methods do the projects apply? How do the methods support answering the research questions?
   4. Partners - What is the make-up of the projects? Which disciplines are represented by the participants?
2. Write a one to two page plan of what you'll do for your [final project](https://jiemakel.gitbook.io/cl4hss/final-project). Discuss the following:

   1. What are your humanities research questions?
   2. Which data will you use?
   3. How do you plan to process, clean up and transform your data?
   4. How do you plan to analyze your data? How will the analysis help answer the humanities research questions?
   5. Critically analyze your data and pipeline for potential bias and problems.&#x20;

   These will be peer reviewed. Return the plan through Moodle at <https://moodle.helsinki.fi/course/view.php?id=36622>.
   {% endtab %}
   {% endtabs %}

### Thu 26.11. — Debriefing, Computational analysis

{% tabs %}
{% tab title="Assignments due" %}

1. Form a group with people from your own or nearby fields. Find a computational research paper from your field. For class, prepare a presentation on the article, detailing:
   1. Research questions - What are the human research questions? Do the projects also target computer science research questions? If so, what? What is the relationship between the CS and human research questions?
   2. Data - How has the data used been gathered? What are the data sources used? Is the data available for others to use?
   3. Methods - What methods do the projects apply? How do the methods support answering the research questions?
   4. Partners - What is the make-up of the projects? Which disciplines are represented by the participants?
2. Write a one to two -page plan of what you'll do for your [final project](https://jiemakel.gitbook.io/cl4hss/final-project). Discuss the following:

   1. What are your humanities research questions?
   2. Which data will you use?
   3. How do you plan to process, clean up and transform your data?
   4. How do you plan to analyze your data? How will the analysis help answer the humanities research questions?
   5. Critically analyze your data and pipeline for potential bias and problems.&#x20;

   These will be peer-reviewed. Return the plan through Moodle at <https://moodle.helsinki.fi/course/view.php?id=36622>.

   .
   {% endtab %}

{% tab title="Meeting contents" %}

* Group presentations on research
* Group assignment on topic modelling
* [Lecture on computational analysis](https://docs.google.com/presentation/d/e/2PACX-1vQfa8arUdmrbczoj75R806mwTBtzQsjVacMvHbmL-3sA_rAzPClc0WgTFSNnRK9ZphZ37c2oLolNhj-/pub?start=false\&loop=false\&delayms=3000)
* Briefing of assignments
  {% endtab %}

{% tab title="Assignments given" %}
For Tue 1.12. (in five days):

1. Peer review two project plans of your fellow students

For Thu 3.12. (in one week):

1. Be prepared to shortly (1-2 minutes max) present your project plan to the others
   {% endtab %}
   {% endtabs %}

### Thu 3.12. — Debriefing, Open, reproducible research

{% tabs %}
{% tab title="Assignments due" %}

1. (Tue 1.12.) Peer review two project plans of your fellow students
2. Be prepared to shortly (1-2 minutes max) present your project plan to the others
   {% endtab %}

{% tab title="Meeting contents" %}

* Project plan presentations
* [Lecture on open, reproducible research](https://docs.google.com/presentation/d/e/2PACX-1vSBysmBut7oJX5DrsiN-5MNlsDYK8n2B5P3EUzc1nDtFujouuBW-FWWgcYH7QKxrwP7mFWDix__nNtC/pub?start=false\&loop=false\&delayms=3000)
  {% endtab %}

{% tab title="Assignments given" %}
For Thu 17.12. (in two weeks):

1. [Final project](https://jiemakel.gitbook.io/cl4hss/final-project)
   {% endtab %}
   {% endtabs %}

### Thu 17.12. — Deadline for returning final project
