# Helsinki fall 2018

## Timetable for fall 2018

### Wednesday 31.10. — [Introduction](https://jiemakel.gitbook.io/cl4hss/introduction-three-approaches-to-methods-for-digital-humanists)

{% tabs %}
{% tab title="Assignments given" %}
For 2.11.:&#x20;

1. Answer the course background [questionnaire](https://goo.gl/forms/gQpLPyOVV4ZvtL1x1)
2. Look over the [final projects from last year](https://jiemakel.gitbook.io/cl4hss/final-project#submissions-from-previous-years). Select the project that interests you the most. Post a short message on the [#introductions](https://slack.com/app_redirect?channel=introductions\&team=T276JCMEU) channel on the course Slack to introduce yourself and to describe why you chose those that project.
3. Read up on [the history of humanities computing](https://jiemakel.gitbook.io/cl4hss/history-of-humanities-computing#history-of-humanities-computing). Be ready to discuss in groups on the next lecture.
   {% endtab %}
   {% endtabs %}

### Friday 2.11. — [Different types of data, data quality, available open datasets](https://docs.google.com/presentation/d/e/2PACX-1vQBZw3s36wdPtPf3gTcz-PW5eRND-59TIrmZki67lR341ycKuQcO1TjvB0Vn6wYJsNySN8QOSzuEDyq/pub?start=false\&loop=false\&delayms=3000)&#x20;

([pdf](https://docs.google.com/presentation/d/1_DI2ynbeQMigaQYbSryGUsTLGH-TKLF0dpzfs3_39Sk/export/pdf), [gd](https://docs.google.com/presentation/d/1_DI2ynbeQMigaQYbSryGUsTLGH-TKLF0dpzfs3_39Sk/edit?usp=sharing))

{% tabs %}
{% tab title="Assignments due" %}

1. Answer the course background [questionnaire](https://goo.gl/forms/gQpLPyOVV4ZvtL1x1)
2. Look over the [final projects from last year](https://jiemakel.gitbook.io/cl4hss/final-project#submissions-from-previous-years). Select the project that interests you the most. Post a short message on the [#introductions](https://slack.com/app_redirect?channel=introductions\&team=T276JCMEU) channel on the course Slack to introduce yourself and to describe why you chose those that project.
3. Read up on [the history of humanities computing](https://jiemakel.gitbook.io/cl4hss/history-of-humanities-computing#history-of-humanities-computing). Be ready to discuss in groups on the next lecture.
   {% endtab %}

{% tab title="Assignments given" %}
Assignments for 7.11.:

1. Find a dataset that could be of interest to you in your final project. Post a message on [#datasets](https://slack.com/app_redirect?channel=datasets\&team=T276JCMEU) on Slack giving a link to the dataset and a note on why you selected it.
2. Read [Perception deception](https://infoactive.co/data-design/ch17.html) & [Common visualization mistakes](https://infoactive.co/data-design/ch18.html) as preparation for next week, learning to not trust visualisations blind.
   {% endtab %}
   {% endtabs %}

### Wednesday 7.11. — [Easy tools for processing and exploring data](https://docs.google.com/presentation/d/e/2PACX-1vQ0GNUtEwkYQ4NyRki6SohJ2DLS0wt4MKF3cVzuU7UlLq9yUij5Qd2ZgFltEb8KcPp7aYOXrSLFMdYa/pub?start=false\&loop=false\&delayms=3000)

([pdf](https://docs.google.com/presentation/d/1RF4s0AJuoVUAQIdw3c4Sf5ozLIb2Kd_vNN1SXp8IqFg/export/pdf), [gd](https://docs.google.com/presentation/d/1RF4s0AJuoVUAQIdw3c4Sf5ozLIb2Kd_vNN1SXp8IqFg/edit?usp=sharing))

{% tabs %}
{% tab title="Assignments due" %}

1. Find a dataset that could be of interest to you in your final project. Post a message on [#datasets](https://slack.com/app_redirect?channel=datasets\&team=T276JCMEU) on Slack giving a link to the dataset and a note on why you selected it.
2. Read [Perception deception](https://infoactive.co/data-design/ch17.html) & [Common visualization mistakes](https://infoactive.co/data-design/ch18.html) as preparation for next week, learning to not trust visualisations blind.
   {% endtab %}

{% tab title="Assignments given" %}
Two full weeks to do these, need to be done only by 21.11.:

1. Data cleanup: complete the [OpenRefine tutorial](https://programminghistorian.org/lessons/cleaning-data-with-openrefine).
2. Visualisation: Experiment with at least one of the following tools:

   * tabular data → chart visualisations: [RAW](http://rawgraphs.io/)​
   * tabular data → chart visualisations: [Voyager](http://vega.github.io/voyager/)
   * tabular data → chart visualisations: ​[Tableau](https://www.tableau.com/)​
   * tabular data → ​interactive map/network/timeline/list/facet visualisations: [Palladio](https://moodle.helsinki.fi/hdlab.stanford.edu/palladio/)​
     * Palladio has [help pages](http://hdlab.stanford.edu/palladio/help/). There are also multiple tutorials on using Palladio, for example [this one](http://miriamposner.com/blog/getting-started-with-palladio/), or [this one](https://programminghistorian.org/en/lessons/creating-network-diagrams-from-historical-sources) which is particularly on network analysis.
   * tabular data → map(+timeline) visualisations: ​[Carto](https://carto.com/)​
   * ​text →​ interactive explorative interface for linguistic study: [Voyant tools](https://voyant-tools.org/)​
   * ​big, preselected collections of text → interface for linguistic study: [Korp](https://moodle.helsinki.fi/korp.csc.fi) / [corpus.byu.edu](http://corpus.byu.edu/)​
   * If you're feeling explorative, feel free to also dig for more tools in  [TAPoR](http://tapor.ca/home).

   If you're short on inspiration, feel free to go through [this](https://docs.google.com/document/d/13I7svLlqrg7i0iisw2E_v48Gae5tnXVFWxmeHyGAKFU/edit) hands-on tutorial covering OpenRefine, RAW and Palladio.

   Afterwards, post a message on [#tools](https://slack.com/app_redirect?channel=tools\&team=T276JCMEU) on Slack detailing:

   1. What is the tool good for?
   2. What kind of data do you need for the tool to be useful?&#x20;
      1. What information does the data need to contain?
      2. What format does it have to be in?
   3. Your experience with the tool.

   If someone has already posted on the tool you tested, don't repeat them. Instead, add to what they've said in a thread.\
   (also be prepared to discuss the tools in class)
3. Programming: Go through the [fundamental concepts of programming for humanists](https://jiemakel.gitbook.io/cl4hss/data-processing-fundamental-concepts-of-programming-for-humanists) and complete the assignments there.
4. Regular expressions: Read the section on [regular expressions](https://jiemakel.gitbook.io/cl4hss/regular-expressions) and go through the assignments there.
5. In preparation for the lecture on 21.11., read [this research article](https://doi.org/10.1073/pnas.1405984111).
   {% endtab %}
   {% endtabs %}

### Friday 9.11. — **No lecture**

### Wednesday 14.11. — Clinic for support in the assignments

### Friday 16.11. — **No lecture**

### Wednesday 21.11. — [Fundamental concepts of statistics](https://jiemakel.gitbook.io/cl4hss/data-analysis-fundamental-concepts-of-statistics)&#x20;

{% tabs %}
{% tab title="Assignments due" %}

1. Data cleanup: complete the [OpenRefine tutorial](https://programminghistorian.org/lessons/cleaning-data-with-openrefine).
2. Visualisation: Experiment with at least one of the following tools:

   * tabular data → chart visualisations: [RAW](http://rawgraphs.io/)​
   * tabular data → chart visualisations: [Voyager](http://vega.github.io/voyager/)
   * tabular data → chart visualisations: ​[Tableau](https://www.tableau.com/)​
   * tabular data → ​interactive map/network/timeline/list/facet visualisations: [Palladio](https://moodle.helsinki.fi/hdlab.stanford.edu/palladio/)​
     * Palladio has [help pages](http://hdlab.stanford.edu/palladio/help/). There are also multiple tutorials on using Palladio, for example [this one](http://miriamposner.com/blog/getting-started-with-palladio/), or [this one](https://programminghistorian.org/en/lessons/creating-network-diagrams-from-historical-sources) which is particularly on network analysis.
   * tabular data → map(+timeline) visualisations: ​[Carto](https://carto.com/)​
   * ​text →​ interactive explorative interface for linguistic study: [Voyant tools](https://voyant-tools.org/)​
   * ​big, preselected collections of text → interface for linguistic study: [Korp](https://moodle.helsinki.fi/korp.csc.fi) / [corpus.byu.edu](http://corpus.byu.edu/)​
   * If you're feeling explorative, feel free to also dig for more tools in  [TAPoR](http://tapor.ca/home).

   If you're short on inspiration, feel free to go through [this](https://docs.google.com/document/d/13I7svLlqrg7i0iisw2E_v48Gae5tnXVFWxmeHyGAKFU/edit) hands-on tutorial covering OpenRefine, RAW and Palladio.

   Afterwards, post a message on [#tools](https://slack.com/app_redirect?channel=tools\&team=T276JCMEU) on Slack detailing:

   1. What is the tool good for?
   2. What kind of data do you need for the tool to be useful?&#x20;
      1. What information does the data need to contain?
      2. What format does it have to be in?
   3. Your experience with the tool.

   If someone has already posted on the tool you tested, don't repeat them. Instead, add to what they've said in a thread.\
   (also be prepared to discuss the tools in class)
3. Programming: Go through the [fundamental concepts of programming for humanists](https://jiemakel.gitbook.io/cl4hss/data-processing-fundamental-concepts-of-programming-for-humanists) and complete the assignments there.
4. Regular expressions: Read the section on [regular expressions](https://jiemakel.gitbook.io/cl4hss/regular-expressions) and go through the assignments there.
5. In preparation for the lecture on 21.11., read [this research article](https://doi.org/10.1073/pnas.1405984111).
   {% endtab %}

{% tab title="Assignments given" %}
In preparation for next Wednesday's (28.11.) lecture, select (at least) one of the following sets of paired articles based on your own interests:&#x20;

* language change, simulation: [Social networks and intraspeaker variation during periods of language change](http://repository.upenn.edu/cgi/viewcontent.cgi?article=1041\&context=pwpl) + [Utterance selection model of language change](http://journals.aps.org/pre/abstract/10.1103/PhysRevE.73.046118). Also note that you can experiment yourself with the model described in the first paper [here](http://www.netlogoweb.org/launch#http://www.netlogoweb.org/assets/modelslib/Sample%20Models/Social%20Science/Language%20Change.nlogo).
* twitter, sentiment analysis: [What a Nasty day: Exploring Mood-Weather Relationship from Twitter](https://arxiv.org/abs/1410.8749) + [A Biased Review of Biases in Twitter Studies on Political Collective Action](https://doi.org/10.3389/fphy.2016.00034)
* simulation, archaeology: [Population growth and collapse in a multiagent model of the Kayenta Anasazi in Long House Valley](https://doi.org/10.1073/pnas.092080799) + [Understanding Artificial Anasazi](http://jasss.soc.surrey.ac.uk/12/4/13.html)
* geographic information, network analysis, archaeology: [Exploring the dynamics of transport in the Dutch limes](http://journal.topoi.org/index.php/etopoi/article/view/203) + [Testing the Robustness of Local Network Metrics in Research on Archeological Local Transport Networks](http://dx.doi.org/10.3389/fdigh.2016.00006)
* history, text reuse detection: [Plundering Philosophers:Identifying Sources of the Encyclopédie](http://hdl.handle.net/2027/spo.3310410.0013.107) + [The Use and Abuse of the Digital Humanities in the History of Ideas: How to Study the Encyclopédie](http://www.tandfonline.com/doi/pdf/10.1080/01916599.2013.774115?needAccess=true) (Interestingly, first article doesn't have affiliations. Digging thrhough, most people seem to be from this project)
* network analysis: [Protestant Letter Networks in the Reign of Mary I: A Quantitative Approach](https://muse.jhu.edu/journals/elh/v082/82.1.ahnert.html) + [Automated analysis of the US presidential elections using Big Data and network analysis](https://doi.org/10.1177%2F2053951715572916)
* 3D/spatial analysis, archaeology: [A Survey of Geometric Analysis in Cultural Heritage](https://doi.org/10.1111/cgf.12668) + [A GIS-based viewshed analysis of Chacoan tower kivas in the US Southwest: were they for seeing or to be seen?](https://doi.org/10.15184/aqy.2016.144)
* Image recognition of woodcut prints: [Image-matching technology applied to Fifteenth-century printed book illustration](https://doi.org/10.1007/s40329-017-0201-5) / [Wormholes record species history in space and time](https://doi.org/10.1098/rsbl.2012.0926)

Form a group with all the other people who selected the same articles. For class, prepare a presentation on them, detailing:

1. How do the two articles relate to each other?&#x20;
2. Research questions - What are the humanities research questions? Do the projects also target computer science research questions? If so, what? What is the relationship between the CS and humanities research questions?
3. Data - How has the data used been gathered? What are the data sources used? How has the data been processed? Is the data available for others to use?
4. Methods - What methods do the projects apply? How do the methods support answering the research questions?
5. Partners - What is the make-up of the projects? Which disciplines are represented by the participants?
   {% endtab %}
   {% endtabs %}

### Friday 23.11. — [Fundamental concepts of statistics](https://jiemakel.gitbook.io/cl4hss/data-analysis-fundamental-concepts-of-statistics) / [Computational data analysis literacy](https://jiemakel.gitbook.io/cl4hss/three-approaches-to-methods-for-digital-humanities-work-area/computational-data-analysis-method-literacy)

{% tabs %}
{% tab title="Assignments given" %}

1. [Explore bootstrapping](http://www.lock5stat.com/StatKey/bootstrap_1_quant/bootstrap_1_quant.html)
2. Check out the [Explained Visually](http://setosa.io/ev/) site, and especially [PCA explained visually](http://setosa.io/ev/principal-component-analysis/)
3. Read on some small, actual work:
   1. The [presentation](https://docs.google.com/presentation/d/1eTj9Vu4WoatnutrjtzpfBk6EwgRwC9YlCAzW7GvoaeA/edit) of the DHH15 key concepts of socialism group
   2. The [presentation](https://prezi.com/g-0b_r604myv/dhh15-multimodality/?utm_campaign=share\&utm_medium=copy) of the DHH15 Finnair Blue Wings multimodality group
   3. If you understand Finnish, the [election questionnaire analysis and visualisation](https://jiemakel.github.io/vaalikonedatavisu/)
4. For 5.12., explore [this topic model](http://aulis.sange.fi/~humis/tmp/ceecvis/) of [CEEC](https://www.helsinki.fi/en/researchgroups/varieng/corpus-of-early-english-correspondence) and read the [explanation](https://jiemakel.gitbook.io/cl4hss/digging-into-a-method-topic-modeling) on topic modelling
   {% endtab %}
   {% endtabs %}

### Wednesday 28.11. — [Computational data analysis literacy](https://jiemakel.gitbook.io/cl4hss/three-approaches-to-methods-for-digital-humanities-work-area/computational-data-analysis-method-literacy), part 2

{% tabs %}
{% tab title="Assignments due" %}

1. [Explore bootstrapping](http://www.lock5stat.com/StatKey/bootstrap_1_quant/bootstrap_1_quant.html)
2. Check out the [Explained Visually](http://setosa.io/ev/) site, and especially [PCA explained visually](http://setosa.io/ev/principal-component-analysis/)
3. Read on some small, actual work:
   1. The [presentation](https://docs.google.com/presentation/d/1eTj9Vu4WoatnutrjtzpfBk6EwgRwC9YlCAzW7GvoaeA/edit) of the DHH15 key concepts of socialism group
   2. The [presentation](https://prezi.com/g-0b_r604myv/dhh15-multimodality/?utm_campaign=share\&utm_medium=copy) of the DHH15 Finnair Blue Wings multimodality group
   3. If you understand Finnish, the [election questionnaire analysis and visualisation](https://jiemakel.github.io/vaalikonedatavisu/)

Select (at least) one of the following sets of paired articles based on your own interests:&#x20;

* language change, simulation: [Social networks and intraspeaker variation during periods of language change](http://repository.upenn.edu/cgi/viewcontent.cgi?article=1041\&context=pwpl) + [Utterance selection model of language change](http://journals.aps.org/pre/abstract/10.1103/PhysRevE.73.046118). Also note that you can experiment yourself with the model described in the first paper [here](http://www.netlogoweb.org/launch#http://www.netlogoweb.org/assets/modelslib/Sample%20Models/Social%20Science/Language%20Change.nlogo).
* twitter, sentiment analysis: [What a Nasty day: Exploring Mood-Weather Relationship from Twitter](https://arxiv.org/abs/1410.8749) + [A Biased Review of Biases in Twitter Studies on Political Collective Action](https://doi.org/10.3389/fphy.2016.00034)
* simulation, archaeology: [Population growth and collapse in a multiagent model of the Kayenta Anasazi in Long House Valley](https://doi.org/10.1073/pnas.092080799) + [Understanding Artificial Anasazi](http://jasss.soc.surrey.ac.uk/12/4/13.html)
* geographic information, network analysis, archaeology: [Exploring the dynamics of transport in the Dutch limes](http://journal.topoi.org/index.php/etopoi/article/view/203) + [Testing the Robustness of Local Network Metrics in Research on Archeological Local Transport Networks](http://dx.doi.org/10.3389/fdigh.2016.00006)
* history, text reuse detection: [Plundering Philosophers:Identifying Sources of the Encyclopédie](http://hdl.handle.net/2027/spo.3310410.0013.107) + [The Use and Abuse of the Digital Humanities in the History of Ideas: How to Study the Encyclopédie](http://www.tandfonline.com/doi/pdf/10.1080/01916599.2013.774115?needAccess=true) (Interestingly, first article doesn't have affiliations. Digging thrhough, most people seem to be from this project)
* network analysis: [Protestant Letter Networks in the Reign of Mary I: A Quantitative Approach](https://muse.jhu.edu/journals/elh/v082/82.1.ahnert.html) + [Automated analysis of the US presidential elections using Big Data and network analysis](https://doi.org/10.1177%2F2053951715572916)
* 3D/spatial analysis, archaeology: [A Survey of Geometric Analysis in Cultural Heritage](https://doi.org/10.1111/cgf.12668) + [A GIS-based viewshed analysis of Chacoan tower kivas in the US Southwest: were they for seeing or to be seen?](https://doi.org/10.15184/aqy.2016.144)
* Image recognition of woodcut prints: [Image-matching technology applied to Fifteenth-century printed book illustration](https://doi.org/10.1007/s40329-017-0201-5) / [Wormholes record species history in space and time](https://doi.org/10.1098/rsbl.2012.0926)

Form a group with all the other people who selected the same articles. For class, prepare a presentation on them, detailing:

1. How do the two articles relate to each other?&#x20;
2. Research questions - What are the humanities research questions? Do the projects also target computer science research questions? If so, what? What is the relationship between the CS and humanities research questions?
3. Data - How has the data used been gathered? What are the data sources used? How has the data been processed? Is the data available for others to use?
4. Methods - What methods do the projects apply? How do the methods support answering the research questions?
5. Partners - What is the make-up of the projects? Which disciplines are represented by the participants?
   {% endtab %}

{% tab title="Presentations" %}

* [Plundering philosophers](https://docs.google.com/presentation/d/1l6Yl03WLc_Bcv7oEd2IqSRJ1gM1-EISQ9k7t6pcWQwg/edit#slide=id.gc6f8954bc_0_53)
* [Twitter](https://docs.google.com/presentation/d/1zs0d4LbGG-IcFguJlWag-FbHw0CCB4b9JQLujshXhqA/edit#slide=id.p)
* [3D archaeology](https://docs.google.com/presentation/d/1Hg3prb9lq5ywrw3yzHqN6-kqDYFGH-pl6SGNjZLKKY8/edit)
  {% endtab %}
  {% endtabs %}

### Friday 30.11. — **No lecture**

### Wednesday 5.12. — [Computational data analysis literacy](https://jiemakel.gitbook.io/cl4hss/three-approaches-to-methods-for-digital-humanities-work-area/computational-data-analysis-method-literacy), part 3

{% tabs %}
{% tab title="Assignments due" %}

* Explore [this topic model](http://aulis.sange.fi/~humis/tmp/ceecvis/) of [CEEC](https://www.helsinki.fi/en/researchgroups/varieng/corpus-of-early-english-correspondence) and read the [explanation](https://jiemakel.gitbook.io/cl4hss/digging-into-a-method-topic-modeling) on topic modelling
  {% endtab %}

{% tab title="Assignments given" %}

1. Find a computational humanities research paper that interests you. Post a message on [#research](https://slack.com/app_redirect?channel=research\&team=T276JCMEU) on Slack shortly describing why you picked the paper.&#x20;
2. Prepare to shortly (max 2 minutes) present your current idea for your final project on the lecture on&#x20;
   {% endtab %}
   {% endtabs %}

### Friday 7.12. — [Open, reproducible research and publishing](https://jiemakel.gitbook.io/cl4hss/three-approaches-to-methods-for-digital-humanities-work-area/open-reproducible-research-and-publishing) / [Final project](https://jiemakel.gitbook.io/cl4hss/final-project)

### Wednesday 12.12. — **No lecture, remote support for final project**

### Friday 14.12. — **No lecture, remote support for final project**

### **Friday 21.12.** — **Deadline for returning**[ **final project**](https://jiemakel.gitbook.io/cl4hss/final-project)
