Computational literacy
  • Computational literacy for the humanities and social sciences
  • Three approaches to computational methods
  • History of humanities computing
  • Data processing: fundamental concepts of programming for humanists and social scientists
  • Data processing: regular expressions
  • Data analysis: fundamental concepts of statistics
    • Understanding and describing groups
    • What is average?
    • Uncertainty in describing groups
    • What is a sensible group to describe?
    • Comparing groups
    • Understanding relationships
  • Digging into a method: topic modeling
  • Final project
  • Where to continue?
  • Course instances
    • Helsinki fall 2021
    • Helsinki fall 2020
    • Helsinki fall 2019
    • Helsinki fall 2018
  • Holding area for unfinished content
    • Data
    • Easy tools for acquiring, processing and exploring data
    • Computational data analysis method literacy
    • Open, reproducible research and publishing
Powered by GitBook
On this page
  • Mon 28.10. — Introduction
  • Wed 30.10. — Debriefing of assignments, Different types of data, data quality, available open datasets
  • Mon 4.11. — No meeting, time for out-of-class work
  • Wed 6.11. — No meeting, time for out-of-class work
  • Mon 11.11. — Debriefing of assignments, Programming
  • Between — Programming, Research
  • Wed 13.11. Continuation of tool and dataset presentations, support clinic for programming
  • Between — Programming, Research
  • Mon 18.11. — Class cancelled due to illness
  • Between — Programming, Research
  • Wed 20.11. — Debriefing, Statistics
  • Between — Statistics, Computational analysis
  • Mon 25.11. — Debriefing, Computational analysis
  • Between — Research
  • Wed 27.11. — Debriefing, Computational analysis
  • Between — Research, Final project planning
  • Mon 2.12. — Debriefing, Open, reproducible research
  • Between — Final project
  • Wed 4.12. — No meeting
  • Mon 9.12. — No meeting
  • Wed 11.12. — Optional support clinic for final project
  • Sun 5.1. — Deadline for returning final project

Was this helpful?

  1. Course instances

Helsinki fall 2019

Timetable for fall 2019

PreviousHelsinki fall 2020NextHelsinki fall 2018

Last updated 4 years ago

Was this helpful?

Mon 28.10. —

  • Activatory pair discussion

  • Flinga questionnaire on background and interests

  • Briefing of assignments

For 30.10. (in two days):

  1. Answer the course background (~5min)

  2. Look over the . Select the project that interests you the most. Be prepared to discuss why you chose those that project in class. (~20-40min)

For 6.11. (in 2 weeks, but more assignments will be given on Wednesday, so start already):

  1. Read up on and do the assignment mentioned there (~1-2h)

Wed 30.10. — Debriefing of assignments, Different types of data, data quality, available open datasets

  1. Answer the course background

  2. Look over the . Select the project that interests you the most. Be prepared to discuss why you chose those that project in class.

  • Group discussion of projects from previous years

  • Briefing of assignments on data and tools

For 11.11. (in 1½ week):

  1. Read up on and do the assignment mentioned there (~1-2h).

  2. Find a that could be of interest to you in your final project. Be prepared to discuss in class:

    1. why you chose those that dataset,

    2. what types of information does it contain,

    3. what the structure, technical format and way of accessing the data is, and

    4. what potential sources of problems or biases does it have. (~15-45min)

  3. Tools:

    1. Data cleanup: complete the . (~30-60min)

    2. Experiment with at least one of the following tools (~30-60min + ~15-30min):

      1. Visualization:

        • tabular data → chart visualisations: ​

        • tabular data → chart visualisations:

        • tabular data → chart visualisations: ​​

        • tabular data → ​interactive map/network/timeline/list/facet visualisations: ​

          • Palladio has . There are also multiple tutorials on using Palladio, for example , or which is particularly on network analysis.

        • tabular data → map(+timeline) visualisations: ​​

        • ​text →​ interactive explorative interface for linguistic study: ​

        • ​big, preselected collections of text → interface for linguistic study: / ​

        • If you're feeling explorative, feel free to also dig for more tools in .

      2. Data acquisition:

        1. Hand-written text transcription:

        2. Layout and text transcription:

        3. Keyword generation from text:

        4. An

        5. An automated image/video description tool

        6. Twitter archiving:

      If you're short on inspiration, feel free to go through hands-on tutorial covering OpenRefine, RAW and Palladio. Afterwards, find other people who experimented with the same tool on Slack. Together, prepare a short demonstration of the tool for class, describing:

      1. What is the tool good for?

      2. What kind of data do you need for the tool to be useful?

        1. What information does the data need to contain?

        2. What format does it have to be in?

      3. Your experience with the tool.

      4. For groups studying visualization tools, also read & , and reflect on how likely it is that you could use the visualizations to deceive yourself

Mon 4.11. — No meeting, time for out-of-class work

Wed 6.11. — No meeting, time for out-of-class work

Mon 11.11. — Debriefing of assignments, Programming

  1. Read up on and do the assignment mentioned there (~1-2h).

  2. Find a that could be of interest to you in your final project. Be prepared to discuss in class:

    1. why you chose those that dataset,

    2. what types of information does it contain,

    3. what the structure, technical format and way of accessing the data is, and

    4. what potential sources of problems or biases does it have. (~15-45min)

  3. Tools:

    1. Data cleanup: complete the . (~30-60min)

    2. Experiment with at least one of the following tools (~30-60min + ~15-30min):

      1. Visualization:

        • tabular data → chart visualisations: ​

        • tabular data → chart visualisations:

        • tabular data → chart visualisations: ​​

        • tabular data → ​interactive map/network/timeline/list/facet visualisations: ​

          • Palladio has . There are also multiple tutorials on using Palladio, for example , or which is particularly on network analysis.

        • tabular data → map(+timeline) visualisations: ​​

        • ​text →​ interactive explorative interface for linguistic study: ​

        • ​big, preselected collections of text → interface for linguistic study: / ​

        • If you're feeling explorative, feel free to also dig for more tools in .

      2. Data acquisition:

        1. Hand-written text transcription:

        2. Layout and text transcription:

        3. Keyword generation from text:

        4. An

        5. An automated image/video description tool

        6. Twitter archiving:

      If you're short on inspiration, feel free to go through hands-on tutorial covering OpenRefine, RAW and Palladio. Afterwards, find other people who experimented with the same tool on Slack. Together, prepare a short demonstration of the tool for class, describing:

      1. What is the tool good for?

      2. What kind of data do you need for the tool to be useful?

        1. What information does the data need to contain?

        2. What format does it have to be in?

      3. Your experience with the tool.

      4. For groups studying visualization tools, also read & , and reflect on how likely it is that you could use the visualizations to deceive yourself

  • Group discussion of the history of humanities computing

  • Group discussion of datasets

  • Group presentations of tools

  • Briefing of assignments on programming and research

For 18.11. (in 1 week):

  1. Research on visualization tool development:

    Read the following two articles on developing tools for particular text-based humanities research questions:

    Now, think of a visualisation that would help you in your field. What information would it visualise? Prepare to discuss in class.

Between — Programming, Research

Wed 13.11. Continuation of tool and dataset presentations, support clinic for programming

Due to not being able to go through all tools on the Monday session, about an hour of this session will be used to go through the rest. After that, I'll be available to help you do the programming assignments.

Between — Programming, Research

Mon 18.11. — Class cancelled due to illness

Between — Programming, Research

Wed 20.11. — Debriefing, Statistics

  1. Research on visualization tool development:

    Read the following two articles on developing tools for particular text-based humanities research questions:

    Now, think of a visualisation that would help you in your field. What information would it visualise? Prepare to discuss in class.

  • Debriefing of programming and regular expression assignments

  • Group discussion on the visualization research

  • Group work on the Old Bailey research

  • Briefing of assignments

  1. (Do the assignments on statistics (not yet ready, but will contain the following in addition to other stuff):

  2. Select (at least) one of the following sets of paired articles based on your own interests:

    Form a group with all the other people who selected the same articles. For class, prepare a presentation on them, detailing:

    1. How do the two articles relate to each other?

    2. Research questions - What are the humanities research questions? Do the projects also target computer science research questions? If so, what? What is the relationship between the CS and humanities research questions?

    3. Data - How has the data used been gathered? What are the data sources used? Is the data available for others to use?

    4. Methods - What methods do the projects apply? How do the methods support answering the research questions?

    5. Partners - What is the make-up of the projects? Which disciplines are represented by the participants?

Between — Statistics, Computational analysis

Mon 25.11. — Debriefing, Computational analysis

  1. Select (at least) one of the following sets of paired articles based on your own interests:

    Form a group with all the other people who selected the same articles. For class, prepare a presentation on them, detailing:

    1. How do the two articles relate to each other?

    2. Research questions - What are the humanities research questions? Do the projects also target computer science research questions? If so, what? What is the relationship between the CS and humanities research questions?

    3. Data - How has the data used been gathered? What are the data sources used? Is the data available for others to use?

    4. Methods - What methods do the projects apply? How do the methods support answering the research questions?

    5. Partners - What is the make-up of the projects? Which disciplines are represented by the participants?

  • Group presentations on research articles

  • Briefing of assignments

Between — Research

Wed 27.11. — Debriefing, Computational analysis

  • Group assignment on topic modelling

  • Briefing of assignments

  1. Find (in groups if you like) a computational humanities research paper that interests you. Prepare to present it in class.

    1. What are your humanities research questions?

    2. Which data will you use?

    3. How do you plan to process, clean up and transform your data?

    4. How do you plan to analyze your data? How will the analysis help answer the humanities research questions?

    5. Critically analyze your data and pipeline for potential bias and problems.

Between — Research, Final project planning

Mon 2.12. — Debriefing, Open, reproducible research

  1. Find (in groups if you like) a computational humanities research paper that interests you. Prepare to present it in class.

    1. What are your humanities research questions?

    2. Which data will you use?

    3. How do you plan to process, clean up and transform your data?

    4. How do you plan to analyze your data? How will the analysis help answer the humanities research questions?

    5. Critically analyze your data and pipeline for potential bias and problems.

  • Research article presentations

  • Briefing on evaluation of project plans

Between — Final project

Wed 4.12. — No meeting

Mon 9.12. — No meeting

Wed 11.12. — Optional support clinic for final project

Sun 5.1. — Deadline for returning final project

Programming: Go through the and complete the assignments there.

Regular expressions: Read the section on and go through the assignments there.

Research 2: read which we went through quickly on the first lecture. Try to understand on a general level what is being done on a methodological level, and how that feeds into the content argument. There will be group work relating to this in the next meeting.

Programming: Go through the and complete the assignments there.

Regular expressions: Read the section on and go through the assignments there.

Research 2: read which we went through quickly on the first lecture. Try to understand on a general level what is being done on a methodological level, and how that feeds into the content argument. There will be group work relating to this in the next meeting.

Check out the site, and especially )

language change, simulation: + . Also note that you can experiment yourself with the model described in the first paper .

twitter, sentiment analysis: +

simulation, archaeology: +

geographic information, network analysis, archaeology: +

history, text reuse detection: + (Interestingly, first article doesn't have affiliations. Digging thrhough, most people seem to be from this project)

network analysis: +

3D/spatial analysis, archaeology: +

Image recognition of woodcut prints: /

For 27.11. explore of and read the on topic modelling

Language change, simulation: + . Also note that you can experiment yourself with the model described in the first paper .

Twitter, linguistic analysis, geographical analysis: +

Simulation, archaeology: +

Geographic information, network analysis, archaeology: +

History, text reuse detection: + (Interestingly, first article doesn't have affiliations. Digging through, most people seem to be from )

Network analysis: +

3D/spatial analysis, archaeology: +

Image recognition of woodcut prints: /

Visual analysis, art history: /

Explore of and read the on topic modelling

Explore of and read the on topic modelling

Write a one to two page plan of what you'll do for your . Discuss the following:

These will be peer reviewed. Return the assignments at

Write a one to two page plan of what you'll do for your . Discuss the following:

These will be peer reviewed. Return the assignments at

slides
fundamental concepts of programming for humanists
regular expressions
Visualizing Mouvance: Toward a visual analysis of variant medieval text traditions
Rule‐based Visual Mappings – with a Case Study on Poetry Visualization
this research article
fundamental concepts of programming for humanists
regular expressions
Visualizing Mouvance: Toward a visual analysis of variant medieval text traditions
Rule‐based Visual Mappings – with a Case Study on Poetry Visualization
this research article
Lecture on statistics
Explore bootstrapping
Explained Visually
PCA explained visually
Social networks and intraspeaker variation during periods of language change
Utterance selection model of language change
here
What a Nasty day: Exploring Mood-Weather Relationship from Twitter
A Biased Review of Biases in Twitter Studies on Political Collective Action
Population growth and collapse in a multiagent model of the Kayenta Anasazi in Long House Valley
Understanding Artificial Anasazi
Exploring the dynamics of transport in the Dutch limes
Testing the Robustness of Local Network Metrics in Research on Archeological Local Transport Networks
Plundering Philosophers:Identifying Sources of the Encyclopédie
The Use and Abuse of the Digital Humanities in the History of Ideas: How to Study the Encyclopédie
Protestant Letter Networks in the Reign of Mary I: A Quantitative Approach
Automated analysis of the US presidential elections using Big Data and network analysis
A Survey of Geometric Analysis in Cultural Heritage
A GIS-based viewshed analysis of Chacoan tower kivas in the US Southwest: were they for seeing or to be seen?
Image-matching technology applied to Fifteenth-century printed book illustration
Wormholes record species history in space and time
this topic model
CEEC
explanation
Social networks and intraspeaker variation during periods of language change
Utterance selection model of language change
here
What a Nasty day: Exploring Mood-Weather Relationship from Twitter
Mapping Lexical Innovation on American Social Media
A Biased Review of Biases in Twitter Studies on Political Collective Action
Population growth and collapse in a multiagent model of the Kayenta Anasazi in Long House Valley
Understanding Artificial Anasazi
Community structure of copper supply networks in the prehistoric Balkans: An independent evaluation of the archaeological record from the 7th to the 4th millennium BC
Testing the Robustness of Local Network Metrics in Research on Archeological Local Transport Networks
Plundering Philosophers:Identifying Sources of the Encyclopédie
The Use and Abuse of the Digital Humanities in the History of Ideas: How to Study the Encyclopédie
this project
Protestant Letter Networks in the Reign of Mary I: A Quantitative Approach
Automated analysis of the US presidential elections using Big Data and network analysis
A Survey of Geometric Analysis in Cultural Heritage
A GIS-based viewshed analysis of Chacoan tower kivas in the US Southwest: were they for seeing or to be seen?
Image-matching technology applied to Fifteenth-century printed book illustration
Wormholes record species history in space and time
A Quantitative Approach to Beauty. Perceived Attractiveness of Human Faces in World Painting
Against Digital Art History
Lecturer presentation on research using computational analysis
this topic model
CEEC
explanation
this topic model
CEEC
explanation
Lecture on computational analysis
final project
https://moodle.helsinki.fi/course/view.php?id=36622
final project
https://moodle.helsinki.fi/course/view.php?id=36622
Lecture on computational analysis
Lecture on open, reproducible research
Introduction
Introduction, practicalities -lecture
questionnaire
questionnaire
Lecture on data
dataset
OpenRefine tutorial
RAW
Voyager
Tableau
Palladio
help pages
this one
this one
Carto
Voyant tools
Korp
corpus.byu.edu
TAPoR
Transkribus
OCR4all
Annif
automated sound transcription tool
TAGS
this
Perception deception
Common visualization mistakes
dataset
OpenRefine tutorial
RAW
Voyager
Tableau
Palladio
help pages
this one
this one
Carto
Voyant tools
Korp
corpus.byu.edu
TAPoR
Transkribus
OCR4all
Annif
automated sound transcription tool
TAGS
this
Perception deception
Common visualization mistakes
the history of humanities computing
the history of humanities computing
the history of humanities computing
final projects from previous years
final projects from previous years