Helsinki fall 2019

Preliminary timetable for fall 2019

Mon 28.10. Introduction

Meeting contents
Assignments given
Meeting contents
Assignments given

For 30.10. (in two days):

  1. Answer the course background questionnaire (~5min)

  2. Look over the final projects from previous years. Select the project that interests you the most. Be prepared to discuss why you chose those that project in class. (~20-40min)

For 6.11. (in 2 weeks, but more assignments will be given on Wednesday, so start already):

  1. Read up on the history of humanities computing and do the assignment mentioned there (~1-2h)

Wed 30.10. — Debriefing of assignments, Different types of data, data quality, available open datasets

Assignments due
Meeting contents
Assignments given
Assignments due
  1. Answer the course background questionnaire

  2. Look over the final projects from previous years. Select the project that interests you the most. Be prepared to discuss why you chose those that project in class.

Meeting contents
  • Group discussion of projects from previous years

  • Briefing of assignments on data and tools

Assignments given

For 11.11. (in 1½ week):

  1. Read up on the history of humanities computing and do the assignment mentioned there (~1-2h).

  2. Find a dataset that could be of interest to you in your final project. Be prepared to discuss in class:

    1. why you chose those that dataset,

    2. what types of information does it contain,

    3. what the structure, technical format and way of accessing the data is, and

    4. what potential sources of problems or biases does it have. (~15-45min)

  3. Tools:

    1. Data cleanup: complete the OpenRefine tutorial. (~30-60min)

    2. Experiment with at least one of the following tools (~30-60min + ~15-30min):

      1. Visualization:

        • tabular data → chart visualisations: RAW

        • tabular data → chart visualisations: Voyager

        • tabular data → chart visualisations: ​Tableau

        • tabular data → ​interactive map/network/timeline/list/facet visualisations: Palladio

          • Palladio has help pages. There are also multiple tutorials on using Palladio, for example this one, or this one which is particularly on network analysis.

        • tabular data → map(+timeline) visualisations: ​Carto

        • ​text →​ interactive explorative interface for linguistic study: Voyant tools

        • ​big, preselected collections of text → interface for linguistic study: Korp / corpus.byu.edu

        • If you're feeling explorative, feel free to also dig for more tools in TAPoR.

      2. Data acquisition:

        1. Hand-written text transcription: Transkribus

        2. Layout and text transcription: OCR4all

        3. Keyword generation from text: Annif

        4. An automated image/video description tool

        5. Twitter archiving: TAGS

      If you're short on inspiration, feel free to go through this hands-on tutorial covering OpenRefine, RAW and Palladio. Afterwards, find other people who experimented with the same tool on Slack. Together, prepare a short demonstration of the tool for class, describing:

      1. What is the tool good for?

      2. What kind of data do you need for the tool to be useful?

        1. What information does the data need to contain?

        2. What format does it have to be in?

      3. Your experience with the tool.

      4. For groups studying visualization tools, also read Perception deception & Common visualization mistakes, and reflect on how likely it is that you could use the visualizations to deceive yourself

Mon 4.11. — No meeting, time for out-of-class work

Wed 6.11. — No meeting, time for out-of-class work

Mon 11.11. — Debriefing of assignments, Programming

Assignments due
Meeting contents
Assignments given
Assignments due
  1. Read up on the history of humanities computing and do the assignment mentioned there (~1-2h).

  2. Find a dataset that could be of interest to you in your final project. Be prepared to discuss in class:

    1. why you chose those that dataset,

    2. what types of information does it contain,

    3. what the structure, technical format and way of accessing the data is, and

    4. what potential sources of problems or biases does it have. (~15-45min)

  3. Tools:

    1. Data cleanup: complete the OpenRefine tutorial. (~30-60min)

    2. Experiment with at least one of the following tools (~30-60min + ~15-30min):

      1. Visualization:

        • tabular data → chart visualisations: RAW

        • tabular data → chart visualisations: Voyager

        • tabular data → chart visualisations: ​Tableau

        • tabular data → ​interactive map/network/timeline/list/facet visualisations: Palladio

          • Palladio has help pages. There are also multiple tutorials on using Palladio, for example this one, or this one which is particularly on network analysis.

        • tabular data → map(+timeline) visualisations: ​Carto

        • ​text →​ interactive explorative interface for linguistic study: Voyant tools

        • ​big, preselected collections of text → interface for linguistic study: Korp / corpus.byu.edu

        • If you're feeling explorative, feel free to also dig for more tools in TAPoR.

      2. Data acquisition:

        1. Hand-written text transcription: Transkribus

        2. Layout and text transcription: OCR4all

        3. Keyword generation from text: Annif

        4. An automated image/video description tool

        5. Twitter archiving: TAGS

      If you're short on inspiration, feel free to go through this hands-on tutorial covering OpenRefine, RAW and Palladio. Afterwards, find other people who experimented with the same tool on Slack. Together, prepare a short demonstration of the tool for class, describing:

      1. What is the tool good for?

      2. What kind of data do you need for the tool to be useful?

        1. What information does the data need to contain?

        2. What format does it have to be in?

      3. Your experience with the tool.

      4. For groups studying visualization tools, also read Perception deception & Common visualization mistakes, and reflect on how likely it is that you could use the visualizations to deceive yourself

Meeting contents
  • Group discussion of the history of humanities computing

  • Group discussion of datasets

  • Group presentations of tools

  • Briefing of assignments on programming and research

slides

Assignments given

For 18.11. (in 1 week):

  1. Programming: Go through the fundamental concepts of programming for humanists and complete the assignments there.

  2. Regular expressions: Read the section on regular expressions and go through the assignments there.

  3. Research on visualization tool development:

    Read the following two articles on developing tools for particular text-based humanities research questions:

    Now, think of a visualisation that would help you in your field. What information would it visualise? Prepare to discuss in class.

  4. Research 2: read this research article which we went through quickly on the first lecture. Try to understand on a general level what is being done on a methodological level, and how that feeds into the content argument. There will be group work relating to this in the next meeting.

Between — Programming, Research

Wed 13.11. Continuation of tool and dataset presentations, support clinic for programming

Due to not being able to go through all tools on the Monday session, about an hour of this session will be used to go through the rest. After that, I'll be available to help you do the programming assignments.

Between — Programming, Research

Mon 18.11. — Class cancelled due to illness

Between — Programming, Research

Wed 20.11. — Debriefing, Statistics

Assignments due
Meeting contents
Assignments given
Assignments due
  1. Programming: Go through the fundamental concepts of programming for humanists and complete the assignments there.

  2. Regular expressions: Read the section on regular expressions and go through the assignments there.

  3. Research on visualization tool development:

    Read the following two articles on developing tools for particular text-based humanities research questions:

    Now, think of a visualisation that would help you in your field. What information would it visualise? Prepare to discuss in class.

  4. Research 2: read this research article which we went through quickly on the first lecture. Try to understand on a general level what is being done on a methodological level, and how that feeds into the content argument. There will be group work relating to this in the next meeting.

Meeting contents
  • Debriefing of programming and regular expression assignments

  • Group discussion on the visualization research

  • Group work on the Old Bailey research

  • Briefing of assignments

Assignments given
  1. (Do the assignments on statistics (not yet ready, but will contain the following in addition to other stuff):

    1. Check out the Explained Visually site, and especially PCA explained visually)

  2. Select (at least) one of the following sets of paired articles based on your own interests:

    Form a group with all the other people who selected the same articles. For class, prepare a presentation on them, detailing:

    1. How do the two articles relate to each other?

    2. Research questions - What are the humanities research questions? Do the projects also target computer science research questions? If so, what? What is the relationship between the CS and humanities research questions?

    3. Data - How has the data used been gathered? What are the data sources used? Is the data available for others to use?

    4. Methods - What methods do the projects apply? How do the methods support answering the research questions?

    5. Partners - What is the make-up of the projects? Which disciplines are represented by the participants?

  3. For 27.11. explore this topic model of CEEC and read the explanation on topic modelling

Between — Statistics, Computational analysis

Mon 25.11. — Debriefing, Computational analysis

Assignments due
Meeting contents
Assignments given
Assignments due
  1. Select (at least) one of the following sets of paired articles based on your own interests:

    Form a group with all the other people who selected the same articles. For class, prepare a presentation on them, detailing:

    1. How do the two articles relate to each other?

    2. Research questions - What are the humanities research questions? Do the projects also target computer science research questions? If so, what? What is the relationship between the CS and humanities research questions?

    3. Data - How has the data used been gathered? What are the data sources used? Is the data available for others to use?

    4. Methods - What methods do the projects apply? How do the methods support answering the research questions?

    5. Partners - What is the make-up of the projects? Which disciplines are represented by the participants?

Meeting contents
Assignments given
  1. Explore this topic model of CEEC and read the explanation on topic modelling

Between — Research

Wed 27.11. — Debriefing, Computational analysis

Assignments due
Meeting contents
Assignments given
Assignments due
  1. Explore this topic model of CEEC and read the explanation on topic modelling

Meeting contents
Assignments given
  1. Find (in groups if you like) a computational humanities research paper that interests you. Prepare to present it in class.

  2. Write a one to two page plan of what you'll do for your final project. Discuss the following:

    1. What are your humanities research questions?

    2. Which data will you use?

    3. How do you plan to process, clean up and transform your data?

    4. How do you plan to analyze your data? How will the analysis help answer the humanities research questions?

    5. Critically analyze your data and pipeline for potential bias and problems.

    These will be peer reviewed. Return the assignments at https://moodle.helsinki.fi/course/view.php?id=36622

Between — Research, Final project planning

Mon 2.12. — Debriefing, Open, reproducible research

Assignments due
Meeting contents
Assignments due
  1. Find (in groups if you like) a computational humanities research paper that interests you. Prepare to present it in class.

  2. Write a one to two page plan of what you'll do for your final project. Discuss the following:

    1. What are your humanities research questions?

    2. Which data will you use?

    3. How do you plan to process, clean up and transform your data?

    4. How do you plan to analyze your data? How will the analysis help answer the humanities research questions?

    5. Critically analyze your data and pipeline for potential bias and problems.

    These will be peer reviewed. Return the assignments at https://moodle.helsinki.fi/course/view.php?id=36622

Meeting contents

Between — Final project

Wed 4.12. — No meeting

Mon 9.12. — No meeting

Wed 11.12. — Optional support clinic for final project

Sun 5.1. — Deadline for returning final project