Computational literacy
  • Computational literacy for the humanities and social sciences
  • Three approaches to computational methods
  • History of humanities computing
  • Data processing: fundamental concepts of programming for humanists and social scientists
  • Data processing: regular expressions
  • Data analysis: fundamental concepts of statistics
    • Understanding and describing groups
    • What is average?
    • Uncertainty in describing groups
    • What is a sensible group to describe?
    • Comparing groups
    • Understanding relationships
  • Digging into a method: topic modeling
  • Final project
  • Where to continue?
  • Course instances
    • Helsinki fall 2021
    • Helsinki fall 2020
    • Helsinki fall 2019
    • Helsinki fall 2018
  • Holding area for unfinished content
    • Data
    • Easy tools for acquiring, processing and exploring data
    • Computational data analysis method literacy
    • Open, reproducible research and publishing
Powered by GitBook
On this page
  • Acquiring data
  • Data processing
  • Data exploration
  • Visualisation
  • Resources

Was this helpful?

  1. Holding area for unfinished content

Easy tools for acquiring, processing and exploring data

PreviousDataNextComputational data analysis method literacy

Last updated 6 months ago

Was this helpful?

This content is not yet complete. In the meantime, see this presentation: (, )

Acquiring data

  • (quite advanced, but contains a section on a user interface tool as well)

  • Hand-written text transcription:

  • Layout and text transcription:

  • Keyword generation from text:

  • An

  • Twitter archiving:

Data processing

Assignment: data processing tools

  • Complete the

Further resources

  • Data Carpentry tutorial, not really for social science but for general cleaning up of data

  • Further tutorials:

    • (includes section on extension)

    • (on reconciliation)

Data exploration

Visualisation

Visualisation is the act of taking data and transforming it into visual shapes and forms. The reasoning behind this is that humans are very good at processing visual information, with a lot of the necessary shape and anomaly detection and comparison processes even happening subconsciously.

Assignment

Assignment: data exploration tools

Experiment with at least one of the following tools:

Afterwards, post a message on Slack detailing:

  1. What is the tool good for?

  2. What kind of data do you need for the tool to be useful?

    1. What information does the data need to contain?

    2. What format does it have to be in?

  3. Your experience with the tool.

If someone has already posted on the tool you tested, don't repeat them. Instead, add to what they've said in a thread.

Assignment: visualisation tool development

Read the following two research articles on developing visualisation tools for particular text-based humanities research questions:

Now, think of a visualisation that would help you in your field. What information would it visualise? Post a message on Slack

Resources

Most visualisation is explanatory.

/

Book: , particularly chapter 5 for pre-attentive processing and 6 for the gestalt laws.

section 1.5

Read & . The articles are primarily oriented around explanatory data visualisation, but most computational humanities data analysis is exploratory. How do the problems transfer into that domain, when the only one you can deceive is yourself?

tabular data → chart visualisations: ​

tabular data → chart visualisations:

tabular data → chart visualisations: ​​

tabular data → ​interactive map/network/timeline/list/facet visualisations: ​

Palladio has . There are also multiple tutorials on using Palladio, for example , or which is particularly on network analysis.

tabular data → map(+timeline) visualisations: ​​

​text →​ interactive explorative interface for linguistic study: ​

​big, preselected collections of text → interface for linguistic study: / ​

If you're feeling explorative, feel free to also dig for more tools in .

If you're short on inspiration, feel free to go through hands-on tutorial covering OpenRefine, RAW and Palladio.

based on what you want to show

of four ways to visualise the same data and how that affects what you can read from it

Easy tools for processing and exploring data
pdf
gd
Introduction to web scraping
Transkribus
OCR4all
Annif
automated sound transcription tool
TAGS
OpenRefine tutorial
OpenRefine for Social Science Data
http://enipedia.tudelft.nl/wiki/OpenRefine_Tutorial
http://j.mp/dhh15ho
http://freeyourmetadata.org/reconciliation/
https://pudding.cool/2017/05/song-repetition/
http://www.datajourneyman.com/2016/03/21/preattentive-processing.html
https://medium.com/@vidya83.kesavan/preattentive-attributes-in-visualization-an-example-cb05ba1c9371
Information visualization : perception for design
http://socviz.co/lookatdata.html
Perception deception
Common visualization mistakes
RAW
Voyager
Tableau
Palladio
help pages
this one
this one
Carto
Voyant tools
Korp
corpus.byu.edu
TAPoR
this
Visualizing Mouvance: Toward a visual analysis of variant medieval text traditions
Rule‐based Visual Mappings – with a Case Study on Poetry Visualization
Flowchart on selecting a good visualization
An example