Skip to content

GriffithUniLibrary/intro-text-mining-analysis

Repository files navigation

{% capture aboutworkshop %}

Topic and aims

A self-paced online workshop developed by staff at Griffith University Library.

An introduction to digital methods and tools in humanities and social sciences (HASS) scholarship focusing on the following in the Digital Humanities workflow: 

  • Build: where to locate and how to gather textual data for your corpora or data set  
  • Prepare: explore useful processes and tools to prepare textual data for analysis, including transcription tools to use and data recognition (OCR) to ensure machine readability 
  • Analyse: identify different types of analysis used to interrogate content and uncover new insights

Audience

This workshop is aimed at researchers and academics in the field of digital humanities and related disciplines.

Prerequisites

To successfully coplete this workshop you will need:

  • A modern browser

Outcomes

At the end of this workshop you should be able to:

  • Implement a basic workflow for researching with digital text 
  • Identify different types of textual data and usage considerations
  • Find textual data for digital analysis 
  • Choose the best tool for your dataset at each stage of the digital research process.

Download and play with software and datasets, do activities and watch videos to guide you through the lessons. The lessons are sequential but can be used as stand alone tool Give yourself around 3 hours to complete all the modules.

Assumed knowledge

It is assumed that you have the following level of understanding:

  • Ability to install software on your own device
  • Foundational data terminology such as tabular data, binary data, csv, tables, fields etc.

License

All materials in these lessons are licensed CC BY-NC.

{% endcapture %}

{% include card.html header="About this workshop" text=aboutworkshop %}

Griffith University - CRICOS Provider Number 00233E