Webinars    Forthcoming workshops    Previous workshops    Office Hours


Our webinar series is a joint initiative with the Language Technology and Data Analysis Laboratory (LADAL), (School of Languages and Cultures, University of Queensland). LADAL sponsored webinars take place in the alternate months.

All webinars take place at 8:00PM Brisbane time which is UTC+10. Zoom links will be available one week prior to the event.

4 April 2022 - Keoni Mahelona: A practical approach to Indigenous data sovereignty

Keoni Mahelona is the Chief Technical Officer of Te Hiku Media where he is a part of the team developing the Kaitiakitanga Licence. This licence seeks to balance the importance of publicly accessible data with the reality that indigenous peoples may not have access to the resources that enable them to benefit from public data. By simply opening access to data and knowledge, indigenous people could be further colonised and taken advantage of in a digital, modern world. Therefore Keoni is committed to devising data governance regimes which enable Indigenous people to reclaim and maintain sovereignty over indigenous data.

June 6 2022 - Barbara McGillivray: The Journal of Open Humanities Data

Barbara McGillivray is a Turing Research Fellow at The Alan Turing Institute, and Editor in Chief of the Journal of Open Humanities Data. Since September 2021 she is also a lecturer in Digital Humanities and Cultural Computation at the Department of Digital Humanities of King's College London. Before joining the Turing, she was language technologist in the Dictionary division of Oxford University Press and data scientist in the Open Research Group of Springer Nature. Her research at the Turing is on how words change meaning over time and how to model this change in computational ways. She works on machine-learning models for the change in meaning of words in historical times (Ancient Greek, Latin, eighteen-century English) and in contemporary texts (Twitter, web archives, emoji). Her interdisciplinary contribution covers Data Science, Natural Language Processing, Historical Linguistics and other humanistic fields, to push the boundaries of what academic disciplines separately have achieved so far on this topic.

Zoom link

August 1 2022 - Václav Cvrček: The Czech national Corpus

Václav Cvrček is a linguist who deals with the description of the Czech language, especially with the use of large electronic corpora and quantitative methods. In 2013-2016 he worked as the director of the Czech National Corpus project, since 2016 he has been the deputy director. Recently, he has been focusing on research on textual variability and corpus-based discourse analysis with a focus on online media.

Zoom link

October 3 2022 - Paweł Kamocki: [topic tba]

Paweł Kamocki is a legal expert in Leibniz-Institut für Deutsche Sprache, Mannheim. He studied linguistics and law, and in 2017 obtained his doctorate in law from the universities of Paris and Münster for a thesis on legal aspects of data-intensive university research, with a focus on Knowledge Commons. He worked as a research and teaching assistant at the Paris Descartes university (now: Université de Paris), then also in the private sector. He is certified to work as an attorney in France. An active member of the CLARIN community since 2012, he currently chairs the CLARIN Legal and Ethical Issues Committee. He also worked with other projects and initiatives in the field of research data policy (RDA, EUDAT) and co-created several LegalTech tools for researchers. One of his main research interests are legal issues in Machine Translation.

Zoom link

Forthcoming workshops

Geolocating Australian Historical Resources

In recent years, there has been a growing interest in using places featured in archival material and other digitised textual resources as the basis for visualisation and spatial analysis. Natural Language Processing (NLP) software, such as Named Entity Recognition (NER), facilitates the automated identification and extraction of geographic locations from text; however, it cannot assign or resolve geographic coordinates. The process of manually identifying coordinates for individual locations can be time consuming and is impractical when working with large corpora. This workshop will introduce participants to the basic principles of geolocation, the process through which geographic coordinates are linked with place names, as well as demonstrating approaches and workflows that can assist in automating this task. Taking the classic Australian text For the Term of his Natural Life (1874) as the object of analysis, this workshop will also provide participants with a critical understanding of the affordances and limitations of these tools. This workshop will provide participants with the following skills and competencies:

  • The ability to identify and extract locations from text through the application of off the shelf Named Entity Recognition Software
  • The ability to identify the appropriate gazetteer (record of place names) for their research, and an understanding of how these records can be employed in a computational approach to automate the task of geo-location.
  • The ability to transform geo-located data into maps through the use of basic mapping software.
  • An understanding of the limitations of NER software, and the ability to recognise and understand how different approaches may impact the accuracy of the data.
  • A practical understanding of how this approach can be automated. An example of this workflow as applied to For the Term of his Natural Life will be made accessible to participants in the form of a freely distributed Jupyter notebook This workshop is of relevance to anyone interested in the practical and theoretical challenges of geolocating Australian historical resources, and welcomes individuals with or without programming experience.

This workshop is part of the Australian Society of Archivists 2022 Conference

Date: October 20 2022
Time: 09:00 - 12:00 Australian Eastern Standard Time
Place: Graneek Room at ANU Chifley Library and online
Details and Registration

Learn how to collect and analyse comments on YouTube videos using the open-source tools Youte and Discursis

This online workshop is offered free to Australian researchers and research students and will cover:

  • Collecting data from YouTube using Youte
  • Analysing comments on YouTube videos using Discursis

Brought to you by the teams at the Australian Digital Observatory (ADO) and the Australian Text Analytics Platform (ATAP) via the Australian Research Data Commons (ARDC).

Date: September 21 2022
Time: 09:00 - 12:00 Australian Eastern Standard Time
Details and Registration

Previous workshops

Finding quotes and speakers in text using the ATAP quotation tools

Date: September 8 2022
Length: 1 hour
Facilitators: Sony Jufri

Advance care planning for your research data

Date: September 7 2022
Event: FAVeR Showcasing Approaches to Digital Humanities for Researchers
Length: 1 hour
Facilitators: Sam Hames, Ben Foley

Computational Thinking in the Humanities

The workshop Computational Thinking in the Humanities was a 3-hour online workshop featuring two plenary talks, lightning presentations, as well as a panel discussion. The workshop was co-organized by the Australian Text Analytics Platform (ATAP), FIN-CLARIAH and its UEF representatives, and the Australian Digital Observatory.

Date: September 1 2022
Length: 3 hours
Further details

Network analysis and Topic Modeling on Twitter data using R

Date: May 18 2022
Event: Joint event ADO and ATAP
Length: 3 hours
Facilitators: Alice Miller, Simon Musgrave

Monotreme Mania! Comparative text analytics on Twitter data

Date: 16 March 2022
Event: Joint event ADO and ATAP
Length: 3 hours
Facilitators: Sam Hames, Simon Musgrave

An introduction to Jupyter notebooks for text analysis: Virtual workshop for absolute beginners

Date: August 24 2022
Event: FAVeR Showcasing Approaches to Digital Humanities for Researchers
Length: 2 hours
Facilitators: Sara King, Simon Musgrave

Date: 27 July 2022
Event: Workshop for Sydney Corpus Lab
Length: 3 hours
Facilitators: Sara King, Simon Musgrave

Date: 24 November 2021
Event: Digital Humanities Australasia 2021 Conference
Length: 3 hours
Facilitators: Sara King, Simon Musgrave

Office Hours

We invite Australian researchers working with linguistics, text analytics, digital and computational methods, social media and web archives, and much more to attend our regular online office hours, jointly hosted with the Digital Observatory. Bring your technical questions, research problems and rough ideas and get advice and feedback from the combined expertise of our ARDC research infrastructure projects. No question is too small, and even if we don’t know the answer we are likely to be able to point you to someone who does.

These sessions run over Zoom from 2-3pm (Australia/Sydney time) every second Tuesday - details.