The Department of Prime Minister and Cabinet provides transcripts of more than 20,000 speeches, media releases, and interviews by Australian Prime Ministers. These transcripts can be searched online, and the underlying XML files can be downloaded using a simple API.
I've created a repository containing all the XML files, a CSV-formatted index, and aggregated text and zip files for each prime minister.
This repository includes Jupyter notebooks for harvesting, indexing, analysing, and aggregating all the transcripts.
Run the notebooks in this repository live on MyBinder — just click the button (it might take a little while to load).
See the GLAM Workbench documentation for more details.
If you think this project is worthwhile, you might like to support me on Patreon.