-
In The Tennis Racket Buzzfeed used Python to analyse betting trends on professional tennis matches and found patterns that suggested match-fixing. The code can be found here.
-
In The Ferguson Area Is Even More Segregated Than You Probably Guessed Buzzfeed used Python to analyse segregation in St. Louis county. The code can be found here.
-
In The Dollar-And-Cents Case Against Hollywood’s Exclusion of Women 538 analysed if films that pass the Bechdel Test make more money for their producers? The article was replicated by Brian Keegan using Python and his code can be found here. Keegan's notebook is a must-read for journalists who aspire to do data-driven reporting.
-
Justin Seitz and Jan Cronjé used Python to identify fake websites connected to an Indian IT guru. The code can be found here.
-
Jens Finnäs used Python to analyse the relationship between asylum immigration and crime, and concluded that there was no relationship between the number of asylum seekers in a town and its crime rate. The story was published in Dagens Samhälle and the code can be found here.
- Jupyter Lab
- Download the entire Wayback Machine archive for a given URL via the command-line
- Awesome OSINT list
Scraping
- Scraping Sci-Fi movies from IMDB with Python
- Create your own database of citizens
- How to aumatically find weapons in social media images using APIs
Analysis
-
Investigation of voting patterns demographic correlations in Sweden
-
"Good bye Excel, hello Pandas" workshop at Noda 2018 and Pandas 101 at Noda 2017
Misc
- The Journalism++ newsletter gives bi-weekly tips about stories, tools and much more related to data and journalism.
- DataElixir is a newsletter about all things data.
- Naked Data is a newsletter by data journalist Jason Norwood-Young.
- Data Is Plural is a newsletter by data journalist Jeremy Singer-Vine.
- Banana Data is a newsletter with focus on AI and data science.
- PEP 8 is a style guide for easy to read and beautiful Python code. The section on naming conventions is particularly good to follow.