audio-language-trainer

Caveat

You are welcome to use what is here, but it will not work out of the box (see setup section later)

Motivation

There is often a gap after completing a basic language course (like Section 1 on DuoLingo, or perhaps you've done a Foundation Michel Thomas method) - the advice is typically to start reading magazines and watching TV, but this is far too hard. I was wanting something that exposed me to some longer dialogue, rapidly increased by vocabulary, and also reinforced what I had learnt (or was still learning). A sort of 'stepping stone' towards watching a TV program perhaps, or going abroad and being more comfortable in the middle of a group where people are conversing - they won't pause after 3 or 4 words, they just keep talking amongst each other!

This is designed to fill that gap.

Introduction

Given a vocab list and target language, it will produce practice material for you. This practice comes in two formats: Flash cards and stories. It is not easy; you have to concentrate, and you will get tired - but this is a sign the brain is forming new connections and you are learning (an experience I found was absent in DuoLingo). In particular, I found the first couple of weeks very hard as most of the words were new, but after 200 phrases in, each subsequent phrase became a lot easier as I started to see common patterns (lexical chunks) and was learning fewer new words per phrase.

Flash cards

I've used evidence-based techniques to make engaging flash cards:

✓ Phrases rather than isolated words (So you learn to use these in the right context - Lexical Approach)

✓ Images and Audio for multiple encodings (dual-encoding theory of language learning - retention and recall boost)

✓ Cloze deletion to encourage your brain to complete sentences (rather than giving you the answer as multiple choice)

✓ Additional, word-separated 'slow' audio to help with your pronunciation

✓ Handy hyperlinks and prompts for further study in Wiktionary or an LLM of your choice

Long-form stories

We can pair up a set of flash cards with a story (that uses the same vocab). This lets you consolidate what you have learnt, and gives you practice in long-form listening. I also produce a section of double-speed audio for the whole story, as research shows this can improve language parsing ability (your brain learns to separate words better) - but crucially this double-speed effect only works if you already know the vocab.

Inputs

You need to update or provide your own (english) vocab list (a JSON file). The target language is controlled from a single line in a configuration file - the code here will translate to almost any language without any additional input. There are various notebooks to create material:

create phrases
create images for those phrases
package those up into an Anki flashcard deck
create a story off a selection of flashcards (embedded in both a webpage and a downloadable album with synchronised lyrics)
publish the story to the cloud (so you can access it from the flashcard)

Setup

Use the notebook which guides you through the process, but there is substantial setup required in terms of Google Cloud APIs,and FFMPEG (for audio generation) to configure if you want to run the code yourself. If you want an example lesson, happy to oblige!

Google Cloud:

You will need an account and a project with the ID and Number, as well as know your regions for the LLM.
Enable TTS, Translate, Vertex AI
Apply for access to Anthropic's models in the Model Garden

Azure:

You will need an API key and a speech synthesis service
Azure has some TTS languages that Google doesn't cover, so it's worth having both

Client:

install FFMPEG for audio generation and fonts for the PDF system to work

Road Map:

Finish linking stories with flash cards using the Anki ecosystem - DONE! Feb 2025
Embed a real-time conversation challenge in the web (for more open-ended speech practice) - DONE! Feb 2025
Learning new alphabets - I have some ideas of linking each new letter with an image, using some custom AI services; could be particuarly useful for Chinese.

Acknowledgements

https://www.saysomethingin.com/en/home/ - heavily inspired by the approach of this company, DuoLingo and the Michel Thomas method.

Name		Name	Last commit message	Last commit date
Latest commit History 643 Commits
chat		chat
data		data
docs		docs
notebooks		notebooks
scripts		scripts
src		src
tests		tests
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
AUDIO_MODULE_GUIDE.md		AUDIO_MODULE_GUIDE.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
app.log		app.log
firestore.md		firestore.md
flashcard_tutorial_generator.py		flashcard_tutorial_generator.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py
test_firestore_connection.py		test_firestore_connection.py
test_phrase_generation.py		test_phrase_generation.py
test_story.json		test_story.json
upload_styles.py		upload_styles.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

audio-language-trainer

Caveat

Motivation

Introduction

Flash cards

Long-form stories

Inputs

Setup

Road Map:

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

audio-language-trainer

Caveat

Motivation

Introduction

Flash cards

Long-form stories

Inputs

Setup

Road Map:

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages