This project scrapes any languages available on Duolingo from the Duome website (https://duome.eu/vocabulary) using Playwright & creates a csv file
-
Download
main.py
&requirements.txt
and put them inside a folder -
Create a virtual environment:
python -m venv VEnv
-
Activate virtual environment:
-
🪟 Windows CMD:
VEnv\Scripts\activate
-
🐧 Linux:
source VEnv/bin/activate
-
-
Install dependencies:
pip install -r requirements.txt
-
Install playwright (
⚠️ code uses Microsoft Edge browser, you can change that to chromium if you don't want to downloadmsedge
):playwright install && playwright install msedge
-
Read the code, you may need to personalize some variables, then run the
main.py
& wait to get the final.csv
file -
Open Anki application...
File ➡️ Import... ➡️ Choose .csv file
- If all word elements didn't load all at once, we should scroll down to retrieve all the words. However, this feature has not been implemented yet, as the