Skip to content

This project scrapes any languages available on Duolingo from the Duome website (https://duome.eu/vocabulary) using Playwright & creates a csv file

License

Notifications You must be signed in to change notification settings

wihal/duolingo-scraper

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Duolingo Scraper

This project scrapes any languages available on Duolingo from the Duome website (https://duome.eu/vocabulary) using Playwright & creates a csv file

🔨 How to run

  1. Download main.py & requirements.txt and put them inside a folder

  2. Create a virtual environment: python -m venv VEnv

  3. Activate virtual environment:

    • 🪟 Windows CMD:

        VEnv\Scripts\activate
      
    • 🐧 Linux:

        source VEnv/bin/activate
      
  4. Install dependencies:

      pip install -r requirements.txt
    
  5. Install playwright (⚠️ code uses Microsoft Edge browser, you can change that to chromium if you don't want to download msedge):

      playwright install && playwright install msedge
    
  6. Read the code, you may need to personalize some variables, then run the main.py & wait to get the final .csv file

  7. Open Anki application...
    File ➡️ Import... ➡️ Choose .csv file

⚠️ Known (possible) issues

  • If all word elements didn't load all at once, we should scroll down to retrieve all the words. However, this feature has not been implemented yet, as the

About

This project scrapes any languages available on Duolingo from the Duome website (https://duome.eu/vocabulary) using Playwright & creates a csv file

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%