📚 llm-english-study-audio-sentence-creator

Welcome to the llm-english-study-audio-sentence-creator project! 🎉

This prototype is designed to assist students in learning English by listening to and practicing sentences with vocabulary in the context of tech industry. 💻

🚀 Problem Addressed

The primary challenge we tackle is the lack of English sentences related to programming and technology in our existing audio resources. To bridge this gap, we've developed this project to generate and convert tech-related sentences into audio format.

💡 Project Overview

Here's how we approach the solution:

Transcribe Audio Files: We use the Whisper model from OpenAI on Hugging Face to transcribe existing audio files into text. 📝
Sentence Separation: Properly separate the transcribed sentences for better understanding by language models. 🧩
Generate Tech Vocabulary Sentences: Utilize the LLM to adapt sentences from the original course content to incorporate tech vocabulary, rather than generating new sentences from scratch. 🛠️
Convert Text to Speech: Employ Microsoft's speecht5_tts model to convert the generated sentences into audio speech. 🎙️
Process Audio Files: Convert the generated audio into MP3 format for easy use. 🎵

🔑 Key Points

1. Transcription Accuracy:

Challenge: Ensuring high accuracy when transcribing audio files with complex or varied sentence structures.
Solution: Using the Whisper model for its robustness in handling diverse audio inputs.

2. Sentence Separation:

Challenge: Properly separating sentences from the transcribed text for better comprehension by language models.
Solution: Developing a prompt method to cleanly segment and organize sentences to improve processing.

3. Tech Vocabulary Adaptation:

Challenge: Generating sentences that accurately reflect tech industry vocabulary while maintaining naturalness.
Solution: Adapting existing sentences to include tech terms rather than generating new content from scratch.

4. Speech Conversion:

Challenge: Ensuring the generated text-to-speech audio sounds natural and clear.
Solution: Utilizing Microsoft's speecht5_tts model for high-quality speech synthesis.

5. Audio Processing:

Challenge: Converting audio to MP3 format while maintaining audio quality.
Solution: Implementing a streamlined process for converting and optimizing audio files.

Dependencies

FFMPEG dependency

sudo apt-get update  
sudo apt-get install ffmpeg libavcodec-extra

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.devcontainer		.devcontainer
.github		.github
old		old
tests		tests
.gitignore		.gitignore
1_transcript_audio_batch.ipynb		1_transcript_audio_batch.ipynb
2_separate_senteces_batch.ipynb		2_separate_senteces_batch.ipynb
3_create_new_sentences_batch.ipynb		3_create_new_sentences_batch.ipynb
4_generate_text_to_speech.ipynb		4_generate_text_to_speech.ipynb
5_join_audio_files.ipynb		5_join_audio_files.ipynb
6_convert_to_mp3.ipynb		6_convert_to_mp3.ipynb
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
transcript.csv		transcript.csv
transcript_new_sentences.csv		transcript_new_sentences.csv
transcript_separated.csv		transcript_separated.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📚 llm-english-study-audio-sentence-creator

🚀 Problem Addressed

💡 Project Overview

🔑 Key Points

1. Transcription Accuracy:

2. Sentence Separation:

3. Tech Vocabulary Adaptation:

4. Speech Conversion:

5. Audio Processing:

Dependencies

About

Releases

Packages

Languages

rhuanbarros/llm-english-study-audio-sentece-creator

Folders and files

Latest commit

History

Repository files navigation

📚 llm-english-study-audio-sentence-creator

🚀 Problem Addressed

💡 Project Overview

🔑 Key Points

1. Transcription Accuracy:

2. Sentence Separation:

3. Tech Vocabulary Adaptation:

4. Speech Conversion:

5. Audio Processing:

Dependencies

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages