Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Audio Not Generating from Podcast Transcript despite format #243

Open
e-ratsavong opened this issue Feb 17, 2025 · 4 comments
Open

Audio Not Generating from Podcast Transcript despite format #243

e-ratsavong opened this issue Feb 17, 2025 · 4 comments

Comments

@e-ratsavong
Copy link

e-ratsavong commented Feb 17, 2025

Hi
When I try to generate audio, my mp3 files are all empty. This is the code I'm using to generate my audio files:

audio_file_from_transcript = generate_podcast(
    transcript_file="./data/transcripts/transcript_test.txt",
    is_local=True,
    tts_model="edge"
)

my transcript was at first this:

<Person1> "Welcome back to PODCASTIFY! Today, we're diving deep into the solar wafer market with a fascinating analysis of prices and trends."
</Person1><Person2>  "This is exciting. I'm ready to learn more about what's going on in the global solar energy sector!"  
</Person2><Person1> "Excellent! So, according to this article from pv magazine International, we're seeing a rebound in solar wafer prices. However, it's not quite reaching its limit. Let's break down the details." 
</Person1>
...

I wasn't sure if the quotations were creating errors with generating the audio so I changed it to this:

<Person1> Welcome back to PODCASTIFY! Today, we're diving deep into the solar wafer market with a fascinating analysis of prices and trends. </Person1>
<Person2> This is exciting. I'm ready to learn more about what's going on in the global solar energy sector! </Person2>

But I am still getting the same issue. There are no console errors just blank audio files that are 428 bytes.

This is the output name of mp3
podcast_a401a2fdc500484380fca7593bb34cae.mp3

@souzatharsis
Copy link
Owner

souzatharsis commented Feb 17, 2025 via email

@e-ratsavong
Copy link
Author

@souzatharsis I removed the is_local command and tried with other tts model besides edge. I also ran ffmpeg manually/separately and was able to generate an audio file. But when I try to generate the audio using podcastfy and try to open the audio file it says "We can't open [file name] this may be because the file type is unsupported, the file extension is incorrect, or the file is corrupt". When I inspect the actual .mp3 it says: Error occurred: [mp3 @ 000002546984CB80] Failed to find two consecutive MPEG audio frames.
data/audio/podcast_70eee2bca48f4527a5fc8a73062a4ec9.mp3: Invalid data found when processing input

@souzatharsis
Copy link
Owner

souzatharsis commented Feb 17, 2025 via email

@e-ratsavong
Copy link
Author

Yes

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants