Skip to content
Change the repository type filter

All

    Repositories list

    • NeMo

      Public
      A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
      Python
      Apache License 2.0
      2.7k000Updated Sep 21, 2024Sep 21, 2024
    • Port of OpenAI's Whisper model in C/C++
      C
      MIT License
      3.9k000Updated Aug 28, 2024Aug 28, 2024
    • pyannote-audio

      Public archive
      Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
      Jupyter Notebook
      MIT License
      825000Updated Jul 27, 2024Jul 27, 2024