Skip to content
Change the repository type filter

All

    Repositories list

    • AI powered speech denoising and enhancement
      Python
      MIT License
      1391.4k421Updated Nov 5, 2024Nov 5, 2024
    • peft

      Public
      🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
      Python
      Apache License 2.0
      1.6k000Updated Oct 3, 2024Oct 3, 2024
    • mup

      Public
      maximal update parametrization (µP)
      Jupyter Notebook
      MIT License
      95001Updated Sep 5, 2024Sep 5, 2024
    • Python
      MIT License
      0210Updated Sep 5, 2024Sep 5, 2024
    • resemble.ai API SDK
      TypeScript
      MIT License
      3921Updated Jun 5, 2024Jun 5, 2024
    • Python
      0110Updated May 8, 2024May 8, 2024
    • PyTSMod

      Public
      An open-source Python library for audio time-scale modification.
      Python
      GNU General Public License v3.0
      27400Updated Apr 10, 2024Apr 10, 2024
    • aiortc

      Public
      WebRTC and ORTC implementation for Python using asyncio
      Python
      BSD 3-Clause "New" or "Revised" License
      763000Updated Mar 27, 2024Mar 27, 2024
    • aioice

      Public
      asyncio-based Interactive Connectivity Establishment (RFC 5245)
      Python
      BSD 3-Clause "New" or "Revised" License
      52000Updated Feb 15, 2024Feb 15, 2024
    • TypeScript
      2800Updated Dec 16, 2023Dec 16, 2023
    • Go
      1200Updated Nov 13, 2023Nov 13, 2023
    • Run OpenAI Whisper as a Cog model
      Python
      Apache License 2.0
      43100Updated Nov 8, 2023Nov 8, 2023
    • Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀
      Python
      Apache License 2.0
      151000Updated Oct 25, 2023Oct 25, 2023
    • A python package to analyze and compare voices with deep learning
      Python
      Apache License 2.0
      4282.8k402Updated Oct 12, 2023Oct 12, 2023
    • A Heroku buildpack for ffmpeg that always downloads the latest static build
      Shell
      MIT License
      726000Updated Aug 21, 2023Aug 21, 2023
    • g2pW

      Public
      Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)
      Python
      Apache License 2.0
      38000Updated Jul 8, 2023Jul 8, 2023
    • univnet

      Public
      Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)
      Python
      BSD 3-Clause "New" or "Revised" License
      46000Updated May 19, 2023May 19, 2023
    • NeMo

      Public
      NeMo: a toolkit for conversational AI
      Python
      Apache License 2.0
      2.5k900Updated Jan 18, 2023Jan 18, 2023
    • espeak-ng

      Public
      eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
      C
      GNU General Public License v3.0
      897200Updated Nov 29, 2022Nov 29, 2022
    • Simple text to phonemes converter for multiple languages
      Python
      GNU General Public License v3.0
      1732001Updated Nov 21, 2022Nov 21, 2022
    • whisper

      Public
      Robust Speech Recognition via Large-Scale Weak Supervision
      Jupyter Notebook
      MIT License
      8.4k100Updated Oct 4, 2022Oct 4, 2022
    • Monotonic Alignment Search
      Cython
      MIT License
      148610Updated Sep 6, 2022Sep 6, 2022
    • reLaugh

      Public
      Supplementary materials of Synthesizing Personalized Non-speech Vocalization from Discrete Speech Representations
      HTML
      0100Updated Jun 25, 2022Jun 25, 2022
    • Benchmark Arabic text diacritization dataset
      Python
      MIT License
      18400Updated Oct 8, 2021Oct 8, 2021
    • Dockerfile
      7000Updated Sep 1, 2021Sep 1, 2021
    • Automatically deploy your project to GitHub Pages using GitHub Actions. This action can be configured to push your production-ready code into any branch you'd like.
      TypeScript
      MIT License
      362000Updated Aug 3, 2021Aug 3, 2021
    • This utility allows one to cut multiple clips from a single or multiple audio files.
      Python
      MIT License
      10500Updated May 17, 2021May 17, 2021
    • Deep Learning Examples
      Jupyter Notebook
      3.2k400Updated Apr 29, 2021Apr 29, 2021
    • Github Action for executing Helm commands on EKS (using aws-iam-authenticator)
      Dockerfile
      MIT License
      60100Updated Apr 14, 2021Apr 14, 2021
    • Resemble's voice cloning engine within Unity
      C#
      2616310Updated Feb 28, 2021Feb 28, 2021