-
MODULABS Rubato Lab.
- Seoul, Korea
- https://linktr.ee/SsojuBro
Stars
An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Code samples from our Python agents tutorial
LangChain, LangGraph Open Tutorial for everyone!
DeepEP: an efficient expert-parallel communication library
A generative world for general-purpose robotics & embodied AI learning.
An anomaly detection library comprising state-of-the-art algorithms and features such as experiment management, hyper-parameter optimization, and edge inference.
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.
Singing Voice Conversion via diffusion model
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Fine-tune Stable Audio Open with DiT ControlNet.
LangChain 공식 Document, Cookbook, 그 밖의 실용 예제를 바탕으로 작성한 한국어 튜토리얼입니다. 본 튜토리얼을 통해 LangChain을 더 쉽고 효과적으로 사용하는 방법을 배울 수 있습니다.
Official Github repository for the SIGCOMM '24 paper "Accelerating Model Training in Multi-cluster Environments with Consumer-grade GPUs"
🔊 Text-Prompted Generative Audio Model
Use API to call the music generation AI of suno.ai, and easily integrate it into agents like GPTs.
Font Animation, Automatic Speech Recognition and Text to Speech Custom Nodes for ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, th…
Open-Sora: Democratizing Efficient Video Production for All
Clarity AI | AI Image Upscaler & Enhancer - free and open-source Magnific Alternative
The Universe of Data. All about data, data science, and data engineering
TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5, F5-TTS, ParlerTTS)
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…