ModelScope

All

28 repositories

data-juicer
Public
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据！
nlp data-science opendata data-visualization pytorch dataset chinese data-analysis llama gpt
Python
•
Apache License 2.0
•195•3.3k•25•18•Updated Jan 12, 2025Jan 12, 2025
DiffSynth-Studio
Public
Enjoy the magic of Diffusion models!
Python
•
Apache License 2.0
•625•6.7k•120•0•Updated Jan 12, 2025Jan 12, 2025
modelscope
Public
ModelScope: bring the notion of Model-as-a-Service to life.
nlp science cv speech multi-modal python machine-learning deep-learning
Python
•
Apache License 2.0
•749•7.2k•10•5•Updated Jan 10, 2025Jan 10, 2025
ms-swift
Public
Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) or 150+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...).
agent deploy llama lora liger peft multimodal sft dpo pre-training
Python
•
Apache License 2.0
•433•5k•333•13•Updated Jan 10, 2025Jan 10, 2025
evalscope
Public
A streamlined and customizable framework for efficient large model evaluation and performance benchmarking
performance evaluation vlm rag llm
Python
•
Apache License 2.0
•40•345•26•1•Updated Jan 10, 2025Jan 10, 2025
ClearerVoice-Studio
Public
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
Python
•
Apache License 2.0
•139•2k•15•3•Updated Jan 10, 2025Jan 10, 2025
dash-infer
Public
DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including CUDA, x86 and ARMv9.
cpu cuda llm llm-inference native-engine guided-decoding
C
•
Apache License 2.0
•18•198•2•0•Updated Jan 10, 2025Jan 10, 2025
FunASR
Public
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
pytorch speech-recognition vad punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection conformer pretrained-model
Python
•
Other
•806•7.7k•227•10•Updated Jan 10, 2025Jan 10, 2025
modelscope-studio
Public
A third-party component library based on Gradio.
python ui gradio antd-design modelscope gradio-custom-component modelscope-studio
Python
•
Apache License 2.0
•8•63•3•0•Updated Jan 9, 2025Jan 9, 2025
agentscope
Public
Start building LLM-empowered multi-agent applications in an easier way.
agent drag-and-drop chatbot multi-agent multi-modal distributed-agents gpt-4 large-language-models llm llm-agent
Python
•
Apache License 2.0
•360•5.8k•32•17•Updated Jan 8, 2025Jan 8, 2025
modelscope-agent
Public
ModelScope-Agent: An agent framework connecting models in ModelScope with the world
agent data-science code chatbot android-application multi-agents rag mobile-agents gpts llm
Python
•
Apache License 2.0
•322•2.8k•69•2•Updated Jan 8, 2025Jan 8, 2025
modelscope-classroom
Public
Jupyter Notebook
•
Apache License 2.0
•72•620•1•0•Updated Dec 31, 2024Dec 31, 2024
langchain-modelscope
Public
Langchain integration for ModelScope
Python
•
Apache License 2.0
•1•4•0•0•Updated Dec 27, 2024Dec 27, 2024
3D-Speaker
Public
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
speaker-verification speaker-diarization language-identification voxceleb modelscope campplus eres2net 3d-speaker cnceleb sdpn
Python
•
Apache License 2.0
•121•1.5k•0•0•Updated Dec 24, 2024Dec 24, 2024
PromptScope
Public
Enjoy easier conversations with LLM
prompt multi-modal gpt-4 in-context-learning large-language-models prompt-engineering llms
Python
•
Apache License 2.0
•1•6•0•0•Updated Dec 12, 2024Dec 12, 2024
facechain
Public
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
Jupyter Notebook
•
Apache License 2.0
•864•9.2k•11•2•Updated Dec 10, 2024Dec 10, 2024
scepter
Public
SCEPTER is an open-source framework used for training, fine-tuning, and inference with generative models.
generative-model scedit aigc lar-gen stylebooth
Python
•
Apache License 2.0
•26•449•8•2•Updated Dec 7, 2024Dec 7, 2024
MemoryScope
Public
Python
•
Apache License 2.0
•35•362•2•0•Updated Nov 21, 2024Nov 21, 2024
comfyscope
Public
Collection of various Comfy components.
Python
•
Apache License 2.0
•1•4•0•2•Updated Nov 20, 2024Nov 20, 2024
richdreamer
Public
[CVPR2024 (Highlight)] RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D. Live Demo：https://modelscope.cn/studios/Damo_XR_Lab/3D_AIGC
Python
•
Apache License 2.0
•18•429•17•0•Updated Sep 27, 2024Sep 27, 2024
motionagent
Public
MotionAgent is your AI assistent to convert ideas into motion pictures.
Python
•
Apache License 2.0
•35•288•3•1•Updated Sep 2, 2024Sep 2, 2024
FunClip
Public
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
speech-recognition speech-to-text gradio video-clip subtitles-generator video-subtitles llm gradio-python-llm
Python
•
MIT License
•448•4k•27•2•Updated Aug 22, 2024Aug 22, 2024
lite-sora
Public
An initiative to replicate Sora
Python
•
Apache License 2.0
•6•101•3•0•Updated Apr 10, 2024Apr 10, 2024
normal-depth-diffusion
Public
Python
•
Apache License 2.0
•8•126•5•0•Updated Feb 7, 2024Feb 7, 2024
FunCodec
Public
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
tts speech-synthesis codec speech-to-text audio-generation encodec voicecloning audio-quantization
Python
•
MIT License
•31•379•20•1•Updated Jan 25, 2024Jan 25, 2024
KAN-TTS
Public
KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech
modelscope speech tts speech-synthesis
Python
•
MIT License
•84•499•42•1•Updated Dec 28, 2023Dec 28, 2023
AdaSeq
Public
AdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence Understanding Models
natural-language-processing information-extraction chinese-nlp word-segmentation bert sequence-labeling relation-extraction natural-language-understanding entity-typing token-classification
Python
•
Apache License 2.0
•38•429•31•0•Updated Nov 15, 2023Nov 15, 2023
kws-training-suite
Public
Python
•
MIT License
•19•95•7•0•Updated May 26, 2023May 26, 2023