clip

Star

Here are 786 public repositories matching this topic...

mikel-brostrom / boxmot

Sponsor

Star

BoxMOT: pluggable SOTA tracking modules for segmentation, object detection and pose estimation models

Updated Mar 28, 2025
Python

CVHub520 / X-AnyLabeling

Sponsor

Star

Effortless data labeling with AI support from Segment Anything and other awesome models.

Updated Mar 28, 2025
Python

OFA-Sys / Chinese-CLIP

Star

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

nlp computer-vision deep-learning transformers pytorch chinese pretrained-models multi-modal clip coreml-models contrastive-loss vision-language multi-modal-learning image-text-retrieval vision-and-language-pre-training

Updated Aug 6, 2024
Python

marqo-ai / marqo

Star

Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai

Updated Mar 28, 2025
Python

easychen / pushdeer

Star

开放源码的无App推送服务，iOS14+扫码即用。亦支持快应用/iOS和Mac客户端、Android客户端、自制设备

app notification-service push clip

Updated Mar 13, 2025
C

open-mmlab / mmpretrain

Star

OpenMMLab Pre-training Toolbox and Benchmark

deep-learning pytorch image-classification resnet pretrained-models clip mae mobilenet moco multimodal self-supervised-learning constrastive-learning beit vision-transformer swin-transformer masked-image-modeling convnext

Updated Nov 1, 2024
Python

yuanzhoulvpi2017 / zero_nlp

Star

中文nlp解决方案(大模型、数据、模型、训练、推理)

nlp transformers text-generation pytorch llama gpt clip bert gpt2 huggingface-transformers llava chatglm-6b llama2

Updated Feb 12, 2025
Jupyter Notebook

pharmapsychotic / clip-interrogator

Star

Image to prompt with BLIP and CLIP

pytorch clip

Updated May 15, 2024
Python

jingyi0000 / VLM_survey

Star

Collection of AWESOME vision-language models for vision tasks

computer-vision deep-learning survey transfer-learning clip knowledge-distillation vision-language-model multi-modal-model

Updated Mar 24, 2025

rom1504 / clip-retrieval

Star

Easily compute clip embeddings and build a clip retrieval system with them

ai deep-learning clip knn semantic-search multimodal

Updated Apr 15, 2024
Jupyter Notebook

open-compass / VLMEvalKit

Star

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

computer-vision evaluation pytorch gemini openai vqa vit gpt multi-modal clip claude openai-api gpt4 large-language-models llm chatgpt llava qwen gpt-4v

Updated Mar 28, 2025
Python

RuffianZhong / RWidgetHelper

Star

Android UI 快速开发，专治原生控件各种不服

Updated Feb 21, 2024
Java

cambrian-mllm / cambrian

Star

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

computer-vision chatbot representation-learning clip dino large-language-models llms instruction-tuning mllm multimodal-large-language-models

Updated Oct 30, 2024
Python

roboflow / awesome-openai-vision-api-experiments

Star

Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥

computer-vision openai classification clip zero-shot chatgpt segment-anything open-vocabulary-detection open-vocabulary-segmentation grounding-dino

Updated Jan 14, 2025
Python

QIN2DIM / hcaptcha-challenger

Star

🥂 Gracefully face hCaptcha challenge with MoE(ONNX) embedded solution.

computer-vision solver yolo object-detection image-segmentation multi-modal clip opencv-python onnx hcaptcha multi-modal-learning onnxruntime playwright onnx-models yolov5 zero-shot-classification hcaptcha-solver

Updated Apr 20, 2024
Python

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

chatbot llama clip mulit-modal vision-language vicuna gpt-4 vision-language-pretraining llava video-chatboat video-conversation

Updated Aug 27, 2024
Python

yzhuoning / Awesome-CLIP

Star

Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).

clip pre-training contrastive-learning

Updated Jun 28, 2024

unum-cloud / uform

Star

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

Updated Jan 3, 2025
Python

SkalskiP / vlms-zero-to-hero

Sponsor

Star

This series will take you on a journey from the fundamentals of NLP and Computer Vision to the cutting edge of Vision-Language Models.

natural-language-processing computer-vision word2vec embeddings seq2seq gpt lora clip bert-model gpt-2 vision-language-model

Updated Jan 23, 2025
Jupyter Notebook

EdVince / Stable-Diffusion-NCNN

Star

Stable Diffusion in NCNN with c++, supported txt2img and img2img

android cpp executable clip diffusion tensorrt mnn ncnn onnx img2img tnn txt2img stable-diffusion

Updated Jul 3, 2023
C++

Improve this page

Add a description, image, and links to the clip topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the clip topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

clip

Here are 786 public repositories matching this topic...

mikel-brostrom / boxmot

CVHub520 / X-AnyLabeling

OFA-Sys / Chinese-CLIP

marqo-ai / marqo

easychen / pushdeer

open-mmlab / mmpretrain

yuanzhoulvpi2017 / zero_nlp

pharmapsychotic / clip-interrogator

jingyi0000 / VLM_survey

rom1504 / clip-retrieval

open-compass / VLMEvalKit

RuffianZhong / RWidgetHelper

cambrian-mllm / cambrian

roboflow / awesome-openai-vision-api-experiments

QIN2DIM / hcaptcha-challenger

mbzuai-oryx / Video-ChatGPT

yzhuoning / Awesome-CLIP

unum-cloud / uform

SkalskiP / vlms-zero-to-hero

EdVince / Stable-Diffusion-NCNN

Improve this page

Add this topic to your repo