open-vocabulary-detection

Here are 34 public repositories matching this topic...

IDEA-Research / Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

speech image-editing caption data-generation 3d-whole-body-pose-estimation open-vocabulary-detection open-vocabulary-segmentation automatic-labeling-system

Updated Sep 5, 2024
Jupyter Notebook

roboflow / notebooks

Star

A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like YOLO11, RT-DETR, SAM 2, Florence-2, PaliGemma 2, and Qwen2.5VL.

Updated Aug 26, 2025
Jupyter Notebook

roboflow / awesome-openai-vision-api-experiments

Star

Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥

computer-vision openai classification clip zero-shot chatgpt segment-anything open-vocabulary-detection open-vocabulary-segmentation grounding-dino

Updated Jan 14, 2025
Python

FoundationVision / GLEE

Star

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

tracking open-world object-detection interactive-segmentation video-object-segmentation referring-expression-segmentation referring-expression-comprehension video-instance-segmentation zero-shot-object-detection referring-video-object-segmentation foundation-model segment-anything open-vocabulary-detection open-vocabulary-segmentation open-vocabulary-video-segmentation

Updated Oct 21, 2024
Python

IDEA-Research / Grounding-DINO-1.5-API

Star

Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

open-world object-detection open-set zero-shot-object-detection foundation-model open-vocabulary-detection grounding-dino

Updated Jan 21, 2025
Python

SkalskiP / awesome-foundation-and-multimodal-models

Sponsor

Star

👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]

nlp computer-vision image-captioning clip blip multimodal zero-shot-detection foundational-models llava segment-anything open-vocabulary-detection open-vocabulary-segmentation grounding-dino

Updated Feb 29, 2024
Python

segments-ai / panoptic-segment-anything

Star

Combining Segment Anything (SAM) with Grounded DINO for zero-shot object detection and CLIPSeg for zero-shot segmentation

segmentation open-vocabulary-detection open-vocabulary-segmentation

Updated May 3, 2024
Jupyter Notebook

wanghao9610 / OV-DINO

Star

Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion

open-world object-detection zero-shot-object-detection open-vocabulary-detection open-vocabulary-segmentation fundation-models ov-dino

Updated Mar 12, 2025
Python

Charles-Xie / awesome-described-object-detection

Star

A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull requests welcomed.

awesome awesome-list visual-grounding referring-expression-comprehension open-world-object-detection open-vocabulary-detection

Updated Jul 22, 2025

jaychempan / LAE-DINO

Star

[AAAI'25] Official Code for “Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community"

remote-sensing object-detection open-vocabulary-detection locate-anything-on-earth fudational-detector lae-dino

Updated Aug 21, 2025
Jupyter Notebook

FoundationVision / GenerateU

Star

[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection

open-world object-detection multimodality open-vocabulary mllm open-vocabulary-detection

Updated Mar 29, 2025
Python

shikras / d-cube

Star

A detection/segmentation dataset with labels characterized by intricate and flexible expressions. "Described Object Detection: Liberating Object Detection with Flexible Expressions" (NeurIPS 2023).

dataset object-detection vision-language multi-modal-learning referring-expression-comprehension open-vocabulary-detection

Updated Mar 20, 2024
Python

CVMI-Lab / CoDet

Star

(NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection

object-detection open-vocabulary open-vocabulary-detection

Updated Apr 26, 2024
Python

naver / shine

Star

[CVPR'24 Highlight] SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection

vision-language open-vocabulary-detection

Updated Jul 24, 2024
Python

rohit901 / cooperative-foundational-models

Star

[WACV 2025] Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"

computer-vision deep-learning pytorch object-detection zero-shot-object-detection open-set-object-detection novel-objects open-vocabulary-detection

Updated Mar 8, 2025
Python

hpc203 / GroundingDINO-onnxrun

Star

使用onnxruntime部署GroundingDINO开放世界目标检测，包含C++和Python两个版本的程序

text-prompt zero-shot-object-detection open-world-detection prompt-engineering open-world-object-detection open-vocabulary-detection groundingdino

Updated Feb 2, 2024
Python

lorebianchi98 / FG-OVD

Star

[CVPR2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detectors for fine-grained understanding."

computer-vision deep-learning artificial-intelligence object-detection zero-shot-object-detection open-vocabulary-detection fine-grained-open-vocabulary-object-detection