florence-2

Here are 28 public repositories matching this topic...

roboflow / maestro

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

transformers vqa objectdetection captioning fine-tuning multimodal vision-and-language phi-3-vision paligemma florence-2 qwen2-vl

Updated Feb 18, 2025
Python

jhc13 / taggui

Star

Tag manager and captioner for image datasets

image-captioning image-tagging tag-manager pyside6 stable-diffusion llava cogvlm florence-2

Updated Jan 21, 2025
Python

autodistill / autodistill-grounded-sam-2

Star

Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.

grounded-sam autodistill florence-2 segment-anything-2

Updated Aug 7, 2024
Python

AI-Powered Watermark Remover using Florence-2 and LaMA Models: A Python application leveraging state-of-the-art deep learning models to effectively remove watermarks from images with a user-friendly PyQt6 interface.

dataset-creation inpainting watermark-remover lama-cleaner florence-2

Updated Jan 15, 2025
Python

Ravi-Teja-konda / Surveillance_Video_Summarizer

Star

VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 Vision-Language Model. Includes a Gradio-based interface for querying and analyzing video footage.

video ai summarization gradio vlm vision-and-language huggingface surviellance gpt-4 chatgpt gradio-python-llm florence-2

Updated Sep 17, 2024
Python

Damarcreative / rem-wm

Sponsor

Star

Rem-WM, a powerful watermark remover tool that leverages the capabilities of Microsoft Florence and Lama Cleaner models.

watermark lama-cleaner florence-2

Updated Jan 28, 2025
Python

autodistill / autodistill-florence-2

Star

Use Florence 2 to auto-label data for use in training fine-tuned object detection models.

object-detection zero-shot-object-detection autodistill florence-2

Updated Aug 15, 2024
Python

retkowsky / florence-2

Star

Florence-2

azure florence-2

Updated Feb 13, 2025
Jupyter Notebook

fireicewolf / wd-llm-caption-cli

Star

A Python base cli tool for caption images with WD series, Joy-caption-pre-alpha,meta Llama 3.2 Vision Instruct and Qwen2 VL Instruct models.

image-caption wd14 llama3-vision florence-2 qwen2-vl joy-caption

Updated Feb 12, 2025
Python

ANYANTUDRE / Florence-2-Vision-Language-Model

Star

Florence-2 is a novel vision foundation model with a unified, prompt-based representation for a variety of computer vision and vision-language tasks.

computer-vision deep-learning huggingface vision-language vision-transformer vision-transformer-models vision-language-model florence-2

Updated Jul 3, 2024
Jupyter Notebook

sayedmohamedscu / Vision-language-models-VLM

Star

vision language models finetuning notebooks & use cases (paligemma - florence .....)

computer-vision vlm florence finetuning multimodal colab-notebook finetune-llms paligemma florence-2 visionlanguage florence-finetuning

Updated Sep 26, 2024
Jupyter Notebook

jacobmarks / fiftyone_florence2_plugin

Star

Run SOTA Vision-Language Model Florence-2 on your data!

computer-vision ml transformer datacentric fiftyone-datasets vision-language-model florence-2

Updated Jun 29, 2024
Python

mithunparab / text2segment_video

Star

Simple Video Summarization using Text-to-Segment Anything (Florence2 + SAM2) This project provides a video processing tool that utilizes advanced AI models, specifically Florence2 and SAM2, to detect and segment specific objects or activities in a video based on textual descriptions.

raft video-summarization optical-flow segment-anything florence-2 sam2

Updated Dec 31, 2024
Python

Iteranya / AktivaAI

Star

Local LLM Discord Bot

ai chatbot discord-bot roleplay llama florence multimodal koboldcpp florence-2

Updated Feb 18, 2025
Python

nguyennpa412 / simple-multimodal-ai

Star

Simple Gradio application integrated with Hugging Face Multimodals to support visual question answering chatbot and more features

docker text-to-speech computer-vision gradio vlm visual-question-answering llm mllm vision-foundation-model image-text-to-text florence-2 xtts-v2 mini-internvl

Updated Aug 16, 2024
Python

sitamgithub-MSIT / TextSnap

Star

TextSnap: Demo for Florence 2 model used in OCR tasks to extract and visualize text from images.

python artificial-intelligence optical-character-recognition gradio ocr-text-reader huggingface-transformers gradio-interface huggingface-spaces vision-language-model florence-2

Updated Nov 20, 2024
Python

Ambruk-chan / DiscordBot

Star

The Ultimate Local LLM Discord Bot!!!

ai discord-bot roleplay llm koboldcpp gbnf florence-2

Updated Dec 6, 2024
Python

regiellis / ecko-cli

Star

ecko-cli is a simple CLI tool that streamlines the process of processing images in a directory, generating captions, and saving them as text files. Additionally, it provides functionalities to create a JSONL file from images in the directory you specify. Images will be captioned using the Microsoft Florence-2-large model and ONNX

cli ai image-processing image-classification onnxruntime huggingface-transformers generative-ai ecko florence-2 ecko-cli

Updated Nov 12, 2024
Python

Kazuhito00 / Florence-2-Colaboratory-Sample

Star

Microsoft の軽量VLMのFlorence-2のColaboratory上でのサンプル

python vlm colaboratory florence-2

Updated Aug 30, 2024
Jupyter Notebook

Gabriellgpc / computer-vision-dataset-maker

Star

The Power of Florence-2 with OpenVINO & FiftyOne: Real-World Applications in Image Analysis

image computer-vision deep-learning image-recognition image-captioning openvino embedding-vectors fiftyone florence-2

Updated Sep 10, 2024
Python

Improve this page

Add a description, image, and links to the florence-2 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the florence-2 topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

florence-2

Here are 28 public repositories matching this topic...

roboflow / maestro

jhc13 / taggui

autodistill / autodistill-grounded-sam-2

D-Ogi / WatermarkRemover-AI

Ravi-Teja-konda / Surveillance_Video_Summarizer

Damarcreative / rem-wm

autodistill / autodistill-florence-2

retkowsky / florence-2

fireicewolf / wd-llm-caption-cli

ANYANTUDRE / Florence-2-Vision-Language-Model

sayedmohamedscu / Vision-language-models-VLM

jacobmarks / fiftyone_florence2_plugin

mithunparab / text2segment_video

Iteranya / AktivaAI

nguyennpa412 / simple-multimodal-ai

sitamgithub-MSIT / TextSnap

Ambruk-chan / DiscordBot

regiellis / ecko-cli

Kazuhito00 / Florence-2-Colaboratory-Sample

Gabriellgpc / computer-vision-dataset-maker

Improve this page

Add this topic to your repo