CV_Robot_MCP

Utilize the VLM in the CV_MCP_Server HuggingFace Space to "see" the objects through the camera of the robot.

Overview

CV_Robot_MCP is a Python-based project designed to enable a robot to visually interpret its environment by integrating Vision-Language Models (VLMs). The system communicates with the CV_MCP_Server HuggingFace Space to analyze images captured by the robot's camera and provide meaningful object descriptions.

🎬 Demo video

🎬 Watch the demo video here

Features

Vision-Language Model Integration: Seamlessly connects your robot's camera feed with a VLM via the HuggingFace Space.
Python Implementation: Entirely written in Python for flexibility and ease of customization.
Object Recognition: Identifies and describes objects in the robot's field of view.

Getting Started

Prerequisites

Python 3.10+
Access to the CV_MCP_Server HuggingFace Space
Robot hardware with camera support (optional for testing with sample images)

Installation

Clone this repository:

git clone https://github.com/OppaAI/CV_Robot_MCP.git
cd CV_Robot_MCP

Install Python requirements:

pip install -r requirements.txt

Generate a HuggingFace token and enter it into the .env file

Usage

Start your robot and ensure the camera is functional.
Run the main script to capture images and send them to the CV_MCP_Server:
```
python cv_robot.py
```
View the object descriptions returned by the VLM.

Configuration

Server URL: Update the HuggingFace Space URL in your code/config files if the endpoint changes.
Camera Settings: Modify resolution, frame rate, or source in your Python scripts as needed.

Contributing

Contributions are welcome! Please open issues or submit pull requests for feature requests, bugs, or improvements.

License

This project is licensed under the MIT License. See LICENSE for details.

Acknowledgements

CV_MCP_Server HuggingFace Space
HuggingFace for hosting the VLM deployment
VLM used in the HF Space: Qwen2.5-VL-7B-Instruct

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
LICENSE		LICENSE
README.md		README.md
cv_robot.py		cv_robot.py
demo.gif		demo.gif
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CV_Robot_MCP

Overview

🎬 Demo video

Features

Getting Started

Prerequisites

Installation

Usage

Configuration

Contributing

License

Acknowledgements

About

Uh oh!

Releases

Packages

Languages

License

OppaAI/CV_Robot_MCP

Folders and files

Latest commit

History

Repository files navigation

CV_Robot_MCP

Overview

🎬 Demo video

Features

Getting Started

Prerequisites

Installation

Usage

Configuration

Contributing

License

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages