Skip to content

A virtual calculator powered by OpenCV, MediaPipe, and Generative AI, enabling users to solve complex math problems by drawing equations on-screen using hand gestures. The system captures the visual input, processes it via Google's Gemini LLM API key, and displays a detailed solution in real time.

Notifications You must be signed in to change notification settings

codewithsanaa/Vision-based-Math-Solver-using-OpenCv-and-Generative-AI

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Vision-based-Math-Solver-using-OpenCv-and-Generative-AI

A virtual calculator powered by OpenCV, MediaPipe, and Generative AI, enabling users to solve complex math problems by drawing equations on-screen using hand gestures. The system captures the visual input, processes it via Google's Gemini LLM API key, and displays a detailed solution in real time.

Tech Stack

python,OpenCV,LLM,Flask,Html,CSS,JavaScript

#System Architechture image

How to Run?

To use the AI features, you need a Gemini API key. You can create a key with one click in Google AI Studio. Replace YOUR_API_KEY in app.py with your own API Key. Ensure you have the latest version of Python installed on your system. Quickly clone the repository and change the working directory.

git clone https://github.com/codewithsanaa/Vision-based-Math-Solver-using-OpenCv-and-Generative-AI.git cd Vision-based-Math-Solver-using-OpenCv-and-Generative-AI/

We have to install all the dependencies of the project. Optionally, you can create a Python virtual environment. Run the following commands in your terminal: python -m venv venv pip install -r requirements.txt

Now we can run the flask server with python app.py command.

About

A virtual calculator powered by OpenCV, MediaPipe, and Generative AI, enabling users to solve complex math problems by drawing equations on-screen using hand gestures. The system captures the visual input, processes it via Google's Gemini LLM API key, and displays a detailed solution in real time.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published