A4xPraddy

Hi, I'm Prasad

AI/ML Engineer • GenAI Developer • Computer Vision & Deep Learning Specialist

I design and develop intelligent systems that learn, reason, and solve real-world problems.
My work spans real-time computer vision, multimodal LLM systems, OCR automation, and 3D deep-learning pipelines.
I enjoy converting raw data into deployable, production-ready AI applications.

About Me

Focused on building scalable AI systems with real-time inference capability.
Currently exploring advanced MLOps workflows, multimodal AI, and RAG-based architectures.
Passionate about applying ML, CV, and LLMs to practical use-cases with measurable impact.
Open to collaborations in AI/ML research, computer vision, and GenAI product development.

Currently Working On

Multimodal AI systems (text–image–video pipelines)
3D spatial deep learning models
OCR-based workflow automation
LLM-driven content intelligence and retrieval systems

Currently Learning

MLOps and model lifecycle management
LangChain and RAG pipelines
LLM optimization and quantization
Scalable deployment architectures (cloud + containers)

Ask Me About

GenAI and LLM integration
Computer vision models and pipelines
OCR and document intelligence
Deep learning architectures
ML workflows and data engineering

Tech Stack

Programming Languages

Python
C++
JavaScript

AI, ML & Deep Learning

TensorFlow, PyTorch, Keras
Scikit-Learn, NumPy, Pandas
OpenCV
CNN, RNN, LSTM, GRU
Graph Attention Networks (GAT)

GenAI & LLM Ecosystem

Google Gemini
LLaMA
Ollama
Transformers
FAISS
NLP pipelines

Data Processing & Scraping

PDFMiner
BeautifulSoup
YouTube Transcript API

Applications & Deployment

Streamlit
Babylon.js
HTML / CSS / JavaScript
Linux
AWS (Basics)
Jupyter

Highlight Projects

Real-Time ASL Recognition

A gesture classification system achieving over 92% accuracy with real-time performance using CNN + CV pipelines.

3D Floor Plan Generation (CNN + GAT)

Predicts room centroids and generates editable 3D layouts with real-time Babylon.js rendering.

MultiScraper AI

A multimodal intelligence engine for extracting and analyzing content from YouTube, PDFs, and websites using Gemini, LLaMA, and Ollama.

OCR Form Automation

OCR-driven form extraction, field detection, and automated filling using OpenCV, PyTesseract, and coordinate mapping.

Alzheimer’s MRI Classification

Built a classification pipeline using VGG16 and custom CNN models achieving up to 99% accuracy on ADNI datasets.

Energy Consumption Forecasting

Time-series forecasting with LSTM and GRU models, achieving top performance in MAE evaluation.

Contact

Email: prasad.mitnapure01@gmail.com

Fun Fact

I turn messy, unstructured real-world data into clean, automated AI workflows.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly