Skip to content
View A4xPraddy's full-sized avatar

Block or report A4xPraddy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
A4xPraddy/README.md

Hi, I'm Prasad

AI/ML Engineer • GenAI Developer • Computer Vision & Deep Learning Specialist

I design and develop intelligent systems that learn, reason, and solve real-world problems.
My work spans real-time computer vision, multimodal LLM systems, OCR automation, and 3D deep-learning pipelines.
I enjoy converting raw data into deployable, production-ready AI applications.


About Me

  • Focused on building scalable AI systems with real-time inference capability.
  • Currently exploring advanced MLOps workflows, multimodal AI, and RAG-based architectures.
  • Passionate about applying ML, CV, and LLMs to practical use-cases with measurable impact.
  • Open to collaborations in AI/ML research, computer vision, and GenAI product development.

Currently Working On

  • Multimodal AI systems (text–image–video pipelines)
  • 3D spatial deep learning models
  • OCR-based workflow automation
  • LLM-driven content intelligence and retrieval systems

Currently Learning

  • MLOps and model lifecycle management
  • LangChain and RAG pipelines
  • LLM optimization and quantization
  • Scalable deployment architectures (cloud + containers)

Ask Me About

  • GenAI and LLM integration
  • Computer vision models and pipelines
  • OCR and document intelligence
  • Deep learning architectures
  • ML workflows and data engineering

Tech Stack

Programming Languages

  • Python
  • C++
  • JavaScript

AI, ML & Deep Learning

  • TensorFlow, PyTorch, Keras
  • Scikit-Learn, NumPy, Pandas
  • OpenCV
  • CNN, RNN, LSTM, GRU
  • Graph Attention Networks (GAT)

GenAI & LLM Ecosystem

  • Google Gemini
  • LLaMA
  • Ollama
  • Transformers
  • FAISS
  • NLP pipelines

Data Processing & Scraping

  • PDFMiner
  • BeautifulSoup
  • YouTube Transcript API

Applications & Deployment

  • Streamlit
  • Babylon.js
  • HTML / CSS / JavaScript
  • Linux
  • AWS (Basics)
  • Jupyter

Highlight Projects

Real-Time ASL Recognition

A gesture classification system achieving over 92% accuracy with real-time performance using CNN + CV pipelines.

3D Floor Plan Generation (CNN + GAT)

Predicts room centroids and generates editable 3D layouts with real-time Babylon.js rendering.

MultiScraper AI

A multimodal intelligence engine for extracting and analyzing content from YouTube, PDFs, and websites using Gemini, LLaMA, and Ollama.

OCR Form Automation

OCR-driven form extraction, field detection, and automated filling using OpenCV, PyTesseract, and coordinate mapping.

Alzheimer’s MRI Classification

Built a classification pipeline using VGG16 and custom CNN models achieving up to 99% accuracy on ADNI datasets.

Energy Consumption Forecasting

Time-series forecasting with LSTM and GRU models, achieving top performance in MAE evaluation.


Contact

Email: prasad.mitnapure01@gmail.com


Fun Fact

I turn messy, unstructured real-world data into clean, automated AI workflows.

Popular repositories Loading

  1. AI_interviewer AI_interviewer Public

    Python

  2. E-commerce-customer-prediction E-commerce-customer-prediction Public

    Python

  3. EDA_ML_project EDA_ML_project Public

    Python

  4. AUTO_FEATURE_SELECTOR_TOOL AUTO_FEATURE_SELECTOR_TOOL Public

    Python

  5. COGNITIVE-OCR-FORM-PROCESSING-SYSTEM- COGNITIVE-OCR-FORM-PROCESSING-SYSTEM- Public

    Python

  6. Internship-E-commerce-Website-Prototype Internship-E-commerce-Website-Prototype Public

    JavaScript