Skip to content
View mfh-001's full-sized avatar
  • Islamabad, Pakistan

Block or report mfh-001

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
mfh-001/README.md

Fahad

AI/ML Engineer · Computer Vision · Robotics & Autonomous Systems · Medical AI

MS in AI & Autonomous Systems  |  Islamabad, Pakistan  |  muhammadfahadhassan01@gmail.com

I'm an engineer who spent more than a year running production network infrastructure for 10,000+ users, then pivoted fully into AI. That background, real systems, real uptime requirements, real debugging under pressure, shapes how I build: I care about things that actually work, not just things that train.

Right now I'm focused on computer vision for healthcare, financial intelligence, and autonomous navigation, with deployed projects in each area.


Deployed Projects

📊 FinSight AI  ·  Live Demo  ·  Repo

Multi-modal financial document intelligence pipeline. ColPali v1.2 indexes PDF pages as image patch embeddings (no OCR) — MaxSim retrieval finds the right page without destroying table structure. Qwen2-VL-7B-Instruct reads tables natively and extracts structured JSON. An agentic risk engine autonomously applies an 8-point financial analyst checklist. Validated on Apple Inc.'s real FY2023 10-K (107 pages, SEC EDGAR): correctly extracted $383B revenue, $97B net income, $6.13 EPS, and flagged the 2.8% revenue decline.

ColPali Qwen2-VL-7B PyTorch BitsAndBytes PyMuPDF SEC EDGAR Dual T4 GPU

🩺 PsoriScan AI  ·  Live Demo  ·  Repo

Automated psoriasis plaque segmentation and PASI-inspired severity scoring from dermoscopic images. U-Net with EfficientNet-B3 encoder achieves Dice 0.9479 on validation. Severity classifier takes both visual features and computed coverage % as inputs, designed to mirror how dermatologists actually assess severity. Deployed on Hugging Face Spaces with input validation, heatmap overlay, and a PASI-proxy scoring breakdown.

PyTorch segmentation-models-pytorch EfficientNet OpenCV Streamlit ISIC 2018 HAM10000

🎵 Aether  ·  Live Demo  ·  Repo

Procedural ambient synthesis engine that maps natural language to DSP parameters. A custom keyword tokenizer scores text against 80 acoustic clusters to drive a 12-dimension synthesis vector. The system generates audio via a 5-layer Web Audio graph featuring a 25-oscillator detuned pad stack and procedural convolution reverb. It runs entirely client-side with no model weights or external APIs. It includes a custom 16-bit PCM encoder for faster than real-time offline WAV export.

Web Audio API JavaScript DSP NLP Tokenization Additive Synthesis OfflineAudioContext HTML5 Canvas

🔬 MediScan AI  ·  Live Demo  ·  Repo

Multimodal medical diagnostic tool combining U-Net skin lesion segmentation (Dice 0.90, Jaccard 0.85) with a RAG-inspired Llama 3.2 3B Instruct conversational layer. Solved catastrophic forgetting during BioGPT fine-tuning by switching to instruction-tuned Llama with a structured medical knowledge base. Deployed with fuzzy query matching for clinical misspelling tolerance.

PyTorch Llama 3.2 PEFT/LoRA BitsAndBytes Streamlit HAM10000

🗺️ Visual SLAM vs GPS  ·  Repo

Monocular SLAM navigation pipeline comparing ORB-SLAM3 trajectory against mobile GPS ground truth. 26.3 cm mean alignment error over full test sequences. Camera calibrated with OpenCV chessboard method; Umeyama similarity transformation applied for SLAM-to-UTM coordinate alignment. Full temporal synchronisation via timestamp interpolation.

ORB-SLAM3 C++ OpenCV NumPy SciPy pyproj Ubuntu

🤖 Autonomous Vacuum & Air Quality Robot  ·  Repo

Hardware robot with ultrasonic sensor navigation (95% accuracy across 100 sqm), real-time DHT22 environmental monitoring, mobile app remote control, and ML-based decision logic for zone-targeted cleaning. Data accuracy within ±2%.

ROS2 Arduino Sensor Fusion ML decision logic


Skills

Area Tools
ML & Computer Vision PyTorch · TensorFlow · U-Net · EfficientNet · LoRA/PEFT · Image segmentation · Sensor fusion
LLMs & Document AI Qwen2-VL · Llama 3.2 · ColPali · BitsAndBytes · RAG pipelines · Multi-modal retrieval
Robotics ROS2 · ORB-SLAM3 · Coordinate transforms · SLAM · Autonomous navigation
Deployment Streamlit · Hugging Face Spaces · Docker · AWS · Terraform · CI/CD
Infrastructure TCP/IP · VLANs · VPNs (L2/L3) · MPLS · GPON/FTTH · Cisco · Juniper · SolarWinds
Languages Python · C++ · MATLAB

Background

Before AI, I spent more than a year as an Operations Engineer at an ISP, managing network operations for 10,000+ subscribers across GPON infrastructure, maintaining 99% uptime SLA, and handling escalated Cisco/Juniper configurations and L2/L3 VPN circuits.

That infrastructure experience is why I'm drawn to problems where the gap between "works in a notebook" and "works reliably" actually matters.


Can be contacted at muhammadfahadhassan01@gmail.com

Pinned Loading

  1. FinSight-AI FinSight-AI Public

    inancial document intelligence — ColPali visual retrieval + Qwen2-VL-7B. Real Apple FY2023 10-K analysis. No OCR, no paid APIs.

    Jupyter Notebook

  2. PsoriScan-AI PsoriScan-AI Public

    Automated psoriasis plaque analysis tool combining U-Net segmentation with an EfficientNet-based severity classifier to produce clinically-motivated outputs.

    Jupyter Notebook

  3. AI-Medical-Assistant AI-Medical-Assistant Public

    Multimodal medical diagnostic tool combining U-Net skin lesion segmentation (Dice 0.90) with Llama 3.2 conversational AI. Built on HAM10000 dataset.

    Jupyter Notebook

  4. ORB-SLAM3-GPS-Comparison ORB-SLAM3-GPS-Comparison Public

    Monocular SLAM trajectory validation against GPS ground truth. Achieved 26.3cm mean alignment error using Umeyama transformation and OpenCV calibration.

    Python