π Hi, I'm Animikh! I specialize in building and scaling end-to-end Machine Learning Systems.
- π Currently a Computer Vision & ML Engineer at Moultrie, developing next-gen CV algorithms for wildlife monitoring
- π MS in AI from Boston University β research focused on end-to-end Autonomous Driving at the H2X Lab
- π Published at IROS '25 β IEEE/RSJ International Conference on Intelligent Robots and Systems
- π¨βπ» Previously CV Engineer & Lead at Wobot.ai β built real-time video analytics from the ground up and scaled it globally
- π± Core interests: Computer Vision, Multi-Modal AI, and Software Engineering
- π Personal website: animikh.me
- π¬ Always up for a good chat! Find me on X and LinkedIn
| π° Cost Savings | β‘ Performance | π― Accuracy | π Scale |
|---|---|---|---|
| $1M+/year saved via ML pipeline overhaul | 6Γ faster inference with NVIDIA Triton | +16.4% mAP on Animal Detection | 20,000+ cameras deployed globally |
| Year | Paper | Venue | Role |
|---|---|---|---|
| 2025 | Scalable Offline Metrics for Autonomous Driving Β· IEEE Xplore | IROS 2025 (IEEE) | First Author |
| 2024 | Towards Closing the Generalization Gap in Autonomous Driving | MS Thesis, Boston University | First Author |
| 2024 | Generative AI, Human Creativity, and Art | PNAS Nexus | Acknowledgement |
| 2022 | LatentGAN Autoencoder: Learning Disentangled Latent Distribution | arXiv | Co-Author |
| 2019 | Sentiment Analysis of Restaurant Reviews Using Machine Learning Techniques | ICERECT / Springer Β· π Best Paper Award | Co-Author |
| 2019 | Encoding Web-based Data for Efficient Storage in ML Applications | ICINPRO (IEEE) | First Author |
| 2018 | Analysis of Customer Opinion Using ML and NLP Techniques | IJASSR | Co-Author |
| 2018 | Sales-forecasting of Retail Stores Using ML Techniques | CSITSS (IEEE) | Co-Author |
| Project | Stack | Description |
|---|---|---|
| π Site2LLM | Chrome Extension | Converts any website to neatly structured Markdown in one click |
| π DigitizeMyNotes | LLMs Β· OCR Β· RAG | Converts handwritten notes to searchable digital text with AI chat, math recognition & export |
| π¬ Video Search | React Β· Qdrant Β· VideoPrism | AI-powered semantic video discovery β find videos by describing what you're looking for |
| πΌοΈ Image Search | React Β· Qdrant Β· CLIP | AI-powered semantic image discovery β find visually similar content from natural language descriptions |
| π¨ Wallpaper AI | Stable Diffusion Β· GenAI | Generates high-quality 4K wallpapers from text prompts with enhancement |
| π Autonomous Driving | PyTorch Β· CARLA | End-to-end CIL in a real-world model city Β· Video |
| π 3D Text2LIVE | NeRF Β· Gen AI | 3D appearance editing of objects via text prompts |
| π‘οΈ Real-Time Face Blur | OpenVINO Β· Edge AI | CPU-optimized privacy-preserving face anonymization |
| ποΈ Helmet Detector | YOLOv3 | Helmetless rider + license plate detection with synthetic data |
| ποΈ RL Racer | Double DQN Β· RL | Racing agent trained on OpenAI Gym CarRacing-v0 |
| π§ Zero-Code Trainer | Docker Β· Streamlit | No-code model training toolkit (AutoML) |
- ποΈ Co-Organizer β Boston Computer Vision AIR (AI, Autonomy & Robotics) β monthly meetups for 50β90+ participants from academia and industry; past events include AI-Enabled Robotics, Autonomous Vehicles, Navigation Beyond GPS, and more
- π Manuscript Reviewer β Manning Publications Β· Journal of Open Source Software (JOSS) Β· GTC 2025 Β· ICCCT 2025 Β· ICCI 2024






