Bengaluru, India · AI Engineer

Shubham
Baid

|

Building GenAI + Vision systems that ship.
Edge deployments · Optimized pipelines · Real-world impact.

0+
Years Exp.
0+
Sites Deployed
0%
Bandwidth Cut
2+pub
1 Patent
Shubham Baid
shubhambaid99@gmail.com
scroll
PyTorch· TensorFlow· OpenCV· TensorRT· CUDA· Python· C++· YOLO· InternVL· Qwen-VL· LLaVA· CrewAI· GStreamer· DeepStream· OpenVINO· Nvidia Jetson· ARM / NEON· RAG· Hugging Face· Docker· PyTorch· TensorFlow· OpenCV· TensorRT· CUDA· Python· C++· YOLO· InternVL· Qwen-VL· LLaVA· CrewAI· GStreamer· DeepStream· OpenVINO· Nvidia Jetson· ARM / NEON· RAG· Hugging Face· Docker·

AI & Computer Vision Engineer with 5+ years of experience building production systems across Multimodal LLMs, Computer Vision, and Edge AI. Started from internship to founding-team Lead Engineer — with a track record across safety-critical deployments, agentic automation, and hardware-constrained inference optimization.

Has contributed to an M&A exit, deployed systems across 1,500+ sites (schools, warehouses, and more), won tracks at Europe's largest hackathon and NASA Space Apps, holds 2 publications and a patent, and is currently building autonomous manufacturing planning systems at a pre-seed stealth startup.

Core philosophy: rapid prototyping, rigorous real-world testing, and keeping an eye on how models perform under deployment constraints — not just benchmarks. Founded and led Google Developer Student Club (GDSC) REVA during undergrad.

Computer Vision Multimodal LLMs Agentic AI Edge AI PyTorch TensorRT OpenCV CUDA / NEON RAG Python C/C++ Model Optimization Active Learning
Quick Stats
Experience5+ years
Sites Deployed1,500+
Publications2
Patent1
Certifications5
Hackathon wins2 tracks
LocationBengaluru, India
Stealth Startup
Applied AI · Pre-Seed
Apr 2025
Present
Lead AI Engineer — Founding Team
  • Designing and orchestrating highly-scaled multi-agent systems for complex planning and data extraction workflows — reducing end-to-end decision and logic building cycles from hours → minutes.
  • Developed a perceptual compression scheme decoupling human-visual fidelity from machine-interpretable signal retention: 80% bandwidth ↓ with 99% task-relevant semantic preservation for downstream VLM inference.
  • Built end-to-end streaming pipeline: Encode → Stream → Decode → VLM forward pass, with robust CPU/GPU fallback for edge devices.
  • Benchmarked LLaVA, Qwen-VL, and custom models for throughput-per-watt on edge hardware.
Multi-agent AIVLMsEdge StreamingCrewAILLaVAQwen-VL
CAFU
Fuel Delivery Tech · Dubai
Jan 2025
Apr 2025
AI Engineer
  • Designed LLM-based agentic content generation workflows → +15% CTR, −50% manual copywriting time.
  • Optimised ML-based ETA prediction system → reduced delayed fulfillment from 13% to 8%, cutting breaches (>5 min) by 42% over 3 months.
  • Built autonomous B2B lead pipeline processing 1,400+ prospects/mo → 90% reduction in manual prospecting hours, improved CAC, 4 high-conversion leads generated.
Agentic AILLMsML OptimizationPython
Avathon
Formerly SparkCognition
May 2022
Jan 2025
Senior AI Engineer
  • Led deployment of VAIA School Safety Suite (firearm detection, person Re-ID) across 100+ cameras in US schools — real-time at scale.
  • Integrated Multimodal LLMs into traditional CV pipelines to handle complex edge cases.
  • Engineered CV pipelines for low-power ARM hardware using NEON instruction sets + model quantization → 50% CPU ↓.
  • Led development of a gamified auto-image annotation tool using active learning — accelerated dataset prep velocity significantly.
  • Managed and mentored a team of AI engineers.
Computer VisionTensorRTNEONMultimodal LLMsTeam Lead
Integration Wizards
Acquired by SparkCognition
Sep 2020
May 2022
Sr. AI Engineer → AI Engineer → Intern
  • Developed ALPR system — SOTA 2021, hybrid CPU/GPU inference, real-time capable.
  • Migrated legacy CV models to NVIDIA DeepStream + TensorRT GPU pipelines — substantially reduced inference latency.
  • Built custom detection models: PPE detection, fall arrester detection, vehicle body-type classification for industrial safety.
  • Core AI technology contributed to the company's M&A exit — acquired by SparkCognition.
ALPRDeepStreamTensorRTPPE DetectionEdge AI

Computer Vision

Object Detection (YOLO, SSD)95%
OpenCV / Image Processing96%
Person Re-ID & Multi-cam Tracking88%

Deep Learning

PyTorch93%
TensorFlow / Keras90%
Model Architecture Design88%

LLMs & Generative AI

Agentic Workflows (CrewAI)90%
Multimodal LLMs (VLMs)88%
RAG Pipelines85%

Edge AI & Optimization

TensorRT / CUDA91%
ARM / NEON Optimization87%
NVIDIA Jetson / DeepStream92%
Production

VAIA School Safety Suite

Led development of multi-camera safety system with firearm detection and person re-ID. Deployed across 100+ cameras in US schools in real-time.

Object DetectionPerson Re-IDMulti-camera
HackZurich 2021 🏆

YetiCoach

Real-time ski coaching from action camera footage only — analyzes ski angles, gap, and technique. Won Sunrise GMBH, Huawei & Swisski track.

Computer VisionSports TechPython
NASA 2021 🏆

Aegir — Ocean Debris AI

CV & deep learning platform on satellite/UAV imagery to locate, classify, and predict ocean debris trajectory for cleanup coordination.

Deep LearningSatelliteEdge AI

Argus

Autonomous drone system generating critical disaster-zone data: locality density, population estimates, depth maps for responder coordination.

Deep LearningUAVDisaster Tech

AutoSnotBot

Drone with custom YOLO model detecting whales from aerial footage and autonomously collecting snot samples for algal bloom prediction research.

YOLOUAVDeepStream

Hailey — AI Writing Assistant

AI writing assistant using Hugging Face Transformers + GPT-2 for contextual text generation and sentence completion.

TransformersGPT-2NLP
Production

Automatic License Plate Recognition

SOTA 2021 ALPR module with minimal resource footprint, hybrid CPU/GPU inference paths, real-time deployment-ready.

OCRTensorRTGPU Inference

Cap for Blind

Arduino-based wearable with ultrasonic sensors providing haptic feedback to assist visually impaired individuals in navigation.

ArduinoIoTAssistive Tech
01
JAZ 2023 · Vol 44, p938 · April 2023

Text Generation Tool for Writing Assistance using Transformer

Transformer-based writing assistance system. Submitted August 2020, published April 2023.

NLPTransformersGenerative AI
Read Paper
02
TEST Engineering & Management · May 2020

Detection of Different Degrees of Skin Burn using YOLOv3

YOLOv3-based model for accurate detection and classification of skin burn severity to aid medical diagnosis.

Computer VisionYOLOv3Medical AI
Read Paper
Patent · Transformer Writing Assistance System

System and Method for Text Generation Tool for Writing Assistance Using Transformer

Patent on the transformer writing assistance system — same work independently published academically, demonstrating both research depth and IP value.

Patent Holder
Education
B.Tech, Computer Science REVA University · 2017 – 2021 Best Outgoing Student · Founded GDSC REVA
Certifications
NVIDIA Jetson AI SpecialistEdge AI deployment on Jetson hardware
Intel Edge AI SpecialistAI optimization on Intel hardware
TensorFlow Developer CertificateGoogle-certified TF expertise
Convolutional Neural Networksdeeplearning.ai / Coursera
Neural Networks & Deep Learningdeeplearning.ai / Coursera
Awards & Recognition
🏆
Track Winner — HackZurich 2021Europe's largest hackathon · YetiCoach (Sunrise GMBH, Huawei & Swisski track)
🚀
Track Winner — NASA Space Apps 2021International NASA hackathon · Project Aegir
Best Outgoing Student 2021REVA University
🌌
NASA Galactic Problem Solver × 22019 & 2020 — NASA Space Apps
🥈
2nd Place — Hackfest 2019National Level · REVA University
🥈
2nd Place — Sentinal Hack 2.0State Level · KSIT

I'm interested in senior AI/ML engineering roles, technical collaborations, and research that pushes what's possible at the intersection of vision and language. If you have something worth discussing — reach out.

Response time: usually within 24 hours.
Based in Bengaluru, India — open to remote-first roles globally.

Send a Message Download CV