About Skills Experience Projects Research Blog Contact
AI Engineer · Boston, MA

Hemanth
Sai .M

MS Artificial Intelligence · Khoury College of Computer Science
Northeastern University · Expected May 2027

Building intelligent systems across computer vision, MLOps, and vision-language research. 4+ years of experience turning research into real-world impact — from airforce runways to production Kubernetes deployments and VLM reasoning research.

0 Years Exp.
0 Patents
0 Publications
0 Projects
Background

Crafting intelligence
from data & vision

I'm a Master's student in Artificial Intelligence at Northeastern University's Khoury College of Computer Sciences, graduating May 2027. My journey began with building robots that could see and understand the world — and that curiosity has only deepened since.

My work spans the full ML stack: from a Foreign Object Debris detection system for the Indian Air Force Academy (93% precision, Jetson Xavier NX), to a production MLOps platform using reinforcement learning to route inference across YOLOv8 variants on GKE — cutting latency by 58% — to VLM reasoning research probing whether LLaVA-1.5 and InstructBLIP truly understand visual content via counterfactual consistency testing.

I'm drawn to problems where computer vision, MLOps, and language models intersect — building systems that work in the real world, not just on benchmarks. I hold an AWS certification and have filed 5 patents, one of which has been commercialised.

AWS Certified Computer Vision Deep Learning MLOps Reinforcement Learning Vision-Language Models Edge Computing GANs
4+ Years in AI/ML
Academic research, internships & industry projects
5 Filed Patents
IPR — Intellectual Property India. One commercialised.
93% FOD Precision
Indian Air Force Academy runway safety system
IEEE Published Paper
ICCCI 2024 — RoboVerse interactive robot

Tools of the trade

Core AI/ML
PyTorch TensorFlow YOLOv8 OpenCV Scikit-learn Stable Baselines3 Keras ONNX TensorRT LoRA / PEFT
Computer Vision
Object Detection Facial Recognition Lane Detection Transfer Learning SVMs MediaPipe cvzone Image Segmentation
Language & VLMs
LLaVA-1.5 InstructBLIP HuggingFace Transformers Visual QA Counterfactual Reasoning bitsandbytes (QLoRA)
MLOps & Cloud
AWS Apache Airflow Docker Kubernetes MLflow FastAPI DVC Evidently AI Prometheus Grafana Amazon S3 Git
Edge & Hardware
NVIDIA Jetson Xavier NX Raspberry Pi Arduino e-con Cameras IoT Sensors Servo Motors
Languages
Python C++ SQL Bash
Career

Where I've built things

Two terms
Aug 2025 – Oct 2025 Jan 2026 – Feb 2026
Instructional Assistant
Northeastern University · Boston, MA

Providing real-time technical and AV support to professors during classroom instruction at Khoury College of Computer Sciences. Managing ServiceNow ticketing for AV and classroom tech issues, ensuring zero-downtime teaching experiences across the department.

ServiceNow Ticketing AV Systems Technical Support
Jan 2024 – Dec 2024
Project Lead - AI/ML Engineering
Indian Air Force Academy · Hyderabad, India

Led the design and deployment of an autonomous Foreign Object Debris detection vehicle to enhance runway safety. Built a custom YOLOv8 model with hyperparameter tuning deployed on NVIDIA Jetson Xavier NX, paired with e-con system cameras for real-time video processing. The vehicle uses an electric platform for wobble-free movement and autonomously patrols airstrips, identifying hazards that could damage aircraft.

Precision: 0.930 Recall: 0.882 F1: 0.906 ~21 FPS Live YOLOv8 Jetson Xavier NX
March 2023 – Jun 2023
Machine Learning Intern
Sclanet AI · Texas, United States

Designed and executed a full computer vision pipeline for retail shelf product detection. Defined image collection criteria, annotated datasets with LabelImg, and stored data on Amazon S3 while product metadata was ingested into MongoDB. Built a custom TensorFlow Object Detection API model using transfer learning, achieving strong mAP scores for real-time product recognition on retail shelves.

TensorFlow OD API Amazon S3 MongoDB Transfer Learning
March 2022 – Jun 2022
Robotics Engineering Intern
HBots · Hyderabad, India

Designed and built a greeting robot combining facial recognition, computer vision, and IoT hardware. Implemented OpenCV for visual processing, reducing processing time by 25%, and used SVMs for facial classification achieving a 15% accuracy improvement. Integrated Raspberry Pi, Pi cameras, servo motors, and temperature sensors for a fully interactive visitor experience.

25% Faster Processing 15% Accuracy Gain OpenCV SVM Raspberry Pi
Portfolio

Selected projects

01 — Flagship
Driver Assistance System

A full ADAS suite with six components: drowsiness detection, lane detection & departure warning, lane keeping assist, object recognition, and collision warning — all running on Jetson hardware with real-time inference.

YOLOv8 OpenCV TensorRT Jetson Orin Nano B.Tech Final Year
02 — Computer Vision
Greeting Robot (RoboVerse)

IoT-integrated visitor management robot. Detects and identifies faces using SVM classifiers, greets known visitors with a handshake, measures body temperature via sensor integration, and marks attendance automatically. Published at IEEE ICCCI 2024.

OpenCV SVM pyttsx3 IoT IEEE Published
03 — Gesture Control
Virtual Mouse via Hand Gestures

Real-time hand gesture recognition system that replaces the physical mouse. Index finger controls the cursor, multi-finger combinations trigger left click and drag-drop, while thumb-index distance modulates system volume — no hardware required.

MediaPipe OpenCV Python Real-time
04 — CV Game
Rock Paper Scissors — No Controller

Webcam-based Rock Paper Scissors game where the player competes against the computer using live hand gestures captured and classified in real-time. Finger configuration detection with score tracking and animated UI.

cvzone OpenCV Python
05 — Agriculture AI
Autonomous Crop Surveillance Rover

An intelligent rover that autonomously patrols farmland, captures crop imagery, detects disease at early growth stages using CNNs, alerts farmers with disease names and treatments, and connects them to the nearest testing facilities.

CNN Transfer Learning IoT Patented
06 — FOD Safety
Runway FOD Detection Vehicle

Autonomous electric vehicle for airstrip safety. Custom YOLOv8 with hyperparameter tuning, live inference on NVIDIA Jetson Xavier NX, capturing high-res imagery via e-con cameras. Achieved 93% precision, recognised by the Indian Air Force Academy.

YOLOv8 Jetson Xavier NX e-con Cameras IAF Academy
07 — MLOps
Adaptive ML Inference Platform

Production MLOps system using a PPO reinforcement learning agent to dynamically route video frames across YOLOv8 Nano / Small / Large — achieving 58% latency reduction (48ms → 20ms), 2.6× throughput, and 42% cost savings while retaining 95%+ detection accuracy. Deployed on GKE with Airflow, MLflow, and automated drift detection.

YOLOv8 Reinforcement Learning Apache Airflow Kubernetes FastAPI MLflow
08 — Vision-Language
VLM Counterfactual Consistency

Research project probing whether LLaVA-1.5 and InstructBLIP truly reason about visual content or rely on pattern-matching. Introduces counterfactual question families across four intervention types (negation, attribute swaps, entailment, spatial) and a novel Consistency Score metric. LoRA fine-tuning with a pairwise consistency loss improves VQA accuracy by 5.4%.

LLaVA-1.5 InstructBLIP LoRA / PEFT GQA HuggingFace
Research & IP

Patents & publications

IEEE Paper
Hello Humans! Welcome to RoboVerse: An IoT Based Interactive Robot
IEEE 14th International Conference on Computer Communication & Informatics (ICCCI)
2024
Patent ✓ Commercialised
Crop Monitoring with AI Based Autonomous Farm Rover
IPR — Intellectual Property India · Commercialised by HEXAIND Technologies
2023
Patent
Driver Assistant System
IPR — Intellectual Property India
2023
Patent
IoT Based Intelligent Vehicle Safety System
IPR — Intellectual Property India · Drunk driving & abuse detection with autonomous disabling
2023
Patent
IoT-Sensor-Based Plant Disease Diagnosis
IPR — Intellectual Property India · CNN-powered rice disease detection with treatment recommendations
2023
Patent
Prediction of Birds and Analysis of Endangered Bird Species
IPR — Intellectual Property India · CNN-based species identification and endangered species classification
2023
Writing

Thoughts on AI & beyond

Linux MLOps Mar 2026
Installing Ubuntu 24.04 on the Asus ROG G14 (RTX 5060 Blackwell)

From BIOS tweaks and nomodeset to compiling asusctl from source — a complete guide to running Linux on Blackwell hardware as a daily driver.

More posts arriving soon
Let's connect

Open to opportunities

Graduating May 2027. Actively seeking AI/ML Engineer roles in Boston and beyond.