AI ML Engineer
Designing intelligent systems that combine AI, infrastructure, and production-scale engineering — from multimodal inference to autonomous agents.
Systems I Build
Production-grade ML workflows, multimodal inference, embedding systems, and automated intelligence pipelines at scale.
GPU-accelerated inference with ONNX Runtime, YOLO, RetinaFace, InsightFace, and large-scale scene analysis systems.
Autonomous reasoning systems using ReAct workflows, LLM orchestration, tool-calling, and intelligent multi-step automation.
AWS deployments, Dockerized services, PostgreSQL at scale, Cloudflare networking, CI/CD, and MLflow tracking.
Engineering Case Studies
Production-grade AI — built, deployed, and running.
Resolved production-critical CUDA/TensorRT runtime conflicts (nvinfer_10.dll) — diagnosed provider incompatibilities and migrated to CUDAExecutionProvider, restoring full GPU inference at 360K+ scene scale.
End-to-end AI pipeline processing 360,000+ movie scenes with automated face detection, recognition, and scene classification. Powers dam-studio-master — a React/Node.js digital asset management platform serving processed media to production.
Designed modular architecture compatible with TensorFlow, PyTorch, and Scikit-learn — ensuring the agent deploys into any existing MLOps stack without changes to the pipeline.
Intelligent monitoring system that detects data drift and concept drift, automatically triggers adaptive retraining, and uses an LLM-powered Decision Agent (GPT / Claude) to reason about root causes and explain every remediation action in plain language.
Infrastructure & Scaling
I care about everything below the model — GPU inference optimization, vector retrieval at scale, deployment pipelines, networking, and cloud-native architecture that holds up in production.
Engineering Philosophy
I'm deeply interested in how modern AI systems work end-to-end — from model behavior and inference optimization to deployment architecture and production reliability.
Communication Channel
Interested in AI systems, infrastructure, automation, or production-scale engineering collaborations?