Available · open to roles

Shivang Singh.

>atPublicis Sapient

I build and scale GenAI systems in production — where latency, token limits, and failure modes matter as much as model quality.

Bengaluru, India
Production GenAI·LLM Infra·Computer Vision·FastAPI · K8s
Bengaluru · 12.97°N 77.59°E
★ Snapshot

look

Currently building

Bodhi Atomize

Production multimodal GenAI platform decomposing 10,000+ marketing assets into 50+ structured signals per asset for Eli Lilly. Multi-stage LLM pipelines with token budgeting, backpressure, and KEDA-autoscaled microservices.

Gemini 2.5 ProClaude 4.6FastAPIKubernetesPyTorchPydantic
shipping· prod traffic since Jun 2025
Based in
Bengaluru, IN
IST · UTC+5:30
Last 7d focus
Recent ship
Dossier

7-agent autonomous job intel pipeline · $0.06/app · LaTeX resume gen

GPT-5· Claude 4.6· Tavily
Daily driver
Python
PyTorch · FastAPI · Pydantic
pytsgosql
Currently

Obsessed with structured outputs, LLM evaluation, and production reliability under burst traffic.

Shipping daily
Coding to
Lo-fi · electronica · The xx
95%manual time cut
10K+assets analyzed
1K+concurrent req
$0.06per Dossier app
01 · About

survive production traffic.

I design and operate LLM pipelines that handle real traffic. At Publicis Sapient I lead Bodhi Atomize — a multimodal GenAI platform that turns images, videos, and GIFs into structured signals for enterprise clients like Eli Lilly. Previously shipped object detection and defect detection systems improving accuracy and inference speed at scale.

My work sits at the intersection of GenAI systems, computer vision, and production ML engineering — where latency, token limits, retries, backpressure, and failure modes matter as much as model quality.

Building AI that reliably works in production
Understanding failure modes early
Writing clean, scalable ML + backend code
Learning from real usage, not just papers
/ 01
95%
Manual analysis time cut
/ 02
10K+
Assets processed for Eli Lilly
/ 03
1K+
Concurrent requests handled
/ 04
1.21%
EER on biometric thesis
02 · Experience

at scale.

Publicis Sapient
Current
AI Engineer · Senior Associate Data Science L1
Jun 2025 — PresentBengaluru, India
  • Architected Bodhi Atomize — production multimodal GenAI platform cutting marketing asset analysis from hours to ~2 min per asset (95% reduction) across 10,000+ assets for Eli Lilly. Outputs 50+ structured JSON signals per asset.
  • Engineered multi-stage LLM inference pipelines with Gemini 2.5 Pro and Pydantic-validated structured outputs. Implemented token budgeting, exponential-backoff retry, and backpressure control to sustain production throughput under rate limits.
  • Integrated YOLO and PaddleOCR into LLM workflows, extracting 50+ typed visual components (text, characters, emotions, branding) per asset. Established LLM evaluation with DeepEval (LLM-as-judge, G-Eval).
  • Built FastAPI microservices with Redis (caching + task queuing) and Celery. Deployed on Kubernetes with KEDA autoscaling to sustain 1,000+ concurrent requests under burst traffic with low latency.
Gemini 2.5 ProFastAPIPydanticYOLOPaddleOCRRedisCeleryKubernetesKEDADeepEval
Lincode Vision Labs
Data Science Intern → Trainee
Oct 2024 — Jun 2025Bengaluru, India
  • Integrated RF-DETR into production pipelines — 1.8× faster inference and +7% mAP50 improvement over YOLOv8 baseline on industrial defect detection.
  • Curated and preprocessed 30,000+ industrial images through targeted augmentation and annotation QA pipelines, lifting defect detection accuracy by 10%.
RF-DETRYOLOv8PyTorchOpenCV
Omdena
Junior Machine Learning Engineer
May 2024 — Aug 2024Remote
  • Led the supervised modelling team predicting urban farming zones in Milan using geospatial data.
  • Engineered XGBoost model achieving 93.68% accuracy. Conducted EDA on 106,000 rows with Geopandas.
  • Implemented real-time predictions, optimising data handling and model efficiency.
XGBoostGeopandasPython
Epoch · IIIT SriCity
Domain Lead — Computer Vision
Jan 2024 — May 2024Sri City
  • Led the Computer Vision domain for the campus AI/ML club. Mentored juniors, ran workshops, organized hackathons.
Matrix · IIIT SriCity
Co-Lead
Oct 2023 — May 2024Sri City
  • Co-led campus tech club. Organized events, hosted talks, fostered project-driven learning.
03 · Selected projects

shipped.

/ project 01

Dossier

Autonomous Agentic Job Search Intelligence

$0.06
per application

7-agent autonomous pipeline (Job Discovery, Watchlist, Company Intel, Market Intel, Gap Analysis, Resume Agent, Referral Finder) that discovers, scores, researches, and generates tailored applications end-to-end. Parallel LLM scoring across 550+ jobs/run, pre-LLM rule filters cut 65% of API calls, Claude generates ATS-optimised LaTeX resumes via 3-pass self-evaluation.

GPT-5Claude Sonnet 4.6Claude Haiku 4.5TavilyThreadPoolExecutorSQLiteLaTeX
View source
/ project 02

FedFV-CV

Federated Deep Learning for Biometric Auth

1.21%
EER

Federated deep learning framework for finger-vein biometric authentication using MobileNetV2. Engineered custom FedWPR aggregation on 122,600 images across 5 clients, outperforming FedAvg benchmarks. B.Tech Thesis, IIIT SriCity.

PyTorchMobileNetV2Federated Learning
View source
/ project 03

slackAgent

AI-Powered Slack Bot with RAG

40%
response time cut

Scalable FastAPI backend with LlamaIndex + ChromaDB semantic search over 20+ documents. Cut query response time by 40% and served 50+ daily queries via Slack API with end-to-end automation through n8n.

FastAPILlamaIndexChromaDBOpenAIn8n
View source
/ project 04

RAG-QA on AWS

Retrieval-Augmented QA, fully CI/CD

70B
params served

Retrieval-augmented QA system using LangChain, FAISS, and AWS Bedrock (LLAMA 3.1-70B). Deployed to AWS ECR + App Runner via Docker with full CI/CD through GitHub Actions.

LangChainFAISSAWS BedrockLLAMA 3.1-70BDockerGitHub Actions
View source
04 · Toolkit

reach for.

LLM & GenAI

10 items
Gemini 2.5 ProGPT-5 / GPT-4oClaude Sonnet 4.6LangChainLangGraphLlamaIndexRAGPydanticPrompt EngineeringDeepEval

Computer Vision

5 items
YOLORF-DETRPaddleOCROpenCVMobileNetV2

MLOps & Backend

7 items
FastAPIDockerKubernetesKEDARedisCeleryMLflow

Cloud & Infra

8 items
GCPAWS BedrockAWS ECRApp RunnerAzureGitHub ActionsChromaDBFAISS

Programming & ML

6 items
PythonPyTorchscikit-learnpandasNumPySQL
Certifications
Google Cloud Computing Foundations: Data, ML, AI
Google Cloud Computing Foundations: Cloud Fundamentals
Google Cloud Computing Foundations: Infrastructure
Kaggle Pandas Certification
Stanford Unsupervised Machine Learning
★ Words from people I've worked with

from real teams.

Shivang treats production AI like a first-class engineering problem — not a research demo. His pipelines actually survive real traffic.
[Replace with real name]
Tech Lead · Publicis Sapient
manager
One of the few engineers who can hold both the model intuition and the infrastructure tradeoffs in his head at the same time. Rare combo.
[Replace with real name]
Senior ML Engineer · Lincode Vision Labs
peer
He delivered Bodhi Atomize end-to-end — from prompt design to KEDA autoscaling. The kind of person you want building the AI layer of your product.
[Replace with real name]
Engineering Manager · Publicis Sapient
manager
06 · Get in touch

that ships.

Open to conversations around GenAI systems, LLM infrastructure, ML engineering, and production AI challenges. Drop a line — I reply fast.

© 2026 Shivang Singh. Crafted with care.
Next.js · Tailwind · Motion · R3F · AI SDK