Available for Opportunities

Shivansh Srivastav

AI Product & ML Systems Engineer

Building scalable, production-ready AI systems and intelligent applications.

About Me

AI Product & ML Systems Engineer building scalable, production-grade LLM platforms.

I specialize in converting ambiguous business problems into high-ROI AI systems—optimizing for cost efficiency, latency, reliability, and long-term architectural durability. My work sits at the intersection of LLM infrastructure, applied machine learning, and product strategy.

Recently, I led the transition from API-dependent LLM usage to a self-hosted inference architecture—preventing projected six-figure annual costs while improving latency by 70% and achieving 8× throughput gains. I designed multi-layer caching systems (95% hit rate), fault-isolated inference pipelines, and secure deployment boundaries.

Beyond infrastructure, I’ve driven measurable product outcomes:

  • • 40% reduction in workflow time
  • • 35% ARR growth via AI-enabled features
  • • 42% increase in application conversion through ranking optimization
  • • 30%+ operational efficiency improvements in manufacturing use cases

I’m particularly interested in:

  • • Scalable LLM orchestration
  • • Retrieval-augmented systems (RAG)
  • • Cost-aware AI infrastructure
  • • Reliability engineering for AI products
  • • Designing systems that balance experimentation velocity with production stability

Philosophy

I value strong architectural decisions, clean trade-offs, and eliminating complexity that doesn’t translate into business impact.

Skills & Technologies

Programming

Python SQL JavaScript C / C++ Java

ML/AI Frameworks

PyTorch TensorFlow Scikit-learn XGBoost LightGBM Keras Hugging Face Transformers

NLP & Conversational AI

LLMs (GPT, BERT, Llama) ASR TTS/STT Intent Recognition Semantic Search RAG

MLOps & Cloud

Docker Kubernetes AWS (EC2, S3, Lambda) Azure MLflow CI/CD Jenkins

Databases

PostgreSQL MySQL MongoDB Redis Neo4j Milvus (Vector DB) SQL Query Optimization

Tools

Git Jupyter Jira Postman REST APIs Excel

Featured Projects

vision network

Adversarial and Unsupervised Learning for Vision Networks

Fortified network resilience, achieving a 30% reduction in adversarial vulnerability and error rates across complex visual datasets. Identified that Token Masking outperforms traditional Image Masking in ViT-based encoders, while adding FD loss or HRDA leads to a decrease in mIoU, suggesting these components may be redundant in the final VFM-UDA method.

PyTorchPythonComputer VisionGANs
video chatbot

Automated HRM with Video Interview Platform

Reduced recruitment time by 40% using advanced analytics, with Milvus DB for fast similarity searches and AWS cloud for scalable infrastructure. Increased assessment accuracy by 30% by integrating machine learning algorithms and leveraging Llama for prompt engineering, and employing computer vision techniques for Resume parsing.

FastapiPythonMLVideo Processing
chatbot

GPT-based Chatbot for Q&A

Built an intelligent question-answering chatbot leveraging GPT models. Implemented context-aware responses, conversation memory, and domain-specific fine-tuning for enhanced accuracy.

OpenAI GPTPythonNLPFastAPI
Cloud CPU Utilization Prediction with LSTM

Cloud CPU Utilization Prediction with LSTM

Designed and deployed an LSTM-based time series forecasting model to predict cloud CPU utilization. Enabled proactive resource scaling and cost optimization for cloud infrastructure.

LSTMTensorFlowTime SeriesPython
Diagnosing cancerous and non-cancerous breast tumours

Cancer Diagnosis ML Model

The objective of the project is to Predict whether the tumor is cancerous (Malignant) type or non-cancerous (Benign).
Plotted Model performance graph of different ML algorithms and, Concluded prediction accuracy of 99% with Logistic Regression classifier.

Scikit-learnPythonMedical ImagingML
bypass login credentials

Bypass Credential & Captcha using Selenium

Created A web automation project that bypasses the Government of India Goods and Services Tax website using login credential information and auto-fills captcha text by recognizing the speech of the captcha and decoding it into text. *learning purpose only

SeleniumPythonAutomationTesting
Inventory management

Django Dynamic Inventory Software

Built a full-featured inventory management system with real-time tracking, automated reordering, and comprehensive reporting. Designed scalable architecture for multi-location support.

DjangoPostgreSQLPythonREST APIs
mobile application

Comic Book Reading Application

Developed an interactive comic book reading platform with features like bookmarking, offline reading, and personalized recommendations using collaborative filtering.

AndroidiOSMobile developmentFirebase

Articles & Writing

Understanding Transformer Architecture in Modern NLP

A deep dive into the transformer architecture and its impact on natural language processing tasks.

March 2024

Production ML Systems: Best Practices

Key considerations and best practices for deploying machine learning models in production environments.

February 2024

LSTM vs Transformers for Time Series Forecasting

Comparing traditional LSTM approaches with modern transformer-based methods for time series prediction.

January 2024

What my Topmate's Mentees have to say?

topmate

Testimonials

Shweta

"You did a wonderful guidance session ...thank you Shivansh for sharing ur valuable thoughts with me and helping me to take a better decision for my future self."

- Shweta

Harshita

"Shivansh has a remarkable capacity to listen intently and deliver insightful feedback that is both constructive and uplifting. I am so appreciative of his mentorship and guidance."

- Harshita Sinha

Sanchit

"I was searching for words that could describe my happiness about your service and how confident I became when I knew exactly what I should do in the next 5-10 years of my life..."

- Sanchit

Get In Touch

I'm always open to discussing new projects and opportunities


Contact Number

+91 79 8576 3648

© 2024 Shivansh Srivastav