About
I'm an AI/ML engineer fascinated by systems that learn.
From seq2seq translators to retrieval-augmented agents, I love the moment a model
stops guessing and starts understanding.
I'm in my final year at IIT Jodhpur studying Artificial Intelligence
& Data Science. Over the last year I've shipped a production RAG system at a
stealth startup, built multi-object trackers on MOT17, trained seq2seq models with
attention, and worked on code-switched speech understanding and voice cloning.
Education
Indian Institute of Technology, Jodhpur
Aug 2022 – May 2026
B.Tech · Artificial Intelligence & Data Science · CGPA 7.65
Narayana e-Techno School
2022
Senior Secondary · CBSE Board · 90.0%
Focus areas
- Retrieval-Augmented Generation
- LLM Agents & Routing
- Computer Vision
- Deep Learning
- MLOps
Experience
AI Intern
Stealth Startup · Remote
May 2025 – Jul 2025
- Built a Retrieval-Augmented Generation system with Azure OpenAI and Ollama, integrating HyDE, step-back prompting, and query analysis for a 10–15% performance gain on RAGAs.
- Developed Hybrid Search and persistent Chat Memory using Qdrant, Weaviate, and Mem0 with metadata-enriched retrieval pipelines.
- Designed a Router Agent for multi-task LLM routing via LiteLLM and containerized deployment with Docker for production readiness.
- Azure OpenAI
- Ollama
- Qdrant
- Weaviate
- Mem0
- LiteLLM
- Docker
- RAGAs
- HyDE
Open to full-time AI/ML & research roles from May 2026.
Projects
A mix of coursework, research projects, and self-initiated builds. Click through to GitHub for code & write-ups.
Code-Switched ASR + Zero-Shot Voice Cloning
Code-switched lecture transcription and cross-lingual voice cloning pipeline. Fine-tunes speech foundation models for Indian-English classroom audio and generates zero-shot speaker-preserving speech in multiple languages.
- PyTorch
- Whisper
- XTTS
- Speech Processing
Neural Machine Translation (Seq2Seq + Attention)
Encoder-decoder LSTM with Bahdanau attention for German→English and Hindi→English. Trained on 29K DE–EN and 10K HI–EN pairs. Applied scheduled teacher forcing, gradient clipping, and checkpointing over 100 epochs.
BLEU 36.05 on DE→English · validated on 500+ test sentences
- PyTorch
- TorchText
- SpaCy
- LSTM
- Attention
Multi-Object Tracking on MOT17
Three tracking-by-detection pipelines benchmarked on MOT17. Built Faster R-CNN + SORT with anchor tuning, focal loss, and adaptive Kalman filtering; integrated YOLOv8 + DeepSORT for robust ID preservation.
MOTA 0.529 · IDF1 0.578
- PyTorch
- OpenCV
- YOLOv5/v8
- Faster R-CNN
- DeepSORT
- Hungarian
Marketing Research AI Agent
Autonomous agent that runs end-to-end market research — topic discovery, web search, synthesis, and report generation — orchestrated with LLM tool-use and structured output.
- Python
- LangChain
- LLM Tool-Use
- Agents
Brain Tumour Detection
CNN-based classifier for brain MRI scans. Compared transfer-learning backbones and data-augmentation strategies on a multi-class tumour dataset.
- TensorFlow/Keras
- CNN
- Transfer Learning
- Medical Imaging
Real-Time Traffic Monitoring
Real-time traffic monitoring system using object detection and tracking on live video. Flags congestion, counts vehicles, and surfaces lane-level analytics.
- Python
- OpenCV
- YOLO
- Real-Time
COVID-19 Chest X-ray Analysis
Preprocessed 9,535 chest X-rays with PCA and lung segmentation. Benchmarked classical ML (Random Forest, XGBoost, SVM) against deep CNNs (VGG-19, ResNet-50, EfficientNet-B3).
ML 93% · DL 97.5% accuracy
- TensorFlow
- Keras
- Scikit-learn
- OpenCV
- ResNet-50
Low-Light Image Enhancement
Deep-learning pipeline for enhancing dark, noisy photographs. Explored retinex-based and CNN-based approaches and compared PSNR / SSIM across methods.
- PyTorch
- CNN
- Image Processing
- PSNR/SSIM
Stock Price Prediction — LSTM
Sequence model for short-term stock price forecasting. Engineered technical indicators as features and evaluated LSTM vs. baseline regressors on multiple tickers.
- PyTorch
- LSTM
- Pandas
- Time-Series
Skills
Programming
AI / ML Frameworks
- PyTorch
- TensorFlow
- LangChain
- LangGraph
- HuggingFace Transformers
- Keras-OCR
- PEFT (LoRA / QLoRA / Adapters)
- Weights & Biases
Developer Tools
- Git
- Jupyter
- Google Colab
- Linux
- Windows
- Docker
- Apache Spark
- VS Code
Libraries
- Pandas
- NumPy
- Scikit-learn
- Matplotlib
- Seaborn
- OpenCV
Web
Current Focus
Achievements
-
2025
Oracle Cloud Infrastructure — Generative AI Professional
Certified on generative AI on OCI — LLM deployment and tuning patterns.
-
2025
Oracle Cloud Infrastructure — Data Science Professional
Certified on end-to-end data science workflows on OCI.
-
2024
Amazon ML Summer School
Selected for the July 2024 cohort — a selective ML program by Amazon.
-
2022
JEE Mains — Top 0.7%
Ranked in the top 0.7% out of 1M+ students.
-
2022
JEE Advanced — AIR 6616
All India Rank 6616 — top 4% of qualifying candidates.
Contact
Open to full-time AI/ML roles, research collabs, and anything ML at the edge of interesting. Fastest way is email.