About

I'm an AI/ML engineer fascinated by systems that learn. From seq2seq translators to retrieval-augmented agents, I love the moment a model stops guessing and starts understanding.

I'm in my final year at IIT Jodhpur studying Artificial Intelligence & Data Science. Over the last year I've shipped a production RAG system at a stealth startup, built multi-object trackers on MOT17, trained seq2seq models with attention, and worked on code-switched speech understanding and voice cloning.

Education

Indian Institute of Technology, Jodhpur Aug 2022 – May 2026

B.Tech · Artificial Intelligence & Data Science · CGPA 7.65

Narayana e-Techno School 2022

Senior Secondary · CBSE Board · 90.0%

Focus areas

  • Retrieval-Augmented Generation
  • LLM Agents & Routing
  • Computer Vision
  • Deep Learning
  • MLOps

Experience

AI Intern

Stealth Startup · Remote

May 2025 – Jul 2025
  • Built a Retrieval-Augmented Generation system with Azure OpenAI and Ollama, integrating HyDE, step-back prompting, and query analysis for a 10–15% performance gain on RAGAs.
  • Developed Hybrid Search and persistent Chat Memory using Qdrant, Weaviate, and Mem0 with metadata-enriched retrieval pipelines.
  • Designed a Router Agent for multi-task LLM routing via LiteLLM and containerized deployment with Docker for production readiness.
  • Azure OpenAI
  • Ollama
  • Qdrant
  • Weaviate
  • Mem0
  • LiteLLM
  • Docker
  • RAGAs
  • HyDE

Open to full-time AI/ML & research roles from May 2026.

Projects

A mix of coursework, research projects, and self-initiated builds. Click through to GitHub for code & write-ups.

2026 · Speech / ASR

Code-Switched ASR + Zero-Shot Voice Cloning

Code-switched lecture transcription and cross-lingual voice cloning pipeline. Fine-tunes speech foundation models for Indian-English classroom audio and generates zero-shot speaker-preserving speech in multiple languages.

  • PyTorch
  • Whisper
  • XTTS
  • Speech Processing
2025 · NLP

Neural Machine Translation (Seq2Seq + Attention)

Encoder-decoder LSTM with Bahdanau attention for German→English and Hindi→English. Trained on 29K DE–EN and 10K HI–EN pairs. Applied scheduled teacher forcing, gradient clipping, and checkpointing over 100 epochs.

BLEU 36.05 on DE→English · validated on 500+ test sentences

  • PyTorch
  • TorchText
  • SpaCy
  • LSTM
  • Attention
2025 · Computer Vision

Multi-Object Tracking on MOT17

Three tracking-by-detection pipelines benchmarked on MOT17. Built Faster R-CNN + SORT with anchor tuning, focal loss, and adaptive Kalman filtering; integrated YOLOv8 + DeepSORT for robust ID preservation.

MOTA 0.529 · IDF1 0.578

  • PyTorch
  • OpenCV
  • YOLOv5/v8
  • Faster R-CNN
  • DeepSORT
  • Hungarian
2025 · LLM Agents

Marketing Research AI Agent

Autonomous agent that runs end-to-end market research — topic discovery, web search, synthesis, and report generation — orchestrated with LLM tool-use and structured output.

  • Python
  • LangChain
  • LLM Tool-Use
  • Agents
2025 · Medical CV

Brain Tumour Detection

CNN-based classifier for brain MRI scans. Compared transfer-learning backbones and data-augmentation strategies on a multi-class tumour dataset.

  • TensorFlow/Keras
  • CNN
  • Transfer Learning
  • Medical Imaging
2025 · Computer Vision

Real-Time Traffic Monitoring

Real-time traffic monitoring system using object detection and tracking on live video. Flags congestion, counts vehicles, and surfaces lane-level analytics.

  • Python
  • OpenCV
  • YOLO
  • Real-Time
2024 · Medical CV

COVID-19 Chest X-ray Analysis

Preprocessed 9,535 chest X-rays with PCA and lung segmentation. Benchmarked classical ML (Random Forest, XGBoost, SVM) against deep CNNs (VGG-19, ResNet-50, EfficientNet-B3).

ML 93% · DL 97.5% accuracy

  • TensorFlow
  • Keras
  • Scikit-learn
  • OpenCV
  • ResNet-50
2025 · Computer Vision

Low-Light Image Enhancement

Deep-learning pipeline for enhancing dark, noisy photographs. Explored retinex-based and CNN-based approaches and compared PSNR / SSIM across methods.

  • PyTorch
  • CNN
  • Image Processing
  • PSNR/SSIM
2025 · Time Series

Stock Price Prediction — LSTM

Sequence model for short-term stock price forecasting. Engineered technical indicators as features and evaluated LSTM vs. baseline regressors on multiple tickers.

  • PyTorch
  • LSTM
  • Pandas
  • Time-Series

Skills

Programming

  • C / C++
  • Python
  • SQL

AI / ML Frameworks

  • PyTorch
  • TensorFlow
  • LangChain
  • LangGraph
  • HuggingFace Transformers
  • Keras-OCR
  • PEFT (LoRA / QLoRA / Adapters)
  • Weights & Biases

Developer Tools

  • Git
  • Jupyter
  • Google Colab
  • Linux
  • Windows
  • Docker
  • Apache Spark
  • VS Code

Libraries

  • Pandas
  • NumPy
  • Scikit-learn
  • Matplotlib
  • Seaborn
  • OpenCV

Web

  • HTML
  • CSS
  • JavaScript

Current Focus

RAG & Agents
LLM Fine-tuning
Computer Vision
Speech / Multimodal
MLOps

Achievements

  • 2025

    Oracle Cloud Infrastructure — Generative AI Professional

    Certified on generative AI on OCI — LLM deployment and tuning patterns.

  • 2025

    Oracle Cloud Infrastructure — Data Science Professional

    Certified on end-to-end data science workflows on OCI.

  • 2024

    Amazon ML Summer School

    Selected for the July 2024 cohort — a selective ML program by Amazon.

  • 2022

    JEE Mains — Top 0.7%

    Ranked in the top 0.7% out of 1M+ students.

  • 2022

    JEE Advanced — AIR 6616

    All India Rank 6616 — top 4% of qualifying candidates.