Vaibhav Tanwar

Hi, I'm Vaibhav 👋

Early-career full stack AI developer passionate about Backend engineering and Infra for Next-Gen Agentic LLM Systems.

GitHub Activity

About

I’m a 21 year old software developer and a recent CS + Applied Math graduate, building end-to-end robust AI driven workflows and platforms with a focus on reliability, scale, and impact. Previously worked across Computer Vision and Full Stack systems. and have extensively built interdisciplinary projects which can be viewed here. I'm an avid learner who likes algorithmic problem solving and am heavily curious in learning about multi modal inference systems at scale. Currently seeking roles in AI, backend, or data engineering where I can own critical paths, move quickly with lean teams, and ship value. Founder‑curious and startup‑minded, I’m excited by zero‑to‑one challenges, building with small, high‑trust teams, and partnering closely with founders to take AI products from prototype to reliable, scalable systems. If that resonates, let’s connect.

Check My Resume

Work Experience

Education

IIIT Delhi

2022 - 2025

B.Tech in Computer Science and Applied Mathematics

Skills

Python

TypeScript

C++

PyTorch

Hugging Face

FAISS

FastAPI

PostgresQL

Sqlite

Redis

KuzuDB

DSPy

LangChain

LangGraph

Google Agent Development Kit

MLFlow

SpaCy

Triton

OpenCV

Open3D

Github

NGINX

Docker

My Projects

Check out my latest work

I've worked on a variety of projects, from simple websites to complex web applications. Here are a few of my favorites.

Youtube Multimodal RAG Pipeline

August 2025

This project implements a sophisticated multimodal RAG system transforming YouTube videos into queryable knowledge bases through advanced frame extraction and caption analysis. Leveraging Gemini for inference and Qdrant for efficient vector storage, the system processes both visual and textual content to generate precise, timestamped responses to natural language queries.

Python

HuggingFace

Qdrant

Llamaindex

Gemini SDK

Streamlit

Source

WhatsApp Multimodal Memory Bot

August 2025

Architected a multimodal WhatsApp memory assistance pipeline (FastAPI + AsyncIO) that ingests text/voice/images via Twilio webhooks, classifies intent in real time via Groq LLM inference, embeds memories in Mem0's vector store for semantic recall, and serves analytics through idempotent SQLite transactions.

Python

Twilio

Groq

Whisper

Mem0

FastAPI

SQLAlchemy

Source

LLM powered Resume Analyzer

July 2025

Developed a full-stack AI-powered resume analyzer using React, TypeScript, and Claude Sonnet integration, featuring real-time PDF processing, multi-dimensional scoring system (ATS, content, structure, skills), and comprehensive feedback generation for job seekers, integrating Zustand state management, Tailwind CSS, React Router, and Puter.js services for authentication, file system operations, and data persistence, delivering a responsive user interface with drag-and-drop functionality and visual score components.

React

TypeScript

Puter.js

Tailwind

React Router

Zustand

Live Deployment

Source

Multi Agent Tutoring System (Work In Progress)

May 2025

Developed a sophisticated tutoring chatbot leveraging Google's Agent Development Kit (ADK) principles with intelligent orchestration between specialized Math and Physics agents powered by Gemini API, integrating context-aware conversation management, autonomous query classification pipeline routing student queries to domain-specific agents and provide personalized responses through prompt engineering and tool integration.

Python

FastAPI

Gemini

Agent Development Kit

Javascript

Live Deployment

Source

AI Powered App Developer

August 2025

Coding assistant built with LangGraph, simulating a multi-agent developer workflow to generate complete projects from natural language prompts. It utilizes Planner, Architect, and Coder agents to sequentially design, structure, and implement applications, leveraging tools for file I/O and code execution. The system is deployed with a FastAPI backend and a NiceGUI frontend for user interaction and project management

Python

FastAPI

Groq

LangGraph

LangChain

NiceGUI

Source

Distributed KV Store with Modified Raft Consensus

July 2024

Implemented a database storing string key-value pairs using Raft Consensus Algorithm, ensuring consistent data replication and fault recovery across the distributed network of nodes and utilized the leader lease mechanism, similar to those used by geo distributed databases such as Cockroach DB and YugaByte DB.

Python

ZeroMQ

Source

Vision-Language Assistant for Navigation Aid in Urban Metro Systems

March 2025

Developed MetroSense, a novel web-based platform to empower visually impaired individuals navigate the Delhi Metro system, achieving 65.1% mAP@50 for identifying environmental elements from real-time image captures. Integrated LLAMA Vision 3.2 90B for sophisticated VQA, engineered with context-rich, few-shot prompting and optimized decoding parameters to achieve a BERT F1 score of 0.85, delivering semantically accurate, context-aware voice-synthesized responses to user queries for improved safety and autonomy.

Python

PyTorch

Transformers

HuggingFace

LLMs

OnRender

Demo

Multi Model Analysis for Stock Market Trend Prediction

July 2024

Developed and benchmarked novel models (GAN, Neural ODE VAE, Neural ODE Classifier) for stock market analysis, achieving a 15% F1 improvement and 85% faster training via Neural ODEs.Implemented a CNN-LSTM architecture delivering high-accuracy regression (R² 0.99, MAE 143.58 on S&P 500) across five major indices on the CNNPred dataset.

Pytorch

Transformers

Scikit-Learn

Pandas

Source

Cloud Native Online Commodity Trading Platform

June 2024

Created a distributed online marketplace system, architected to facilitate direct transactions between buyers and sellers through a central platform hosted on Google Cloud VM instances, leveraging gRPC for communication and Protocol Buffers for efficient data serialization.

Python

gRPC

Protobuf

Source

K Means using Map Reduce Framework

April 2024

Implemented a distributed Map-Reduce framework comprising of Master, Mapper and Reducer components to perform K Means Clustering on a given dataset ensuring fault tolerance for both components and utlized gRPC for communication among the three processes for each iteration.

Python

gRPC

Source

Cycle Accurate Simulator for a 5 stage RISC CPU

Implemented a simulator for a processor based on RV32I variant of RISC-V ISA where the microarchitecture included a 5 stage pipeline allowing forwarding/bypassing and separate execution unit for Network on Chip operations,along with a 2-way set associative cache following Least Recently Used replacement policy.

C++

Source

Get in Touch

Shoot me an email or a dm on twitter and I'll be happy to connect with you and discuss about potential opportunities.

GitHub Activity

About

Work Experience

MIDAS Research Group

Infosys Centre for Artificial Intelligence

Scale AI

Networked Systems and Security Research Lab

Education

IIIT Delhi

Skills

Check out my latest work

Youtube Multimodal RAG Pipeline

WhatsApp Multimodal Memory Bot

LLM powered Resume Analyzer

Multi Agent Tutoring System (Work In Progress)

AI Powered App Developer

Distributed KV Store with Modified Raft Consensus

Vision-Language Assistant for Navigation Aid in Urban Metro Systems

Multi Model Analysis for Stock Market Trend Prediction

Cloud Native Online Commodity Trading Platform

K Means using Map Reduce Framework

Cycle Accurate Simulator for a 5 stage RISC CPU

Get in Touch