Welcome

Nice to meet you

Drop your name and email if you'd like to stay in touch. Completely optional.

Portfolio Views

Last 7 days 0
Last 90 days 0
Last 365 days 0
VK.
Home Impact Education Experience Projects Writing Achievements Reviews Contact
Open to full-time AI/ML roles · Starting May 2026

I build AI that
ships to production.

Hi, I'm Vikas Kumar, an AI/ML Engineer specializing in RAG pipelines, LLM systems, agentic AI, and knowledge graphs. $3.2M production impact across Snap, Aditya Birla, and Lawroom AI.

Book a Call Resume
Python PyTorch LangChain LlamaIndex Neo4j FastAPI Azure OpenAI HuggingFace Docker PostgreSQL

By the Numbers

Real Impact. Real Production.

These are not portfolio demos. These are outcomes from production AI systems I built under real accountability.

$3.2M
Logistics cost savings
in production
97%
Ontology accuracy
Knowledge Graph
96%
Document extraction
accuracy
23%
Delivery efficiency
improvement
3K+
Active users
Lawroom AI
10
Internships &
engagements

I treat prompt engineering as a systems design problem.

Context window architecture, output grounding, and hallucination mitigation are engineering constraints I treat with the same rigor as any systems design problem. I have solved all of these under real production pressure with real accountability.

I build at the hard end of the stack: RAG pipelines under retrieval pressure, Neo4j knowledge graph architectures over unstructured enterprise data, transformer-based document intelligence, and agentic systems that actually ship.

🧠 RAG & LLM Systems

Production-grade retrieval pipelines with vector search, context window management, and low-hallucination outputs deployed at scale using Azure OpenAI and LlamaIndex.

🕸️ Knowledge Graphs

Neo4j architectures modeling 500+ entities across semantic dimensions, integrated with LangChain to unify structured and unstructured enterprise data at 97% ontology accuracy.

Agentic AI

Multi-step autonomous systems with tool-use, persistent memory, and real-world API integration, designed to serve real users and not just run in notebooks.

📊 ML & Analytics

End-to-end pipelines from data engineering and feature design through model training, evaluation, and stakeholder-facing dashboards that drive actual decisions.

Academic Background

Education

Two fully funded degrees, awarded on merit across two continents.

Wake Forest University

MS in Business Analytics

2025 – 2026
100% Merit Scholarship · $85,000+
Current Work: Building RAG-powered career analytics system over Handshake data serving 500+ students and 40+ advisors using Azure OpenAI.
Courses: ML for Business, Python, Data Engineering, Optimization, Visualization, Career Management
Tools: Python, SQL, Power BI, Tableau, Scikit-learn, Azure OpenAI

Plaksha University

B.Tech in Artificial Intelligence

2021 – 2025
Full-ride Bharati Scholarship · $42,000+
Focus: Deep Learning, Econometrics, Time Series, NLP, Knowledge Representation, Design Thinking
Projects: LSTM predictive models, NLP chatbots, ESG-financial correlation, Knowledge Graphs
Tools: Python, PyTorch, TensorFlow, Selenium, Firebase, Git, Neo4j

Career Timeline

Work Experience

From production AI systems to research labs, a journey built on real outcomes.

Wake Forest University

AI Engineer

Mar 2026 – Present

North Carolina, USA

  • Building a RAG-powered generative AI system that unifies Handshake, career portals, and institutional databases, enabling natural language querying over student employment and placement data using Azure OpenAI and vector embeddings, serving 500+ students and 40+ advisors.
  • Designing ETL workflows to ingest, normalize, and index Handshake API data into a unified knowledge layer powering real-time LLM-driven analytics dashboards for faculty.
  • Architecting prompt evaluation frameworks to reduce hallucination risk and ensure reliable structured data retrieval across diverse query types.
RAG Azure OpenAI LangChain PostgreSQL

Darden

Graduate Consultant

Oct 2025 – Present
  • Designed and executed A/B tests across 3 marketing campaign cohorts using chi-square hypothesis testing, identifying high-value customer segments and improving conversion decisions.
  • Built K-means customer segmentation models in Python (scipy, statsmodels) identifying 4 distinct clusters, reducing targeting spend waste and improving messaging relevance.
  • Automated ETL workflows and built real-time A/B monitoring dashboards, cutting analyst reporting time by ~40%.
Python K-Means scipy A/B Testing

Snap Inc.

AR & ML Extern

Jun 2025 – Sep 2025
  • Engineered 5+ production AR Lenses in Lens Studio using computer vision and 3D rendering optimization, deployed across distributed cloud infrastructure serving millions of concurrent users at sub-50ms latency.
  • Optimized rendering pipelines across 3 development cycles through systems-level performance profiling, improving frame rates and cutting memory footprint by 15%.
  • Built A/B testing infrastructure with event tracking and distributed analytics pipelines, identifying UX bottlenecks and driving a 20% lift in key engagement metrics.
Lens Studio Computer Vision 3D Rendering A/B Testing

Aditya Birla Group

AI/Software Engineer Intern

Jan 2025 – Jun 2025
  • Built end-to-end distributed logistics optimization system using Python, scikit-learn, and TensorFlow, delivering $3.2M in annual cost savings with 23% delivery efficiency improvement across six enterprise clusters.
  • Engineered a domain-specific Knowledge Graph in Neo4j modeling 500+ entities across 12 semantic dimensions, integrating Azure OpenAI with advanced prompt engineering to achieve 97% ontology accuracy on a production-grade LLM knowledge base.
  • Developed automated document intelligence pipeline using fine-tuned transformer models and Hugging Face, achieving 96% extraction accuracy and reducing manual processing overhead by 80%.
Neo4j LangChain HuggingFace Azure OpenAI TensorFlow

Lawroom AI

Founding AI Engineer

Sep 2024 – Jan 2025
  • Architected and deployed a multilingual generative AI legal chatbot using InLegalBERT and ElevenLabs TTS, delivering accurate NLP responses across 10 languages to 2,000+ users.
  • Designed systematic prompt engineering and LLM evaluation pipelines, improving translation reliability by 35% for low-resource Indian regional languages.
  • Built scalable modular backend using FastAPI and Firebase with secure REST APIs, role-based authentication, and behavioral analytics supporting 3,000+ active users at production-grade uptime.
InLegalBERT FastAPI Firebase ElevenLabs TTS

Scale AI

AI Trainer, RLHF & Red Teaming

Aug 2024 – Dec 2024
  • Designed 500+ adversarial and edge case prompts to systematically expose failure modes in large language models including hallucination, instruction drift, and reasoning collapse across complex multi-turn conversations.
  • Conducted red teaming evaluations across 3,000+ model responses, stress-testing alignment boundaries and surfacing vulnerabilities in LLM reasoning consistency.
  • Maintained 90%+ agreement rate against ground truth labels, contributing high-quality feedback to RLHF pipelines used in production model training.
RLHF Red Teaming Hallucination Mitigation LLM Evaluation

Indian School of Business (ISB)

Research Intern

Jun 2024 – Sep 2024
  • Designed and deployed a production data engineering pipeline integrating PostgreSQL and MongoDB via REST APIs, improving data processing efficiency by 30% for research stakeholders.
  • Automated metadata governance workflows using the OpenMetadata API, improving data discoverability and lineage tracking across the organization's data ecosystem.
  • Built an interactive Streamlit data application delivering a self-serve interface for stakeholders to manage and explore organizational data assets independently.
PostgreSQL MongoDB Streamlit OpenMetadata

Omdena

Machine Learning Engineer

Feb 2024 – Apr 2024
  • Architected and deployed a generative AI mental health support chatbot using GPT-3 and transformer-based NLP, boosting user engagement by 40%.
  • Fine-tuned NLP algorithms combining BERT embeddings and GPT-3 language generation, improving personalized advice accuracy by 30% for end users.
  • Implemented a continuous feedback loop updating chatbot response quality from live user interaction data, improving conversational accuracy by 25%.
GPT-3 BERT NLP Python

Airtel

Machine Learning Intern

May 2023 – Aug 2023
  • Led optimization of a large-scale Python web scraping framework using multithreading and async I/O, reducing data extraction time by 30% across high-throughput production scraping jobs.
  • Enhanced geospatial preprocessing pipelines using DBSCAN clustering and Folium, improving spatial accuracy by 20% for real estate property evaluation at scale.
  • Designed a cross-validation algorithm reducing data discrepancies by 25% across multiple large-scale property data sources.
Python DBSCAN Folium Multithreading

Millennium Campus Network (MCN)

Campus Director, UN Millennium Fellowship

Aug 2022 – Dec 2022
  • Oversaw execution of a five-month UN Millennium Fellowship program, managing 30+ events and activities aligned with fellowship goals.
  • Conducted bi-weekly progress reviews for 10 fellows, providing targeted feedback resulting in a 95% goal achievement rate.
  • Served as primary liaison for 100+ stakeholders including fellows, mentors, and program administrators, fostering a high-collaboration environment across the full fellowship cohort.

Selected Work

Featured Projects

Production-grade AI systems and research-backed data solutions.

NLP · Chatbot

AI Chatbot for United Airlines

Built an NLP-based conversational agent that reduced response time by 75% and improved CSAT scores using intent classification and contextual response generation.

Python NLP TensorFlow Flask
LSTM · Forecasting

Predictive Maintenance System

LSTM time series models to forecast equipment failures, cutting downtime by 60% by enabling proactive maintenance scheduling for industrial clients.

Python LSTM Time Series Scikit-learn
Finance · Analytics

ESG vs Financial Performance

Statistical analysis of ESG scores vs stock performance correlation across the S&P 500, using Selenium-scraped data and regression models to surface investment insights.

Python Selenium Matplotlib Statistics
ML · Security

Real-time Fraud Detection

Ensemble ML model (XGBoost + Random Forest) for real-time credit card fraud detection achieving 99.2% accuracy, streamed via Apache Kafka and containerized with Docker.

XGBoost Random Forest Kafka Docker
Optimization · Supply Chain

Logistics Optimization Engine

Distributed route clustering system using ML and operations research that delivered $3.2M in annual cost savings and 23% faster delivery across six enterprise clusters at Aditya Birla.

Python scikit-learn TensorFlow PostgreSQL
LegalTech · Multilingual AI

Multilingual Legal AI Chatbot

Generative AI legal assistant using InLegalBERT + ElevenLabs TTS supporting 10 Indian languages, serving 3,000+ active users with production-grade availability at Lawroom AI.

InLegalBERT FastAPI Firebase ElevenLabs

Latest Writing

Insights

Writing on AI systems, technology, and the future of work. Published on Medium with 474+ reads on a single piece.

Recognition

Achievements

Scholarships, medals, and recognition across academics, research, and leadership.

🥈 National Competition

2nd Place, East Region at DataCamp's Data for Good Analytics Competition among 120+ graduate teams. Built an AI system to detect AI-generated misinformation in educational content, tackling factual reliability as AI becomes a primary learning tool.

🎓 Graduate Scholarship

Awarded 100% merit-based scholarship (Havells Foundation) to pursue MSBA at Wake Forest University, valued at $85,000+.

🏛️ Undergraduate Scholarship

Awarded the Bharati Scholarship, a full-ride covering four years of undergraduate education at Plaksha University ($42,000+).

🌐 International Recognition

Recognized by the National Academy of Engineering (USA) for research on entrepreneurial mindset in engineering education.

🥇 Case Study Champion

Gold Medalist at Caseify, a consulting-based case study competition on guestimation, defeating 12 competing teams.

⚙️ Engineering Innovation

Gold Medalist at PSIP and SP Dutta Innovation Award for designing a wind turbine with 8% higher output, selected from 40 competing teams.

🌍 Leadership

Selected as Campus Director for the UN Millennium Fellowship, managing 30+ events and leading 10 fellows to a 95% goal completion rate.

🏏 Sports

Intra-University Cricket Champion at Plaksha University, leading the team to the inter-department cricket tournament title.

Spirit of Plaksha

Awarded the Spirit of Plaksha Award for outstanding contribution to peer learning, mentorship, and making technical concepts accessible to fellow students across the university community.

What People Say

Recommendations

From managers and colleagues who worked with me directly.

"

He took ownership of our website from design to Firebase integration, always with a user-centric focus. He also contributed to benchmarking AI models and integrating voice-to-text features. Vikas is constantly growing his skills and would be a valuable asset to any team.

Bhawna Rupani

Lead Gen AI Engineer · Lawroom AI

"

Vikas has consistently demonstrated a strong commitment to learning and delivering high-quality work on our Knowledge Graph implementation. He brings enthusiasm and curiosity to every task, communicates clearly, and collaborates effectively. His positive attitude and eagerness to learn make him a pleasure to work with.

Kunkum Poovamma

Senior GenAI Engineer · Aditya Birla Group

"

Vikas is a dedicated and hardworking individual with a strong passion for data science, machine learning, and deep learning. He consistently seeks opportunities to upgrade his skills and stays at the forefront of new technologies. His enthusiasm for ML is evident as he combines technical expertise with a genuine love for learning.

Tanisha Saraf

Robotics Engineer · Plaksha University

Open Source

GitHub Activity

--
Contributions (6 mo)
--
Contributions (18 mo)
--
Public Repos
--
Languages

Contribution Activity

GitHub contribution chart

github.com/thisisvk45 →

Get in Touch

Let's Build Something

Open to full-time AI/ML engineering roles starting May 2026. Also happy to connect about research, writing, or interesting problems.

Send a Message

Email

kumav25@wfu.edu

Location

San Francisco Bay Area

Open to relocation

Seeking

Full-time AI/ML Engineering Roles · Starting May 2026

Currently building at WFU

RAG-powered career analytics system serving 500+ students using Azure OpenAI + LangChain.

vikas@portfolio ~ ai-assistant
vikas@ai:~$ Hello! I'm Vikas's AI assistant. Ask me anything about his experience, projects, or skills.
$
404
vikas@portfolio:~$ _
bash: page not found
This path does not exist in the repository.
exit code 404 • no such file or directory
↩ cd ~/home
Currently Building ×
Agentic RAG system with real-time knowledge graph updates and multi-hop reasoning
ESC to close