NISHIKA YADAV
Hi, I am a Backend & AI Engineer with 2+ years building production LLM platforms, multi-provider orchestration, RAG pipelines, vector stores, and fine-tuning models. Python is my primary language, and I bring experience in data science, machine learning, and core AI as well.
My interest lies with a plethora of things from geospatial, remote-sensing, and satellite data to responsible AI, AI Governance, and tech/public policy. I hope to get the chance to work on these fields in my near future or dwell into them as much as I can on my own in the meantime.

Experience and Skills
Where I've worked
Tech4Dev — Kaapi
Backend & AI Engineer
Dec 2024 – Present
- ->Architected a Python-based multi-provider LLM gateway (OpenAI, AWS Bedrock, Gemini) with a provider-agnostic abstraction layer in FastAPI + SQLModel
- ->Built a production document upload to optional document transformation to vector store pipeline with asynchronous support via Celery + Redis + gevent
- ->Contributed to a guardrails microservice with autodiscovery, A/B testing, and service-to-service auth — validators added without modifying core platform code.
- ->Delivered features on a production LLMOps frontend (Next.js 16, React 19, TypeScript): guardrails config UI, document management, knowledge base creation, speech-to-text evaluation.
- ->Engineered complete pipeline for fine-tuning LLM Models and evaluating those models with OpenAI — stratified data splits, background job tracking, JSONL preprocessing.
- ->Experimented with multiple file search setups including AWS Bedrock + OpenSearch + S3 and Google Gemini File Search to evaluate best-fit RAG setups for NGO clients.
- ->Owned the full prompt engineering and evaluation workflow for a public policy chatbot serving government stakeholders, continuously optimizing prompt templates and RAG configurations using structured evaluations and real-world user feedback.
Dalgo (via Tech4Dev)
Data Engineer
Sept 2024 – Dec 2024
- ->Built DBT transformation pipelines converting raw NGO field data into analytics-ready models powering Jal Jeevan Mission dashboards.
- ->Delivered Apache Superset dashboards tracking key mission metrics, replacing manual reporting workflows.
Calfus Inc.
AI Engineering Intern
Feb 2024 – Aug 2024
- ->Delivered a production chatbot (Ollama + LlamaIndex + LangChain) letting customers query their cash flow database in natural language, replacing a 3-step manual workflow.
- ->Fine-tuned Llama 2 and CodeLlama for Text-to-SQL using SFTTrainer, ORPOTrainer, DPOTrainer; explored QLoRA 4-bit quantisation to run 7B models on constrained hardware.
PACTA
Research & Data Analyst Intern
Dec 2023 – Feb 2024
- ->Produced daily analysis reports in Python and Tableau, surfacing trends in Tamil Nadu's disability data for the research team.
IIIT — ML Research Intern
ML Research Intern
May 2023 – Jun 2023
- ->Trained UX-Net, a modified U-Net architecture combining convolutional and recurrent processing for low-latency speech separation, and applied it to speech separation and acoustic echo cancellation tasks in PyTorch.
Skills
scroll to zoom · drag to pan · double-click to resetProjects & Writing
Things I've built
Llama 2 Fine-Tuning (QLoRA)
↗Fine-tuned Llama 2-7B on a free Colab T4 GPU using 4-bit QLoRA precision, reducing VRAM footprint by ~75% with no meaningful accuracy loss. Published multiple fine-tuned model variants on Hugging Face.
Multi-Provider LLM Gateway
↗Provider-agnostic LLM abstraction layer supporting OpenAI, AWS Bedrock, and Gemini. Built with FastAPI + SQLModel, hardened with SlowAPI rate limiting and SSRF-protected webhook callbacks.
Anemia Detection from Conjunctiva Images
↗Detected anemia from conjunctiva eye images using 5 models — CNN, XGBoost, Logistic Regression, VGG16, and ResNet50 (transfer learning). Best performer was logistic regression. Dataset sourced from Mendeley Data (Ghana conjunctiva images by Appiahene et al.).
Education
MITS Gwalior
2020 – 2024
B.Tech, Electrical Engineering
Madhav Institute of Technology & Science, Gwalior
Certifications
— PEDP Data Science for Social Impact — Ashoka University (2025–26)
— Neo4j & LLM Fundamentals — Neo4j (2024)
— Summer Analytics 2023 — Consulting & Analytics Club, IIT Guwahati
Volunteering
Save Mumbai Mangrove
April 2026 – Present
Tech Volunteer
People+AI
March 2024 – Aug 2024
Volunteer