Mumbai, IndiaBackend & AI Engineer

NISHIKA YADAV

Hi, I am a Backend & AI Engineer with 2+ years building production LLM platforms, multi-provider orchestration, RAG pipelines, vector stores, and fine-tuning models. Python is my primary language, and I bring experience in data science, machine learning, and core AI as well.

My interest lies with a plethora of things from geospatial, remote-sensing, and satellite data to responsible AI, AI Governance, and tech/public policy. I hope to get the chance to work on these fields in my near future or dwell into them as much as I can on my own in the meantime.

Nishika

Experience and Skills

Where I've worked

Tech4Dev — Kaapi

Backend & AI Engineer

Dec 2024 – Present

  • ->Architected a Python-based multi-provider LLM gateway (OpenAI, AWS Bedrock, Gemini) with a provider-agnostic abstraction layer in FastAPI + SQLModel
  • ->Built a production document upload to optional document transformation to vector store pipeline with asynchronous support via Celery + Redis + gevent
  • ->Contributed to a guardrails microservice with autodiscovery, A/B testing, and service-to-service auth — validators added without modifying core platform code.
  • ->Delivered features on a production LLMOps frontend (Next.js 16, React 19, TypeScript): guardrails config UI, document management, knowledge base creation, speech-to-text evaluation.
  • ->Engineered complete pipeline for fine-tuning LLM Models and evaluating those models with OpenAI — stratified data splits, background job tracking, JSONL preprocessing.
  • ->Experimented with multiple file search setups including AWS Bedrock + OpenSearch + S3 and Google Gemini File Search to evaluate best-fit RAG setups for NGO clients.
  • ->Owned the full prompt engineering and evaluation workflow for a public policy chatbot serving government stakeholders, continuously optimizing prompt templates and RAG configurations using structured evaluations and real-world user feedback.

Dalgo (via Tech4Dev)

Data Engineer

Sept 2024 – Dec 2024

  • ->Built DBT transformation pipelines converting raw NGO field data into analytics-ready models powering Jal Jeevan Mission dashboards.
  • ->Delivered Apache Superset dashboards tracking key mission metrics, replacing manual reporting workflows.

Calfus Inc.

AI Engineering Intern

Feb 2024 – Aug 2024

  • ->Delivered a production chatbot (Ollama + LlamaIndex + LangChain) letting customers query their cash flow database in natural language, replacing a 3-step manual workflow.
  • ->Fine-tuned Llama 2 and CodeLlama for Text-to-SQL using SFTTrainer, ORPOTrainer, DPOTrainer; explored QLoRA 4-bit quantisation to run 7B models on constrained hardware.

PACTA

Research & Data Analyst Intern

Dec 2023 – Feb 2024

  • ->Produced daily analysis reports in Python and Tableau, surfacing trends in Tamil Nadu's disability data for the research team.

IIIT — ML Research Intern

ML Research Intern

May 2023 – Jun 2023

  • ->Trained UX-Net, a modified U-Net architecture combining convolutional and recurrent processing for low-latency speech separation, and applied it to speech separation and acoustic echo cancellation tasks in PyTorch.

Skills

scroll to zoom · drag to pan · double-click to reset

Projects & Writing

Things I've built

Llama 2 Fine-Tuning (QLoRA)

Fine-tuned Llama 2-7B on a free Colab T4 GPU using 4-bit QLoRA precision, reducing VRAM footprint by ~75% with no meaningful accuracy loss. Published multiple fine-tuned model variants on Hugging Face.

PyTorchQLoRAHugging FaceTransformers4-bit Quantisation

Multi-Provider LLM Gateway

Provider-agnostic LLM abstraction layer supporting OpenAI, AWS Bedrock, and Gemini. Built with FastAPI + SQLModel, hardened with SlowAPI rate limiting and SSRF-protected webhook callbacks.

FastAPIOpenAIAWS ECS/EC2GeminiRedisPostgreSQL

Anemia Detection from Conjunctiva Images

Detected anemia from conjunctiva eye images using 5 models — CNN, XGBoost, Logistic Regression, VGG16, and ResNet50 (transfer learning). Best performer was logistic regression. Dataset sourced from Mendeley Data (Ghana conjunctiva images by Appiahene et al.).

CNNVGG16ResNet50XGBoostTransfer LearningMedical Imaging

Writing

Kaapi Guardrails: A Tattle-Tech4Dev Collaboration for AI Safetyprojecttech4dev.orgBuilding a Chatbot for Public Policy Officialsprojecttech4dev.orgData at Work: Building a Data Pipeline for Jal Jeevan Missionprojecttech4dev.org

Education

MITS Gwalior

2020 – 2024

B.Tech, Electrical Engineering

Madhav Institute of Technology & Science, Gwalior

Certifications

PEDP Data Science for Social Impact — Ashoka University (2025–26)

Neo4j & LLM Fundamentals — Neo4j (2024)

Summer Analytics 2023 — Consulting & Analytics Club, IIT Guwahati

Volunteering

Save Mumbai Mangrove

April 2026 – Present

Tech Volunteer

People+AI

March 2024 – Aug 2024

Volunteer

Contact

Emailnishikayadav26@gmail.comLinkedInlinkedin.com/in/nishika-yadavGitHubgithub.com/nishika26Phone+91-8840139142
© 2026 Nishika YadavBuilt with Next.js · Deployed on Vercel