ML · SWE · Data

Building software systems—grounded in engineering, strengthened by ML.

I build at the intersection of software engineering and machine learning—reliable backends, clean APIs, and solid data pipelines, plus modeling and evaluation when it actually improves the product.

About Me

Software engineer and deep learning researcher from Surat, India.

On the engineering side: full-stack systems, APIs, data pipelines, distributed infrastructure. On the research side: bilevel optimization, implicit differentiation, GPU-scale experiments — published at NeurIPS 2025, extended work submitted to ICLR 2026.

I care about both equally. Systems that hold up. Models that actually work. The intersection is where the interesting problems are.

Dreams — Fleetwood Mac
What I Do

Engineering: full-stack from specs to UX — auth, databases, CI/CD, observability, clean APIs.

Research: bilevel optimization, implicit differentiation, A100 training runs, reproducible experiments, publication-grade code.

Both inform each other. I write systems that survive contact with real data. I write experiments that survive contact with real constraints.

Operating style

Deep dives across software, models, and systems.

I like work where a topic is worth digging into properly: the model, the data, the interface, the infrastructure, and the trade-offs that connect them. Surface-level understanding does not interest me much; I want to know why a system works, where it fails, and how to make it usable.

The intersection of fields is what keeps me engaged. Computer science gives me leverage, AI/ML gives me a way to model messy behavior, and my minors in physics and statistics push me toward stronger reasoning about uncertainty, structure, and evidence.

Why Engineering and Deep Learning, Together

CS gives leverage. Deep learning gives expressive power. Engineering makes things work. Research makes things correct.

I'm most engaged when both are in tension: when a model needs to be theoretically sound and practically deployable. When code needs to be reproducible and production-ready.

Infrastructure and evaluation matter as much as the architecture. Strong baselines, reproducible runs, measurable improvements.

My Principles (How I Work)
  1. 01 Start simple → iterate. Build the smallest honest baseline, then improve with evidence.
  2. 02 Reproduce everything. Scripts, seeds, configs, and data assumptions live with the code. Plots are programmatic.
  3. 03 Engineer for scale. Clear APIs, tests, and observability make systems maintainable.
  4. 04 Explain the trade-offs. Users should see why a route/model was chosen—not just the output.
  5. 05 Keep it legible. Code, docs, and UX should make future work easier, not harder.
Hobbies & Interests
  • Triathlon prep — endurance, structure, and consistency
  • Learning French
  • Tennis on the regular
  • Electronic music (EDM) — Spotify →
  • Philosophy, psychology, chess, and reading — Goodreads →
  1. May 2026
  2. 2025/10/24

    Preprint published

    Released the scalable gradient-based contract design preprint covering bilevel optimization, implicit gradients, and GPU-scale experimentation.

  3. Oct 2025
  4. Jan 2025
  5. Sep 2024-Apr 2025
  6. Feb-Sep 2024
  7. Aug 2022