RESEARCH ENGINEERING INTERN
AUTONISE
APPLIED ML // BENGALURU
● ACTIVEWorking across the full stack of a production AI system — generation pipelines, LLM reliability research, evaluation methodology, and inference infrastructure.
[01]
PROGRAMMATIC IR PIPELINE
Proved that direct coordinate output from autoregressive LLMs is unstable due to spatial lookahead planning failures. Built deterministic wrappers (LLM → Python / Matplotlib / RDKit / Schemdraw) where domain libraries handle layout constraints. Token cost: ₹10–30 → ~₹0.5 per video. Errors and latency dropped dramatically.
[02]
SWISS-TOURNAMENT RANKING
Engineered Swiss-tournament-style matchmaking to rank JEE question banks using simulated intransitive preference systems. Comparison overhead: O(N²) round-robin → O(N log N). Core finding: pairwise comparison design matters as much as the ranking algorithm itself.
[03]
ITERATIVE VERIFICATION HARNESSES
Designed benchmarking rigs for generation-verification loops using DeepSeek and Qwen. Implemented error categorization separating protocol failures (JSON truncation from token limits) from semantic rule failures (business logic errors) to prevent loop decay.
[04]
ML INFRASTRUCTURE
Evaluated serving performance, memory constraints, and scalability tradeoffs using vLLM (RTX 3090), LoRA fine-tuning, and Mixture-of-Experts routing. Built intuition for when training-time architecture choices constrain serving options.