Aditya Challapally, Author at Engineering@Microsoft

Posts by this author

Feb 27, 2026

Engineering and algorithmic interventions for multimodal post-training at Microsoft scale

Aditya Challapally leads post-training research and infrastructure for Copilot agent capabilities that process millions of multimodal interactions. This post builds on the diagnostics from Diagnosing instability in production-scale agent reinforcement learning with the engineering and algorithmic interventions we developed to get the best results ...

Jan 28, 2026

Diagnosing instability in production-scale agent reinforcement learning

On January 28, 2026, Hugging Face announced that they have upstreamed the Post-Training Toolkit into TRL as a first-party integration, making these diagnostics directly usable in production RL and agent post-training pipelines. This enables closed-loop monitoring and control patterns that are increasingly necessary for long-running and continuously...