Ignite AI: Minha Hwang on Scaling AI Experiments and Building Smarter Models with Less Data

Playback speed

Share post at current time

0:00

Transcript

Ignite AI: Minha Hwang on Scaling AI Experiments and Building Smarter Models with Less Data | Ep167

Episode 167 of the Ignite Podcast

Ignite Insights

Jun 09, 2025

Transcript

The AI world moves fast—but few people think rigorously about how we know what’s actually working. In our latest episode of the Ignite Podcast, we spoke with Minha Hwang, Principal Applied Scientist at Microsoft, to break down the messy, high-stakes world of AI experimentation, model evaluation, and what comes after training your models.

With a unique journey spanning MIT, McKinsey, academia, and Microsoft, Minha combines deep technical expertise with a pragmatic business lens. This blog unpacks the most valuable lessons from our conversation.

🎯 From Data Storage to Decision Science

Minha’s path is anything but linear:

PhD #1 in materials science at MIT

McKinsey consultant working across industries

PhD #2 in marketing science—before becoming a professor at McGill

And today, he leads high-impact experimentation systems at Microsoft

What ties it all together? A relentless focus on data-driven decision making and understanding the real impact behind the numbers.

🧪 Why A/B Testing Isn’t Enough Anymore

Most companies lean heavily on A/B testing. But Minha warns of a harsh reality:

“False positives are shockingly common—especially when teams run too many tests, with too many metrics, on too little traffic.”

He outlines how Microsoft tackles this:

Proxy metrics to detect signal faster

Variance reduction techniques using ML

Repeat experiments to validate surprising results (“solidification”)

These practices help Microsoft scale experimentation without sacrificing trust in the data.

🔍 Causal Inference: The Most Underrated Skill in ML

While machine learning is great for prediction, Minha argues that causal inference—understanding what actually caused an outcome—is what truly drives business impact.

“Most ML teams are mapping X to Y. But businesses want to know: if I change X, what happens to Y?”

He highlights tools like observational causal inference, counterfactual reasoning, and A/B tests—but notes most data science programs underemphasize them.

🤖 Evaluating LLMs: The New Frontier

As Microsoft integrates large language models (LLMs) into more products, experimentation gets trickier:

A/B testing LLM features often lacks clean control groups

Standard metrics don’t always reflect user preference or quality

Evaluation becomes more about human preferences and offline metrics

This shift demands a new mindset—one that blends rigorous experimentation with deep qualitative insight.

🧠 The Case for Open Source and Reinforcement Learning

Minha is optimistic about:

Open-weight models like DeepSeek as democratizers of AI innovation

Reinforcement learning as a path beyond the limits of human-labeled data

“If we want models to go beyond human-level intelligence, we’ll need them to learn from experience—not just our data.”

He predicts RL and simulated environments will play a growing role in training next-gen AI.

🚀 What Comes After LLMs?

While LLMs dominate headlines, Minha is thinking ahead:

Smarter pricing agents for small businesses

Non-LLM applications with direct business value

Eventually, robotics and physical AI, where visual and tactile learning replaces pure text-based intelligence

The future, he believes, will demand more than language—it will require systems that understand, act, and adapt.

💡 Final Thought

Amid the AGI debates and benchmark hype, Minha offers a grounded view:

“As an engineer, I don’t care if it’s AGI. What matters is—does it solve the problem? Is it useful?”

That’s a philosophy worth holding onto in today’s rapidly evolving AI landscape.

🎧 Want to Go Deeper?

Listen to the full episode with Minha Hwang for stories, frameworks, and strategies you won’t hear anywhere else. Whether you're building AI systems or evaluating their business impact, this one’s a masterclass.

👂🎧 Watch, listen, and follow on your favorite platform: https://tr.ee/S2ayrbx_fL

🙏 Join the conversation on your favorite social network: https://linktr.ee/theignitepodcast

Chapters: