Skip to main content

Turn any podcast RSS feed into searchable transcripts, summaries, and episode chat.

No card • 10 free transcript credits
Sign up free with Google
No Priors: Artificial Intelligence | Technology | Startups
Baseten CEO Tuhin Srivastava on the AI Inference Crunch, Custom Models, and Building the Inference Cloud
No Priors: Artificial Intelligence | Technology | Startups

Baseten CEO Tuhin Srivastava on the AI Inference Crunch, Custom Models, and Building the Inference Cloud

Conviction 42m 4 days ago
At this moment of inflection in technology, co-hosts Elad Gil and Sarah Guo talk to the world's leading AI engineers, researchers and founders about the biggest questions: How far away is AGI? What markets are at risk for disruption? How will commerce, culture, and society change? What’s happening in state-of-the-art in research? “No Priors” is your guide to the AI revolution. Email feedback to show@no-priors.com. Sarah Guo is a startup investor and the founder of Conviction, an investment firm purpose-built to serve intelligent software, or "Software 3.0" companies. She spent nearly a decade incubating and investing at venture firm Greylock Partners. Elad Gil is a serial entrepreneur and a startup investor. He was co-founder of Color Health, Mixer Labs (which was acquired by Twitter). He has invested in over 40 companies now worth $1B or more each, and is also author of the High Growth Handbook.

Show Notes

Tap timecodes to jump
Baseten CEO and co-founder Tuhin Srivastava sits down with Sarah Guo and Elad Gil to discuss the rapid growth of AI inference demand, Baseten’s 30x growth, and why inference is becoming the strategic “last market.” Tuhin Srivastava argues the application layer will persist because companies with unique user signals can encode value into workflows and post-train specialized models, citing examples like Abridge and support workflows. The conversation covers GPU capacity constraints, Baseten’s multi-cloud fabric across 18 clouds and 90 clusters, long-term contracting dynamics, the importance of the software layer for stickiness, evolving workloads, multichip possibilities, and operational lessons at scale.
Sign up for new podcasts every week. Email feedback to show@no-priors.com
Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @Tuhinone
 
Chapters:
Baseten growth
Why the app layer wins
Serving frontier customers
Open source model mix
Chinese models and geopolitics
Custom inference dominates
Post training acquisition
When to invest in custom models
Supply crunch and data centerse
Longer GPU Contracts
What Makes a Winner
Multi Chip Future
Runtime Roadmap
Scaling Edge Cases
Hiring and Leadership
Operations Pager Culture
Efficiency Drives Demand
Concierge Everything Future
Conclusion

Transcript not yet processed.

Sign in to unlock (1 credit)

Full transcripts, AI insights,
episode chat — free.

Sign up with Google in one click. 10 unlock credits included. No card needed.

Google sign-in · No credit card · Cancel anytime