Skip to content
All resources

From the guest lectures

Why voice will be the core interface

Sequoia · Training Data2 min readFree

A plain summary, so you can get the gist here without leaving.

This is a 2025 podcast episode from Sequoia's Training Data series, featuring ElevenLabs CEO Mati Staniszewski. The conversation makes the case that voice is becoming a primary way we interact with computers, and that staying focused is how a company wins.

What it is

Training Data is an interview series where the venture firm Sequoia talks with people building AI. In this episode the guest is Mati Staniszewski, who co-founded and leads ElevenLabs, the voice and audio company.

The discussion is less about any single product and more about a direction: where voice technology is heading, why it is becoming central, and what it takes to build a durable company in a fast-moving field. Treat it as a window into how an operator thinks, not a technical tutorial.

The core idea

The headline argument is that voice will be a core interface, not a side feature. For most of computing we have typed and tapped. As speech generation and understanding get good enough, talking to a device becomes natural and fast, and in many situations, hands-free or while moving, it is simply the better way to interact.

The second thread is focus. A common temptation, especially when a company is doing well, is to expand into many areas at once. The conversation argues for the opposite: pick the thing you are best at, here it is voice, and keep going deeper rather than wider. That discipline is presented as a feature of the strategy, not a limitation.

Why it matters

If voice really does become a main way people use software, then how you design products changes. You start thinking about conversations and sound, not only screens and buttons, and that reshapes what is worth building.

For anyone learning to build with AI, this kind of conversation is valuable in a different way than a paper. It teaches judgment: how a founder reads where the technology is going, how they decide what to chase, and why saying no to good options can be as important as saying yes. That is exactly the kind of thinking a guest lecture is meant to share with a community.

Key points
  • A 2025 Sequoia Training Data podcast episode with ElevenLabs CEO Mati Staniszewski.
  • Central claim: voice is becoming a core interface for computers, not just an add-on.
  • As speech tech improves, talking can be faster and more natural than typing in many settings.
  • A second theme is the value of focus: go deeper on what you do best rather than expanding everywhere.
  • It offers founder-level judgment about reading where technology is heading, useful for builders, not just technical detail.
Open the original source

Sequoia · Training Data

New to this? Come build with us.

Reading is good. Building with people is better. Our drop-ins are free and open to total beginners.