Three months after sharing preview demos to a waitlist, Sesame launched its iOS app on May 28, making its conversational AI agents publicly available for the first time.
Sesame was co-founded by former executives from Oculus, the VR company Meta acquired in 2014. The founding team built careers around presence - making people feel genuinely engaged with a technology rather than just using it. They're applying that thinking to voice AI: conversations that feel like talking to a person, not dictating commands to a voice assistant.
The Problem They're Targeting
Most AI voice interactions still feel transactional. Even improved products like ChatGPT's Advanced Voice Mode have latency quirks, awkward interruptions, and responses that don't quite track the natural flow of conversation. These aren't primarily language model problems - they're design and engineering problems around timing, tone, and context retention. Sesame's argument is that building voice-first from scratch produces better results than bolting audio onto a text-first product.
The Oculus analogy is useful here. Early VR headsets adapted from gaming hardware never matched headsets built from the start with immersion as the primary design constraint. Sesame is making the same bet about voice interfaces.
What's Available Now
The iOS app is Sesame's first public consumer release. Pricing details and whether a free tier exists aren't confirmed in the launch announcement. No Android version has been announced.
The conversational AI space on mobile is genuinely crowded - ChatGPT, Claude, and Gemini all have capable voice modes built into established apps with large user bases. Sesame's window for differentiation depends entirely on whether users notice a real qualitative difference in how natural the conversation feels. That's a claim that's hard to evaluate until you've actually used it.