Today, most users interact with AI through a text box messaging interface. But voice interaction is increasingly coming online. ChatGPT, and most of the foundational large language model providers, have implemented voice chat interaction. Companies like Wispr Flow, Sandbar, and dozens of AI transcription companies are building voice-based AI businesses in a myriad of ways.
With the focus on voice interaction, many companies are turning to a form factor we believe is short-sighted: the pin. The pin is the low-hanging fruit of wearable AI technology: usually little more than an ugly mic and speaker with a rigid interface. We believe it will be a technological fossil in short order.
Here is a quick overview of some of the companies going after the pin (confirmed and rumored):
The AI pin was born from a desire to “liberate” us from smartphones. In reality, it prioritizes supposed convenience over utility.
The form factor is a trap. You either turn into a walking privacy violation by recording everything, or you have to manually 'summon' the AI every five seconds. It is supposed to be seamless, but the 'push-to-talk' requirement makes every interaction feel like a performance. Checking your watch passes unnoticed; tapping a pin and talking to your chest makes a scene. In the end, you do not save time, you merely trade a screen for a struggle.
One note to add is that companies like Sandbar are focusing on individual productivity with privacy at the core. Their voice ring is not meant to record the world around you but instead helps organize your thoughts and ideas by giving an easy-to-use interaction layer with a recognizable and acceptable form factor.
Worst of all, the pin lacks high-bandwidth feedback. If you put a billboard in front of someone, they would understand it far more quickly than if you verbally described it to them. We process visual information roughly 60,000 times faster than audio. Instead of a rich display, you are stuck waiting for a slow, linear voice to finish its sentence or squinting at a low-resolution laser projected onto a shaky palm. By removing the screen without replacing it with a superior visual layer (e.g., an optimal form factor such as smart glasses), pins create an information bottleneck.
The Future
We believe the optimal future is one where AI-enabled wearables remove the need for a phone. As we have discussed in previous newsletters, smartphones could move towards a “compute” device instead of an “interaction” device, where the main value is to offload compute, store memory, and serve as a backup ecosystem if needed. This allows for wearables to take on a more comfortable form factor and primary interaction layer.
The real opportunity lies in hardware that augments the senses we already use: vision and hearing. This is why the future is most likely smart glasses:
Takeaway: The AI pin is a dead end that prioritizes the convenience of a mic-and-speaker clip over the necessity of high-bandwidth, visual context. By anchoring the AI to the chest rather than the eyes, these devices create a massive information bottleneck that forces users into socially awkward, low-speed interactions. The future lies in smart glasses that align with our natural sensory inputs, transforming AI from a reactive "clerk" on your shirt into a proactive partner that shares your point of view.