Stories
Slash Boxes
Comments

SoylentNews is people

SoylentNews is powered by your submissions, so send in your scoop. Only 16 submissions in the queue.
posted by hubie on Friday January 09, @12:53PM   Printer-friendly

https://techcrunch.com/2026/01/01/openai-bets-big-on-audio-as-silicon-valley-declares-war-on-screens/

OpenAI is betting big on audio AI, and it's not just about making ChatGPT sound better. According to new reporting from The Information, the company has unified several engineering, product, and research teams over the past two months to overhaul its audio models, all in preparation for an audio-first personal device expected to launch in about a year.

The move reflects where the entire tech industry is headed — toward a future where screens become background noise and audio takes center stage. Smart speakers have already made voice assistants a fixture in more than a third of U.S. homes. Meta just rolled out a feature for its Ray-Ban smart glasses that uses a five-microphone array to help you hear conversations in noisy rooms — essentially turning your face into a directional listening device. Google, meanwhile, began experimenting in June with "Audio Overviews" that transform search results into conversational summaries. And Tesla is integrating Grok and other LLMs into its vehicles to create conversational voice assistants that can handle everything from navigation to climate control through natural dialogue.

It's not just the tech giants placing this bet. A motley crew of startups has emerged with the same conviction, albeit with varying degrees of success. The makers of the Humane AI Pin burned through hundreds of millions before their screenless wearable became a cautionary tale. The Friend AI pendant, a necklace that records your life and offers companionship, has sparked privacy concerns and existential dread in equal measure. And now at least two companies, including Sandbar and one helmed by Pebble founder Eric Migicovsky, are building AI rings expected to debut in 2026, allowing wearers to literally talk to the hand.

The form factors may differ, but the thesis is the same: audio is the interface of the future. Every space — your home, your car, even your face — is becoming an interface.

OpenAI's new audio model, slated for early 2026, will reportedly sound more natural, handle interruptions like an actual conversation partner, and even speak while you're talking, which is something today's models can't manage. The company is also said to envision a family of devices, possibly including glasses or screenless smart speakers, that act less like tools and more like companions.

As The Information notes, former Apple design chief Jony Ive, who joined OpenAI's hardware efforts through the company's $6.5 billion acquisition in May of his firm io, has made reducing device addiction a priority, seeing audio-first design as a chance to "right the wrongs" of past consumer gadgets.


Original Submission

This discussion was created by hubie (1068) for logged-in users only. Log in and try again!
Display Options Threshold/Breakthrough Mark All as Read Mark All as Unread
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
(1)
  • (Score: 4, Insightful) by Anonymous Coward on Friday January 09, @05:44PM (1 child)

    by Anonymous Coward on Friday January 09, @05:44PM (#1429225)

    Not looking forward to getting "spoken" email that will take much longer to wade through than I can do visually on a nice big screen.
    Likewise for texts, the burden belongs on the sender to type, then receiver(s) are much more efficient in reading the message.
    Making it easier to create and send messages is a recipe for reducing the signal to noise ratio.

    (ancient rant)
    When Windows 3.1 started to get useful, a friend introduced me to mousing and otherwise using it (I was doing fine with MS-DOS). At one point I commented that mousing was a lot slower than emacs chord-commands for editing long documents (a big part of my work back then, formatting the final manual/book was a separate process).
    His reply, "Reduce your productivity with Windows."

    • (Score: 3, Insightful) by Anonymous Coward on Friday January 09, @06:31PM

      by Anonymous Coward on Friday January 09, @06:31PM (#1429235)

      Slop-tech came for your keyboard

      Then they came for your screen

      Next, they will scoop out what was left of your brain.

  • (Score: 5, Funny) by Thexalon on Friday January 09, @08:07PM

    by Thexalon (636) on Friday January 09, @08:07PM (#1429255)

    Look, we have a whole new thing that definitely will make AI a wonderful and profitable venture now. I know all of our other attempts at using AI in the real world have been unprofitable without government contracts, but this one will be. And we know that because Amazon Alexa exists and they definitely didn't just cut all the funding for it last year.

    --
    "Think of how stupid the average person is. Then realize half of 'em are stupider than that." - George Carlin
  • (Score: 5, Touché) by anubi on Saturday January 10, @12:45PM

    by anubi (2828) on Saturday January 10, @12:45PM (#1429351) Journal

    Be prepared to spend a day or two. Maybe all week.

    --
    "Prove all things; hold fast that which is good." [KJV: I Thessalonians 5:21]
  • (Score: 2) by Bentonite on Sunday January 11, @12:21PM

    by Bentonite (56146) on Sunday January 11, @12:21PM (#1429570)

    as well, just to serve the NPC obsession about a computer that can be operated by text.

    (Yes, the NPC mind doesn't consider between learning how to use computers, or not using computers - instead the computer must somehow accept ambiguous voice commands and proceed to do everything automatically).

    I guess a LLM duct taped to voice synthesis and tooling would seem to work well enough, as it would do lots of things that must be correct.

(1)