On Monday, OpenAI announced a significant update to ChatGPT that enables its GPT-3.5 and GPT-4 AI models to analyze images and react to them as part of a text conversation. Also, the ChatGPT mobile app will add speech synthesis options that, when paired with its existing speech recognition features, will enable fully verbal conversations with the AI assistant, OpenAI says.

OpenAI is planning to roll out these features in ChatGPT to Plus and Enterprise subscribers "over the next two weeks." It also notes that speech synthesis is coming to iOS and Android only, and image recognition will be available on both the web interface and the mobile apps.
Despite their drawbacks, in marketing materials, OpenAI is billing these new features as giving ChatGPT the ability to "see, hear, and speak." Not everyone is happy about the anthropomorphism and potential hype language involved. On X, Hugging Face AI researcher Dr. Sasha Luccioni posted, "The always and forever PSA: stop treating AI models like humans. No, ChatGPT cannot 'see, hear and speak.' It can be integrated with sensors that will feed it information in different modalities."

While ChatGPT and its associated AI models are clearly not human—and hype is a very real thing in marketing—if the updates perform as shown, they potentially represent a significant expansion in capabilities for OpenAI's computer assistant.

  • (Score: 5, Interesting) by Beryllium Sphere (r) on Friday September 29 2023, @06:59PM (1 child)

    by Beryllium Sphere (r) (5062) on Friday September 29 2023, @06:59PM (#1326346)

    The fact that ChatGPT output is programmed to say "I" contributes to confusing it with an entity. This decision has drawn criticism from someone in the field.

    I've been actively fighting to stop myself from saying "ChatGPT said" and to say instead "The output included".

    Or we could dust off the old joke. Don't anthropomorphize LLMs. They hate it when you do that.

    • (Score: 4, Informative) by maxwell demon on Saturday September 30 2023, @09:10AM

      by maxwell demon (1608) on Saturday September 30 2023, @09:10AM (#1326433) Journal

      You know, with ChatGPT, you can just ask. I just did, and that is what ChatGPT answered:

      I don't have personal preferences or feelings, so I don't have the capacity to like or dislike anything, including being anthropomorphized. Anthropomorphism is a human tendency to attribute human characteristics, emotions, or intentions to non-human entities or objects. I'm just a computer program, so I don't have consciousness, emotions, or self-awareness. My purpose is to assist and provide information to the best of my abilities based on the text-based input I receive.

  • (Score: 4, Funny) by Opportunist on Friday September 29 2023, @07:47PM

    by Opportunist (5545) on Friday September 29 2023, @07:47PM (#1326350)

    You know, hear nothing, see nothing, say nothing.

  • (Score: 2) by SomeGuy on Friday September 29 2023, @08:41PM (2 children)

    by SomeGuy (5632) on Friday September 29 2023, @08:41PM (#1326358)

    Hasn't the idea of talking to computer gotten old? I remember when cell phone based digital assistants, like Apple's Siri, were supposedly the hot shit, but even others besides myself would look at people funny when they tried talking one.

    Me, I don't TALK to machines unless I am SCREAMING at them.

    That said, when we get to the point where "AI" replaces front line tech support, that might not be the worst thing in the world. At least there would be NO FREAKING EXCUSE for awful grating Indian accents any more.

    • (Score: 0) by Anonymous Coward on Friday September 29 2023, @09:26PM

      by Anonymous Coward on Friday September 29 2023, @09:26PM (#1326368)

      I spend so much time not talking to anyone. Not sure I want to talk to computer, but maybe it will make me speak gooder.

    • (Score: 2) by Freeman on Monday October 02 2023, @05:39PM

      by Freeman (732) on Monday October 02 2023, @05:39PM (#1326773) Journal

      I might be convinced that talking to my computer would be okay, if the following conditions were met.

      #1 It wasn't sending all of my data over the internet to some corporate mothership or any "mothership" for that matter.
      #2 It was actually convenient.
      #3 The system didn't hallucinate so much.
      #4 I could change the "trigger" word to whatever I felt like.

  • (Score: 0) by Anonymous Coward on Saturday September 30 2023, @09:57AM

    by Anonymous Coward on Saturday September 30 2023, @09:57AM (#1326435)

