OpenAI's New AI Model Can Finally Count the Rs in “Strawberry”

posted by hubie on Friday September 20 2024, @11:03AM

from the one-step-for-AI-one-giant-leap-for-the-hype-train dept.

https://arstechnica.com/information-technology/2024/09/openais-new-reasoning-ai-models-are-here-o1-preview-and-o1-mini/

OpenAI finally unveiled its rumored "Strawberry" AI language model on Thursday, claiming significant improvements in what it calls "reasoning" and problem-solving capabilities over previous large language models (LLMs). Formally named "OpenAI o1," the model family will initially launch in two forms, o1-preview and o1-mini, available today for ChatGPT Plus and API users.
[...]
In a rare display of public hype-busting, OpenAI product manager Joanne Jang tweeted, "There's a lot of o1 hype on my feed, so I'm worried that it might be setting the wrong expectations. what o1 is: the first reasoning model that shines in really hard tasks, and it'll only get better. (I'm personally psyched about the model's potential & trajectory!) what o1 isn't (yet!): a miracle model that does everything better than previous models. you might be disappointed if this is your expectation for today's launch—but we're working to get there!"
[...]
AI benchmarks are notoriously unreliable and easy to game; however, independent verification and experimentation from users will show the full extent of o1's advancements over time. On top of that, MIT Research showed earlier this year that some of OpenAI's benchmark claims it touted with GPT-4 last year were erroneous or exaggerated.
One of the examples of o1's abilities that OpenAI shared is perhaps the least consequential and impressive, but it's the most talked about due to a recurring meme where people ask LLMs to count the number of Rs in the word "strawberry." Due to tokenization, where the LLM processes words in data chunks called tokens, most LLMs are typically blind to character-by-character differences in words.
[...]
It's no secret that some people in tech have issues with anthropomorphizing AI models and using terms like "thinking" or "reasoning" to describe the synthesizing and processing operations that these neural network systems perform.
Just after the OpenAI o1 announcement, Hugging Face CEO Clement Delangue wrote, "Once again, an AI system is not 'thinking', it's 'processing', 'running predictions',... just like Google or computers do. Giving the false impression that technology systems are human is just cheap snake oil and marketing to fool you into thinking it's more clever than it is."

Original Submission

This discussion was created by hubie (1068) for logged-in users only, but now has been archived. No new comments can be posted.

OpenAI's New AI Model Can Finally Count the Rs in “Strawberry” | Log In/Create an Account | Top | 20 comments | Search Discussion

The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.

SoylentNews

SoylentNews is people

Navigation

Sections

SoylentNews

Log In

OpenAI's New AI Model Can Finally Count the Rs in “Strawberry”

Gemini has some catching up to doGemini has some catching up to do (Score: 2, Interesting) by Anonymous Coward on Friday September 20 2024, @11:40AM (4 children)

Re:Gemini has some catching up to doRe:Gemini has some catching up to do (Score: 1) by pTamok on Friday September 20 2024, @01:26PM (2 children)

Re:Gemini has some catching up to doRe:Gemini has some catching up to do (Score: 3, Funny) by EvilSS on Saturday September 21 2024, @12:37AM (1 child)

Re:Gemini has some catching up to do(Score: 1) by pTamok on Saturday September 21 2024, @05:06PM

Re:Gemini has some catching up to do(Score: 1) by shrewdsheep on Friday September 20 2024, @02:04PM

An hour(Score: 4, Touché) by ledow on Friday September 20 2024, @12:20PM

ObservationObservation (Score: 5, Interesting) by ElizabethGreene on Friday September 20 2024, @01:31PM (13 children)

Re:ObservationRe:Observation (Score: 2, Insightful) by shrewdsheep on Friday September 20 2024, @02:00PM (1 child)

Re:Observation(Score: 1, Funny) by Anonymous Coward on Saturday September 21 2024, @12:02AM

Re:ObservationRe:Observation (Score: 3, Interesting) by ledow on Friday September 20 2024, @02:52PM (8 children)

Re:ObservationRe:Observation (Score: 2, Insightful) by pTamok on Friday September 20 2024, @03:55PM (7 children)

Re:Observation(Score: 2) by Tork on Friday September 20 2024, @08:21PM

Re:ObservationRe:Observation (Score: 3, Insightful) by darkfeline on Friday September 20 2024, @09:56PM (3 children)

Re:ObservationRe:Observation (Score: 1) by pTamok on Saturday September 21 2024, @08:32PM (2 children)

Re:ObservationRe:Observation (Score: 3, Funny) by ledow on Sunday September 22 2024, @06:53PM (1 child)

Re:Observation(Score: 1) by pTamok on Sunday September 22 2024, @08:34PM

Re:ObservationRe:Observation (Score: 2) by ElizabethGreene on Monday September 23 2024, @02:10PM (1 child)

Re:Observation(Score: 3, Interesting) by ledow on Thursday September 26 2024, @08:47AM

Re:ObservationRe:Observation (Score: 2) by krishnoid on Friday September 20 2024, @03:05PM (1 child)

Re:Observation(Score: 3, Funny) by Reziac on Saturday September 21 2024, @02:35AM

SoylentNews

SoylentNews is people

Navigation

Sections

SoylentNews

Log In

Related Links

OpenAI's New AI Model Can Finally Count the Rs in “Strawberry”

Gemini has some catching up to doGemini has some catching up to do (Score: 2, Interesting) by Anonymous Coward on Friday September 20 2024, @11:40AM (4 children)

Re:Gemini has some catching up to doRe:Gemini has some catching up to do (Score: 1) by pTamok on Friday September 20 2024, @01:26PM (2 children)

Re:Gemini has some catching up to doRe:Gemini has some catching up to do (Score: 3, Funny) by EvilSS on Saturday September 21 2024, @12:37AM (1 child)

Re:Gemini has some catching up to do(Score: 1) by pTamok on Saturday September 21 2024, @05:06PM

Re:Gemini has some catching up to do(Score: 1) by shrewdsheep on Friday September 20 2024, @02:04PM

An hour(Score: 4, Touché) by ledow on Friday September 20 2024, @12:20PM

ObservationObservation (Score: 5, Interesting) by ElizabethGreene on Friday September 20 2024, @01:31PM (13 children)

Re:ObservationRe:Observation (Score: 2, Insightful) by shrewdsheep on Friday September 20 2024, @02:00PM (1 child)

Re:Observation(Score: 1, Funny) by Anonymous Coward on Saturday September 21 2024, @12:02AM

Re:ObservationRe:Observation (Score: 3, Interesting) by ledow on Friday September 20 2024, @02:52PM (8 children)

Re:ObservationRe:Observation (Score: 2, Insightful) by pTamok on Friday September 20 2024, @03:55PM (7 children)

Re:Observation(Score: 2) by Tork on Friday September 20 2024, @08:21PM

Re:ObservationRe:Observation (Score: 3, Insightful) by darkfeline on Friday September 20 2024, @09:56PM (3 children)

Re:ObservationRe:Observation (Score: 1) by pTamok on Saturday September 21 2024, @08:32PM (2 children)

Re:ObservationRe:Observation (Score: 3, Funny) by ledow on Sunday September 22 2024, @06:53PM (1 child)

Re:Observation(Score: 1) by pTamok on Sunday September 22 2024, @08:34PM

Re:ObservationRe:Observation (Score: 2) by ElizabethGreene on Monday September 23 2024, @02:10PM (1 child)

Re:Observation(Score: 3, Interesting) by ledow on Thursday September 26 2024, @08:47AM

Re:ObservationRe:Observation (Score: 2) by krishnoid on Friday September 20 2024, @03:05PM (1 child)

Re:Observation(Score: 3, Funny) by Reziac on Saturday September 21 2024, @02:35AM