Newsweek - WordupNews, Newsbeat, Analysis, Business, Politics, Technology, Entertainment, Fashion, Sports

Google has released what it’s calling a new “reasoning” AI model — but it’s in the experimental stages, and from our brief testing, there’s certainly room for improvement.

The new model, called Gemini 2.0 Flash Thinking Experimental (a mouthful, to be sure), is available in AI Studio, Google’s AI prototyping platform. A model card describes it as “best for multimodal understanding, reasoning, and coding,” with the ability to “reason over the most complex problems” in fields such as programming, math, and physics.

In a post on X, Logan Kilpatrick, who leads product for AI Studio, called Gemini 2.0 Flash Thinking Experimental “the first step in [Google’s] reasoning journey.” Jeff Dean, chief scientist for Google DeepMind, Google’s AI research division, said in his own post that Gemini 2.0 Flash Thinking Experimental is “trained to use thoughts to strengthen its reasoning.”

“We see promising results when we increase inference time computation,” Dean said, referring to the amount of computing used to “run” the model as it considers a question.

It’s still an early version, but check out how the model handles a challenging puzzle involving both visual and textual clues: (2/3) pic.twitter.com/JltHeK7Fo7

— Logan Kilpatrick (@OfficialLoganK) December 19, 2024

Built on Google’s recently announced Gemini 2.0 Flash model, Gemini 2.0 Flash Thinking Experimental appears to be similar in design to OpenAI’s o1 and other so-called reasoning models. Unlike most AI, reasoning models effectively fact-check themselves, which helps them avoid some of the pitfalls that normally trip up AI models.

As a drawback, reasoning models often take longer — usually seconds to minutes longer — to arrive at solutions.

Given a prompt, Gemini 2.0 Flash Thinking Experimental pauses before responding, considering a number of related prompts and “explaining” its reasoning along the way. After a while, the model summarizes what it considers to be the most accurate answer.

Well — that’s what’s supposed to happen. When I asked Gemini 2.0 Flash Thinking Experimental how many R’s were in the word “strawberry,” it said “two.”

Google reasoning model — Google’s new reasoning model struggles with counting the letters in words, SOMETIMES.Image Credits:Google

Your mileage may vary.

In the wake of the release of o1, there’s been an explosion of reasoning models from rival AI labs — not just Google. In early November, DeepSeek, an AI research company funded by quant traders, launched a preview of its first reasoning model, DeepSeek-R1. That same month, Alibaba’s Qwen team unveiled what it claimed was the first “open” challenger to o1.

Bloomberg reported in October that Google had several teams developing reasoning models. Subsequent reporting by The Information in November revealed that the company has at least 200 researchers focusing on the technology.

What opened the reasoning model floodgates? Well, for one, the search for novel approaches to refine generative AI. As my colleague Max Zeff recently reported, “brute force” techniques to scale up models are no longer yielding the improvements they once did.

Not everyone’s convinced that reasoning models are the best path forward. They tend to be expensive, for one, thanks to the large amount of computing power required to run them. And while they’ve performed well on benchmarks so far, it’s not clear whether reasoning models can maintain this rate of progress.

TechCrunch has an AI-focused newsletter! Sign up here to get it in your inbox every Wednesday.

Source link

Breaking News

Wave 2 And 5 Targets Put XRP At $7 And $13

Most people scorn AI hallucinations, but researchers love them—and one scientist says they helped him win his Nobel Prize

Watch Boston Dynamics’ electric Atlas do a backflip

Manchester United defender Diogo Dalot feeds the homeless on Christmas Eve | UK News

‘Sonic the Hedgehog 3’ Crosses $70M In 4 Days: Box Office

This Altcoin Will Crush the Cardano and XRP Price Performance in 2025, Says Top Altcoin Traders

14 New Year’s Eve Outfit Ideas To Bow Out In Style

American Airlines temporarily grounded flights due to technical glitch

Grasshopper Business Checking Review 2025: Features & Fees

Manchester United defender Diogo Dalot feeds the homeless on Christmas Eve | UK News

Last-minute Christmas shopping takes a dive as buyers feel the squeeze

American Airlines resumes Christmas Eve flights after technical issue

‘Unsustainable’ prepayment meters could see households spend third of income on energy, experts warn

Google releases its own ‘reasoning’ AI model

More From Author

Wave 2 And 5 Targets Put XRP At $7 And $13

Most people scorn AI hallucinations, but researchers love them—and one scientist says they helped him win his Nobel Prize

Watch Boston Dynamics’ electric Atlas do a backflip

+ There are no comments

Cancel reply

Chris Jericho Warns Matt Cardona Ahead of ROH Final Battle, I’m Crazier Than Ever!

Australian premium ready-meals business set to hire 925 workers for new factory in booming Kentucky

You May Also Like:

Wave 2 And 5 Targets Put XRP At $7 And $13

Most people scorn AI hallucinations, but researchers love them—and one scientist says they helped him win his Nobel Prize

Watch Boston Dynamics’ electric Atlas do a backflip

Manchester United defender Diogo Dalot feeds the homeless on Christmas Eve | UK News

‘Sonic the Hedgehog 3’ Crosses $70M In 4 Days: Box Office

This Altcoin Will Crush the Cardano and XRP Price Performance in 2025, Says Top Altcoin Traders

14 New Year’s Eve Outfit Ideas To Bow Out In Style

American Airlines temporarily grounded flights due to technical glitch

Get in Touch

Breaking News

Top Tagged

+ There are no comments

Chris Jericho Warns Matt Cardona Ahead of ROH Final Battle, I’m Crazier Than Ever!

Australian premium ready-meals business set to hire 925 workers for new factory in booming Kentucky