Connect with us

Tech

Google launches Gemini 3.1 Pro, retaking AI crown with 2X+ reasoning performance boost

Published

on

Late last year, Google briefly took the crown for most powerful AI model in the world with the launch of Gemini 3 Pro — only to be surpassed within weeks by OpenAI and Anthropic releasing new models, s is common in the fiercely competitive AI race.

Now Google is back to retake the throne with an updated version of that flagship model: Gemini 3.1 Pro, positioned as a smarter baseline for tasks where a simple response is insufficient—targeting science, research, and engineering workflows that demand deep planning and synthesis.

Already, evaluations by third-party firm Artificial Analysis show that Google’s Gemini 3.1 Pro has leapt to the front of the pack and is once more the most powerful and performant AI model in the world.

A big leap in core reasoning

The most significant advancement in Gemini 3.1 Pro lies in its performance on rigorous logic benchmarks. Most notably, the model achieved a verified score of 77.1% on ARC-AGI-2.

Advertisement

This specific benchmark is designed to evaluate a model’s ability to solve entirely new logic patterns it has not encountered during training.

This result represents more than double the reasoning performance of the previous Gemini 3 Pro model.

Google Gemini 3.1 Pro benchmark chart

Google Gemini 3.1 Pro benchmark chart. Credit: Google

Beyond abstract logic, internal benchmarks indicate that 3.1 Pro is highly competitive across specialized domains:

Advertisement
  • Scientific Knowledge: It scored 94.3% on GPQA Diamond.

  • Coding: It reached an Elo of 2887 on LiveCodeBench Pro and scored 80.6% on SWE-Bench Verified.

  • Multimodal Understanding: It achieved 92.6% on MMMLU.

These technical gains are not just incremental; they represent a refinement in how the model handles “thinking” tokens and long-horizon tasks, providing a more reliable foundation for developers building autonomous agents.

Improved vibe coding and 3D synthesis

Google is demonstrating the model’s utility through “intelligence applied”—shifting the focus from chat interfaces to functional outputs.

One of the most prominent features is the model’s ability to generate “vibe-coded” animated SVGs directly from text prompts. Because these are code-based rather than pixel-based, they remain scalable and maintain tiny file sizes compared to traditional video, boasting far more detailed, presentable and professional visuals for websites and presentations and other enterprise applications.

Other showcased applications include:

Advertisement
  • Complex System Synthesis: The model successfully configured a public telemetry stream to build a live aerospace dashboard visualizing the International Space Station’s orbit.

  • Interactive Design: In one demo, 3.1 Pro coded a complex 3D starling murmuration that users can manipulate via hand-tracking, accompanied by a generative audio score.

  • Creative Coding: The model translated the atmospheric themes of Emily Brontë’s Wuthering Heights into a functional, modern web design, demonstrating an ability to reason through tone and style rather than just literal text.

Business impact and community reactions

Enterprise partners have already begun integrating the preview version of 3.1 Pro, reporting noticeable improvements in reliability and efficiency.

Vladislav Tankov, Director of AI at JetBrains, noted a 15% quality improvement over previous versions, stating the model is “stronger, faster… and more efficient, requiring fewer output tokens”. Other industry reactions include:

  • Databricks: CTO Hanlin Tang reported that the model achieved “best-in-class results” on OfficeQA, a benchmark for grounded reasoning across tabular and unstructured data.

  • Cartwheel: Co-founder Andrew Carr highlighted the model’s “substantially improved understanding of 3D transformations,” noting it resolved long-standing rotation order bugs in 3D animation pipelines.

  • Hostinger Horizons: Head of Product Dainius Kavoliunas observed that the model understands the “vibe” behind a prompt, translating intent into style-accurate code for non-developers.

Pricing, licensing, and availability

For developers, the most striking aspect of the 3.1 Pro release is the “reasoning-to-dollar” ratio. When Gemini 3 Pro launched, it was positioned in the mid-high price range at $2.00 per million input tokens for standard prompts. Gemini 3.1 Pro maintains this exact pricing structure, effectively offering a massive performance upgrade at no additional cost to API users.

  • Input Price: $2.00 per 1M tokens for prompts up to 200k; $4.00 per 1M tokens for prompts over 200k.

  • Output Price: $12.00 per 1M tokens for prompts up to 200k; $18.00 per 1M tokens for prompts over 200k.

  • Context Caching: Billed at $0.20 to $0.40 per 1M tokens depending on prompt size, plus a storage fee of $4.50 per 1M tokens per hour.

  • Search Grounding: 5,000 prompts per month are free, followed by a charge of $14 per 1,000 search queries.

For consumers, the model is rolling out in the Gemini app and NotebookLM with higher limits for Google AI Pro and Ultra subscribers.

Advertisement

Licensing implications

As a proprietary model offered through Vertex Studio in Google Cloud and the Gemini API, 3.1 Pro follows a standard commercial SaaS (Software as a Service) model rather than an open-source license.

For enterprise users, this provides “grounded reasoning” within the security perimeter of Vertex AI, allowing businesses to operate on their own data with confidence.

The “Preview” status allows Google to refine the model’s safety and performance before general availability, a common practice in high-stakes AI deployment.

By doubling down on core reasoning and specialized benchmarks like ARC-AGI-2, Google is signaling that the next phase of the AI race will be won by models that can think through a problem, not just predict the next word.

Advertisement

Source link

Continue Reading
Click to comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Tech

Qatar Helium Shutdown Puts Chip Supply Chain On a Two-Week Clock

Published

on

Iranian drone strikes shut down a major helium facility in Qatar, removing about 30% of global helium supply and raising concerns for the semiconductor industry, which relies on the gas for chip fabrication. “QatarEnergy declared force majeure on existing contracts on March 4, freeing it from supply obligations to customers,” reports Tom’s Hardware. The industry outlet Gasworld reports that no imminent restart is planned. From the report: Helium consultant Phil Kornbluth, speaking at a Gasworld webinar on March 4, said that if the outage extends beyond roughly two weeks, industrial gas distributors could be forced to relocate cryogenic equipment and revalidate supplier relationships, a process that could stretch over months regardless of when Qatari output resumes.

South Korea is among the most exposed countries, which, according to the Korea International Trade Association, imported 64.7% of its helium from Qatar in 2025. The country relies heavily on helium imports to cool silicon wafers during fabrication and is understood to have no viable substitute.

The country’s Ministry of Trade, Industry and Resources has reportedly launched an investigation into supply and demand for 14 semiconductor materials and equipment types with high dependence on Middle Eastern sources, Nikkei reported on Wednesday. Bromine, which is used in circuit formation, is another big concern, with South Korea sourcing 90% of its imports from Israel, also party to the ongoing conflict in Iran.

Source link

Advertisement
Continue Reading

Tech

At The WBC: Mark DeRosa Screwed Up & Then MLB Streisanded The Story

Published

on

from the nice-try dept

The World Baseball Classic is currently going on and I absolutely adore it. Essentially a World Cup for baseball, 20 nations are playing against one another in a banger of a tune-up for the Major League Baseball season. It’s a flamboyant delight, with cultural celebrations such as the Italian team doing a shot of espresso after they hit home runs in the dugout.

The American team is managed by former major leaguer Mark DeRosa. While I won’t bore you with too many gory details, DeRosa royally fucked up during the tail end of pool play. Through a complicated series of winning scenarios and tie-breaker rules, the American team headed into its game with Italy needing to win to secure its place in the playoffs. DeRosa, it appears, was under an entirely different impression. These were his comments before the game with Italy.

After the game, he mentioned that some of his players were “dragging” on the field and he essentially put in a lineup that didn’t include many of the normal starting players. If you don’t know professional baseball culture, there’s a reason for the dragging. With nothing at stake, it’s pretty clear DeRosa thought the playoffs were already secured… and told his players to go out and celebrate that night. They likely did, late into the night and with the help of plenty of alcohol. Then they lost to Italy, which meant they needed Italy to win or to get into tie-breaking scenarios against their next game with Mexico. They got lucky in that Italy did beat Mexico in the next game, but the fuck up took things out of the hands of Team USA, leaving it up to their rivals.

You may not care about any of the above, but baseball fans do. DeRosa, in his day job, is also an employee of MLB, serving as a commentator on the MLB channel. MLB itself took down the original video of DeRosa’s comments and put up a version in which you don’t hear DeRosa’s mistake nor his admitting later that he screwed up.

Also, this reporting from The Athletic doesn’t actually make things look better for DeRosa and Team USA:

“The league appears to have taken down video that included DeRosa’s mistaken comments from MLB.com, with attempts by The Athletic to access it yielding error messages early Wednesday morning. A version of the interview that remained on MLB Network’s Facebook page appeared to be condensed and did not include the now-scrutinized remarks.”

Advertisement

I really don’t know what MLB was thinking here. American baseball fans would somehow forget what they heard DeRosa say? A screw up that could have bounced the American team from the WBC entirely would somehow fly under the radar?

Regardless, the Streisand Effect took over and now then the reporting on all of this went into wide circulation. In discussing MLB’s attempt at the hidden ball trick, reporting on DeRosa’s fuck up went through another, and larger, round of reporting. By trying to hide what DeRosa did, MLB made it public all the more.

This is classic Streisand Effect stuff at work and I can barely believe that Major League Baseball thought this isn’t exactly what would occur.

Filed Under: baseball, mark derosa, streisand effect, wbc

Companies: mlb

Advertisement

Source link

Continue Reading

Tech

ChatGPT, Other Chatbots Approved For Official Use In the Senate

Published

on

An anonymous reader quotes a report from the New York Times: A top Senate administrator on Monday gave aides the green light to use three artificial intelligence chatbots for official work, a reflection of how widespread the use of the products has become in workplaces around the globe. The chief information officer for the Senate sergeant-at-arms, who oversees the chamber’s computers as well as security, said in a one-page memo reviewed by The New York Times that aides could use Google’s Gemini chat, OpenAI’s ChatGPT or Microsoft Copilot, which is already integrated into Senate platforms.

Copilot “can help with routine Senate work, including drafting and editing documents, summarizing information, preparing talking points and briefing material, and conducting research and analysis,” the memo said. The document later added that “data shared with Copilot Chat stays within the secure Microsoft 365 Government environment and is protected by the same controls that safeguard other Senate data.” It’s unclear how widely AI is used in the Senate or how widespread it might become, as individual offices and committees set their own rules. The chamber has also not publicly released comprehensive guidance on chatbots, the report notes.

In contrast, the House has clearer policies allowing the general use of AI for limited internal tasks but restricting it from sensitive data or for being used for deepfakes and certain decision-making activities.

Source link

Advertisement
Continue Reading

Tech

A change could be set to make even older Android phones much faster

Published

on

Google is working on a behind-the-scenes change to Android that could make phones feel noticeably quicker – without requiring new hardware.

The company is introducing a new optimisation technique for the Android kernel. This could improve app launches, system performance and even battery efficiency.

The update centres on the Android kernel, the core part of the operating system. The kernel is responsible for managing communication between apps, the processor and the phone’s hardware. According to Google, the kernel accounts for roughly 40% of total CPU activity on Android devices. This means even small improvements here can have a meaningful impact on day-to-day performance.

The new approach uses something called Automatic Feedback-Directed Optimisation (AutoFDO). In simple terms, it allows the software compiler, the tool that converts code into instructions your phone’s processor understands, to learn from how people actually use their devices. This is instead of relying purely on general assumptions.

Advertisement

To gather this data, Google ran controlled tests using Pixel phones that simulated real-world behaviour. The process involved launching and interacting with the top 100 most popular Android apps. Profiling tools tracked which parts of the kernel were used most frequently. The system then identifies these “hot” sections of code and prioritises them when rebuilding the kernel.

Advertisement

By reorganising the code around the parts that matter most, the compiler can make smarter optimisation decisions. The result, Google says, is faster app launches, smoother multitasking and potentially better battery life.

The company has already begun rolling the optimisation out to its android16-6.12 and android15-6.6 kernel branches, which underpin recent Android versions. It also plans to expand the technique to future releases.

Advertisement

Longer term, Google also intends to apply similar optimisations to other parts of the system. This includes additional kernel components and hardware drivers used by phone makers for features like cameras and modems.

It’s the kind of change most users will never see — but if it works as intended, it could make everyday Android performance feel just a little bit snappier.

Source link

Advertisement
Continue Reading

Tech

ICYMI: the week’s 7 biggest tech news stories from Sonos’ big return to our review of the ‘impressively premium’ MacBook Neo

Published

on

When is a quiet week in tech not a quiet week in tech? How about right now. Because while this week lacked the huge launches of the previous one, it was still packed with big stories and impressive new tech.

For starters, we delivered our expert verdicts on the Apple devices that were revealed last week, and the MacBook Neo in particular blew us away. We also sat down for a long chat with Sonos‘ CEO as the audio giant launched two new speakers, and delivered our Google Pixel 10a review.

Source link

Advertisement
Continue Reading

Tech

Meta is killing end-to-end encryption in Instagram DMs

Published

on

Meta is killing end-to-end encryption in Instagram DMs. The feature will “no longer be supported after May 8, 2026,” the company wrote in an update on its support page. Unlike WhatsApp, Meta never made encryption available to all Instagram users and it was never a default setting. Instead, users in “some areas” had the ability to opt-in to encryption on a per-chat basis.

In a statement, a Meta spokesperson said the feature was being retired due to low adoption. “Very few people were opting in to end-to-end encrypted messaging in DMs, so we’re removing this option from Instagram in the coming months,” the spokesperson said. “Anyone who wants to keep messaging with end-to-end encryption can easily do that on WhatsApp.”

Interestingly, Meta’s statement doesn’t mention the status of encryption on Messenger. The company began turning on end-to-end encryption as a default setting in 2023 after years of work on the feature. A support page for Messenger currently states that the company “is in the process of securing personal messages with end-to-end encryption by default.”

Meta’s approach to encrypted messaging has changed several times over the years. It started encrypting WhatsApp chats in 2016. In 2019, Mark Zuckerberg outlined a “privacy-focused” revamp of the company’s apps, saying at the time that “implementing end-to-end encryption for all private communications is the right thing to do.” In 2021, the company’s head of safety said that Meta was delaying its encryption work until 2023 in order to create stronger safety features.

Advertisement

Meta’s use of encryption has been repeatedly criticized by law enforcement and some child safety organizations that say the feature makes it harder to catch predators who target children on social media. Recently, the topic has been raised numerous times during a trial in New Mexico over child safety. Internal documents that have surfaced as part of the trial show Meta executives and researchers debating the trade-offs between safety and privacy as it relates to encryption.

In testimony that was broadcast during the trial, Zuckerberg said that safety issues were “a large part of the reason why it took so long” to bring encryption to Messenger. “There’s been debate about this, but I think the majority of folks, from people who use our products to people who are involved in security overall, believe that strong encryption is positive,” he said.

Source link

Advertisement
Continue Reading

Tech

Today’s NYT Mini Crossword Answers for March 14

Published

on

Looking for the most recent Mini Crossword answer? Click here for today’s Mini Crossword hints, as well as our daily answers and hints for The New York Times Wordle, Strands, Connections and Connections: Sports Edition puzzles.


Need some help with today’s Mini Crossword? It’s the extra-long Saturday version, and a few of the clues are tricky. Read on for all the answers. And if you could use some hints and guidance for daily solving, check out our Mini Crossword tips.

If you’re looking for today’s Wordle, Connections, Connections: Sports Edition and Strands answers, you can visit CNET’s NYT puzzle hints page.

Advertisement

Read more: Tips and Tricks for Solving The New York Times Mini Crossword

Let’s get to those Mini Crossword clues and answers.

completed-nyt-mini-crossword-puzzle-for-march-15-2026.png

The completed NYT Mini Crossword puzzle for March 15, 2026.

Advertisement

NYT/Screenshot by CNET

Mini across clues and answers

1A clue: Book parts: Abbr.
Answer: PGS

4A clue: Silicon Valley company that operates a fleet of robotaxis
Answer: WAYMO

6A clue: To a much greater degree
Answer: WAYMORE

Advertisement

8A clue: Contents of a scuba diver’s tank
Answer: AIR

9A clue: South Korean automaker
Answer: KIA

10A clue: Stop on a train route
Answer: STATION

12A clue: Actress Merman of “Anything Goes”
Answer: ETHEL

Advertisement

13A clue: Find another purpose for
Answer: REUSE

Mini down clues and answers

1D clue: Employee’s hourly calculation
Answer: PAYRATE

2D clue: Workout spot
Answer: GYM

3D clue: “Great” mountains of Tennessee, familiarly
Answer: SMOKIES

Advertisement

4D clue: One giving you the dish?
Answer: WAITER

5D clue: Baltimore M.L.B. player
Answer: ORIOLE

6D clue: Used to be
Answer: WAS

7D clue: Suffix with Caesar or Euclid
Answer: EAN

Advertisement

11D clue: Night that NBC once aired “30 Rock” and “The Office”: Abbr.
Answer: THU

Source link

Advertisement
Continue Reading

Tech

MacOS isn’t too much of a safe haven than Windows as infostealers come for Apple computers

Published

on

I used to be of the opinion that MacBooks are relatively safer than other laptops, but I have been proven wrong. Embarrassingly and demonstrably wrong. A new report from Sophos X-Ops has spared no effort in rubbing my nose in it. 

Researchers at the firm tracked three separate attack campaigns between November 2025 and February 2026, all of which targeted macOS users with something called the MacSync infostealer. For those catching up — it’s a type of malware that quietly rifles through your passwords and saved credentials, acting like a digital pickpocket. 

So, how does it actually work?

The malware used a delivery method called ClickFix, which requires minimal technical effort. It just needs the victims to copy and paste a command into their Mac’s Terminal (designed to run and execute text-based commands) and press enter on the keyboard.

First, bad actors used fake OpenAI download pages, which were circulated via sponsored ads on Google (sitting right above the legitimate link). Then, they got even more creative: attackers started sharing rear ChatGPT shared conversations disguised as “helpful Mac guides.”

These guides routed users into fake GitHub pages, which contained carefully created software installation instructions, but in reality, they asked users to copy a terminal command, allowing the ManSync infostealer to work in the background. That’s it; that’s the whole attack. 

Advertisement

How bad did it get?

Sophos has found out that by December 2025 alone, bad actors had routed more than 50,000 clicks on such malicious domains. A “click” means that someone copied the malicious terminal command, but not necessarily that the malware successfully installed; the actual infection count could be lower. 

The developers put another spin on their attacking method in February 2026, allowing it to run silently in the background, bypassing the competent macOS security tools such as Gatekeeper and XProtect. It can, in a very real way, patch your ledger crypto wallet’s 24-word master key. 

The firm reports that infection clusters were active in key markets, including parts of North and South America and India, as recently as weeks before they published the article (by the end of the beginning of March, possibly). 

Moreover, the notion that “Macs are safe,” is at least, for the time being, not true. As AI platforms grow in popularity, and, more importantly, gain the trust of millions of users, bad actors are coming up with new ways to use the LLMs-driven tools to their advantage. For now, I’d advise you to not paste any text-based command into your Mac’s Terminal.

Advertisement

Source link

Continue Reading

Tech

Samsung says its Micro RGB TVs likely won’t up your sleep cycle

Published

on

We’ve all heard the saying: “screens before bed are bad,.” Yet somehow, I’ve been watching screens to go to sleep after a day of working with the screens for around eight to 10 hours. Well, I might consider switching to Samsung’s micro RGB TVs for both my work and leisure requirements, as they’ve recently got an eye- and sleep-friendly certification. 

In a press release, the Korean tech giant has announced that its Micro RGB TV (the R95H model) has received two certifications from VDE (which is a German testing body). 

What certifications has the Samsung TV received?

The Samsung TV has received the Safety for Eyes certification and the Circadian Rhythm Display (CRD) certification. Without making things too technical for you, the R95H model has been officially tested to not wreck your eyes or sleep, especially during the hours after sunset, when too much blue light consumption can disturb your sleep cycle. 

Here’s how it works. The first certification, Safety for Eyes, takes care of the blue light emissions — the wavelength which is associated the most with eye strain and disturbed sleep — confirming that the television meets the safe thresholds for prolonged viewing sessions. 

The second one, Circadian Rhythm Display (CRD) verification goes a step further by confirming that the TV actually mimics the pattern of natural light. The television leans toward producing cooler tones during the day, warmer tones in the evening, and, most importantly, dials down blue light at night. 

Advertisement

How do the compatible TVs pull this off?

Basically, it doesn’t force your brain into thinking that it’s noon by producing cool light, when it’s midnight, so that viewing the television doesn’t disrupt your sleep cycle. But how does the TV manage all this?

Well, it’s Samsung’s micro RGB LED architecture that allows the display to make the fine-grained adjustments in the overall brightness and color profile of the screen, with an enhanced level of precision that isn’t present on other models. 

While the Safety for Eyes certification is available across the company’s 2026 TV lineup, Circadian Rhythm Display (CRD) is currently available on the premium models.

Source link

Advertisement
Continue Reading

Tech

Harbor Freight Has A Versatile 12-Tray Solution To Workshop Clutter

Published

on





Keeping a workshop organized can feel like a never-ending task, and so any item that helps make organization easier can make a big difference. Fans of Harbor Freight will already be well aware that the retailer is a great place to look for cheap garage and workshop essentials, and one product in particular might come in useful for anyone trying to keep their workshop clutter within manageable levels. The Bauer storage system modular organizer features 12 individual bins that can be arranged in a custom configuration, making it a great place to store those small items that can get lost around the workshop.

All of the bins are removable, so there’s no need to haul around the entire organizer for smaller jobs. However, anyone who prefers to take everything with them on the go should still find the organizer useful, since it’s IP65 rated against dust and water ingress and can be connected to other Bauer storage system products. The brand offers a range of crates, tool boxes, and cases, alongside the modular storage organizer, in a similar manner to Milwaukee’s popular Packout storage system.

The Bauer organizer retails for $39.99 at Harbor Freight, and at the time of writing, it’s only available as an in-store exclusive and not online. However, if its reviews are anything to go by, it might be worth the trip to your local retailer.

Advertisement

The Bauer organizer gets consistently good reviews

Bauer makes plenty of top-rated power tools, and its modular storage organizer gets similarly glowing reviews from buyers. It has amassed just under 400 reviews from Harbor Freight buyers to date, with a near-perfect average score of 4.9 out of 5 stars. Several reviewers note how easy the organizer makes it to store a wide range of items, from screws and drill bits to pens and snacks. Others say that the organizer’s clear lid is a particularly useful feature, since it allows them to see exactly what’s in each bin at a glance.

Advertisement

Complaints about the organizer are few and far between. One reviewer who left a two-star review claimed that the material quality of the organizer wasn’t up to the task, while a few reviewers who left three-star reviews said rival systems were tougher overall. Aside from that, buyers remain consistently impressed with the organizer’s construction and its capabilities.

While plenty of reviewers like the Bauer organizer, it’s far from the only Harbor Freight product that might come in useful if you’re looking to cut down on clutter. The retailer also offers individual $3 stacking tilt bins that can help organize garages and workshops, and they get similarly good reviews from buyers.

Advertisement



Source link

Continue Reading

Trending

Copyright © 2025