Tech

Why Weibo’s tiny VibeThinker-3B has the AI world arguing over benchmarks again

Published

3 hours ago

17 June 2026

On Sunday, a team of nine researchers at Sina Weibo — the Chinese social media giant better known for its microblogging platform than for cutting-edge artificial intelligence — quietly posted a 14-page technical report to arXiv that sent shockwaves through the AI research community. Their claim: a language model with just 3 billion parameters can match or exceed the reasoning performance of flagship systems from Google DeepMind, OpenAI, Anthropic, and DeepSeek that are hundreds of times larger.

The model, called VibeThinker-3B, scored 94.3 on AIME 2026 — the American Invitational Mathematics Examination, one of the most demanding standardized math competitions in the world. That figure places it alongside DeepSeek V3.2, a model with 671 billion parameters, and ahead of Gemini 3 Pro, Google’s high-performance flagship reasoning system, which scored 91.7. With a test-time scaling technique the team calls Claim-Level Reliability Assessment, the score climbs to 97.1, edging past virtually every system in the public record.

Within hours of publication, the paper had drawn 62 upvotes on Hugging Face’s daily papers feed, the model repository had accumulated 130 likes, and the GitHub repository had reached 685 stars. But the reaction on social media was not uniformly celebratory. It was, in many cases, deeply skeptical.

“WHAT THE HELL is happening in AI?” wrote the user @orcus108 on X, in a post that accumulated over 161,000 views. “A 3B parameter model just put up coding benchmark scores in the same league as Claude Opus 4.5… I genuinely don’t know if this is a breakthrough or if the benchmarks are broken.”

That tension — between genuine scientific advancement and the growing suspicion that AI benchmarks have become gameable to the point of meaninglessness — sits at the heart of the VibeThinker-3B story. And the answer matters enormously, not just for academic bragging rights, but for the multibillion-dollar question of whether the AI industry’s relentless push toward ever-larger models is the only path to intelligence.

Benchmark scores that defy the scaling laws of modern AI

The results reported in the technical report are, by any conventional standard, extraordinary.

On the mathematics side, VibeThinker-3B achieved 91.4 on AIME 2025, 94.3 on AIME 2026, 89.3 on HMMT 2025 (the Harvard-MIT Mathematics Tournament), 93.8 on BruMO 2025 (the Brown University Math Olympiad), and 76.4 on IMO-AnswerBench, a benchmark comprising 400 problems at the level of the International Mathematical Olympiad. In coding, it posted an 80.2 Pass@1 on LiveCodeBench v6, a benchmark designed to test executable code generation, and achieved a 96.1 percent acceptance rate on unseen LeetCode weekly and biweekly contests from late April through late May 2026. On instruction following, it scored 93.4 on IFEval.

To put the parameter disparity in perspective: DeepSeek V3.2 has 671 billion parameters — roughly 224 times the size of VibeThinker-3B. GLM-5, from Zhipu AI, has 744 billion parameters. Kimi K2.5, from Moonshot AI, exceeds 1 trillion. VibeThinker-3B’s 3 billion parameters could run on a consumer laptop.

The researchers frame this result not as an anomaly but as evidence for a broader theoretical claim. They introduce what they call the “Parametric Compression-Coverage Hypothesis,” which argues that different types of AI capability have fundamentally different relationships to model size. Verifiable reasoning — the kind tested by math competitions and coding challenges, where answers can be definitively checked — is what the paper calls a “parameter-dense” capability: one that can be compressed into a compact core. Open-domain knowledge, by contrast, is “parameter-expansive,” requiring broad coverage across facts, concepts, and edge cases that inherently demands more parameters.

The paper acknowledges this distinction directly. On GPQA-Diamond, a graduate-level science knowledge benchmark, VibeThinker-3B scored just 70.2 — well behind the 91.9 achieved by Gemini 3 Pro and the 87.0 scored by Claude Opus 4.5. The authors write that this gap “is consistent with our claim rather than a contradiction to it: the main finding is not that a 3B model has fully replaced leading general-purpose models, but that a small model can reach first-tier performance on many verifiable reasoning tasks.”

Inside the four-stage training pipeline that powers a tiny reasoning engine

VibeThinker-3B is not built from scratch. It is post-trained on top of Qwen2.5-Coder-3B, a compact foundation model from Alibaba’s Qwen team, through what the Weibo AI researchers call the “Spectrum-to-Signal Principle” — a multi-stage pipeline first introduced in the team’s earlier VibeThinker-1.5B work in November 2025.

The training unfolds in four major phases. The first is a two-stage supervised fine-tuning process that uses curriculum learning: the model first trains on a broad mixture of math, code, STEM reasoning, general dialogue, and instruction-following data, then shifts to a curated subset of harder, longer-horizon reasoning problems. In the second stage, samples with reasoning traces shorter than 5,000 tokens are discarded, and problems that VibeThinker-1.5B can solve more than 75 percent of the time are filtered out, forcing the model to focus on genuinely difficult challenges.

The second phase applies reinforcement learning across multiple domains — mathematics, code, and STEM — using the team’s MaxEnt-Guided Policy Optimization algorithm, or MGPO, which prioritizes training on problems at the model’s current capability boundary rather than problems it already solves easily or finds impossible. Notably, the team found that a strategy that worked well at the 1.5B scale — progressively expanding the context window during RL training — actually hurt performance at 3B. They hypothesize that the stronger starting checkpoint meant that truncating reasoning traces during warm-up was no longer removing noise but disrupting valid reasoning patterns. The solution was to train with a single 64,000-token context window throughout.

Within the math RL phase, the team also introduces what it calls “Long2Short Math RL,” a secondary optimization stage that redistributes rewards to favor shorter correct solutions over longer ones, reducing verbosity without sacrificing accuracy. The technique uses a zero-sum reward redistribution that avoids biasing the overall reward signal while nudging the model toward more efficient reasoning.

The third phase extracts high-quality reasoning trajectories from the RL-trained checkpoints and distills them back into a unified model through supervised fine-tuning. The team uses a “learning-potential score” — essentially the student model’s perplexity on each teacher trajectory — to prioritize traces that are correct but that the student has not yet internalized. The final phase, called Instruct RL, applies reinforcement learning on instruction-following tasks using a combination of rule-based validators for format constraints and rubric-based reward models for open-ended quality assessment.

Francesco Bertolotti, an AI researcher who flagged the paper early on X, described the approach succinctly: “These results were achieved primarily through post-training refinements on Qwen2.5-Coder. The paper doesn’t provide many details, but it appears they distill from RL ckpts and then do a final RL-based instruct RL.” His post drew over 161,000 views.

Real-world testing reveals the gap between benchmark scores and practical AI performance

For every enthusiastic reaction, the paper drew an equally forceful objection. The AI research community in mid-2026 has grown deeply wary of benchmark-driven claims, and VibeThinker-3B arrived in an environment primed for suspicion.

“The benchmarks are literal pattern matching single file coding,” wrote @BigMoonKR on X. “It has no relation to actual coding work. I don’t know how people still don’t get this.”

“Benchmaxxing,” declared @oflu_bedirhan, using a term that has become shorthand in the AI community for models that appear optimized specifically for benchmark performance at the expense of real-world utility.

The most pointed criticism came from users who actually downloaded and tested the model. “Just tried the full precision,” wrote @politilols. “It doesn’t even know what a uv script (so the most popular Python dev tool) is. Haven’t seen that in a single LLM in at least a year now. Benchmaxxed.” When Bertolotti responded that the model seemed more focused on mathematical reasoning than practical coding, the user countered: “They include a livecodebench score. Zero chance that is reflective of the model.”

@Itsdotdev raised a structural criticism: “Look into the benchmarks themselves and it probably won’t be so shocking. Why no DeepSWE? Why none of the standard benchmarks SOTA providers use?” The user @AvenirReym posed a more diagnostic question: “If it holds on a benchmark made after the model’s training cutoff, it’s real. If it only wins on AIME-style sets that have been circulating for years, it’s leakage.”

The paper’s authors appear to have anticipated these objections. The technical report states that training sets “have undergone strict benchmark decontamination,” including n-gram-based filtering to remove “n-gram overlaps with evaluation sets.”

The LeetCode contest evaluation — which covers contests from April 25 to May 31, 2026, dates that postdate any plausible training data cutoff — represents the most robust guard against data contamination concerns. On those contests, VibeThinker-3B passed 123 out of 128 first-attempt submissions, a 96.1 percent rate that exceeded GPT-5.2, Doubao Seed 2.0 Pro, Kimi K2.5, and Claude Opus 4.6 under identical evaluation conditions.

Still, real-world user reports suggest a significant gap between benchmark performance and practical utility — a phenomenon that has become familiar across the industry. “In LM Studio it only responds well to first question, next questions reply to the first question,” reported @luismolinaab.

Why a social media company may have found a crack in the scaling hypothesis

Even the sharpest critics acknowledged that achieving these benchmark numbers at 3 billion parameters — regardless of how transferable they are to production use cases — is a meaningful engineering achievement. “Even if it’s benchmaxxing doing so with 3B parameters is fascinating, goes to show how fast this field is progressing,” wrote @rohityin.

The observation cuts to a question that has consumed the AI industry since the advent of the scaling hypothesis: Is bigger always better? The conventional wisdom, articulated most famously in the Chinchilla scaling laws and reinforced by the commercial dominance of ever-larger foundation models, holds that more parameters and more training data reliably yield better performance. The economic corollary is stark: training and deploying frontier models costs tens or hundreds of millions of dollars, creating enormous barriers to entry.

VibeThinker-3B challenges that consensus — but only partially. The paper is careful to draw a boundary around its claims, distinguishing between tasks with “clear verification signals” and those that require broad factual knowledge. The Parametric Compression-Coverage Hypothesis explicitly argues that small models cannot replace large ones across the board.

“The true significance of VibeThinker-3B does not lie in proving that a 3B model can replace large-scale generalists,” the paper states, “but rather in providing a concrete empirical signal: the development of compact models is no longer merely a passive compromise for deployment efficiency or cost control; it emerges as a promising research trajectory that is fundamentally complementary to the traditional parameter scaling paradigm.”

Perhaps the most surprising element of the work is its provenance. Sina Weibo — publicly traded on Nasdaq and Hong Kong, with a market capitalization that fluctuates in the single-digit billions — is not a company typically associated with frontier AI research. Yet the VibeThinker series is Weibo’s second major open-source AI contribution in seven months.

VibeThinker-1.5B, released in November 2025, demonstrated that a model with just 1.5 billion parameters could outperform the original DeepSeek R1 on several math benchmarks — a result the team achieved for what it claimed was a post-training cost of just $7,800, compared to the $294,000 estimated for DeepSeek R1.

The research team is compact — nine authors, all listed as Sina Weibo Inc. employees. The model is released under the MIT License, one of the most permissive open-source licenses available, and the weights are freely downloadable from both Hugging Face and ModelScope. Within the first day of release, community members had already created GGUF quantizations and derivative models.

Small models, big implications, and the question the AI industry can no longer avoid

The most honest assessment of VibeThinker-3B may be that it is simultaneously less and more than what the benchmarks suggest. Less, because a model that struggles with basic knowledge of popular developer tools is unlikely to replace any production-grade coding assistant anytime soon. More, because the underlying insight — that reasoning ability and factual knowledge are partially decoupled, and that the former can be compressed far more aggressively than previously assumed — has profound implications for how the industry thinks about model design, deployment economics, and the accessibility of advanced AI capabilities.

If the Parametric Compression-Coverage Hypothesis holds, it suggests a future in which small, specialized reasoning engines operate alongside large knowledge-rich models in hybrid architectures — a vision where a 3-billion-parameter model handles the logical heavy lifting while a larger system supplies the factual grounding. Such an architecture could dramatically reduce the cost of deploying AI reasoning capabilities, potentially bringing competition-level mathematical and coding performance to devices with modest hardware.

“The interesting part is that we’re starting to separate knowledge from reasoning,” wrote @RealLambdaFlux on X. “A small model with strong post-training can punch way above its size on tasks with clear feedback.”

@cmitsakis suggested the practical endgame: “I think small models are the future for agents because they can use tools to get the knowledge and they can run fast and cheap.”

Whether that future arrives through VibeThinker-3B specifically, or through the dozens of teams now racing to reproduce and extend these results, the paper has already accomplished something that no benchmark score can fully capture.

It has forced the AI community to confront an uncomfortable possibility: that for years, the industry may have been spending billions of dollars scaling up parameters to improve a kind of intelligence that could have fit, all along, on a laptop. The weights are public. The code is open. And the most important test isn’t on any leaderboard — it’s whether anyone can make a model this small actually useful in the real world.

Source link

Tech

Today’s NYT Mini Crossword Answers for June 17

Published

13 minutes ago

17 June 2026

NewsAdmin

Looking for the most recent Mini Crossword answer? Click here for today’s Mini Crossword hints, as well as our daily answers and hints for The New York Times Wordle, Strands, Connections and Connections: Sports Edition puzzles.

Need some help with today’s Mini Crossword? It was a bit tricky today, I thought, especially 6-Across and 2-Down. Read on for all the answers. And if you could use some hints and guidance for daily solving, check out our Mini Crossword tips.

If you’re looking for today’s Wordle, Connections, Connections: Sports Edition and Strands answers, you can visit CNET’s NYT puzzle hints page.

Let’s get to those Mini Crossword clues and answers.

completed-nyt-mini-crossword-puzzle-for-june-17-2026.png — The completed NYT MIni Crossword puzzle for June 17, 2026.
Advertisement

NYT/Screenshot by CNET

Mini across clues and answers

1A clue: Witty one-liners
Answer: QUIPS

6A clue: Common poster in a geography class
Answer: USMAP

7A clue: Country that’s won the World Cup four times (but failed to qualify in 2018, 2022 and 2026)
Answer: ITALY

8A clue: The one for Starbucks has a wavy-haired mermaid
Answer: LOGO

9A clue: ___ socks (1970s fad)
Answer: TOE

Mini down clues and answers

1D clue: Patchwork blanket
Answer: QUILT

2D clue: “We feel the same!”
Answer: USTOO

3D clue: Word after spitting or mirror
Answer: IMAGE

4D clue: ___ Alto, Calif.
Answer: PALO

5D clue: Secret agent
Answer: SPY

Source link

Tech

FIFA Wants Jamal Musiala To Forget About Dre (During The World Cup)

Published

26 minutes ago

17 June 2026

NewsAdmin

The organization isn’t going to let a non-sponsor brand show up on the field.

Jessica Kirsh/Shutterstock

FIFA is known for having a strict policy about making sure brands, which aren’t official sponsors and advertisers, don’t appear on World Cup fields and stadium. For instance, it recently made sure that Beats wasn’t getting any free advertisement on the field and had Bayern Munich player Jamal Musiala literally cover the logo of his headphones with tape during warmup.

At FIFA’s request, Jamal Musiala had to cover the logo of his Beats by Dre headphones with a tape strip before the Curaçao game. FIFA is cracking down hard on brand logos at the World Cup – even the players have to hide logos if the companies are not official tournament sponsors… pic.twitter.com/PaAPBZYXP5
— Bayern & Germany (@iMiaSanMia) June 16, 2026

X user @iMiaSanMia posted a photo showing Musiala wearing headphones with a covered logo, reportedly at FIFA’s request, before Bayern’s match against Curaçao. If you haven’t heard yet, FIFA also had Levi’s cover its logo with a tarp at the Levi’s Stadium in Santa Clara, California, which is being called the San Francisco Bay Area Stadium for the World Cup. Levi’s, of course, took advantage of the buzz around it and replaced its social media profile picture with a tarp-covered version of its logo.

While the Beats branding isn’t showing up on the field, it’s been popping up on a lot of football/soccer players’ social media posts. In fact, it’s been using the players to tease an unannounced over-ear headphones model, which could have customizable colors based on the variety we’ve seen so far.

Source link

Tech

Anthropic’s latest feud with the Trump admin may actually help it, sales data suggests

Published

40 minutes ago

17 June 2026

NewsAdmin

Anthropic is having a month.

The AI lab finished May by surpassing OpenAI in market share of business spending for the first time, Ramp just revealed. It raised $65 billion at a $965 billion valuation (also besting OpenAI) at the end of May, then waltzed into June by filing confidential paperwork for an IPO, reportedly on the strength of its first-ever profitable quarter.

Then on Friday, the Trump administration renewed its war on the model maker by sending a letter demanding it ban non-Americans, including Anthropic’s employees, from accessing its state-of-the-art models: the limited-release Mythos 5 and the more guarded version of Mythos released to the public three days earlier, called Fable 5.

This essentially forced Anthropic to pull its latest all-powerful model from the market altogether.

Although the White House invoked an obscure export control directive when ordering the ban, the exact cause remains unclear. The chatter was that hackers easily bypassed Fable 5’s guardrails, which were intended to prevent access to Mythos’ capabilities. That model is so good at finding security flaws in software code that Anthropic itself marketed it as dangerous and restricted its public release.

This new drama comes after Anthropic famously refused to allow the government to use its models for mass surveillance of Americans and fully autonomous weapons. As a result, in March, the Trump administration declared the company a supply-chain risk.

That didn’t deter Anthropic’s sales to businesses. Quite the opposite, Ramp’s data shows. Ironically, this latest feud with the Trump administration, which also appears to validate the hubbub over Mythos’ mythological power, may help rather than hurt Anthropic, according to Ramp’s lead economist, Ara Kharazian. Kharazian is the person who compiled the business-spending AI data.

“If anything, it’ll probably boost them,” Kharazian told TechCrunch. “Anthropic’s best month on record, as far as business adoption, was the month that the Department of Defense labeled them a supply-chain risk. There’s a lot of aura that comes with your model specifically being named too dangerous to use.”

Ramp’s data isn’t granular enough for us to see how much of a financial hit the company will take by pulling Mythos and Fable 5 off the market.

Still the data, from more than 70,000 businesses that use its platform, shows that customers heavily use Anthropic’s Opus models and that business use has been growing.

For instance, Ramp reported that Anthropic’s share of AI subscriptions paid for by businesses rose 2.5 percentage points in May to 41%. This compares to OpenAI, which commanded 39.5% of AI subscriptions by its customers, essentially flat from the prior month. (OpenAI still greatly leads Anthropic in overall consumer usage, according to new data from Sensor Tower.)

Beyond subscriptions, the vast majority of what companies spend money on is API calls to the model, which cover token use for activities like coding. Anthropic’s Claude Code has a strong reputation as a powerful AI coding tool.

Ramp can’t always see from the spending data which models most businesses are using. When it can see the model details — in about one-third of transactions — businesses are mostly spending on various flavors of Claude Opus, particularly the later versions. Opus is the model that preceded Mythos and is still openly available.

In fact, in late May, Anthropic released a new version, Opus 4.8.

Mythos had not been on the market for that long, having been released to limited users as of April. And Fable 5 was shut down after a few days.

While we can’t predict how this latest drama with the White House will impact Anthropic’s ability to go public as it hoped to (public-market investors tend to be wary of companies embroiled in controversies with the government), the numbers indicate that Anthropic’s available models are more popular with businesses than ever before.

When you purchase through links in our articles, we may earn a small commission. This doesn’t affect our editorial independence.

Source link

Tech

Evan Spiegel Doesn’t Want You To Call Snap Specs AI Glasses

Published

54 minutes ago

17 June 2026

NewsAdmin

Snap’s newly announced AR Specs might seem similar to other smartglasses, but Snap CEO Evan Spiegel says that’s the wrong way to think about the product. Specs, he says, is “a new type of computer, a see-through computer.”

Shortly after unveiling Specs at AWE, Spiegel sat down with Engadget to tell us more about the device we got a glimpse of onstage. The CEO repeatedly referred to Specs as a “computer” and that really is core to understanding how Snap is positioning the product (and justifying the price). Specs, Spiegel said, “is able to overlay computing on the world around you and bring computing into the world, which is so important if you want to make computing feel more human.”

But Snap will have to do more than just persuade people to buy a computer for their face. When Specs go on sale later this year, the company will face a very different environment than when it first started experimenting with camera-enabled glasses in 2016. For one, it has a lot more competition now. But today, there’s also increasing suspicion of smartglasses, given that there have been some very public cases of people misusing the tech.

There’s the Meta of it all, too. The company was recently caught with an unreleased facial recognition feature on its Ray-Ban glasses (that it removed soon after outside researchers discovered it).

Spiegel, not surprisingly, isn’t a fan of facial recognition.

“There are certain use cases, like facial recognition, that we don’t allow in Lenses, and one of the benefits of having our own developer ecosystem and our own developer tools is that we’re able to moderate the Lenses that are submitted and available on Snap to make sure that they comply with our guidelines,” he told Engadget.

He also said he hopes people will view Specs differently than what’s currently out there. “I think AI glasses are typically being used to record content, that’s sort of the purpose of the glasses as they’re marketed,” he said. “That’s not the purpose of Specs. In fact, I think that might be an almost tangential use case.”

Spiegel said he thinks people will feel more comfortable around Specs once they understand wearers are more likely to be “using a computer, not surreptitiously recording videos.”

Specs will also launch at a time when more governments and regulators are scrutinizing social media companies’ track records on child safety. Earlier this week, UK Prime Minister Keir Starmer said the UK would ban children under 16 from social media, including Snap. Spiegel said that while he anticipates Specs “will mostly be used by adults,” the company has built some parental control features for people who want to share the glasses with their teens. “You can basically swipe a little toggle [in the Specs app] and limit the world of Lenses that they can use when they’re using Specs,” he explained. “So they can have all the fun and play, and still provide comfort to parents that they’re overseeing what their teens are doing.”

At $2,195, Specs will be more expensive than any other smartglasses currently on the market. It’s also more expensive than even most headsets, save for the Apple Vision Pro, which Spiegel drew a clear comparison to during his keynote. I asked if Snap’s goal is for the price of Specs to come down eventually and he said it is a long term goal for the company.

“That’s something we’re really focused on over time, because we want Specs to be as accessible as possible,” he said. “As far as computers go, it’s an incredibly powerful new computer, and we try to price in a way that makes it something that early adopters and developers and folks who are really passionate about this technology can afford.”

Besides price, the biggest question ahead of the Specs reveal was just how much Snap would be able to change their design. Spiegel was wearing the new Specs throughout our conversation, and after seeing them up close I’m able to confirm they are indeed much more refined than the developer version from 2024. The arms are still quite thick, though, and stuck out a bit past Spiegel’s head. But from the front, they are noticeably narrower and rounder than the boxy, more angular frames we’ve seen in the past from Snap.

While he was speaking, I was able to easily see his eyes through the lenses, though I could detect some rainbow-like reflections from the embedded waveguides when he turned his head. I also saw the lenses when the dimming feature was enabled and they looked fully blacked out, like dark sunglasses.

Unfortunately, Snap isn’t offering demos of the glasses just yet, so my impressions are limited to what I was able to observe during my quick chat with Spiegel. But I’m looking forward to seeing how Snap’s “computer” will look and fit on different faces.

Source link

Tech

The “Made in America” Trump phone is just a reskinned HTC U24 built in China

Published

1 hour ago

17 June 2026

NewsAdmin

Facepalm: Claims that Trump Mobile could deliver a “Made in America” smartphone within months sounded dubious when the T1 was initially unveiled a year ago. The ensuing mockups suspiciously resembled existing foreign designs, and a recent teardown confirms the device is nearly identical to one from Taiwan-based HTC.

iFixit’s teardown of the Trump Mobile T1 confirms that the phone is essentially an HTC U24 Pro with a few minor cosmetic changes. The findings settle suspicions that had been circulating since earlier this year and undercut Trump Mobile’s original claim that the device would be American-manufactured.

The T1’s listed specs: a 6.78-inch 120Hz AMOLED display, a Snapdragon processor, a 50MP main camera, a 50MP telephoto, and an 8MP ultrawide – closely mirror what HTC publishes for the U24 Pro. When NBC brought a unit to iFixit, the repair team disassembled it using the same techniques that had worked on the U24.

HTC U24 Pro (left) and Trump Mobile T1 (right). Source: iFixit

Scans revealed nearly identical internal layouts and component placement, and iFixit successfully booted the T1 using a motherboard taken from the HTC device. The LPDDR5 RAM was sourced from Micron rather than SK Hynix, a difference iFixit attributes to supply chain variability rather than any meaningful design divergence.

Other changes are cosmetic or minor: a gold chassis (with the American flag rendered with 11 stripes instead of 13), re-drilled speaker holes, a different camera shell, a repositioned flash, and a larger battery. That battery grows from 4,600mAh to 5,000mAh, though charging speed drops from 60W to 30W.

When Trump Mobile unveiled the T1 alongside its carrier service exactly one year ago, the company claimed the phone would be “designed and built in the United States,” but walked that back quickly. Subsequent language described the device as “designed with American principles in mind,” and the website now simply calls it “Proudly American.”

The earliest mockups depicted a vague design that sparked doubts about whether a real product existed, while later images mirrored a repainted Samsung Galaxy Ultra. When the actual phone leaked in February, observers immediately recognized HTC’s design.

Trump Mobile executives have said the company aims to rely as little as possible on Chinese parts and labor, but Taiwan’s National Communications Commission database lists Guangdong Yuanchang Electronics Co., Ltd., a China-based manufacturer, as the producer of the HTC U24 Pro, and some U24 Pro retail boxes carry a “Made in China” label. Furthermore, when Google acquired a significant portion of HTC’s hardware engineering team in 2017 for $1.1 billion, it left the company with a considerably reduced capacity to design its own handsets. iFixit suspects HTC contracted a Guangdong company to both manufacture and design the U24 in the first place.

President Trump, like Obama before him, has pressured companies including Apple and Samsung to explain why smartphone manufacturing cannot be revived domestically. Supply chain analyst Kevin O’Marah has estimated that a fully domestic smartphone production timeline would span roughly a decade, requiring a phone designed from scratch around automated US production lines and manufacturing equipment that doesn’t currently exist in the country – making it unsurprising that Trump Mobile couldn’t accomplish the feat in a single year.

That said, final assembly of the T1 occurs in Miami, which could represent a first step toward a more domestically produced device. The persistent obstacle is the cost of US labor, and if domestic companies can gradually master the supply chain, fully automated US factories might eventually make it viable, though not for years. Pre-orders for the T1 are open at a promotional price of $499, slightly undercutting the U24 Pro’s $579 MSRP. A successor, the T1 Ultra, is planned.

Source link

Tech

Today’s NYT Connections: Sports Edition Hints, Answers for June 17 #632

Published

1 hour ago

17 June 2026

NewsAdmin

Looking for the most recent regular Connections answers? Click here for today’s Connections hints, as well as our daily answers and hints for The New York Times Mini Crossword, Wordle and Strands puzzles.

Today’s Connections: Sports Edition is a tough one. If you’re struggling with the puzzle but still want to solve it, read on for hints and the answers.

Connections: Sports Edition is published by The Athletic, the subscription-based sports journalism site owned by The Times. It doesn’t appear in the NYT Games app, but it does in The Athletic’s own app. Or you can play it for free online.

Hints for today’s Connections: Sports Edition groups

Here are four hints for the groupings in today’s Connections: Sports Edition puzzle, ranked from the easiest yellow group to the tough (and sometimes bizarre) purple group.

Yellow group hint: Almost time to draft!

Green group hint: U.S. Bank is another one.

Blue group hint: Sharp items on sports shoes.

Purple group hint: Big Red Machine.

Answers for today’s Connections: Sports Edition groups

Yellow group: Fantasy football moves.

Green group: NFL stadiums.

Blue group: Soccer cleat makers.

Purple group: Cincinnati Reds to win MVP.

What are today’s Connections: Sports Edition answers?

The completed NYT Connections: Sports Edition puzzle for June 17, 2026.
NYT/Screenshot by CNET

The yellow words in today’s Connections

The theme is fantasy football moves. The four answers are add, drop, sit and start.

The green words in today’s Connections

The theme is NFL stadiums. The four answers are Arrowhead, Highmark, MetLife and SoFi.

The blue words in today’s Connections

The theme is soccer cleat makers. The four answers are Adidas, Diadora, Lotto and Puma.

The purple words in today’s Connections

The theme is Cincinnati Reds to win MVP. The four answers are Bench, Larkin, Morgan and Votto.

Source link

Tech

The Problem Of Making A Good Metal-To-Glass Seal

Published

1 hour ago

17 June 2026

NewsAdmin

If you’ve ever taken a close look at a vacuum tube, you’ll have seen the seals around the pins that keep everything air-tight while providing the the device’s electrical contacts. As [maurycyz] finds out, it’s not an easy process to get right.

The problem is one of both chemistry and thermal expansion, as while a good seal can be made between glass and red copper oxide, it remains very difficult indeed to stop the glass cracking on cooldown due to differing thermal expansion properties. We’re led through a variety of experiments including surface treatments and flattening the metal to a sheet, with varying pros and cons. The most successful seal on the page comes from very thin tungsten wire, though hardly the most practical conductor for a vacuum tube.

It’s a fascinating investigation for the casual reader, taking them into the properties of metal-glass bonds and the difficulties involved in making them. We have even more respect for the people who make their own tubes after reading it.

Source link

Tech

Next-gen nuclear company TerraPower plants flag in UK

Published

2 hours ago

17 June 2026

NewsAdmin

TerraPower test equipment — TerraPower’s lab tests the equipment and processes for next-generation nuclear reactors. (GeekWire File Photo / Kevin Lisota)

TerraPower, the Bellevue, Wash.-based nuclear energy company, announced Tuesday the opening of a subsidiary office in the United Kingdom as it pursues its first international power plant.

“TerraPower is entering the UK market with a long-term commitment to supporting the nation’s clean energy future and establishing ourselves as a serious and reliable deployment partner,” Chris Levesque, company president and CEO, said in a statement.

In October 2025, TerraPower submitted its Generic Design Assessment (GDA) application to UK regulators and in February received formal acceptance from the country’s Department for Energy Security and Net Zero. The company has now officially started Step 1 of the GDA process.

Nuclear power has seen a resurgence of interest in recent years, driven by spiking energy demand from data center expansion, the electrification of transportation and other economic sectors, and energy security concerns tied to fossil fuel dependence.

TerraPower is among the companies developing next-generation nuclear technologies that aim to be safer, less expensive and faster to deploy than traditional reactors.

@media (max-width: 600px) {
aside.callout { float:none !important; max-width:100% !important; margin-left:0 !important; margin-right:0 !important; }
aside.callout .callout-img { display:none !important; }
}

The company broke ground on its Natrium demonstration plant in Kemmerer, Wyo., in 2024, starting with non-nuclear construction. In April, it began work on the nuclear components after approval from the Nuclear Regulatory Commission.

The facility features a 345-megawatt, sodium-cooled fast reactor paired with a molten salt thermal storage system that captures excess heat. Drawing on that salt battery can boost the plant’s output to 500 megawatts for more than five hours. By comparison, Seattle uses around 2,000 megawatts during extreme weather events. TerraPower aims to have the reactor splitting atoms by the end of 2030.

The company also has a deal with Meta to build up to eight Natrium reactors in the U.S., with the first two targeted to come online by 2032.

The UK office extends that growth beyond American borders. Ian Hudson, the newly appointed head of TerraPower UK, said a permanent presence will allow the company to work closely with British partners.

Source link

Tech

Security Camera Gets Several Defensive Upgrades

Published

2 hours ago

17 June 2026

NewsAdmin

Ever since the early web, people have been streaming video with inexpensive webcams, and since the advent of the Raspberry Pi and its dedicated camera slot we’ve really seen how easy it can be to build security cameras or any other webcam and get it online quickly. But these cameras notably lack defensive capabilities if anyone tries to break into an area they shouldn’t be, and [John] added some features to this webcam to help defend his garage.

The webcam itself is a custom build, mounted on a custom-built tilt-and-pan mount that lets it freely rotate to view any location in the garage. Some custom software running on a Raspberry Pi lets it operate in autonomous mode or be controlled manually from an Android tablet. But for the defensive capabilities, it also carries a Nerf machine gun with a laser sight and spotlights which can all be controlled autonomously by the Raspberry Pi, including a computer vision system that lets it track various objects. While this is mostly a fun novelty for his security camera, the noise it makes might be enough to startle any would-be burglar.

[John] added a few other features to this build as well, including a speaker, which allows the system to be voice-controlled and to communicate back to the user. This lets him activate and deactivate the system using a verbal password. These types of Nerf guns are fairly popular for turrets as well, and some have practical uses as well like keeping cats from walking on the kitchen counters.

Source link

Tech

The new Siri makes one of Apple’s most convenient OS features a cumbersome mess

Published

2 hours ago

17 June 2026

NewsAdmin

ai + ml

Goodbye, useful Spotlight; hello force-fed Apple intelligence bloatware that feels distressingly like Google AI Overviews

HANDS ON That new AI-juiced Siri that Apple rolled out last week at WWDC was supposed to set a new paradigm for on-device AI.

But don’t believe the hype coming out of Tim Cook’s final big event. After a week-long test drive, it seems like Apple just crammed Google AI Overviews on top of the most useful parts of its various operating systems and made the whole ecosystem more cumbersome to use. But hey, it has more AIs!

I’ve been running the iOS and macOS 27 developer betas since they were made available on June 8, and I was blessed by the waitlist gods with access to the new version of Siri a few days after that. There are definitely some useful new features: Siri now carries on actual conversations, which makes it far more useful than the ask, get a response, we’re-done-here flow of the old Siri that left no room for clarifying questions or follow ups. Siri is now able to find things on my device more easily too – at least on my M1 MacBook. My iPhone 15 Pro has been telling me it’s still re-indexing my device after the update for more than a week, but I was still able to use it to conduct web searches and find some things on my phone – it’s possible this message itself was an error.

The dedicated Siri app is also nice in its own way, as it shows a record of every conversation I’ve had with the new Apple Intelligence front end for later review, but that comes with a caveat, too. Even the most brief questions – the overnight weather forecast, for example – is now stored in perpetuity, cluttering up the list of chats we’ve had until I manually delete it. The only apparent alternative is setting an expiration window for past chats and losing records of the more useful conversations we’ve had.

Who turned out my Spotlight?

Those are small inconveniences, however, compared to my biggest gripe with Siri AI: It’s completely ruined Spotlight.

I’ve come to rely on Apple’s embedded search/launcher feature almost exclusively for digging up apps that I don’t keep a shortcut for, and on my iPhone, it’s the main method I use to kick off a web search because it’s so simple. Swipe down from the center of the screen, type what I want to search for, and tap on the item that points to my query as a Google search in Safari. Swipe, type, and a tap and I’m perusing a search result page.

Not anymore.

The new Siri-first interface that presumes that if you’re searching for anything but an app or file, you must want Siri to feed you a few links of Apple Intelligence’s choosing.

Getting to a web search from a Spotlight query now requires multiple taps: Type your query, tap “Show Results” (careful: hitting enter will trigger Siri to craft a response, eliminating the possibility of seeing any actual Spotlight content), tap on “Show More” next to the list of Siri-surfaced web results, scroll down until you see Search Google (or whatever engine you have set as your default), then tap that.

Maybe I’m being a grumpy old journalist who likes things the way they used to be, the transformation of Spotlight into a Siri interface seems like intentional degradation of a basic feature in order to front-load an AI that in my experience so far is largely an inconvenience.

Overall, the experience reminds me of Google’s much-maligned and often wrong AI Overviews, which push actual search results down the page in favor of force-fed info from Google Gemini.

There’s a logical reason for the similarity. At the end of 2025, Apple replaced its former AI chief John Giannandrea, formerly Google’s SVP of search and AI, in a bid to right the Siri ship. Taking his place was another Google alum with even closer ties to The Chocolate Factory’s AI strategy, Amar Subramanya, who spent 16 years there, including a turn as the head of Gemini engineering. Subramanya, now Apple’s VP of AI, now reports directly to Apple’s SVP of software engineering, Craig Federighi, who himself has assumed responsibility for Apple’s machine learning initiatives, including the construction of Apple foundation models.

As we learned at WWDC last week, Apple has leaned heavily on a partnership with Google to build its foundation models, and it appears Subramanya has brought some of that Google AI ethos with him as well.

So, what’s the alternative to the new AI bloat in iOS 27? Siri can still be turned off entirely in the Settings app, so there’s that, but I’ve decided to take another tack and use one of Apple’s other AI features to get what I want.

As the iMaker mentioned at WWDC, you can now create shortcuts (tiny scripts that automate basic tasks) by making a natural language request to Siri. In my case, I asked it to build a shortcut I could drop on my home screen to do a Google search with whatever text I input. It works perfectly, and is available to duplicate on your own iDevice should you see fit.

Again, this is a developer beta, so it’s entirely possible that Apple will wise up and stop burying basic Spotlight search functionality before its 27 series of OSes release to the public this fall. We asked Apple if the change was intentional, but didn’t hear back. ®

Source link

Powerball tickets rest on a 7-Eleven store register January 9, 2016 in Chicago, Illinois.

Business3 days ago

No Jackpot Winner as $257 Million Prize Rolls Over to $269 Million Monday Draw

SpaceX related party maze puts Valor and Musk in creditors’ spotlight

Crypto World6 days ago

Oppenheimer backs SpaceX as $70 billion retail frenzy builds

Fashion5 days ago

Weekend Open Thread: Tuckernuck – Corporette.com

Crypto World6 days ago

Markets Rally as SpaceX IPO Looms Amid Iran Tensions and Inflation Surge

Brian Armstrong's Bold Prediction: AI Agents Will Soon Dominate Global Financial

Crypto World2 days ago

Zimbabwe Requires Crypto Businesses to Register Annually Under New FIU Regulations

Tech4 days ago

NanoClaw integrates JFrog registries to secure AI agent downloads

Tech5 days ago

This Week In Security: Microsoft On Microsoft, Register Your Domains, Linux On ARM, And FreeBSD Joins The File Cache Club

Argentina bill targets crypto gambling payments

Crypto World4 days ago

Bitget enters Argentina’s regulated crypto market through PSAV registration

Tech6 days ago

Dutton Ranch star claims they ‘didn’t see any disruption’ on set following Chad Feehan’s exit from Yellowstone spinoff fueled by Taylor Sheridan clash rumors