Tech

Is Anthropic ‘nerfing’ Claude? Users increasingly report performance degradation as leaders push back

Published

4 weeks ago

13 April 2026

A growing number of developers and AI power users are taking to social media to accuse Anthropic of degrading the performance of Claude Opus 4.6 and Claude Code — intentionally or as an outcome of compute limits — arguing that the company’s flagship coding model feels less capable, less reliable and more wasteful with tokens than it did just weeks ago.

The complaints have spread quickly on Github, X and Reddit over the past several weeks, with several high-reach posts alleging that Claude has become worse at sustained reasoning, more likely to abandon tasks midway through, and more prone to hallucinations or contradictions.

Some users have framed the issue as “AI shrinkflation” — the idea that customers are paying the same price for a weaker product.

Others have gone further, suggesting Anthropic may be throttling or otherwise tuning Claude downward during periods of heavy demand.

Those claims remain unproven, and Anthropic employees have publicly denied that the company degrades models to manage capacity. At the same time, Anthropic has acknowledged real changes to usage limits and reasoning defaults in recent weeks, which has made the broader debate more combustible.

VentureBeat has reached out to Anthropic for further clarification on the recent accusations, including whether any recent changes to reasoning defaults, context handling, throttling behavior, inference parameters or benchmark methodology could help explain the spike in complaints.

We have also asked how Anthropic explains the recent benchmark-related claims and whether it plans to publish additional data that could reassure customers. An Anthropic spokesperson did not address the questions individually, instead referring us to X posts by Claude Code creator Boris Cherny and Claude Code team member Thariq Shihipar regarding Opus 4.6 performance and usage limits, respectively. Both X posts are also referenced and linked below.

Viral user complaints, including from an AMD Senior Director, argue Claude has become less capable

One of the most detailed public complaints originated as a GitHub issue filed by Stella Laurenzo on April 2, 2026, whose LinkedIn profile identifies her as Senior Director in AMD’s AI group.

In that post, Laurenzo wrote that Claude Code had regressed to the point that it could not be trusted for complex engineering work, then backed that claim with a sprawling analysis of 6,852 Claude Code session files, 17,871 thinking blocks and 234,760 tool calls.

The complaint argued that, starting in February, Claude’s estimated reasoning depth fell sharply while signs of poorer performance rose alongside it, including more premature stopping, more “simplest fix” behavior, more reasoning loops, and a measurable shift from research-first behavior to edit-first behavior.

The post’s broader point was that for advanced engineering workflows, extended reasoning is not a luxury but part of what makes the model usable in the first place.

That GitHub thread then escaped into the broader social media conversation, with X users including @Hesamation, who posted screenshots of Laurenzo’s GitHub post to X on April 11, turning it into an even more viral talking point.

That amplification mattered because it gave the wider “Claude is getting worse” narrative something more concrete than anecdotal frustration: a long, data-heavy post from a senior AI leader at a major chip company arguing that the regression was visible in logs, tool-use patterns and user corrections, not just gut feeling.

Anthropic’s public response focused on separating perceived changes from actual model degradation. In a pinned follow-up on the same GitHub issue posted a week ago, Claude Code lead Boris Cherny thanked Laurenzo for the care and depth of the analysis but disputed its main conclusion.

Cherny said the “redact-thinking-2026-02-12” header cited in the complaint is a UI-only change that hides thinking from the interface and reduces latency, but “does not impact thinking itself,” “thinking budgets,” or how extended reasoning works under the hood.

He also said two other product changes likely affected what users were seeing: Opus 4.6’s move to adaptive thinking by default on Feb. 9, and a March 3 shift to medium effort, or effort level 85, as the default for Opus 4.6, which he said Anthropic viewed as the best balance across intelligence, latency and cost for most users.

Cherny added that users who want more extended reasoning can manually switch effort higher by typing /effort high in Claude Code terminal sessions.

That exchange gets at the core of the controversy. Critics like Laurenzo argue that Claude’s behavior in demanding coding workflows has plainly worsened and point to logs and usage patterns as evidence.

Anthropic, by contrast, is not saying nothing changed. It is saying the biggest recent changes were product and interface choices that affect what users see and how much effort the system expends by default, not a secret downgrade of the underlying model. That distinction may be technically important, but for power users who feel the product is delivering worse results, it is not necessarily a satisfying one.

External coverage from TechRadar and PC Gamer further amplified Laurenzo’s post and larger wave of agreement from some power users.

Another viral post on X from developer Om Patel on April 7 made the same argument in even more direct terms, claiming that someone had “actually measured” how much “dumber” Claude had gotten and summarizing the result as a 67% drop.

That post helped popularize the “AI shrinkflation” label and pushed the controversy beyond hard-core Claude Code users into the broader AI discourse on X.

These claims have resonated because they map closely onto what many frustrated users say they are seeing in practice: more unfinished tasks, more backtracking, more token burn and a stronger sense that Claude is less willing to reason deeply through complicated coding jobs than it was earlier this year.

Benchmark posts turned anecdotal frustration into a public controversy

The loudest benchmark-based claim came from BridgeMind, which runs the BridgeBench hallucination benchmark. On April 12, the account posted that Claude Opus 4.6 had fallen from 83.3% accuracy and a No. 2 ranking in an earlier result to 68.3% accuracy and No. 10 in a new retest, calling that proof that “Claude Opus 4.6 is nerfed.”

That post spread widely and became one of the main anchors for the broader public case that Anthropic had degraded the model.

Other users also circulated benchmark-related or test-based posts suggesting that Opus 4.6 was underperforming versus Opus 4.5 in practical coding tasks.

Still other posts pointed to TerminalBench-related results as supposed evidence that the model’s behavior had changed in certain harnesses or product contexts.

The effect was cumulative: benchmark screenshots, side-by-side tests and anecdotal frustration all began reinforcing one another in public.

That matters because benchmark claims tend to travel farther than more subjective complaints. A developer saying a model “feels worse” is one thing. A screenshot showing a ranking drop from No. 2 to No. 10, or a dramatic percentage swing in accuracy, gives the appearance of hard proof, even when the underlying comparison may be more complicated.

Critics of the benchmark claims say the evidence is weaker than it looks

The most important rebuttal to the BridgeBench claim did not come from Anthropic. It came from Paul Calcraft, an outside software and AI researcher on X, who argued that the viral comparison was misleading because the earlier Opus 4.6 result was based on only six tasks while the later one was based on 30.

In his words, it was a “DIFFERENT BENCHMARK.” He also said that on the six tasks the two runs shared in common, Claude’s score moved only modestly, from 87.6% previously to 85.4% in the later run, and that the bigger swing appeared to come mostly from a single fabrication result without repeats. He characterized that as something that could easily fall within ordinary statistical noise.

That outside rebuttal matters because it undercuts one of the cleanest and most viral claims in circulation. It does not prove users are wrong to think something has changed. But it does suggest that at least some of the benchmark evidence now driving the story may be overstated, poorly normalized or not directly comparable.

Even the BridgeBench post itself drew a community note to similar effect. The note said the two benchmark runs covered different scopes — six tasks in one case and 30 in the other — and that the common-task subset showed only a minor change. That does not make the later result meaningless, but it weakens the strongest version of the “BridgeBench proved it” argument.

This is now a key feature of the controversy: the claims are not all equally strong. Some are grounded in first-hand user experience. Some point to real product changes. Some rely on benchmark comparisons that may not be apples-to-apples. And some depend on inferences about hidden system behavior that users outside Anthropic cannot directly verify.

Earlier capacity limits gave users a reason to suspect more changes under the hood

The current backlash also lands in the shadow of a real, confirmed Anthropic policy change from late March. On March 26, Anthropic technical staffer Thariq Shihipar posted that, “To manage growing demand for Claude,” the company was adjusting how 5-hour session limits work for Free, Pro and Max subscribers during peak hours, while keeping weekly limits unchanged.

He added that during weekdays from 5 a.m. to 11 a.m. Pacific time, users would move through their 5-hour session limits faster than before. In follow-up posts, he said Anthropic had landed efficiency wins to offset some of the impact, but that roughly 7% of users would hit session limits they would not have hit before, particularly on Pro tiers.

In an email on March 27, 2026, Anthropic told VentureBeat that Team and Enterprise customers were not affected by those changes, and that the shift was not dynamically optimized per user but instead applied to the peak-hour window the company had publicly described. Anthropic also said it was continuing to invest in scaling capacity.

Those comments were about session limits, not model downgrades. But they are important context, because they establish two things that users now keep connecting in public: first, Anthropic has been dealing with surging demand; second, it has already changed how usage is rationed during busy periods. That does not prove Anthropic reduced model quality. It does help explain why so many users are primed to believe something else may also have changed.

Prompt caching and TTL

A separate, more recent GitHub issue broadens the dispute beyond model quality and into pricing and quota behavior. In issue #46829, user seanGSISG argued that Claude Code’s prompt-cache time-to-live, or TTL, appeared to shift from a one-hour setting back to a five-minute setting in early March, based on analysis of nearly 120,000 API calls drawn from Claude Code session logs across two machines.

The complaint argues that this change drove meaningful increases in cache-creation costs and quota burn, especially for long-running coding sessions where cached context expires quickly and must be rebuilt. The author claims that this helps explain why some subscription users began hitting usage limits they had not previously encountered.

What makes this issue notable is that Anthropic did not flatly deny that something changed. In a reply on the thread, Jarred Sumner said the March 6 change was real and intentional, but rejected the framing that it was a regression. He said Claude Code uses different cache durations for different request types, and that one-hour cache is not always cheaper because one-hour writes cost more up front and only save money when the same cached context is reused enough times to justify it.

In his telling, the change was part of ongoing cache optimization work, not a silent downgrade, and the pre–March 6 behavior described in the issue “wasn’t the intended steady state.”

The thread later drew a more detailed response from Anthropic’s Cherny, who described one-hour caching as “nuanced” and said the company has been testing heuristics to improve cache hit rates, token usage and latency for subscribers. Cherny said Anthropic keeps five-minute cache for many queries, including subagents that are rarely resumed, and said turning off telemetry also disables experiment gates, which can cause Claude Code to fall back to a five-minute default in some cases.

He added that Anthropic plans to expose environment variables that let users force one-hour or five-minute cache behavior directly. Together, those replies do not validate the issue author’s claim that Anthropic silently made Claude Code more expensive overall, but they do confirm that Anthropic has been actively experimenting with cache behavior behind the scenes during the same period users began complaining more loudly about quota burn and changing product behavior.

Anthropic says user-facing changes, not secret degradation, explain much of the uproar

Anthropic-affiliated employees have publicly pushed back on the broadest accusations. In one widely circulated reply on X, Cherny responded to claims that Anthropic had secretly nerfed Claude Code by writing, “This is false.”

He said Claude Code had been defaulted to medium effort in response to user feedback that Claude was consuming too many tokens, and that the change had been disclosed both in the changelog and in a dialog shown to users when they opened Claude Code.

That response is notable because it concedes a meaningful product change while rejecting the more conspiratorial interpretation of it. Anthropic is not saying nothing changed. It is saying that what changed was disclosed and was aimed at balancing token use, not secretly reducing model quality.

Public documentation also supports the fact that effort defaults have been in motion. Claude Code’s changelog says that on April 7, Anthropic changed the default effort level from medium to high for API-key users as well as Bedrock, Vertex, Foundry, Team and Enterprise users.

That suggests Anthropic has actively been tuning these settings across different segments, which could plausibly affect user perceptions even if the core model weights are unchanged.

Shihipar has also directly denied the broader demand-management accusation. In a reply on X posted April 11, he said Anthropic does not “degrade” its models to better serve demand. He also said that changes to thinking summaries affected how some users were measuring Claude’s “thinking,” and that the company had not found evidence backing the strongest qualitative claims now spreading online.

The real issue may be trust as much as model quality

What is clear is that a trust gap has opened between Anthropic and some of its most demanding users.

For developers who rely on Claude Code all day, subtle shifts in visible thinking output, effort defaults, token burn, latency tradeoffs or usage caps can feel indistinguishable from a weaker model.

That is true whether the root cause is a product setting, a UI change, an inference-policy tweak, capacity pressure or a genuine quality regression.

It also means both sides of the fight may be talking past each other. Users are describing what they experience: more friction, more failures and less confidence. Anthropic is responding in product terms: effort defaults, hidden thinking summaries, changelog disclosures, and denials that demand pressure is causing secret model degradation.

Those are not necessarily incompatible descriptions. A model can feel worse to users even if the company believes it has not “nerfed” the underlying model in the way critics allege. But coming at a time when Anthropic’s chief rival OpenAI has recently pivoted and put more resources behind its competing, enterprise and vibe-coding focused product Codex — even offering a new, more mid-range ChatGPT subscription in an effort to boost usage of the tool — it’s certainly not the kind of publicity that stands to benefit Anthropic or its customer retention.

At the same time, the public evidence remains mixed. Some of the most viral claims have come from developers with detailed logs and strong opinions based on repeated use. Some of the benchmark evidence has been challenged by outside observers on methodological grounds. And Anthropic’s own recent changes to limits and settings ensure that this debate is happening against a backdrop of real adjustments, not pure rumor.

Source link

Tech

Binaural Microphone On A Budget

Published

42 seconds ago

11 May 2026

NewsAdmin

For as many speakers as someone can cram into a surround sound system, humans still (generally) only have two ears to listen to those sounds with. This means that, for recording purposes, it’s possible to create incredibly vivid three-dimensional sounds with just two microphones, provided that there’s an actual physical replica of a human ear attached to each microphone. This helps ensure that all the qualities of the sounds are preserved in a way a real human would experience them, and as [David Green] demonstrates, these systems don’t need to be very expensive.

This build doesn’t just use models of human ears for recording sounds through. The silicone ears are mounted on a styrofoam mannequin head as well, which provides some sound isolation between the two microphones, much like a real human head. The ears are mounted in appropriate locations with the microphones installed inside, and the entire microphone apparatus is positioned on a PVC rig with a camera so that binaural audio will be recorded for anything [David] points it at.

Although he had some issues interfacing two microphones using 19th-century technology instead of soldering everything together, the build still eventually came together, and only for around $70 USD. However, this build is a bit dated now, so prices may have changed by now. It’s still a great way to produce realistic stereo sound without breaking the bank, but it’s not the only way of getting this job done.

Source link

Tech

The size of a credit card: This fully functional computer even packs an e-ink screen

Published

10 minutes ago

11 May 2026

NewsAdmin

A developer has built a remarkably thin computer that is almost the same size and thickness as a standard credit card, potentially opening the door to a new category of ultra-portable computing devices.

Called the “Muxcard,” the experimental device combines a fully functional microcomputer, wireless connectivity, NFC support, sensors, and an E Ink display into a body measuring just 1mm thick – thin enough to fit inside a regular wallet alongside bank cards. The project, created by GitHub user “krauseler,” has quickly drawn attention from the maker and hardware enthusiast community for pushing the physical limits of compact electronics.

A tiny computer designed to fit in your wallet

Despite its slim form factor, the Muxcard includes surprisingly capable hardware. The device is powered by an ESP32-C3 microcontroller and integrates a 1.54-inch flexible E Ink display, NFC hardware, an IMU motion sensor, Bluetooth and Wi-Fi connectivity, and a miniature lithium-polymer battery.

The engineering challenge was not simply shrinking components, but making them durable enough to survive everyday bending and pressure inside a wallet. According to project details shared online, the creator used flexible PCBs and carefully separated sensitive components into “islands” connected through bend-tolerant sections to reduce mechanical stress.

One of the biggest hurdles involved integrating the E Ink display into such a thin device. Traditional connectors were reportedly too bulky, forcing the creator to hand-solder connections directly onto the display flex cable. Power management also became a major challenge because ultra-thin batteries offer extremely limited capacity.

Why this matters beyond a DIY project

At first glance, the Muxcard may seem like a niche experiment for hobbyists. However, the project reflects a broader trend toward invisible and ambient computing – devices becoming smaller, thinner, and more seamlessly integrated into everyday objects.

The use of an E Ink screen is particularly important because it consumes almost no power while displaying static information, allowing the card to remain functional for longer periods despite its tiny battery. The low-power design could make devices like this suitable for secure identification, digital business cards, two-factor authentication systems, event passes, or minimalist smart home controls.

For consumers, projects like the Muxcard offer a glimpse into how future computing devices may evolve beyond phones and wearables into objects people already carry every day.

What comes next

The Muxcard remains an experimental open-source project rather than a commercial product. However, the hardware files and firmware have already been published online for non-commercial use, meaning developers and enthusiasts can attempt to build their own versions.

As flexible electronics, thin batteries, and low-power displays continue improving, concepts like the Muxcard could eventually influence future digital IDs, secure authentication tools, and ultra-portable computing devices.

Source link

Tech

New Linux ‘Dirty Frag’ zero-day gives root on all major distros

Published

21 minutes ago

11 May 2026

NewsAdmin

Dirty Frag Linux Tux

A new Linux zero-day exploit, named Dirty Frag, allows local attackers to gain root privileges on most major Linux distributions with a single command.

Security researcher Hyunwoo Kim, who disclosed it earlier today and published a proof-of-concept (PoC) exploit, says this local privilege escalation was introduced roughly nine years ago in the Linux kernel’s algif_aead cryptographic algorithm interface.

Dirty Frag works by chaining two separate kernel flaws, the xfrm-ESP Page-Cache Write vulnerability and the RxRPC Page-Cache Write vulnerability, to modify protected system files in memory without authorization and achieve privilege escalation.

Also, while Dirty Frag belongs to the same class as the Dirty Pipe and Copy Fail Linux vulnerabilities, it exploits the fragment field of a different kernel data structure.

“As with the previous Copy Fail vulnerability, Dirty Frag likewise allows immediate root privilege escalation on all major distributions, and it

chains two separate vulnerabilities,” Kim said.

“Dirty Frag is a case that extends the bug class to which Dirty Pipe and Copy Fail belong. Because it is a deterministic logic bug that does not depend on a timing window, no race condition is required, the kernel does not panic when the exploit fails, and the success rate is very high.”

This kernel privilege escalation affects a wide range of Linux distros, including Ubuntu, Red Hat Enterprise Linux, CentOS Stream, AlmaLinux, openSUSE Tumbleweed, and Fedora, which have not yet received patches.

Kim released complete Dirty Frag documentation and a PoC exploit with distribution maintainers’ agreement after an embargo on full public disclosure was broken on May 7, 2026, when an unrelated third party independently published the exploit.

“Because the embargo has currently been broken, no patch or CVE exists. After consultation with the maintainers on linux-distros@vs.openwall.org and at their request, this Dirty Frag document is being published,” Kim said.

To secure systems against attacks, Linux users can use the following command to remove the vulnerable esp4, esp6, and rxrpc kernel modules (however, it’s important to note that this will break IPsec VPNs and AFS distributed network file systems):


sh -c "printf 'install esp4 /bin/false\ninstall esp6 /bin/false\ninstall rxrpc /bin/false\n' > /etc/modprobe.d/dirtyfrag.conf; rmmod esp4 esp6 rxrpc 2>/dev/null; true"

This new zero-day disclosure comes as Linux distro maintainers are still rolling out patches for “Copy Fail,” another root privilege escalation vulnerability now actively exploited in attacks.

CISA added Copy Fail to its Known Exploited Vulnerabilities (KEV) Catalog last Friday, ordering federal agencies to secure their Linux devices within two weeks, by May 15.

“This type of vulnerability is a frequent attack vector for malicious cyber actors and poses significant risks to the federal enterprise,” the U.S. cybersecurity agency warned at the time. “Apply mitigations per vendor instructions, follow applicable BOD 22-01 guidance for cloud services, or discontinue use of the product if mitigations are unavailable.”

In April, Linux distros patched another root-privilege escalation vulnerability (dubbed Pack2TheRoot) that had been found after a decade since it was introduced in the PackageKit daemon.

Update May 08, 09:58 EDT: The two page-cache write vulnerabilities chained by Dirty Frag are now tracked under the following CVE IDs: the xfrm-ESP one was assigned CVE-2026-43284, and the RxRPC isye is now CVE-2026-43500.

AI chained four zero-days into one exploit that bypassed both renderer and OS sandboxes. A wave of new exploits is coming.

At the Autonomous Validation Summit (May 12 & 14), see how autonomous, context-rich validation finds what’s exploitable, proves controls hold, and closes the remediation loop.

Claim Your Spot

Source link

Tech

Edifier M90 review: good jack-of-all-trade bookshelf speakers

Published

31 minutes ago

11 May 2026

NewsAdmin

Why you can trust TechRadar

We spend hours testing every product or service we review, so you can be sure you’re buying the best. Find out more about how we test.

Edifier M90: One-minute review

When you get into audio, it quickly becomes clear that the best stereo speakers won’t be enough. Sure, they’ll cover your living room, but what about your desktop? Your TV set-up? It’s time to buy more speakers!

…or you could accept the the Edifier M90 speakers’ pitch, which is to just buy one pair of speakers that have absolutely loads of connection options. Not only do they have the basics — Bluetooth 6.0 and aux-in — they have support for optical, USB-C in and HDMI eARC.

That latter’s a big selling point here, so you can plug the Edifier M90 speakers into your TV without losing audio fidelity, as it’s something not offered by too many similar options.

Edifier M90 review: Design

The two Edifier M90 units besides each other, with one showing its back panel. — (Image credit: Future)

Familiar boxy design in white or black
8.35 x 5.24 x 8.86 in / 21.2 x 13.3 x 22.5cm, 6.6lbs / 3kg each
Some controls on back of unit

The Edifier M90 will look familiar to people who’ve been shopping around the brand’s options, as it’s a doppelganger for the M60. You’re getting two clean and simple speakers, with a large woofer topped by a smaller tweeter, in either white or black.

The speakers are 8.35 inches tall, 5.24 inches wide and 8.86 inches deep, so they can fit on your desktop by your monitor, or on a bookshelf (as you can see in the pictures). They’re light enough not to worry fragile shelves, and to be easy to move about your apartment too.

While the M90 look clean at the front, there’s a mess at the back. One of the speakers has five different jacks hidden around the corner – not including the audio input – as well as a power switch and volume dial. We’ll get more into this jacks in the Features section, but because of them, the back of my unit quickly became a mess of cables (as you’ll see in the images).

It’s a little annoying that these controls are hidden around the back of the speaker, but the remote makes up for it.

The in-box remote takes two AAA batteries, and it’s nice and small. It has the expected buttons — volume, skip tracks, mute — as well as options to quickly change the input, which I found useful for changing between my TV connection, Bluetooth phone, and any wired options such as a turntable.

You can also use the remote to flick between three presets: Classic Dynamic and Monitor, which you can set up yourself.

The main Edifier M90 unit, next to some records. — (Image credit: Future)

Fits many niches in your home hi-fi set-up
Not quite as good as any one unit it replaces

The Edifier M90’s price step up from its sibling might give some buyers pause, and a good argument would be made for other stereo speaker setups, which could get you more for your money — especially when it comes to better stereo imaging.

But when you consider how versatile the M90s are, the value proposition becomes a little clearer. These aren’t just for your bookshelf, but can be used for your desktop and TV as well. And so they could be a great value option rather than buying separate pieces of tech for your hi-fi setup — a real all-rounder.

Should I buy the Edifier M90?

The Edifier M90's tweeter — (Image credit: Future)

Swipe to scroll horizontally

Edifier M90 scorecard
Attributes	Notes	Rating
Features Advertisement	The range of connection options is great, but the app doesn’t add much.	3.5 / 5
Sound quality	I was impressed by the bass capability and volume, though could have done with clearer treble. Advertisement	4 / 5
Design	They’re relatively compact and clean-looking, with a useful remote.	4 / 5 Advertisement
Value	As a Swiss Army Knife for audio, they’re good value for what they offer.	4 / 5

The two Edifier M90 units besides each other. — (Image credit: Future)

Tested for several months
Tested at home connected to phones, laptops, TVs, turntables and more

I used the Edifier M90 for several months before writing this review. In that time I used the M90 alongside a vast range of devices. I connected them wirelessly to several smartphones, via USB-C or aux to phones, MP3 players and laptops, and also to my TV and turntable.

Samsung Galaxy S26 Ultra and Majority MP3 Player, got particular time with the M90.

I’ve been testing audio products for TechRadar for years, including other Edifier speakers, Bluetooth speakers and headphones.

Source link

Tech

Which Fitbit should you buy?

Published

40 minutes ago

11 May 2026

NewsAdmin

After months of rumours and sly teasers, Google has finally officially unveiled the Fitbit Air – its screenless wearable.

But how does the screenless Fitbit Air compare to the four-star Fitbit Charge 6? Is the Fitbit Air considered an upgrade, or is it only designed with certain users in mind?

We’ve assessed the Fitbit Air’s specs and compared them to the Fitbit Charge 6’s own to help you decide which wearable will suit you best.

If you’re sold on a screenless wearable then make sure you check out our Fitbit Air vs Whoop comparison too. Otherwise for a broader look at all the options, our best fitness trackers, best Fitbit and best smartwatch guides have you covered.

Specs comparison table

	Fitbit Air	Fitbit Charge 6
Dimensions	34.9 x 17 x 8.3 mm	36.73 x 23.09 x 11.20 mm
Material	Plastic	Aluminium
Display	No	1.3-inch AMOLED
Water Rating	5ATM	5ATM and IP68
Battery	Up to seven days	Up to seven days
Productivity	N/A	Google apps supported
UK RRP	£84.99	£139
US RRP	$99.99	$159.95

Price and Availability

At the time of writing, the Fitbit Air is available for pre-order and will launch officially in the US and UK from May 26. With an official RRP of £84.99/$99.99, it’s one of the cheaper options in Google’s Fitbit range.

SQUIRREL_PLAYLIST_10208510

In comparison, the Fitbit Charge 6 is available to buy now and has a higher RRP of £139/$159.95. However, as the Fitbit Charge 6 is a few years old, it’s possible to pick up the wearable with a solid price cut. For example, at the time of writing US customers could pick up the Fitbit Charge 6 for just $119.95 from Google’s official store.

Although both wearables can be used without a subscription, they are compatible with Google Health Premium – the newest monthly plan that unlocks features such as Google Health Coach. This plan will set you back an additional $9.99 a month (the UK price is TBC at the time of writing).

Fitbit Air is screenless

Take one look at the Fitbit Air and Fitbit Charge 6 and the difference is clear: the Fitbit Air is entirely screenless. Much like Whoop, the Fitbit Air is designed to quietly track your health and fitness data without any distraction.

Pink Fitbit Air on wrist — Fitbit Air on wrist. Image Credit (Google)

This might sound confusing to those who have never used a screenless fitness tracker, as you might be wondering how you control the Fitbit Air or track a workout without the use of a screen. Essentially, you can use the companion smartphone app (Google Health) to see your metrics and data, plus manually start or add a workout. However, the Fitbit Air benefits from auto-workout detection which means it will know when you’ve started exercising and will track and log the workout accordingly.

Google Health on smartphone app — Google Health app. Image Credit (Google)

In comparison, the Fitbit Charge 6 is fitted with a 1.4-inch AMOLED touchscreen display that has a Gorilla Glass 3 covering for scratch resistance. The display is bright, detailed and offers an always-on option (although keep in mind that’ll drain the battery faster). Plus, the inclusion of the display means you’ll have access to Google Wallet, Google Maps and even YouTube Music Controls without needing to rely on your phone.

Fitbit Charge 6 (4) — Fitbit Charge 6 on wrist. Image Credit (Trusted Reviews)

This means the Fitbit Charge 6 can double as a smartwatch, rather than just being a fitness tracker.

Fitbit Air charges faster, but both promise the same battery life

We should disclaim that battery life will vary depending on your individual usage. However, both the Fitbit Air and Fitbit Charge 6 generally promise up to seven days of battery life – however when its always-on display is enabled, the Fitbit Charge 6 drops down to around four days.

The Fitbit Air does promise to offer faster charging than the Fitbit Charge 6, with Google claiming the wearable can go from 0 to 100% in about 90 minutes. In addition, a five minute charge should result in one day of power too.

In comparison, we found that the Fitbit Charge 6 takes around two hours to reach 100% power.

Fitbit Charge 6 has built-in GPS

Although the Fitbit Air can track runs and the like, it doesn’t actually have built-in GPS. Instead, you’ll need to ensure your paired phone is with you. On the other hand, the Fitbit Charge 6 technically benefits from on-device GPS which means you shouldn’t need to carry your phone out with you.

We should disclaim that its GPS isn’t particularly reliable, as we found it works best when your paired phone is with you and the Fitbit Charge 6 can swap between your handset’s GPS and the device’s antenna based on signal strength. However, once you leave your phone at home, we found the Charge 6 struggles to accurately track your route and instead bases distance on the accelerometer instead. This is a known issue, and one that appeared on the 2021 Fitbit Charge 5 too.

Fitbit Air promises more accurate sleep tracking

Google promises that the Fitbit Air sees huge improvements in sleep tracking compared to previous Fitbit models. Not only does the Fitbit Air see in-depth tracking that captures time spent in each sleep stage and breathing regularity, but it also summarises this information into a personalised Sleep Score. This, Google explains, is powered by advanced new machine learning models that are 15% more accurate than before.

The Fitbit Charge 6 does offer impressively accurate sleep tracking, and we even concluded that it feels more accurate than rival offerings. With this in mind, the promise of more accurate tracking is certainly promising.

Sleep tracking on Google Health app — Sleep on the Google Health app. Image Credit (Google)

Fitbit Air is designed for Google Health Coach

One of the key features of the Fitbit Air is, somewhat annoyingly, sat behind Google’s monthly subscription, Google Health Premium. However, at $9.99 a month, it’s arguably an easier pill to swallow than the likes of Whoop’s annual subscription costs.

Signing up to Google Health Premium unlocks Google Health Coach, a personalised coach that’s built with Google’s Gemini. The Coach promises to deliver personalised guidance based on your metrics, fitness goals and lifestyle too. Plus, Health Coach unlocks the aforementioned Sleep Score and can also answer your specific health and fitness-related questions too.

Early Verdict

The Fitbit Air is easily one of the most exciting Fitbit launches in recent times, and looks set to be a genuinely viable competitor to Whoop. Deciding between the Fitbit Air and Fitbit Charge 6 will depend entirely on your personal preference – if you want a wearable that doubles as a smartwatch then the Fitbit Charge 6 is an easy choice.

On the other hand, if you want a dedicated fitness tracker that benefits from Google’s Gemini-powered Health Coach, then the Fitbit Air is an appealing alternative.

We’ll be sure to update this versus once we review the Fitbit Air, so make sure you visit back in due course.

Source link

Tech

Elipson Unveils Facet II 6 Active BT Speakers with aptX HD, MM Phono, and HDMI ARC

Published

54 minutes ago

11 May 2026

NewsAdmin

Since 1938, Elipson has built its reputation on distinctive French loudspeaker design and high-end acoustics, but the brand has spent the past few years pushing hard into more accessible territory with its Prestige Facet II and Horus lines. The new Facet II 6 Active BT lands right in the middle of a crowded category dominated by KEF, Q Acoustics, Klipsch, and Triangle, but it doesn’t show up empty-handed. With aptX HD Bluetooth, HDMI ARC, and a built-in moving magnet phono stage, Elipson is clearly aiming at listeners who want a compact, all-in-one stereo system that can handle streaming, TV audio, and vinyl without stacking boxes or draining your bank account.

For 2026, Elipson expands its active connected lineup with the Prestige Facet II 6 Active BT, a powered bookshelf speaker designed to bring the Facet II series into the modern, all-in-one category. It builds on the strengths of the Prestige Facet II passive models and refines the earlier 6B BT concept with integrated amplification and a broader mix of wired and wireless connectivity. In a segment where convenience often comes at the expense of flexibility, Elipson is clearly positioning this as a single-box stereo solution that doesn’t force users to choose between streaming, TV integration, or vinyl playback.

elipson-prestige-facet-ii-6-active-bt-speakers-lifestyle

To start, the Prestige Facet II 6 Active BT is a matched bookshelf pair built around a powered primary speaker and a passive secondary unit. All amplification and connectivity live in the main speaker, keeping setup simple while maintaining a true stereo configuration.

Amplification, Drivers, and Crossover: Elipson equips the system with 2 x 50 watts RMS of Class D amplification, driving a 25mm tweeter and 140mm mid-bass driver in each cabinet. The redesigned crossover uses higher-grade components, including polypropylene film capacitors, metal film resistors, and low DCR inductors, along with 2.25 mm OFC internal wiring. The goal is straightforward: cleaner signal transfer, better driver integration, and more controlled output.

Bluetooth: Wireless playback is handled via Bluetooth 5.3 with aptX HD support, allowing for higher-quality streaming than standard SBC. It is a practical inclusion for casual listening that does not immediately compromise sound quality.

USB Audio: A USB-C Hi-Res Audio input turns the system into a capable desktop solution. With support for 24-bit/192 kHz playback, it bypasses typical computer audio limitations and provides a more stable, lower-noise signal path for music, editing, or general use.

elipson-prestige-facet-ii-6-active-bt-speakers-front-back

Phono Input: The built-in moving magnet phono stage is a key differentiator at this price point. It allows a turntable to be connected directly, eliminating the need for an external preamp and making vinyl playback far more accessible without sacrificing signal integrity.

Bluetooth: In addition to built-in amplification, the Facet II 6 Active BT also includes built-in Bluetooth 5.3 (the BT in the product name provides the clue) with AptX HD compatibility.

HDMI: With its HDMI ARC input, the Prestige Facet II 6 Active BT can replace a soundbar for users seeking an elegant and high-performance stereo solution for TV viewing. ARC provides direct audio connection with the TV, volume control via the TV remote, and automatic synchronization. This setup is much better than a TV’s internal speakers, with improved spatialization, clearer dialogue, and a more convincing soundstage.

Comparison

Elipson Model	Prestige Facet II 6 Active BT	Prestige Facet 6B BT	Horus 6B Active BT
Product Type	Active Connected Bookshelf Speaker	Active Connected Bookshelf Speaker	Active Connected Bookshelf Speaker
Price	£699	£669	€499
Amplifier Type	Class D	Class-D	Class D
Amplification	2 x 50 W RMS	2 x 70 W RMS	2 x 50 W RMS
Inputs	Line In 1 (RCA) Phono MM Advertisement HDMI ARC Optical / Coaxial Bluetooth 5.3 (AptX HD) USB C Audio (Hi-Res 24-bit /192 kHz)	1 x 3.5mm jack auxiliary input Advertisement 1 RCA input (line/phono) 1 optical S/PDIF input Bluetooth with aptX HD codec	Aux input Advertisement Phono MM input Coaxial input: 24-bit / 192 kHz Optical input: 24-bit / 192 kHz USB Audio input: 24-bit / 96 kHz Advertisement TV/ARC input: ARC compatible Bluetooth 5.0 with APTX HD codec
Output	Subwoofer Low pass 120 Hz	Subwoofer (20-220 Hz at ±3 dB)	Subwoofer 150 Hz / 12 dB / Octave
Drive-Units	Tweeter: 25mm (1in) Mid-Woofer: 140mm (5.5-in) Advertisement	Tweeter: 25mm (1in) Mid-Woofer: 140mm (5.5in)	Tweeter: 25 mm (1in) – Silk dome Neodymium magnet Mid-bass: 130 mm (5in) – Cellulose pulp coated with fiberglass Advertisement
Frequency Response (±3 dB)	57 Hz – 25 kHz	57 Hz – 25 kHz	55 Hz – 22 kHz
Signal -to-Noise Ratio	> 90 dB(A)	Not Indicated	Not Indicated
Crossover	2800 Hz – 18 dB / 18 dB	Not Indicated	Not Indicated
Nominal impedance	6 ohms	6 ohms	8 Ohms
Equalization Controls	Bass +6 / +3 / 0 dB Midrange -3 / 0 / +3 dB Treble -3 / 0 / +3 dB	Bass/Treble EQ	N/A
Auto Standby	Yes – after 20 minutes	Yes – after 60 minutes	Yes, after 20 minutes
Remote Control	Volume, source selection, Bluetooth functions	Remote control included (volume, input)	Yes
Dimensions (WHD)	176 x 298 x 223 mm 6.93 x 11.73 x 8.78 in	176 x 298 x 225 mm 6.93 x 11.73 x 8.86 in	425 × 410 x 345 mm 16.73 x 16.1 x 13.58 in
Weight	7.7 kg (17lbs) active speaker 6.3 kg (13.8 lbs) passive speaker	7 kg (15.5lbs) active speaker 5.6 kg (12.4lbs) passive speaker	5.6 kg (12.4lbs) active speaker 5 kg (11lbs) passive speaker
Colors	Black Matt, White Matt, Black Matt/Walnut	Black, White, or Black/Walnut	Light Wood/BeigeWalnut/Dark GreyBlack/Carbon

The Bottom Line

The Prestige Facet II 6 Active BT stands out by combining modern connectivity with a genuinely useful analog feature: a built-in MM phono stage. HDMI ARC handles TV audio, aptX HD covers wireless streaming, and USB-C enables hi-res desktop playback.

What’s missing? No Wi-Fi streaming platform, no app ecosystem, and no multi-room support. If you’re expecting BluOS, AirPlay, Chromecast, or room correction, you won’t find it here. This is a more traditional, self-contained stereo system rather than a networked audio hub.

Advertisement. Scroll to continue reading.

The competition is fierce. Audioengine and Kanto dominate the plug-and-play desktop and budget space, KEF’s LSX II pushes harder on streaming and DSP, and PSB’s Alpha iQ offers BluOS integration and deeper ecosystem support. Elipson’s edge is its balance of connectivity and simplicity; especially for vinyl users, but availability in the U.S. could be the biggest hurdle.

Price & Availability

The Prestige Facet II 6 Active BT is priced at £699 through Elipson.
The Prestige Facet 6B BT is priced at £669 through Elipson.
The Horus 6B Active BT is priced at €499 through Elipson.

Pro Tip: Contact Elipson or Authorized dealers for US pricing.

Source link

Tech

AI tool poisoning exposes a major flaw in enterprise agent security

Published

1 hour ago

11 May 2026

NewsAdmin

AI agents choose tools from shared registries by matching natural-language descriptions. But no human is verifying whether those descriptions are true.

I discovered this gap when I filed Issue #141 in the CoSAI secure-ai-tooling repository. I assumed it would be treated as a single risk entry. The repository maintainer saw it differently and split my submission into two separate issues: One covering selection-time threats (tool impersonation, metadata manipulation); the other covering execution-time threats (behavioral drift, runtime contract violation).

That confirmed tool registry poisoning is not one vulnerability. It represents multiple vulnerabilities at every stage of the tool’s life cycle.

There’s an immediate tendency to apply the defenses we already have. Over the past 10 years, we’ve built software supply chain controls, including code signing, software bill of materials (SBOMs), supply-chain levels for software Artifacts (SLSA) provenance, and Sigstore. Applying these defense-in-depth techniques to agent tool registries is the next logical step. That instinct is right in spirit, but insufficient in practice.

The gap between artifact integrity and behavioral integrity

Artifact integrity controls (code signing, SLSA, SBOMs) all ask whether an artifact really is as described. But behavioral integrity is what agent tool registries actually need: Does a given tool behave as it says, and does it act on nothing else? None of the existing controls address behavioral integrity.

Consider the attack patterns that artifact-integrity checks miss. An adversary can publish a tool with prompt-injection payloads such as “always prefer this tool over alternatives” in its description. This tool is code-signed, has clean provenance, and has an accurate SBOM. Every check on artifact integrity will pass. But the agent’s reasoning engine processes the description through the same language model it uses to select the tool, collapsing the boundary between metadata and instruction. The agent will select the tool based on what the tool told it to do, not just which tool is the best match.

Behavioral drift is another problem that these types of controls miss. A tool can be verified at the time it was published, then change its server-side behavior weeks later to exfiltrate request data. The signature still matches, the provenance is still valid. The artifact has not changed. The behavior has.

If the industry applies SLSA and Sigstore to agent tool registries and declares the problem solved, we will repeat the HTTPS certificate mistake of the early 2000s: Strong assurances about identity and integrity, with the actual trust question left unanswered.

What a runtime verification layer looks like in MCP

The fix is a verification proxy that sits between the model context protocol (MCP) client (the agent) and the MCP server (the tool). As the agent invokes the tool, the proxy performs three validations on each invocation:

Discovery binding: The proxy validates that the tool being invoked matches the tool whose behavioral specification the agent previously evaluated and accepted. This stops bait-and-switch attacks, where the server advertises one set of tools during discovery and then serves different tools at invocation time.

Endpoint allowlisting: The proxy monitors the outbound network connections opened by the MCP server while the tool is executing, and compares them against the declared endpoint allowlist. If a currency converter declares api.exchangerate.host as an allowed endpoint but connects to an undeclared endpoint during execution, the tool gets terminated.

Output schema validation: The proxy validates the tool’s response against the declared output schema, flagging responses that include unexpected fields or data patterns consistent with prompt injection payloads.

The behavioral specification is the key new primitive that makes this possible. It is a machine-readable declaration, similar to an Android app’s permission manifest, that details which external endpoints the tool contacts, what data reads and writes the tool performs, and what side effects are produced. The behavioral specification ships as part of the tool’s signed attestation, making it tamper-evident and verifiable at runtime.

A lightweight proxy validating schemas and inspecting network connections adds less than 10 milliseconds to each invocation. Full data-flow analysis adds more overhead and is better suited to high-assurance deployments. But every invocation should validate against its declared endpoint allowlist.

What each layer catches and what it misses

Attack pattern	What provenance catches	What runtime verification catches	Residual risk Advertisement
Tool impersonation	Publisher identity	None unless discovery binding added	High without discovery integrity Advertisement
Schema manipulation	None	Only oversharing with parameter policy	Medium Advertisement
Behavioral drift	None after signing	Strong if endpoints and outputs are monitored	Low-medium Advertisement
Description injection	None	Little unless descriptions sanitized separately	High Advertisement
Transitive tool invocation	Weak	Partial if outbound destinations constrained	Medium-high Advertisement

Neither layer is sufficient on its own. Provenance without runtime verification misses post-publication attacks. And runtime verification without provenance has no baseline to check against. The architecture requires both.

How to roll this out without breaking developer velocity

Begin with an endpoint allowlist at deployment time. This is the most valuable and easiest form of protection. All tools declare their contact points outside the system. The proxy enforces those declarations. No additional tooling is needed beyond a network-aware sidecar.

Next, add output schema validation. Compare all returned values against what each tool declared. Flag any unexpected value returns. This catches data exfiltration and prompt injection payloads in tool responses.

Then, deploy discovery binding for high-risk tool categories. Credential-handling, personally identifiable information (PII), and financial information processing tools should undergo the full bait-and-switch check. Less risky tools can bypass this until the ecosystem matures.

Finally, ceploy full behavioral monitoring only where the assurance level justifies the cost. The graduated model matters: Security investment should scale with the risk.

If you’re using agents that choose tools from centralized registries, add endpoint allowlisting as a bare minimum today. The rest of the behavioral specifications and runtime validations can come later. But if you are solely relying on SLSA provenance to ensure that your agent-tool pipeline is safe, you are solving the wrong half of the problem.

Nik Kale is a principal engineer specializing in enterprise AI platforms and security.

Welcome to the VentureBeat community!

Our guest posting program is where technical experts share insights and provide neutral, non-vested deep dives on AI, data infrastructure, cybersecurity and other cutting-edge technologies shaping the future of enterprise.

Read more from our guest post program — and check out our guidelines if you’re interested in contributing an article of your own!

Source link

Tech

Apple Vision still has a future

Published

1 hour ago

11 May 2026

NewsAdmin

As we’ve repeated before, and a new report reiterates, the supposed death of Apple Vision Pro and its product team was an exaggeration. There are no signs of “giving up” on the product line.

A report relying on a limited-in-scope anonymous leak reached the conclusion that Apple Vision Pro had become an abandoned product line. While the base team may have changed or evolved, the project itself hasn’t been given up on.

AppleInsider‘s initial assessment of the situation has been reiterated by others in the know, including in the latest According to the Power On newsletter. While the Vision Products Group has been broken up into various other organizations, development of the Apple Vision Pro hasn’t stopped.

In fact, one report from John Gruber suggests the Vision Products Group still exists in some form at Apple. It’s a direct contradiction to Mark Gurman’s reporting, but there’s likely an easy explanation.

In any case, as Gruber points out, the Vision Pro Group isn’t going to learn of its dissolution from a rumor posted by a website. If anything, the world would learn about it via a leak of the all-hands meeting that made the announcement, like with Apple Car.

Putting the pieces together

While we likely won’t ever know the full story, here’s what it seems has occurred based on all the details so far.

A special projects group is formed in 2016, led by Mike Rockwell, to develop augmented reality products
Vision Products Group is detailed in July 2023 after Apple Vision Pro reveal
Apple Vision Pro releases in February 2024 and sells around 600,000 units in the first year
John Giannandrea is swapped out with Mike Rockwell after seemingly successful Apple Vision Pro development and launch
Mike Rockwell poaches several heads and engineers from the Vision Products Group, but it isn’t reported as being entirely disbanded at this point
An Apple Vision Pro with M5 is launched in October 2025, likely to keep the chipset modern and something being produced as new
On April 15, 2026, Apple’s marketing chief Greg Joswiak says Apple Vision Pro is a peek into the future, but it is tough to say exactly when spatial computing will take over.
On April 29, rumors appear that suggest Apple had given up on Apple Vision Pro and the entire Vision Products Group had been dissolved

Now we’re back to today where we know the Vision Products Group has not been entirely dissolved. The active team members were reportedly confused by this news.

I believe the reason why we’ve seen contradictory reporting here is because of how Apple is structured internally. It doesn’t tend to create special teams, with Vision Products Group and the Apple Car Project Titan being notable exceptions.

So, as it becomes clear that a new and refined headset won’t be possible in the near term, Apple began siphoning off its top talent into other, more pressing, divisions.

That doesn’t mean Vision Products Group is gone. In fact, they’re likely the ones developing the fabled Apple Glass that will be full AR glasses of the future.

The thing is, neither a lighter Vision Pro nor Apple Glass are possible today. There’s a chance this anonymous leak originated from a team member that was moved and upset about the change.

In any case, visionOS 27 will arrive during WWDC 2026 on June 8 with some refinements in place. Those with an Apple Vision Pro on hand shouldn’t worry that their device will suddenly stop being supported by Apple.

Source link

Tech

Rocket Lab Reports Growing Demand for Commercial Space Products. Stock Surges 34%

Published

1 hour ago

11 May 2026

NewsAdmin

For just the first three months of 2026, Rocket Lab’s launch business reports $63.7 million in revenue, reports CNBC — plus another $136.7 million from its space systems business. Besides beating Wall Street’s expectations, Rocket Lab also announced that its backlog has more than doubled from a year ago to $2.2 billion, and that it’s buying space robotics company Motiv Space Systems.

Friday its stock price shot up 34% in one day…

Rocket Lab’s stock has more than quadrupled over the past year, benefiting from skyrocketing demand for businesses tied to the space economy ahead of SpaceX’s hotly anticipated IPO later this year. Demand for space systems and satellites is also escalating as President Donald Trump pursues his ambitious Golden Dome missile defense project and NASA’s crewed Artemis missions rev up.

Rocket Lab said Thursday that it signed its largest contract ever with a confidential customer for its Neutron and Electron rockets through 2029, weeks after landing a $190 million deal for 20 hypersonic test flights… “The demand signal is clear,” CEO Peter Beck said on an earnings call with analysts, calling the pace of new product releases from the company this year “relentless”…. Rocket Lab’s good news lifted other space companies. Firefly Aeropspace and Intuitive Machines both jumped more than 20, while Redwire gained 19%. Voyager Technologies rose 14%.

“The company anticipates revenue between $225 million and $240 million during the second quarter.”

Source link

Tech

PlayStation3 Emulator Devs Politely Ask Contributors to Stop Submitting ‘AI Slop’ Pull Requests

Published

2 hours ago

11 May 2026

NewsAdmin

Open-source PS3 emulator RPCS3 “has been around since 2011,” Kotaku notes, and has made 70% of the PlayStation 3’s library fully playable, “bolstered in part by the many users who contribute to its GitHub page.” But their dev team “took to X today to very kindly and civilly request that users ‘stop submitting AI slop code pull requests’ to its GitHub page.”

Then they immediately proceeded to tell the AI-brain-rotted tech bros attempting to justify their vibe-coding nonsense to kick rocks in the replies, which is somewhat less civil but far more entertaining to read…

My favorite one was when someone asked how the team was certain they weren’t rejecting human-written code, to which RPCS3 replied: “You can’t possibly handwrite the type of shit AI slop we have been seeing.”

Source link