Tech

New AI optimization framework beats Claude Code and Codex by 2.5x on the same compute budget

Published

1 month ago

20 June 2026

Imagine your engineering team just deployed an AI agent to search through internal company documents and answer employee questions. It works perfectly in development, but in production, it consistently hallucinates or misses key constraints. Fixing this is rarely a simple patch. It requires a tedious, trial-and-error process of tweaking chunking strategies, retrieval methods, and system prompts simultaneously. Because these adjustments are entangled, it becomes nearly impossible to attribute which specific tweak actually solved the problem.

To address this challenge, researchers at Renmin University of China and Microsoft Research introduced Arbor, a framework that upgrades AI-driven research and optimization from a sequence of trial-and-error guesses into a cumulative learning process. Arbor organizes hypotheses, experiments, and insights into a tree that helps the system learn from prior failures to make smarter, verified improvements over time.

In practical tests, Arbor delivered more than 2.5 times the verifiable performance gains of standard AI coding agents across real-world engineering tasks while operating under the same resource budget.

For enterprise AI, this technique directly translates to automating the continuous improvement of complex, real-world engineering systems.

Understanding the bottleneck in autonomous optimization

As large language models and AI systems become more capable, they are expected to carry out more complex operations such as autonomous optimization (AO) of software systems such as agent harnesses or model training algorithms.

AO captures the fundamental loop of autonomous research. An AI agent starts with an initial mutable artifact, such as a machine learning codebase or data pipeline, and a specific objective. The agent’s goal is to iteratively improve this artifact through experimental feedback without step-by-step human supervision.

The main challenge of AO is often misunderstood. Many engineering teams find that simply giving a coding agent more time or compute to optimize a codebase doesn’t lead to better results. “Automation can keep an AI working for a very long time — but a loop is not the same as progress,” Jiajie Jin, co-author of the paper, told VentureBeat. “If the goal is vague, or the metric is easy to hack, long-running automation often just produces ‘improvements’ faster that nobody actually wants.”

Jin explains that complex tasks take many attempts to get right, and standard agent architectures are missing the critical data structure to maintain state. “How do you make sure the insight and experience from each attempt actually accumulate, instead of getting lost in a scrollback buffer?” he said. Without this structure, agents simply repeat the same mistakes.

Current agent systems can run experiments for many hours against well-specified goals: editing code, invoking tools, running tests autonomously. But they treat each attempt in isolation, missing the structural mechanisms that would let them accumulate and act on what they’ve learned.

They lack the capacity to simultaneously maintain and compare multiple competing research directions. Without this, they cannot interpret both successes and failures to reshape their future exploration, which is the core mechanism that makes human research cumulative.

General coding agents typically rely on conversation transcripts for their memory. Because AO tasks span hundreds of turns and easily exceed context window limits, these agents struggle to preserve and reuse factual evidence over long histories. As a result, they lose the overarching structure of the research process and are prone to stalling on early failures or chasing noisy evaluation swings. The system needs a structured, durable memory that records what directions have been tried, what factual evidence was produced, and how each result changes the space of future hypotheses.

Existing frameworks are also prone to reward hacking and overfitting to development metrics. This makes them create the illusion of progress without producing improvements that transfer to real-world performance.

Finally, general-purpose coding agents typically chain their tool calls on a single shared working tree. This architectural limitation prevents them from testing parallel hypotheses in isolated environments without corrupting the main codebase or obscuring which hypothesis caused a specific outcome.

The Arbor framework

Arbor solves the challenges of AO with a framework that automates the long-horizon loop of exploration, experimentation, and abstraction that characterizes human research. Arbor separates the strategic direction of research from the ground-level coding tasks with two key components:

The coordinator: A long-lived AI agent that acts like a principal investigator. It never directly edits the target codebase. Instead, it owns the general state of the optimization research, observes accumulated evidence, comes up with new hypotheses and directions to explore, and decides what to do with the results of experiments.

Executors: Short-lived, highly focused AI agents. When the coordinator wants to test an idea, it spins up an executor and places it in an isolated environment, essentially a fresh git worktree. Each executor is handed one hypothesis. It implements the assigned idea, runs evaluations, debugs errors, and reports back to the coordinator with the results and created artifacts.

These two components collaborate through a mechanism that the researchers call “Hypothesis Tree Refinement” (HTR). HTR represents the entire research process as a persistent, branching tree where every node binds together four things: a hypothesis, the executable artifact, the factual evidence produced, and a distilled insight. This means the coordinator can explore multiple competing directions at the same time without losing its place.

The coordinator builds the tree by placing broad ideas near the root, while concrete refinements branch out as leaves. This allows Arbor to safely explore multiple competing hypotheses simultaneously. If an executor’s experiment fails, the tree records why it failed as a negative constraint, ensuring the system doesn’t endlessly repeat the same mistake.

To understand why Arbor’s isolation matters, consider a common enterprise scenario: optimizing a Retrieval-Augmented Generation (RAG) pipeline for an internal AI assistant. “When you ask a single agent like Claude Code or Codex to ‘improve accuracy,’ it will typically change a bunch of things in one pass — chunking, the prompt, the retrieval method,” Jin said. This entangles the changes, making it impossible to attribute which one actually helped. It also directly mutates the repository without isolation.

Arbor solves this by treating each lever as a separate hypothesis. Chunking becomes one branch, retrieval another, and the prompt another — each implemented and evaluated in its own isolated git worktree. “So you get clean attribution: ‘constraint decomposition on the retrieval side gave +X; breadth-first search actually hurt,’” Jin said.

When an executor returns a report, the coordinator writes the evidence to the tree and backpropagates the insight upward to parent nodes. This means a local observation becomes a generalized constraint that shapes the coordinator’s future idea generation.

To prevent reward hacking or overfitting to the development data, HTR enforces a strict “merge gate.” Even if an executor reports a fantastic development score, the coordinator will spin up an isolated worktree to test the candidate against a held-out test evaluator. The artifact is only merged into the current best trunk if it demonstrably improves the test score, verifying that the progress is real.

Arbor generally falls under the concept of “loop engineering,” popularized by industry figures like OpenClaw creator Peter Steinberger and Claude Code lead Boris Cherny. The idea is to move beyond single prompts to design iterative cycles (observe, reason, act, verify) that drive autonomous agents. However, as Jin points out, “A loop can fill up with messy, untraceable attempts, and you end up with nothing to show and no way to reconstruct what changed.”

Arbor in action

The researchers evaluated Arbor on an autonomous optimization task suite built from real-world research settings and the MLE-Bench Lite machine learning engineering benchmark. The AO suite featured tasks from different areas of AI development, including model training, harness engineering, and data synthesis.

The researchers used different backbone models for the coordinator and executor agents, including Claude Opus 4.6, GPT-5.5, and Gemini-3-Flash. They tested Arbor against the strongest coding agents, Codex and Claude Code. Arbor and the baselines were given the same resources. For the MLE-Bench Lite tasks, Arbor was also compared against top-tier agentic research systems like AI-Scientist, ML-Master, and AIDE.

Arbor consistently outperformed the baselines. It achieved the best held-out test result on all tasks, attaining more than 2.5 times the average relative gain of Codex and Claude Code. On the BrowseComp task, which involves optimizing a search agent, Arbor improved the system’s held-out accuracy from a baseline of 45.33% to 67.67%. Meanwhile, Codex and Claude Code stalled at 50% and 53.33%, respectively. On MLE-Bench Lite, when equipped with GPT-5.5, Arbor achieved the strongest result among all benchmarked systems.

Arbor proved to be resilient against overfitting. For example, during the Terminal-Bench 2.0 task experiments, Claude Code achieved a high development score of 75 but its score dropped to 71 on the held-out data. Arbor had a lower development score of 72.22 but achieved the highest held-out score of 77.36, ensuring its results transfer to real-world applications.

Arbor also showed generalization in a cross-task transfer experiment. After Arbor finished optimizing the search harness for the BrowseComp task, researchers took the optimized codebase and tested it on two unrelated search-agent tasks, HLE and DeepSearchQA. Arbor’s optimized codebase significantly improved performance on those unseen tasks as well.

Deploying Arbor: Sweet spots and hidden costs

For engineering leads looking to drop Arbor into their existing tech stack, the framework is designed to sit on top of existing Git workflows rather than replacing them. “Its output is an ordinary git branch that your existing code review, CI, and human review can inspect directly,” Jin said. Only verified gains are merged into a per-run trunk, leaving the main repository untouched until a developer manually chooses to promote the code.

However, deploying Arbor comes with specific tradeoffs. Jin points out that the biggest catch is token cost, as maintaining a long-lived coordinator that continuously manages the tree and dispatches executors is the dominant expense. Running multiple isolated worktrees concurrently also requires genuine compute and disk resources to process real experiments.

So where is Arbor’s sweet spot? According to Jin, it excels at tasks with a clear, trustworthy metric, tolerance for a long time horizon, and a real search space with several plausible directions, such as pipeline optimization, data-synthesis quality, and model-training recipe tuning.

Conversely, teams should explicitly avoid using Arbor for real-time latency tasks, obvious one-line fixes, or when the underlying evaluation metric is flawed. The quality ceiling of the entire run is strictly bounded by the quality of the evaluator. “If the metric isn’t trustworthy, Arbor will just optimize toward an untrustworthy result faster,” Jin said.

Jin sees the next evolution going beyond single scalar metrics. “A natural evolution is to have each node’s artifact carry a vector — accuracy, latency, cost — instead of a single score,” Jin said. “Going from a single scalar to a multi-objective Pareto search is a very natural extension of the framework.”

Source link

Tech

Microsoft steps in after LG monitors were found auto-installing apps with McAfee ads

Published

3 minutes ago

25 July 2026

NewsAdmin

The big picture: McAfee was a trusted name in the antivirus software market during the MS-DOS days. Many computers came with a copy of the company’s VirusScan freeware tool to fend off trojan horses and other virus threats. Today, McAfee is a name that’s almost exclusively trusted in the advertising and B2B markets. End users would very much prefer to avoid McAfee products at this point, which is why they tend to get angry when someone tries to push the McAfee brand through bundled advertising pop-ups.

The controversial advertising partnership between LG and McAfee has apparently come to an end after Microsoft got involved in the matter.

Pavan Davuluri, executive vice president of Windows and Devices, recently confirmed that Redmond got in touch with LG, the company that started it all. LG has “agreed” to remove the McAfee pop-up from its Windows Store app, Davuluri said, which likely means that Microsoft forced the South Korean manufacturer to stop pushing McAfee-related advertising to customers who purchased a new gaming monitor.

The controversy began earlier this month, when users discovered that the “LG Monitor App Installer” app was displaying McAfee promotions. Windows installed the app after affected users connected their brand-new LG monitors, with no easy way to uninstall the tool without resorting to third-party applications or disabling the Microsoft Store itself.

– Pavan Davuluri (@pavandavuluri) July 22, 2026

According to Davuluri’s post, Microsoft “connected” with LG before the latter agreed to remove the McAfee adware message from the LG Monitor App Installer app. Both companies apparently share a common goal of improving the Windows ecosystem and providing customers with the best possible software experience.

Per user complaints, the list of LG monitors reportedly found to automatically push the LG Monitor App Installer app onto users’ PCs includes the following models: 34GX900A-B, 45GX950-B.AEU, 32GS95UE-B, 39GX950B, 27GP83B-B, 27GN800, 32GS95UE-B, and 27GN850-B.

Apps developed through Microsoft’s Universal Windows Platform can be designed to install automatically when a user connects a specific device to a Windows machine. Put simply, Microsoft approved this practice for UWP app developers, and the LG monitor tool was simply working as designed.

The official UWP documentation even states that automatic installation can be a source of confusion because users do not receive any notification about the installation process.

Other companies known to take advantage of this auto-installation feature include Razer, Logitech, Asus, and Gigabyte.

Source link

Tech

iPhone exploit fight highlights who owns security research

Published

16 minutes ago

25 July 2026

NewsAdmin

A federal judge has ordered a public iPhone exploit taken offline after Magnet Forensics argued it wasn’t independent security research at all, but instead a stolen trade secret.

U.S. District Judge Victoria Marie Calvert partially approved Magnet’s request for a preliminary injunction. She directed Paradigm Shift and former Magnet exploit engineer Mario Del Gaudio to delete the usbliter8 article, code, technical details, and related materials in their possession by 11:59 p.m. Eastern on July 23.

By July 23, Paradigm Shift had replaced the original article with a page indicating the blog post was unavailable. The preliminary injunction will continue throughout the litigation unless the court removes it in a separate order.

Magnet’s July 7 complaint asserts that usbliter8 originated from a confidential A12 and A13 SecureROM access capability integrated into a commercial forensic product. The company alleges Del Gaudio acquired the technique while employed by Magnet and later shared it through Paradigm Shift.

Paradigm Shift originally presented usbliter8 as newly published security research before releasing it on June 18.

We reported at the time that the exploit affects devices including the iPhone XS, iPhone XR, iPhone 11 lineup, and second-generation iPhone SE. The court hasn’t made a final ruling on liability.

Calvert found that Magnet had established a likelihood of success on its trade-secret and contract claims for purposes of the preliminary injunction, based on evidence the defendants didn’t contest at the July 16 hearing.

The iPhone exploit requires physical access

Usbliter8 targets SecureROM, the immutable code that starts Apple’s secure boot process. It combines a flaw in a USB controller with security settings used on A12 and A13 devices to execute code while a device is in Device Firmware Update mode.

Because SecureROM is built into the processor during manufacturing, Apple can’t replace the vulnerable code through an ordinary software update. It may still be able to develop mitigations that interfere with exploitation or reduce its usefulness.

The flaw doesn’t create a remote attack or automatically expose everything stored on an iPhone. Using usbliter8 requires physical access to the device, a USB connection, DFU mode, and programmable hardware capable of sending specially constructed USB traffic.

The exploit can run unsigned code before the operating system starts, but it doesn’t directly compromise the Secure Enclave or automatically reveal a user’s passcode and encrypted data. Additional vulnerabilities or forensic techniques would be needed to cross those protections.

Those requirements make usbliter8 especially relevant to forensic investigations involving seized devices. Magnet sells investigation products to law enforcement agencies, intelligence services, government bodies, and private organizations.

Magnet says usbliter8 came from a secret capability

Del Gaudio worked as an exploit engineer placed with Magnet from November 2023 through November 2024. He signed an agreement covering confidential information, intellectual property, and continuing restrictions that survived the end of his placement.

Magnet says Del Gaudio had access to a zero-day capability internally called “MSG,” which targeted the same A12 and A13 SecureROM vulnerability later described in the usbliter8 publication.

According to the complaint, Magnet engineers discussed the vulnerability in meetings attended by Del Gaudio by April 2024. The company says it integrated MSG into one of its products in May 2024 and that Del Gaudio used the capability dozens of times while testing another tool.

Two modern iPhones standing upright on a wooden table, one dark gray and larger, one silver and smaller, both showing triple rear cameras, with blurred home decor in the background

Magnet says Del Gaudio had access to a zero-day capability internally called “MSG.”

Magnet supplied additional details in a July 17 declaration addressing questions Calvert raised at the July 16 hearing. The company’s director of iOS research said Del Gaudio attended restricted sessions during a company gathering in Denver from March 11 through March 15, 2024.

Fewer than 20 people attended the smaller iOS sessions, according to the declaration. Magnet said the group discussed the SecureROM vulnerability, MSG’s technical architecture, and its development into an access capability for the company’s products.

Magnet links Del Gaudio to the publication

After his placement ended in November 2024, Del Gaudio became affiliated with Paradigm Shift, according to the complaint. The Spanish security company published “Introducing usbliter8: An A12/A13 SecureROM Exploit” on June 18.

A preserved screenshot connected Del Gaudio’s name and photograph to the @NotHdesk account associated with the research, according to Magnet. The company also says the account was linked to an email address known to belong to him.

Those details form part of Magnet’s case that Del Gaudio had access to MSG and was connected to the usbliter8 publication. The public record doesn’t include source-code comparisons, file-transfer records, or a detailed technical analysis showing exactly how MSG and usbliter8 match.

The original usbliter8 article was deliberately omitted from the complaint because Magnet argued that attaching it would further distribute the information it sought to protect. The company offered to provide the material privately for the court to review.

On June 18, Magnet sent Del Gaudio a cease-and-desist demand and contacted Paradigm Shift the next day. The demand sought removal of the article and code, identification of anyone who received the information, preservation of evidence, and return or destruction of Magnet material.

In letters dated June 22 and June 28, Paradigm Shift’s attorneys disputed Magnet’s claims and pressed the company to identify the information it considered a trade secret. Magnet filed the lawsuit on July 7 after the parties failed to reach an agreement.

Neither Del Gaudio nor Paradigm Shift appeared at the July 16 injunction hearing, despite receiving electronic notice. Calvert therefore considered an uncontested record when deciding whether temporary relief was warranted.

The court questioned whether the flaw should remain secret

Magnet argues that publication let competitors study the technique without making the same investment. The company also says Apple could reduce the exploit’s value through mitigations, while the disclosure may weaken customer trust in Magnet’s ability to protect sensitive capabilities.

The dispute raises a security question over whether companies should keep zero-days secret for forensic use or disclose them so manufacturers and device owners can respond.

Two iPhones held up, one light green and one purple, both showing their backs with dual cameras and Apple logos, against a blurred red brick wall background

Magnet argues that publication let competitors study the technique without making the same investment.

Calvert described the public-interest issue as the most difficult part of the case. The judge addressed concerns about companies and government actors stockpiling zero-day vulnerabilities rather than reporting them to affected manufacturers.

The court also questioned whether consumers were better protected by knowing about the vulnerability once its existence had become public. Calvert concluded the court couldn’t resolve that policy debate through an unopposed preliminary injunction motion.

Paradigm Shift, as reported by MacRumors, said it informed Apple Product Security before publishing on June 18. The order doesn’t prevent Apple from using information it already has to mitigate the vulnerability.

Since Apple already has the disclosure, the injunction can’t fully restore the secrecy Magnet claims gave the capability commercial value. However, Calvert found that removing the material could still limit further harm and prevent Paradigm Shift from using the research for promotion.

The central trade-secret question remains unresolved

Magnet also sought extensive forensic access to the defendants’ computers, accounts, and storage. Calvert declined to grant that relief outside the normal discovery process.

The court observed that Magnet accepted Del Gaudio might not have required company hardware or files to replicate the capability. In Magnet’s view, familiarity with the research could have been enough.

A key question remains for future steps, such as whether Del Gaudio copied protected Magnet data or drew on technical understanding and experience kept after departing the firm.

The injunction covers material held by the defendants but can’t remove copies already downloaded or shared elsewhere. The case now turns on whether usbliter8 represents independent research or the disclosure of Magnet’s confidential forensic capability.

Source link

Tech

Nvidia is challenging 20 years of datacenter CPU design with its new Vera chip

Published

30 minutes ago

25 July 2026

NewsAdmin

The big picture: Nvidia lifted the embargo on its Vera CPU deep dive this week, and the disclosure amounts to a direct challenge to two decades of x86 datacenter design philosophy. This is the most detail the company has shared on the chip since it first appeared on the Rubin roadmap, and it confirms something I have suspected for a while: Nvidia is not treating the CPU as an attach story anymore. It is treating it as a battleground with a lot of potential dollars at play.

Vera is built around the Olympus core, the first custom CPU core Nvidia has ever brought to the datacenter and the first custom core the company has designed anywhere since the Denver and Carmel efforts of the Tegra era nearly a decade ago.

Ryan Shrout is a longtime technology analyst and industry veteran who has spent over two decades covering PC hardware, graphics, and semiconductors. He previously led technical marketing at Intel and was the founding editor of PC Perspective. He is currently President and GM at Signal65. You can follow him on X @ryanshrout.

Grace used licensed off-the-shelf Arm Neoverse V2 cores. Olympus is an Nvidia design from the ground up, a wide, high-IPC core with a 10-wide decode front end that reorders aggressively and prefetches based on patterns like graph structures in memory.

88 of those cores sit on a monolithic compute die, running 176 threads through a partitioned scheme Nvidia calls Spatial Multithreading, a deliberate departure from the opportunistic resource sharing of traditional SMT.

That monolithic choice matters. Nvidia still uses chiplets for the memory controllers and I/O, but the compute die is one piece of silicon connected by a second-generation scalable coherency fabric. The company measures bisection bandwidth across the die at roughly 3.4 terabytes per second.

The memory subsystem is LPDDR5X hardened for the datacenter with ECC and full telemetry, delivering up to 1.2 TB/s of bandwidth, roughly 3x the memory bandwidth per core and about 5x the bandwidth per watt of conventional DDR-based server designs (all based on Nvidia claims).

The headline claims stack up as roughly 2x faster performance from the Olympus core, 3x the core-to-core bandwidth of chiplet-based competition, and 40% lower memory latency under load through the LPDDR5X subsystem.

Vera ships in two forms: a dense liquid-cooled rack packing 256 CPUs and more than 22,000 cores, and a conventional air-cooled 2U with two sockets. Dell has committed to multiple PowerEdge systems built on it. Nvidia sizes the opportunity as a $200 billion expansion of the CPU market, which explains a lot of the recent market dynamics.

The argument behind the architecture

In 2014, a top Xeon carried 14 to 18 cores. Today an Epyc Turin part carries 128. Core counts grew roughly 9x over that stretch because cloud economics rewarded rentable vCPUs, while per-core performance only about doubled. Chiplets kept costs down but taxed memory bandwidth, data movement, and latency along the way.

Nvidia argues that agentic AI breaks this trade. An agent reasons on the GPU, then drops to the CPU for tool calls, SQL queries, API work, and scripting, then goes back to the GPU, sometimes hundreds of times per task. That loop is sequential.

You cannot throw more cores at a sequential loop and make it shorter. Only a faster core, fed with data faster, compresses it. The loaded-latency data Nvidia showed makes the point visually, with chiplet designs hitting a saturation wall just shy of 400 GB/s of memory traffic while Vera keeps scaling.

The company has landed on “max single-threaded CPU at scale” as the category name. It is a mouthful, and I will get to that.

The proof points

The customer data is early but notable. Perplexity ran coding sandboxes on Vera and completed jobs 1.5x faster than the production Xeon fleet it runs today, with concurrent sandbox startup 1.9x faster.

The New York Stock Exchange, which processes 1.1 trillion records a day, tested Vera with the Redpanda streaming engine on HPE systems and measured 6x lower p99 latency versus Epyc Turin, and is now evaluating it as a replacement. Los Alamos National Laboratory saw 7x on an agentic workload and 3x on radiation transport and multigrid simulation codes.

These are real workloads rather than synthetic benchmarks, which I give Nvidia credit for, and the supporting documentation puts names and configurations on most baselines. They remain vendor-supplied and worth reading closely.

The SPEC CPU 2026 numbers carry an estimated label from a pre-production reference system, and every comparison lands on Zen 5 Turin or current and older Intel silicon. The Los Alamos runs were measured against a Sapphire Rapids based supercomputer that launched in early 2023. Beating shipping parts is the right first test, but Venice and Diamond Rapids arrive within the year, and that is the fight that will settle this.

Where there is more detail needed

I asked the Nvidia team directly during the analyst briefing last week what actually separates an agentic CPU from a plain, very good datacenter CPU. We have had big, fast processors running back-to-back loops of VMs and containers for years. Is this genuinely a new workload class, or a fast CPU wearing new marketing?

To their credit, the team acknowledged that “agentic CPU” is the wrong label and would pigeonhole the part. The honest answer is that the fundamentals have not changed, but the rates have.

Agent pipelines hydrate and tear down environments constantly rather than occasionally. Memory pressure is continuous. Latency under load becomes the whole game, because every millisecond the CPU stalls is a millisecond a very expensive GPU sits idle. That is a real architectural argument, and the decision to spend die area on per-core speed instead of core count is a genuine philosophical break from where x86 roadmaps have been heading.

What the market needs now is independent, rigorous CPU measurement built around these agentic pipelines, run across current and next generation parts from every vendor. That is exactly the kind of work we are looking forward to diving into at Signal65.

The ecosystem arrived on day one, to no surprise

The partner roster attached to this launch is unusually deep for a CPU announcement. OpenAI says it will deploy Vera at scale beginning in Q3, and the early adopter list also includes Anthropic, SpaceX, and Perplexity, with Los Alamos, NERSC, and TACC representing the supercomputing side.

Dell, HPE, Lenovo, Supermicro, and Bull all have Vera systems coming, backed by the full ODM bench.

The rack-scale platform is ramping just as visibly. CoreWeave was the first cloud to bring up and validate Vera Rubin NVL72 and published the first measured numbers from live hardware, a 10x gain in tokens per second per megawatt over Grace Blackwell NVL72 on DeepSeek-R1.

Google Cloud stood up the first A5X instance on Vera Rubin for the reinforcement learning startup Ineffable Intelligence, Azure and OCI have racks running, and a newly expanded Microsoft and Mistral agreement puts Vera Rubin at the center of a multibillion-dollar European buildout.

On the CPU specifically, DeepInfra, which serves nearly five trillion tokens a week, measured support for 1.6x more concurrent agents and 2.2x faster orchestration versus Granite Rapids. Nvidia counts 300 partners and more than 350 factory sites in 30 countries behind the ramp.

What this means for AMD, Intel, and the hyperscalers

The timing is not subtle. This disclosure lands the day before Advancing AI opens in San Francisco, the flagship AMD event where the 256-core Zen 6 Venice generation of Epyc and the Instinct MI450 family are expected to headline the keynote from Lisa Su on Thursday. Nvidia just set the terms of the datacenter CPU conversation roughly 24 hours before its biggest rival takes the stage.

If a faster CPU returns GPUs to work sooner, the CPU price becomes a rounding error in rack TCO, and the fight shifts from dollars per core to tokens per rack. That framing is the one AMD and Intel now have to answer, and it is a very different conversation than the one that produced 128-core roadmaps.

AMD is not conceding the frame. It has already argued that rack-level performance per watt favors high-core-count Epyc, and the 256-core Venice generation will sharpen that response. Intel has Diamond Rapids coming. The hyperscalers have Graviton, Axion, and Cobalt, all designed around the scale-out economics Vera explicitly rejects.

But Nvidia is not selling a merchant CPU into a commodity socket. It is selling the CPU as the utilization lever for the most expensive assets in the AI factory. If a faster CPU returns GPUs to work sooner, the CPU price becomes a rounding error in rack TCO, and the fight shifts from dollars per core to tokens per rack.

That framing is the one AMD and Intel now have to answer, and it is a very different conversation than the one that produced 128-core roadmaps.

Nvidia has promised deeper head-to-head benchmark data against both x86 and Arm competition in the coming weeks. That data, and the independent validation that should follow it, will tell us whether Vera resets the datacenter CPU conversation or simply carves out a well-defended niche inside the Nvidia rack.

Source link

Tech

Anne Rice’s Vampire Series Is Renewed as ‘Queen of the Damned’ for Season 4

Published

57 minutes ago

24 July 2026

NewsAdmin

Hot on the heels of the season three finale, AMC has announced the fourth season of Anne Rice’s Interview With the Vampire. Revealed during a panel for The Vampire Lestat at San Diego Comic-Con on Friday, season 4 will be titled Anne Rice’s Queen of the Damned.

On hand for the announcement were cast members of the vampire series Sheila Atim, Jacob Anderson, Assad Zaman and Eric Bogosian, as well as executive producers Mark Johnson and Hannah Moscovitch.

They also dropped a teaser trailer for the series.

Queen of the Damned will see “our established and beloved characters confront Akasha with their world, and ours, very much hanging in the balance,” said AMC Studios President Dan McDermott in a statement.

Moscovitch will serve as showrunner for Queen of the Damned, with Johnson and Rolin Jones executive producing.

No word yet on when season 4 will air, but it will stream exclusively on AMC Plus. You can catch the first two series of Interview With the Vampire on Netflix.

Corinne Reichert

Senior Editor

Corinne Reichert (she/her) grew up in Sydney, Australia and moved to California in 2019. She holds degrees in law and communications, and currently writes news, analysis and features for CNET across the topics of electric vehicles, broadband networks, mobile devices, big tech, artificial intelligence, home technology and entertainment. In her spare time, she watches soccer games and F1 races, and goes to Disneyland as often as possible.

See full bio

Source link

Tech

Early Childhood Education Teachers Grapple with Screen Time

Published

1 hour ago

24 July 2026

NewsAdmin

When the American Academy of Pediatrics updated its guidelines around screen time for children and teenagers for the first time in 10 years this past January, many teachers and educators applauded the news. The document focused less on television and more on digital devices. It offered educators, families, pediatricians, and other stakeholders research-backed guidance on using digital devices to support children’s learning, rather than prioritize prolonged engagement.

However, many of the recommendations were not tailored to specific age groups. Recommendations like “create a family media plan” and “protect sleep” were helpful but offered few concrete steps for pre-K students.

That left early childhood educators asking what they could do for their young learners.

The Risks in Pre-K Through K

Unlike the physical world, digital content comes fast, in short bursts, and when kids get used to that, it has negative consequences. “Too much of this screen and things jumping in front of them, it could be damaging overall,” says Latoya Jones, a pre-K through fifth-grade media specialist in Broward County, Florida. “They are developing their cognitive processes, and if we’re teaching them to think in six-second bursts, maybe that’s not what we’re aiming for.”

When kids interact with devices rather than people, their emotional regulation can suffer, says Kristina Turner, a first-grade teacher at the Paterson, New Jersey, campus of College Achieve Public Schools, a network of three K-12 charter schools. “A lot of the students, they are quick-tempered because they’re just used to quick things. If something takes too long, their behavior starts to heighten,” she says.

For the youngest learners in particular, “it is so important that they are learning how to interact, and listen, and even advocate for themselves,” says Colleen Francisco, a kindergarten teacher at Laurel Springs School, an online school in West Chester, Pennsylvania.

“There’s beauty in students having an issue or a problem, because then an adult can support them in working through it,” she says. When kids are tethered to their screens, those opportunities don’t arise.

Clearly, the AAP guidance is well timed and much needed. But how to put that guidance into practice?

Classroom Strategies for Reducing Screen Time

“I never want my children using screens passively,” says Devon Caldwell, a pre-K and K teacher at Canupawakpa Education in Manitoba, Canada, and an instructor at Nipissing University Schulich School of Education in Ontario. “I want to see kids’ bodies moving, their mouths talking, little hands making something.”

To that end, she prioritizes co-viewing and co-creating, using interactive teaching software approved by her and the school. When students are using a screen, “they’re always using it with a partner. Right away they’re negotiating rules, talking to each other, developing those really important skills,” she says.

For her first graders, Turner tries as much as possible to create hands-on activities instead of relying on screens. For example, she uses a whiteboard, but tries to avoid it when giving instructions. “Students have to use their listening skills so they can be able to follow directions without using a screen,” she says. “By end of school year, I’m able to give them different tasks, and they’re able to do that independently, without depending on technology.”

Tyler Brown, an English teacher at Indian River Middle School in Philadelphia, New York, encourages teachers to be thoughtful about the digital tools they share with the youngest learners. “Don’t throw [the digital tool] at everything,” he says. “Think about what you’re using it for. Think about what it’s supposed to do. Is this an enhancement of learning or is this a replacement?”

These self-directed questions can drive practical choices in the classroom. “You’re doing AT words: cat, mat, bat. You could easily have a website or a Google slide where they’re dragging those items onto the correct word,” says Brown.

“Or you could do what we’ve done in the past, which is a cut-and-glue activity,” he continues. “Why are we using screen time for something that can easily be a manipulative activity, where kids are doing things hands-on? Little kids are tactile learners, and those hands-on exercises have real value.”

Even as a teacher in an online school, Francisco is finding ways to limit screen time. “One of our assignments is taking a nature walk and writing down what you see. That is something that extends the learning off the screen,” she says.

Even math can be hands-on for early learners, she adds. “You don’t have to have anything fancy. You don’t have to buy the tools: Even just using cereal in order to practice adding and subtracting,” she says.

Jones was a kindergarten teacher before becoming a media specialist. “Anytime I had a word of the week, we used Play-Doh to build a word. Or we did those little sandboxes where they drew the word in the sand,” she says.

Big picture: Active and tactile often are better than passive and screen-based. “Could this be accomplished with blocks or Play-Doh or markers and paper? If it could be, let’s do it that way,” Caldwell says.

But teachers can only do so much in the classroom when kids are immersed in digital media at home. In support of the AAP’s call to prioritize family time over screen time, teachers may need to engage more with parents.

Engaging with Families

Pre-K and K teachers don’t have control over what kids do outside of school. “At home, a lot of my students go straight to their screens,” Turner says. To help steer her own kids and her students when they are at home, she gives parents resources. “I’m able to provide them with index cards, and you can keep them in the back seat,” she says.

When kids get in the car, “instead of reaching for an iPad, they can reach for their index cards, whether it’s math problems or sight words,” she says. “Or you can put that as a magnet on your refrigerator: If you want a snack, you got to read this word.”

When Francisco sends homework through the learning management system, “we have workbooks and other hands-on activities that support the lesson,” she says. And she encourages parents to talk to their kids, as a pedagogic exercise.

“Phonemic awareness is especially important to early-literacy learners. The learner is listening to sounds and how sounds are manipulated,” she says. “I’ve given parents additional resources so that [kids] are not reading or not listening or watching on the video. Instead, they’re doing it with their student, face to face.”

It takes some creativity and extra effort by teachers and parents to pivot kids off of screens, but experts say it’s worth the effort. Too much screen time “can negatively impact executive functioning, socio-emotional and language skills,” says Bryana Casas M.A., master teacher at Pacific Oaks Children’s School in Pasadena, California.

For those honing their interpersonal skills — a main goal in pre-K through K education — “nothing can replace relationships and face-to-face interactions between people,” she says.

Source link

Tech

Prentis, new AI lab co-founded by Reid Hoffman, Marc Pincus in talks to raise $100M

Published

1 hour ago

24 July 2026

NewsAdmin

Prentis, a new AI research lab focused on computer use models, co-founded by serial entrepreneur Ritankar Das, and tech heavyweights Reid Hoffman and Marc Pincus, is in talks to raise $100 million at a $1 billion valuation, according to two people familiar with the discussions.

Launched in April, Prentis is training models to learn how office workers navigate routine workflows across documents and systems, with the goal of building AI agents that can control computers to automate those tasks.

Prentis will ostensibly develop agents tailored to these customers’ needs, such as handling insurance claims and automating customs duty refund exceptions without needing a human to hunt down paperwork.

The startup has already signed contracts worth up to $50 million with several customers, including healthcare management service organization, a manufacturer, and goods and clothing manufactures, the two people familiar with the discussions tell TechCrunch, echoing investor materials obtained by TechCrunch that predict an estimated $75 million annualized run rate by the third quarter of this year. (Prentis’s pitch deck notes those figures reflect estimated annualized value based on a contracted fee equal to 20% of savings realized, not recognized revenue, and are “performance-dependent and subject to final execution.”)

By its own account, Prentis says its Hive-32B model outperforms rivals, including OpenAI’s GPT-5.4 and Anthropic’s Claude Opus 4.6, on two computer-use benchmarks: WindowsAgentArena, which measures end-to-end task completion on real Windows applications; and ScreenSpot-v2, which tests a model’s ability to locate the right on-screen control.

In its pitch deck, the company argues its edge comes from running a much smaller, cheaper model. In fact, it claims roughly 10 times lower cost per task than frontier APIs, saying it’s more economical to deploy across everyday workflows. TechCrunch hasn’t independently verified the company’s benchmark results.

The startup is betting that automating everyday office tasks will soon outpace coding as AI’s biggest use case, but it’s a crowded market. Anthropic, Open AI, and Mira Murati’s Thinking Machines are also working on developing AI agents for computer use, one of sources said. Anthropic has also been acquiring talent in the category directly — it bought the Seattle computer-use startup Vercept earlier this year, folding in its founders and shutting down its product.

Prentis didn’t respond to TechCrunch’s request for comment.

Ritankar Das, CEO of Prantis, is also the founder of Titan, a holding company that builds and operates AI companies. Das, now 31, was UC Berkeley’s youngest University Medalist in more than a century, graduating at 18 with a double major in bioengineering and chemical biology before earning a master’s in biomedical engineering at Oxford.

He founded Titan in 2014 after dropping out of an AI PhD program at Cambridge, where he’d been a Gates Cambridge Scholar. Das has described Titan as an intentional throwback to an old-fashioned holding-company model like Berkshire Hathaway, one that’s funded by its own exits rather than outside limited partners.

Other businesses launched and operated by Titan include AI-powered virtual care provider Tala Health, which raised a $100 million seed round last year, and Forta Health, an autism care startup that raised $55 million led by Insight Partners in 2024. Titan-founded disease prediction company Dascena was acquired by CirrusDx in 2022.

Prentis is a side project of sorts for its two other co-founders. Hoffman, the LinkedIn co-founder and Greylock partner, said last month that he was stepping down from Microsoft’s board after nearly a decade to go “founder mode” on Manas AI, an AI drug-discovery startup he’s also backing; he was an early OpenAI investor and co-founded Inflection AI with Mustafa Suleyman before Microsoft absorbed most of that team in 2024.

Pincus, the Zynga founder, now runs the investment firm Reinvent Capital with Hoffman as a senior adviser, and published a memoir, “Life at the Speed of Play,” last month.

Prentis has already hired more than 25 employees, including researchers who previously worked at OpenAI, Google DeepMind, Meta, Tencent and Alibaba, according to its website.

When you purchase through links in our articles, we may earn a small commission. This doesn’t affect our editorial independence.

Source link

Tech

Remote worker with anxiety wins discrimination case after employer refused to let her turn off her camera

Published

2 hours ago

24 July 2026

NewsAdmin

TL;DR

UK tribunal rules Holiday Extras discriminated against a remote worker by refusing to let her keep her camera off during calls

A UK employment tribunal has ruled that forcing a remote worker with anxiety, ADHD, and autism to turn on her camera during a video training session amounted to disability discrimination. Laura Tait, a home-based travel consultant at Holiday Extras, was awarded compensation after the Croydon tribunal found the company failed to make reasonable adjustments for her conditions. The ruling does not ban camera-on policies outright, but it establishes that employers must consider individual accommodations for disabled workers who find video calls distressing.

Tait joined Holiday Extras in June 2021 as a remote consultant selling travel insurance, a role in which voice calls accounted for roughly three-quarters of customer interactions. By 2022, she had informed managers that work-induced stress was triggering repeated absences and that she could manage her anxiety more effectively through live chat and email channels. She requested that two or three days each week be allocated to text-based work instead of phone or video calls.

The company offered temporary adjustments during periods of phased return but refused to guarantee a permanent shift in her workload, arguing that voice calls were the core business function and that changes would be unfair to other staff. On August 24, 2023, during a remote training session, Tait asked to keep her camera off because she felt “super anxious,” but was told to start with it on and see how she managed. She was unable to cope and had to leave the session.

The 💜 of EU tech

The latest rumblings from the EU tech scene, a story from our wise ol’ founder Boris, and some questionable AI art. It’s free, every week, in your inbox. Sign up now!

Tait went on sick leave in October 2023 and has not returned. The tribunal found that Holiday Extras failed to make several reasonable adjustments, including allowing her to join meetings with her camera off and permanently increasing her share of chat and email shifts. It concluded that accommodating Tait would have had minimal impact on more than 50 other travel consultants and that the company’s refusal left her at a substantial disadvantage.

Employment lawyers cautioned that the decision does not mean all camera-on policies are automatically unlawful, since it turned on Holiday Extras’ specific failure to adjust for a worker whose combined disabilities made video calls particularly burdensome. The case arrives as courts on both sides of the Atlantic increasingly scrutinise how workplace policies interact with disability protections, from camera requirements in remote meetings to AI systems that penalise workers on medical leave. Compensation will be decided at a later hearing.

Source link

Tech

How EU tariffs on Apple and Google affect everyone

Published

2 hours ago

24 July 2026

NewsAdmin

President Donald Trump is escalating the fight over European Union antitrust penalties against Apple, Google, and other U.S. tech companies by opening a trade investigation that could lead to new tariffs.

Trump announced the investigation in a social media post after the European Commission fined Google 890 million euros on July 23 for violating the Digital Markets Act.

The fine included 460 million euros, or about $517 million, for favoring Google services in search results and 430 million euros, or about $483 million, for restricting how businesses direct Google Play users to alternative purchasing options.

“The United States of America is not a PIGGYBANK’ for Europe, nor will we allow it to be,” Trump wrote. He accused the EU of unfairly taking money from American companies and said its penalties should be reversed.

Trump predicted that the investigation would result in a “substantial tariff” against the EU. He didn’t identify a tariff rate, affected products, or a timetable for completing the investigation.

Section 301 of the Trade Act of 1974 allows the U.S. trade representative to investigate foreign government practices considered unjustifiable, unreasonable, or discriminatory and harmful to American commerce. A finding against the EU could support tariffs or other trade restrictions.

The US-based investigation obviously can’t overturn European Commission decisions or erase fines imposed under EU law. Apple and Google must separately challenge those penalties through the European legal system.

The announcement comes three days after 25 Republican lawmakers urged Trump to use Section 301 against the EU’s Digital Markets Act and Digital Services Act.

The lawmakers argued that the rules disproportionately burden American technology companies and give foreign competitors easier access to the U.S. market. European officials maintain that the regulations apply according to companies’ size and market power rather than their nationality.

Trump also threatened to use Section 301 on September 5, 2025, if foreign governments continued imposing fines and regulations that he said discriminated against American technology companies.

Apple is part of the widening US-EU dispute

Although the July 23 Google fine immediately preceded Trump’s announcement, he also named Apple, Meta, Amazon, and other U.S. companies in his criticism of European regulators.

The European Commission fined Apple 500 million euros on April 23, 2025, after finding that its App Store rules restricted developers from directing customers to offers outside Apple’s payment system.

Apple appealed the fine on July 7, 2025, arguing that the Commission’s demands went beyond what the Digital Markets Act requires and dictated how Apple must operate the App Store.

The company has said it spent hundreds of thousands of engineering hours and made dozens of product and policy changes to comply with the law. Apple has also accused the Commission of repeatedly changing its expectations during the compliance process.

European regulators say the anti-steering rules give consumers access to competing offers and prevent gatekeepers from using control of an app store to disadvantage rivals. The Commission has designated services operated by Apple, Alphabet, Amazon, Meta, Microsoft, and ByteDance as gatekeepers under the DMA.

The disagreement is no longer limited to whether individual App Store or Google Play rules comply with European law. Trump is attempting to treat the EU’s regulatory system itself as a discriminatory trade practice, potentially connecting technology enforcement to tariffs on unrelated European goods.

Tariffs won’t settle the underlying legal fight

Section 301 gives the administration a mechanism to investigate the EU and impose trade penalties if U.S. officials find that European regulations burden American commerce. The investigation marks a meaningful escalation, but it won’t determine whether Apple or Google violated European law.

Tariffs are collected from U.S. importers, which may absorb the cost or pass some of it to businesses and consumers. Any duties covering products outside the technology industry could therefore affect companies that had no role in the EU’s enforcement decisions.

The investigation, reported by The Associated Press, also follows a broader expansion of Section 301 action by the Trump administration. New tariffs on imports from more than 60 countries took effect July 23 over alleged failures to enforce bans on goods produced through forced labor.

For Apple, the investigation adds federal trade pressure to a dispute it is already fighting through European courts. It may raise the political and economic cost of future EU penalties, but only European regulators or judges can withdraw or overturn the existing fine.

Source link

Tech

Chinese Companies Are Selling Vapes With Chemicals Potentially More Potent Than Nicotine

Published

2 hours ago

24 July 2026

NewsAdmin

Earlier this year, I picked up a rather unfortunate habit: vaping. More specifically, I started smoking flavored vapes manufactured in China’s so-called “Vape Valley” and sold throughout the United States.

Flavored nicotine vapes are largely illegal in the US, yet convenience stores and tobacco shops across the country have generated billions of dollars in sales from these products in recent years, despite periodic law enforcement seizures and efforts by the Food and Drug Administration to keep them off the market. More recently, Chinese manufacturers have found a new way to sidestep the law, escaping regulatory oversight altogether.

Like many people, I assumed the only addictive chemical in these products was synthetic nicotine, which vape manufacturers have used for years to avoid FDA oversight. But after Congress closed that loophole in 2022—expanding regulation beyond tobacco-derived nicotine—Chinese manufactures began filling their vapes with little-studied chemicals that mimic nicotine’s effects, known as nicotine analogs.

Because current regulations still narrowly define what counts as nicotine, Chinese companies have been able to sell nicotine analog vapes in the US without having to worry about federal tobacco rules, which generally require companies to submit new products to the FDA for scientific review before they can be legally marketed.

“This is just like whack-a-mole, these companies will do everything they can to circumvent regulation,” says Robert Jackler, an emeritus professor of head and neck surgery at Stanford University and the founder of an interdisciplinary research group studying the impacts of tobacco advertising.

The effects of nicotine analogs on humans haven’t been extensively researched, but animal studies have found that one of the most popular variants, 6-methyl-nicotine, could be more potent and addictive than regular nicotine. Vapes containing nicotine analogs may also expose users to other mysterious chemicals. One 2024 study found that some manufacturers were selling nicotine analog vapes that contained additional unlabeled ingredients, including artificial sweeteners and cooling agents with unknown inhalation risks. “What’s on the label has very little relationship to what’s in it,” says Jackler.

Researchers first documented the emergence of vapes containing 6-methyl-nicotine and other nicotine analogs in the US market about three years ago. But the chemical compounds themselves are nothing new—tobacco companies have been researching them since as far back as the 1970s.

A 2005 review of millions of previously secret internal industry documents found Big Tobacco had long explored nicotine-like compounds as replacements for nicotine, in part because they believed they could help circumvent potential regulation.

But US tobacco companies never wound up marketing mainstream products containing any of these nicotine-like chemicals. Instead, they surfaced decades later in disposable vapes made by Chinese manufacturers, which have proven extremely adept at finding new ways to keep selling their wares in the United States.

Chinese vape companies “are extremely creative, they are extremely smart,” says Rich Marianos, a former official with the US Bureau of Alcohol, Tobacco, Firearms and Explosives who is now executive director of the Tobacco Law Enforcement Network, an advocacy group that does not publicly disclose its funding.

“The Chinese have been flooding the American market with illegal vape products designed to target children for years. These fake nicotine products appear to be a new scheme to trick American consumers into putting illicit, potentially dangerous chemicals into their body,” Tim Sheehy, a Republican senator from Montana, said in a statement to WIRED. “The Trump Administration has made cracking down on these unregulated Chinese products a priority, and I hope they continue to sound the alarm on this problem.”

But Jeckler notes that the Trump administration has effectively dismantled the Centers for Disease Control and Prevention office responsible for the agency’s tobacco prevention programs and significantly reduced the size of the FDA’s Center for Tobacco Products. That means much of the effort to regulate nicotine analogs has fallen to the states. Jackler says he’s aware of four states—California, Nebraska, Indiana, and Tennessee—that have expanded their definitions of tobacco products to explicitly include nicotine-like chemicals. But there is still no comprehensive federal law that treats these substances the same way as tobacco-derived or synthetic nicotine.

Source link

Tech

Codeberg Bans Cryptocurrency And LLM-Generated Code Projects

Published

2 hours ago

24 July 2026

NewsAdmin

Community-led open source project hosting site Codeberg has formally announced that projects whose code is largely or fully machine-generated through LLMs and other ‘AI’ tools will no longer be welcome. This follows on the heels of a similar ban on cryptocurrency-related projects.

The community vote was on two issues, the first being the notion that scraping of project code for the use in LLMs should be forbidden, which was a motion that easily passed. The second motion was on disallowing projects whose code was substantially generated by LLMs like Claude, OpenAI Codex, and similar. This motion passed with 358 in favor versus 144 against.

In the earlier linked blog post the reasoning behind especially this second issue is expanded upon, covering not only ‘license whitewashing’, but also the direct and indirect hardware costs, with the expanding ‘AI’ datacenter hyperscaling having massively increased hardware costs for Codeberg over the past years, as the costs have been largely externalized.

Also covered is also the aspect of these LLM-based tools destroying the OSS community, which is something that is backed up by recent studies. Even if we ignore that such LLM-tools are destroying the cognitive abilities of its users, there’s an argument to be made that if LLM-scraping is disallowed, then it’s consistent to also not allow LLM-generated code.

In the Terms of Use you can see these changes, both for LLMs and for cryptocurrency projects.

Thanks to [mk-fg] for the tip.

Source link