Connect with us
DAPA Banner
DAPA Coin
DAPA
COIN PAYMENT ASSET
PRIVACY · BLOCKDAG · HOMOMORPHIC ENCRYPTION · RUST
ElGamal Encrypted MINE DAPA
🚫 GENESIS SOLD OUT
DAPAPAY COMING

Tech

Uplift Parsons standing desk review

Published

on

Why you can trust TechRadar


We spend hours testing every product or service we review, so you can be sure you’re buying the best. Find out more about how we test.

I’ve said it before, and I’ll say it again: I have tested a vast number of desks. In fact, I would even be able to venture to say that I’m getting close to the arena of saying I’ve tested most of the key desks in the market.

Uplift has been stepping up their game big time. When I saw the Parsons and realized it came in a 48-inch-wide silhouette with those beautiful four legs and a sleek frame, I knew I needed to get my hands on it as soon as possible.

Advertisement

Source link

Continue Reading
Click to comment

You must be logged in to post a comment Login

Leave a Reply

Tech

New Alibaba AI framework skips loading every tool, cutting agent token use 99%

Published

on

As enterprise AI systems scale to handle complex workflows, practitioners face the challenge of routing subtasks to the right tools and skills. Agents can have hundreds of tools and skills and get confused on which one to use for each step of a workflow.

To address this challenge, researchers at Alibaba developed SkillWeaver, a framework that creates an execution graph for a given task and chooses the right skills for each of the nodes. They also introduce Skill-Aware Decomposition (SAD), a novel technique that uses a feedback loop to enable the agent to fetch and vet relevant tool candidates iteratively. This compositional approach and feedback loop mechanism distinguishes SkillWeaver from other tool-routing frameworks that choose tools in a one-shot fashion. 

SkillWeaver relates to real-world AI applications where agents autonomously orchestrate multi-tool ecosystems, such as the Model Context Protocol (MCP), to execute multi-step business operations like downloading datasets, transforming information, and creating visual reports. 

In practice, the researchers’ experiments with SkillWeaver show that implementing this retrieve-and-route approach significantly increases accuracy while reducing token consumption by over 99% compared to naively exposing agents to an entire tool library.

Advertisement

For practitioners building AI agents, the main takeaway is that the granularity of task decomposition is the biggest bottleneck to accurate tool retrieval. 

The challenge of skill routing

Skills are a key pattern in modern LLM agent architectures. A skill is a modular, reusable tool specification that uses structured natural language documentation. 

As enterprise agents integrate with massive tool ecosystems, accurately routing user queries to the right skills becomes a difficult task. Exposing an entire library to an LLM to find the right tool is highly inefficient, quickly overwhelms context limits, and consumes hundreds of thousands of tokens.

Most current tool-use frameworks attempt to solve this through API retrieval, documentation matching, or hierarchical structures that treat routing strictly as a single-skill selection or per-step problem. 

Advertisement

However, this single-skill paradigm is insufficient for enterprise environments because real-world queries are inherently compositional. A standard business request such as “Download the dataset, transform it, and create visual reports” cannot be fulfilled by one tool. It requires breaking the prompt down and sequencing an API client, a data processor, and a visualization tool into a cohesive, multi-step execution plan.

How SkillWeaver and SAD work

To tackle this, the researchers frame the problem of handling complex tasks that require multiple skills as “compositional skill routing.” Given a complex user prompt and a vast library of tools, an agent must simultaneously figure out how to break the request into a sequence of atomic sub-tasks, how to map each sub-task to the single best available skill, and how to compose those skills into an executable plan.

SkillWeaver orchestrates this process through three distinct stages: Decompose, Retrieve, and Compose. In the first stage, an LLM acts as a task decomposer, breaking the user’s complex query down into a sequence of sub-tasks that each require one skill. Once the sub-tasks are clearly defined, the system uses an embedding model to compare each subtask against the skill library to pull a shortlist of the top candidate tools for each step. 

In the final stage, a planner evaluates the retrieved candidates based on how well they work together. It checks for inter-skill compatibility to ensure the outputs of one tool naturally flow into the inputs of the next. It then creates a final execution plan as a Directed Acyclic Graph (DAG) that maps out dependencies so independent tasks can potentially execute in parallel.

Advertisement
SkillWeaver

For example, consider a user asking an AI agent to “Download the dataset, transform it, and create visual reports.” In the decompose stage, the decomposer LLM breaks this into three distinct sub-tasks: downloading the dataset, transforming the data, and creating the reports. 

In the retrieve stage, the system searches the library and finds candidates like “api-client” or “http-fetch” for task one, “csv-parser” or “etl-pipeline” for task two, and so on. Finally, the compose stage evaluates these options, selects the specific combination of “api-client,” “csv-parser,” and “chart-gen” that are most compatible, and wires them together into a final, ready-to-execute workflow.

A key challenge of this pipeline is that LLMs often produce generic step descriptions that fail to match the specific, technical vocabulary of the actual skills available in the library. To fix this, SkillWeaver introduces Iterative Skill-Aware Decomposition (SAD), a novel feedback loop. SAD works by having the LLM draft an initial plan, conducting a preliminary search to find loosely matching skills, and then feeding those retrieved skills back into the LLM as hints. This allows the LLM to rewrite its decomposition so the granularity and vocabulary perfectly align with the actual tools that exist.

SkillWeaver in action

To evaluate how SkillWeaver performs in realistic enterprise scenarios, the researchers created a custom benchmark called CompSkillBench. It consists of 300 multi-step queries of different difficulty levels. To mirror real-world environments, they used a library of 2,209 real-world skills sourced from the public MCP ecosystem, covering 24 functional categories like cloud infrastructure, finance, and databases. 

For the core engine, the researchers primarily used a lightweight 7-billion parameter model (Qwen2.5-7B-Instruct) for task decomposition, paired with a standard semantic search retriever (MiniLM with a FAISS index) to find the tools. SkillWeaver was evaluated against three main setups: a brute-force “LLM-Direct” method where they stuffed all the tool names into the prompt of a large model, a vanilla LLM-based decomposition without SAD, and a ReAct-style agent loop.

Advertisement

The experiments indicate that task decomposition is the main bottleneck. Standard LLM behavior falls short when dealing with large tool libraries, but the SAD feedback loop dramatically moves the needle. In the vanilla setup, the 7B model achieved a decomposition accuracy (i.e., predicting the correct number of steps) only 51.0% of the time. By activating the SAD feedback loop, accuracy jumped to 67.7% (with the larger Qwen-Max model, the accuracy reached 92%). On “hard” tasks requiring four to five distinct skills, SAD improved accuracy by 50%.

SkillWeaver results

In comparison to the naive approach, SkillWeaver reduces token consumption by more than 99% (source: arXiv)

One fascinating finding was that larger models can actually perform worse when unguided. When tested in the vanilla setup, a larger 14-billion parameter model saw its accuracy plummet below the 7B model’s accuracy because it tended to over-decompose tasks into microscopic, unnecessary steps. Once SAD was introduced, the retrieved tool hints anchored the model back to reality and increased its accuracy. This suggests that aligning an agent with the vocabulary of specific tools is often more impactful than paying for a larger, more expensive LLM.

Another important takeaway is token savings. The LLM-Direct baseline, which used the very large Qwen-Max model, showed that feeding all tools into the prompt of a large model fails. Despite near-perfect task breakdown capabilities, the massive model only retrieved the right tool category 21.1% of the time when flooded with tool options. SkillWeaver’s targeted retrieve-and-route approach vastly outperformed this in accuracy while slashing context window consumption from an estimated 884,000 tokens down to roughly 1,160 tokens per query, a 99.9% reduction. For practitioners, this translates directly to drastically lower API costs and faster response times. 

Advertisement

Finally, the traditional ReAct baseline completely failed, achieving 0% decomposition accuracy. Its loop naturally collapses multi-step plans into isolated actions rather than explicitly mapping out a cohesive, multi-tool sequence.

Considerations for developers

While the researchers have not yet released the source code for SkillWeaver, their work was built on off-the-shelf tools that can easily be reproduced. 

Skill-Aware Decomposition (SAD), which is the key innovation at the heart of the framework, is a clever prompt-engineering and retrieval loop. The authors have shared the prompt templates in their paper, and developers can implement it themselves quite easily using standard orchestration libraries like LangChain, LlamaIndex, or even raw Python scripts.

As for the retrieval component, the authors built the core framework using all-MiniLM-L6-v2, an open-source embedding model. They found that swapping in a slightly stronger off-the-shelf encoder (BGE-base-en-v1.5) immediately boosted accuracy without any fine-tuning. While an off-the-shelf bi-encoder is great at getting a relevant tool into the top 10 candidates nearly 70% of the time, it struggles to consistently rank the perfect tool at exactly number one, achieving that only about 37% of the time. To bridge this gap, teams will likely need to implement a secondary cross-encoder or LLM-based reranker to re-order those top 10 candidates.

Advertisement

One upfront preparation requirement is vectorizing the tool library and building a FAISS index in advance. In practice, this is a negligible hurdle. Embedding and indexing all 2,209 skills in the benchmark took a mere 15 seconds. Once built, retrieving tools from the index adds less than 15 milliseconds of latency per query. For enterprise environments, syncing the tool index is a trivial background job. 

A current limitation in SkillWeaver is the lack of error recovery. While SkillWeaver successfully maps out a compatible DAG for execution, the authors’ pilot study revealed the challenges of multi-step tool chains. For example, if an API call fails in step two, the entire chain breaks. The paper’s core contribution is limited to the routing and planning phase. For a true production deployment, practitioners must build their own error recovery, fallback, and retry mechanisms on top of the compose stage to handle real-world API timeouts or malformed outputs.

Source link

Advertisement
Continue Reading

Tech

The US Marines just accepted six F-35Bs carrying lead weights where their radars should have been installed

Published

on


  • Six new F-35Bs entered service carrying ballast inside their noses instead of radar
  • The APG-85 delay created stealth fighters without their primary sensor
  • Lot 17 redesign decisions eliminated compatibility with older radar hardware

The United States Marine Corps has accepted delivery of six newly built F-35B stealth fighters carrying ballast weights where a radar should be installed.

The aircraft left production lines without the AN/APG-85 radar that future F-35 variants are expected to rely upon for combat operations.

Source link

Advertisement
Continue Reading

Tech

Godot Game Engine No Longer Accepts AI Code

Published

on

The Godot Foundation will stop accepting AI-authored code, agent-submitted pull requests, and AI-generated text in contributor communications after maintainers were overwhelmed by low-effort submissions. “It is time for us to recognize that these problems aren’t going away and therefore we need to take steps to reduce the burden on maintainers while ensuring we still have a pipeline to mentor new contributors to become future maintainers,” the Godot Foundation said in a blog post. Contributors may still use AI for limited “menial things” if they disclose it, but humans must understand, own, and be able to fix the code they submit. PC Gamer reports: The Foundation says the pileup of Godot pull requests pending review isn’t all bad: It’s a sign that interest in using and contribution to Godot is increasing. But the influx of contributions authored or submitted by AI is sapping the projects’ maintainers of their willingness to confront the “already tedious” work of reviewing pull requests. “If your feedback on PRs is just being absorbed by a machine and not going towards mentoring a potential future maintainer, it becomes much harder to justify spending your free time on PR review,” the Foundation said.

As the problem becomes increasingly unsustainable, the Godot Foundation says it’s in the process of updating its contribution policies, focusing on “adding barriers to low-effort slop” contributions, encouraging maintainers to review code, developing new contributors into future maintainers, and crucially, requiring that all contributions come from humans who are accountable for their code — and fixing it if it fails. “AI cannot take responsibility, and we can’t trust heavy users of AI to understand their code enough to fix it,” the Foundation said.

The Foundation says we can expect Godot’s contributing policy to soon include explicit rejections of AI-authored code, noting that contributors should only use AI assistance for “menial things” and must disclose its use. Additionally, the Foundation will reject any AI-generated text in human-to-human communications, saying it’s “a basic principle of respect” — though it says machine translations “are still acceptable” if the original text was human-authored. “Things change every day with respect to the current suite of AI tools available,” the Foundation said. “We will continue taking a conservative approach in our policies towards them, but we will re-evaluate as things evolve.”

Source link

Advertisement
Continue Reading

Tech

AI Agent Executes ‘First’ End-To-End Ransomware Attack

Published

on

Sysdig says it has documented the first ransomware attack carried out end to end by an AI agent, which autonomously exploited exposed systems, stole credentials, established persistence, compromised a production database, and destroyed data. The research team named the attacker “JadePuffer” and said it gained initial access to an internet-facing Langflow instance by exploiting CVE-2025-3248. “The most striking characteristic, however, was the LLM’s behavior,” Sysdig director of threat research Michael Clark said in a blog post. An anonymous reader quotes an excerpt from The Register: JadePuffer’s “self-narrating” payloads “contained natural language reasoning, target prioritization, and the kind of detailed annotations that human operators don’t often write but LLM-generated code produces reflexively,” Clark added. “The operation also adapted in real time, retrying failed steps within refined parameters. In one sequence, it went from a failed login to a working fix in 31 seconds.” After exploiting CVE-2025-3248, a missing authentication vulnerability in Langflow that allows remote, unauthenticated attackers to execute arbitrary Python on the host, the AI agent began scanning for and collecting secrets, including LLM provider API keys, cloud credentials “with explicit coverage of Chinese providers” including Alibaba, Aliyun, Tencent, and Huawei, while also scanning for AWS, Azure and Google Cloud Platform, cryptocurrency wallets, and database credentials.

The AI also installed a crontab entry on the Langflow server to maintain persistence and call back to the attacker’s infrastructure every 30 minutes. JadePuffer’s intended target was a separate internet-exposed production server running a MySQL database and an Alibaba Nacos configuration service, we’re told. Nacos is an open-source service-discovery and dynamic configuration platform developed by Alibaba and used in the cloud provider’s microservices applications. The agent connected to the server’s exposed MySQL port using root credentials, although Sysdig doesn’t know how the attacker obtained them. These credentials weren’t stolen from the victim’s environment.

JadePuffer then attacked Nacos via multiple vectors including an authorization bypass flaw (CVE-2021-29441) and forging a valid JSON web token (JWT) using Nacos’s default signing key. Additionally, using its root database access, the LLM injected a backdoor administrator into the Nacos backing database. It ultimately encrypted all 1,342 Nacos service configuration items using MySQL’s built-in AES encryption function, and created an extortion demand, ransom note, Bitcoin payment address, and a Proton Mail contact […]. However, according to the threat hunters, the victim can’t recover the encrypted data, even if they paid the ransom demand, because the agent escalated “from row-level deletion to dropping entire database schemas, narrating its own targeting rationale,” without backing up any of the encrypted data.

Source link

Advertisement
Continue Reading

Tech

Labor Force Participation Rate Falls To Lowest In 50 years

Published

on

The US unemployment rate fell to 4.2% in June largely because 720,000 people left the labor force, pushing participation to 61.5%. Excluding the Covid-era jobs market, that’s the lowest participation rate since June 1976. CNBC reports: The decline in the labor force marks a “massive exodus” driven by multiple factors, said Mike Reid, head of U.S. economics at RBC. “The unemployment rate fell to 4.2% as both the number of unemployed workers and the size of the labor force pulled back,” Reid wrote in a post-report commentary. “This may well be a story of retirements but could also be a story of prior job seekers dropping out of the labor force.”

[…] [T]he rolls of those counted as not in the labor force, a group that includes the unemployed and those not looking for work, jumped by 832,000. And while the establishment survey, which counts jobs filled, showed growth for the month of 57,000, the survey of households, which counts the actual level of those working, tumbled by 507,000. On a year-over-year basis, the labor force is down by just over 1 million, while the level of the employed also has fallen by 1.06 million and the ranks of the unemployed have risen by 40,000. The employment-to-population ratio slipped to 59% in June, the lowest since October 2021. All that has happened while the unemployment rate has risen by just one-tenth of a percentage point to 4.2%.

The drop in participation is sometimes attributed to a shrinking immigrant population and retiring baby boomers and Gen Xers. However, in June the biggest plunge came from what is defined as “prime age” workers, or those between the ages of 25 and 54. That rate fell 0.6 percentage point to 83.3%, its lowest since December 2023. “Looking at the statistics now, that argument doesn’t hold up so well,” North said of the retirement and immigration rationale. “I hate to use the word ‘alarming,’” he added, but said the numbers are cause for concern.

Source link

Advertisement
Continue Reading

Tech

A new attack uses a BioShock-style puzzle to convince AI browsers they're not in the real world

Published

on


Researchers from LayerX recently unveiled BioShocking, a new type of vulnerability designed to target AI-powered browsers capable of executing autonomous tasks on the open web. The security firm explained that BioShocking can “game” an AI-based browser, causing the system to execute malicious instructions after effectively bypassing its intended security guardrails.
Read Entire Article
Source link

Continue Reading

Tech

T-Mobile Jacks Up Prices For Everybody, Ignores Years Of ‘Uncarrier’ Promises

Published

on

from the we-are-now-fully-enshittified dept

In the wake of the Sprint T-Mobile merger, wireless carriers immediately stopped trying to compete on price (exactly what deal critics had warned would happen when you reduce sector competition). T-Mobile, which once tried to differentiate itself as the consumer-friendly “uncarrier,” almost immediately began behaving just like AT&T and Verizon, starting with firing 9,000+ people.

It’s how mindless and harmful consolidation always works. We know this, there’s endless evidence of this, and somehow it never seems to matter in a country too corrupt to function.

In the last few years, T-Mobile’s been facing lawsuits and consumer blowback because it’s constantly jacking up the price for customers who believed they were under a “price lock” guarantee thanks to a 7-year-old promotion promising that their price would never change.

More recently, T-Mobile announced it would be kicking roughly 8 million subscribers off of their traditional (and often cheaper plans), and onto more expensive and shittier new T-Mobile plans. These new price hikes have joined a bunch of other price hikes to make everybody’s bills significantly more expensive and all of their connections less feature rich and useful:

Advertisement

T-Mobile frames the current migration as an average $4-per-line adjustment, according to CNET. That sounds modest until you stack it on the $5-per-line hike that already hit many legacy smartphone plans back in April 2025. PhoneArena reports some customers on older grandfathered plans face total increases approaching 60% compared to their original rates. Meanwhile, administrative fees for voice lines climbed from $3.99 to $4.49 per month — raised twice within a single year, according to tmo.report — with mobile internet line fees moving from $1.60 to $2.10.

This must be more of that deregulatory, consolidative innovation my Libertarian friends at “non profit” “free market” “think tanks” have spent years telling me about.

This was, of course, something merger critics warned about, very vocally, for a long time. I wrote repeatedly, at multiple outlets, about how this deal’s pre-merger promises were utterly worthless. It didn’t matter, because the federal government is too corrupt to function in the public interest, antitrust reform no longer exists, and the electorate very clearly has a head full of cottage cheese.

Meanwhile all the folks responsible — whether corrupt politicians, shitty Libertarian free market think tanks, or cocky executives — have long-since moved on to other terrible ideas and memory holed the entire thing, while consumers and labor — as always — are forced to eat all of the real-world costs.

Anyway, remember when T-Mobile bribed Trump to get the merger approved, eliminated all of its “DEI” requirements like an obedient poodle, or that time they hired Corey Lewandowski as a consultant just days after he mocked a Down Syndrome kid on cable TV? Great stuff. So many memories.

Advertisement

Filed Under: competition, consolidation, enshittification, john legere, layoffs, mergers, price hikes, prices, telecom, wireless

Companies: t-mobile

Source link

Advertisement
Continue Reading

Tech

Anthropic is in talks with Samsung to manufacture a custom AI chip

Published

on

TL;DR

Anthropic is discussing a custom AI chip with Samsung, though the project is early-stage and no design has been finalized.

Anthropic is in talks with Samsung Electronics to explore manufacturing a custom AI chip, The Information reported on Thursday. The project remains at an early stage, and Anthropic has not yet decided what the chip would be used for, how powerful it would be, or how it would fit into a server, according to the report. The company could still abandon the effort entirely.

When asked for comment, Anthropic told TechCrunch that a diversified hardware stack including chips from Google, Amazon, and Nvidia will continue to be central to its compute strategy, and said it had nothing further to add on the Samsung discussions. Samsung already plays a significant role in the AI chip supply chain as a major manufacturing partner for Nvidia, producing chips that power AI training and inference workloads. The two companies are also building an AI chip factory together in South Korea.

The talks follow a Reuters report in April that Anthropic was exploring the idea of building its own chips as Claude’s compute demands outpaced available supply. At the time, the effort was described as preliminary, with no dedicated team assembled and no commitment to a specific design. What has changed since April is that Anthropic has hired Clive Chan, who previously helped build OpenAI’s custom chip programme, a signal that the company is moving from exploration to active development.

Advertisement

The timing also coincides with a move by Anthropic’s main competitor. Last week, OpenAI unveiled its first custom chip, a Broadcom-built inference processor it calls the “Intelligence Processor,” designed to reduce the company’s dependence on Nvidia hardware. Amazon and Google both already offer their own custom silicon through their cloud platforms, and Anthropic currently runs Claude across all three chip families.

Anthropic’s annualized revenue run rate surpassed 30 billion dollars earlier this year, more than tripling from roughly nine billion dollars at the end of 2025, a growth rate that makes the economics of custom silicon increasingly attractive. The company signed a long-term deal with Google and Broadcom in April for roughly three and a half gigawatts of TPU compute starting in 2027, but designing its own chips would give it an additional layer of control over the hardware that runs its models. Whether Samsung or another manufacturer ultimately builds a chip for Anthropic remains an open question, but the direction of travel across the industry, away from total reliance on Nvidia, is now unmistakable.

Source link

Advertisement
Continue Reading

Tech

Nvidia recruits longtime Microsoft sales leader Nick Parker with $40M+ pay package

Published

on

Microsoft executive Nick Parker at a conference in 2018. (Microsoft Photo)

Nick Parker, a 26-year Microsoft veteran who led the company’s worldwide commercial sales business, is leaving to become Nvidia’s new sales chief — a high-profile talent shift between two of the biggest players in the AI boom. 

Parker will join Nvidia as executive vice president of worldwide field operations, effective Aug. 24, according to a regulatory filing. He succeeds Jay Puri, who is retiring after 21 years running Nvidia’s global sales operation and will stay on as a senior adviser. 

“Microsoft and NVIDIA are great partners and I look forward to continuing to nurture that fantastic relationship,” Parker wrote in a LinkedIn post announcing the move.

The regulatory filing by Nvidia sets Parker’s base salary in the new role at $1 million, with a $5 million signing bonus and equity grants targeted at $40 million. The bulk of that, $35 million in restricted stock units, vests over roughly four years, while the additional $5 million in shares is tied to Nvidia outperforming the S&P 500 over three years.

The new role puts him in charge of global sales and customer relationships at the center of the AI boom, reporting directly to Nvidia CEO Jensen Huang — one of the most consequential commercial roles in the industry, overseeing the operation that sells Nvidia’s chips to the world’s largest companies.

Advertisement

Parker, 55, rose through OEM, device and partner sales roles at Microsoft before being named president of industry and partner sales in 2022. After a promotion this year, he served most recently as executive vice president and chief business officer of Microsoft Worldwide Sales & Solutions, reporting to Judson Althoff, CEO of Microsoft’s commercial business.

Puri, 71, is credited with helping transform Nvidia from a consumer gaming brand into an AI infrastructure giant, building the enterprise sales operation Parker will now inherit.

On Thursday, Microsoft unveiled a $2.5 billion initiative called the Microsoft Frontier Company, which will embed AI engineers inside customers. It will be led by Rodrigo Kede Lima, a longtime Microsoft sales and enterprise leader, most recently president of Microsoft Asia. 

Source link

Advertisement
Continue Reading

Tech

Samsung is plotting to replace your M.2 SSD with a storage chip smaller than a fingernail to improve battery life and supercharge ondevice AI inference

Published

on


  • Samsung reveals industry first UFS 5.0 storage: A single fingernail-sized chip that can read at up to 10.8GB/s
  • With capacities of up to 1TB on offer, it makes a solid case for displacing existing M.2 drives in laptops, handhelds and game consoles amongst other hardware
  • The chip is also relevant for AI, offering a significant increase in read speeds for on-device AI solutions as a 40% decrease in power consumption versus UFS 4.1

Samsung has unveiled what it says is the industry’s first look at UFS 5.0, a new storage standard for its customers.

The memory and storage giant unveiled its new storage chip on the 23rd of June while positioning its embedded storage standard as an important breakthrough for localized, or on-device, AI solutions.

Source link

Advertisement
Continue Reading

Trending

Copyright © 2025