Connect with us
DAPA Banner

Tech

Google begins calling out battery-killing Android apps

Published

on

Google is living up to its word and posting warning labels for battery-killing apps. 9to5Google spotted Google’s rollout announcement, which the company previously said would arrive on March 1.

The label says, “This app may use more battery than expected due to high background activity.” If you don’t yet see the warnings, they may not have reached you yet. Google says the banners will “roll out gradually to impacted apps” in the coming weeks.

Play Store battery warning

Play Store battery warning (Google)

Warning labels aren’t the only stick in Google’s fight against infringing apps. They may also be excluded from discovery services like Play Store recommendations.

Google’s definition of battery-draining apps centers around Android’s “partial wake lock” mechanism. This service allows an app to keep the phone’s processor running even while the screen is off. There are logical exceptions where apps do need this: audio playback, location access, etc. But the company apparently sees too many abusing that API for other reasons. And Google wouldn’t want people to assume the problem is with the hardware and switch to an iPhone — because then we’re talking about money.

Advertisement

If you’re a developer, Google’s technical documentation offers much more detail. For everyone else, keep an eye out for those Play Store labels and consider steering clear of those apps until their devs clean things up.

Source link

Continue Reading
Click to comment

You must be logged in to post a comment Login

Leave a Reply

Tech

Apple Can Create Smaller On-Device AI Models From Google’s Gemini

Published

on

Apple reportedly has full access to customize Google’s Gemini model, allowing it to distill smaller on-device AI models for Siri and other features that can run locally without an internet connection. MacRumors reports: The Information explains that Apple can ask the main Gemini model to perform a series of tasks that provide high-quality results, with a rundown of the reasoning process. Apple can feed the answers and reasoning information that it gets from Gemini to train smaller, cheaper models. With this process, the smaller models are able to learn the internal computations used by Gemini, producing efficient models that have Gemini-like performance but require less computing power.

Apple is also able to edit Gemini as needed to make sure that it responds to queries in a way that Apple wants, but Apple has been running into some issues because Gemini has been tuned for chatbot and coding applications, which doesn’t always meet Apple’s needs.

Source link

Continue Reading

Tech

Google & Meta found liable for social media addiction

Published

on

Meta and Google have been found liable for building intentionally addictive social media services, in a trial that sets a strong precedent for hundreds of other lawsuits that are still pending.

Man in a gray t-shirt onstage, shrugging with one arm extended and lips pursed, standing before a dark blue geometric background, wearing a headset microphone.
Meta CEO Mark Zuckerberg

On Wednesday, a jury in Los Angeles Superior Court finished its deliberations over a lawsuit between Meta and Google, and a young woman. The jury found that the tech giants were liable for enabling a woman identified as Kaley to become addicted to social media as a child.
The lawsuit commenced in January, while the jury deliberations started on March 13.
Continue Reading on AppleInsider | Discuss on our Forums

Source link

Continue Reading

Tech

Spotify's SongDNA maps the creative lineage of your favorite music

Published

on


Spotify is working on a new feature to help users discover music. The immensely popular streaming service recently introduced SongDNA, an “immersive” experience that provides extended credits to the people who contributed to one or more music productions.
Read Entire Article
Source link

Continue Reading

Tech

Meta ordered to pay $375m after losing child safety lawsuit

Published

on

Meanwhile, a Los Angeles jury is deliberating a social media addiction lawsuit against Meta and Google.

A New Mexico jury has found that Meta endangered children by misleading users about the safety of its platforms. The decision comes after a nearly seven-week trial, resulting in Meta being told to pay $375m in damages.

“The jury’s verdict is a historic victory for every child and family who has paid the price for Meta’s choice to put profits over kids’ safety,” said New Mexico attorney general Raúl Torrez, who filed the lawsuit against Meta in 2023.

In the suit, he claimed Meta knowingly exposes children to sexual exploitation and mental harm for profit. Meta owns Instagram, Facebook and WhatsApp.

Advertisement

According to the New Mexico Department of Justice, evidence presented at the trial established that Meta’s design features enable bad actors to engage in child sexual exploitation. Evidence also showed platforms are also designed to addict young people, according to the department.

“Meta executives knew their products harmed children, disregarded warnings from their own employees, and lied to the public about what they knew. Today the jury joined families, educators, and child safety experts in saying enough is enough,” Torrez added. The company has been found to have violated parts of the state’s Unfair Practices Act.

Meta plans to appeal the decision. “We respectfully disagree with the verdict and will appeal,” Meta said in statement yesterday (24 March).

“We work hard to keep people safe on our platforms and are clear about the challenges of identifying and removing bad actors or harmful content. We will continue to defend ourselves vigorously, and we remain confident in our record of protecting teens online.”

Advertisement

Meanwhile Torrez will be asking the presiding judge to place additional penalties on Meta during a bench trial scheduled in early May. Torrez will also be asking the court to force Meta to make its apps safer for children.

The New Mexico verdict is a major loss for Meta, which is gearing up for a number of trials set for this year. A jury in Los Angeles is currently deliberating a social media addiction suit against Meta and Google. TikTok and Snapchat were involved in the original suit, but have since settled out of court.

Thousands have filed lawsuits against social media companies over the alleged harm they pose to their users, including more than 40 US state attorney generals.

A coalition suit filed in 2023 accused Meta of designing and deploying “harmful features” on Instagram and Facebook, which get younger people addicted to these platforms.

Advertisement

Don’t miss out on the knowledge you need to succeed. Sign up for the Daily Brief, Silicon Republic’s digest of need-to-know sci-tech news.

Source link

Advertisement
Continue Reading

Tech

Startups pitch big AI ideas during mini-competition at GeekWire’s ‘Agents of Transformation’

Published

on

Big Tech is not alone in the AI innovation race. Four startup founders took the stage at GeekWire’s Agents of Transformation event Tuesday in Seattle for a rapid-fire pitch competition.

Ideas from Pay-i, Cascade, Autessa and GemaTEG were pitched to the crowd and a panel of judges, with Pay-i founder David Tepper emerging as winner and most impressive under pressure.

Judges Bryan Hale of Anthos Capital, Yifan Zhang of AI House, and T.A. McCann of Pioneer Square Labs said they were looking for someone who was “both great at presenting but also fantastic at answering the questions.”

Read more about each pitch:

Advertisement

Pay-i (pitch by David Tepper, founder/CEO)

David Tepper, right, founder and CEO of Pay-i, pitches alongside judges, from left, Bryan Hale of Anthos Capital, Yifan Zhang of AI House, and T.A. McCann of Pioneer Square Labs during GeekWire’s Agents of Transformation event in Seattle on Tuesday. (GeekWire Photo / Kevin Lisota)

An AI spend management platform that tracks ROI across an organization’s entire AI footprint — not just tokens, but the full cost stack including models, tools, and GPU resources.

David Tepper argued that tokens account for only 72% of the total expense associated with AI, and that the complexity multiplies fast when agents are drawing on multiple models, enterprise discounts, and rented GPU banks simultaneously.

Born from his days tracking Microsoft’s internal Gen AI spend on Excel spreadsheets — a period when he says he once saved his division $300,000 a week by simply asking the right questions — the company targets enterprises spending at least $500,000 on AI annually.

“After all the hype and FOMO wears off, there’s three letters that are going to survive the AI revolution, and that’s ROI,” he said.

Advertisement

Cascade AI (pitch by Ana-Maria Constantin, co-founder/CEO)

Ana-Maria Constantin, co-founder and CEO of Cascade, pitches during GeekWire’s Agents of Transformation event in Seattle. (GeekWire Photo / Kevin Lisota)

An agentic HR and IT support platform that deploys AI agents to handle sensitive employee situations — benefits navigation, mental health resources, leave management — confidentially and around the clock, freeing HR teams for higher-stakes human judgment.

Ana-Maria Constantin opened her pitch with a show of hands, asking the audience whether they’d ever hesitated to go to HR because they weren’t sure whose side HR would be on.

“Imagine if that’s the case for the people in this room, senior leaders working for some of the most successful companies in the world,” she said. “Imagine how regular employees are feeling. That’s the problem we’re working on at Cascade AI.”

Autessa (pitch by Roshnee Sharma, CEO)

Advertisement
Roshnee Sharma, CEO of Autessa, pitches at GeekWire’s Agents of Transformation event in Seattle. (GeekWire Photo / Kevin Lisota)

A platform that replaces off-the-shelf SaaS with custom-built software staffed by “AI employees” — agents that handle workflows like lead qualification and order processing.

Roshnee Sharma’s pitch opened with a crowd-participation moment: what does SaaS really stand for? “Software as a spend,” she declared.

The company targets mid-market businesses with $20 million to $500 million in revenue, and prices its AI employees at roughly $7 to $10 each.

Judges pushed back on whether results were truly cost-saving or merely cost-neutral; Sharma argued the savings are real because clients avoid having to hire additional headcount: “We didn’t fire people. We got people able to do more of the work that they wanted to do.”

GemaTEG (pitch by Manfred Markevitch, co-founder/CEO)

Advertisement
Is this any way to cool an overheating data center processor.? Manfred Markevitch, co-founder and CEO of GemaTEG, makes the case for his startup’s alternative at GeekWire’s Agents of Transformation event in Seattle. (GeekWire Photo / Kevin Lisota)

The outlier of the group: a hardware thermal management company targeting AI data centers, using solid-state cooling technology that requires no water and uses 40% less power than conventional systems.

“AI runs on hardware. It’s not only software,” co-founder and CEO Manfred Markevitch told the crowd, noting that a conventional hybrid-scale data center can consume a million gallons of water per day.

GemaTEG’s granular approach cools at the individual chip level rather than the whole building, and the company claims its systems perform twice as well as conventional ones on a per-watt basis. The company already has installations with the U.S. Department of Energy, and partners in Italy and Switzerland.

Hyperscaler deployment is one to two years out, with chip manufacturer design-in conversations already underway. Judges pressed hard on customer lock-in risk; Markevitch compared the stickiness of their solution to Intel Inside — once designed in, it spans multiple chip generations.

Related coverage:

Advertisement

Source link

Continue Reading

Tech

Galway PhD student on what led to her discovery of new exoplanet

Published

on

‘Wispit 2C’ is estimated to be about 5m years old and most likely 10 times the mass of Jupiter.

Galway native Chloe Lawlor has discovered a new planet – the second one to be found forming near an infant star called ‘Wispit 2’, some 437 light years away.

As a child, Lawlor wanted to be an artist, she tells SiliconRepublic.com. However, she changed her mind once she joined university. “I moved into physics because I did like physics in school, so I thought, ‘Oh, maybe I’ll just try this out.’”

The 25-year-old says discoveries such as these feed the innate curiosity humans have in wanting to know how we came to be, how we evolved and why we are here. Lawlor is a PhD student at the University of Galway’s Centre for Astronomy at the School of Natural Sciences and the Ryan Institute.

Advertisement

She is working in collaboration with project lead Richelle van Capelleveen, a PhD student from Leiden Observatory in the Netherlands, and postdoctoral researcher Guillaume Bourdarot from the Max Planck Institute for Extraterrestrial Physics in Germany, to learn more about young planets and how they’re forming.

“Most of the planets that we’ve observed have been much older,” Lawlor says. “We don’t know how they get to those sort of final stages like something like our solar system. This is really key for these formation theories and it’s hopefully going to tell us a lot about these young systems, how they’re forming, and then how they evolve.”

Lawlor’s new discovery, an exoplanet named ‘Wispit 2C’, is thought to be about 5m years old. ‘Wispit 2B’, a nearby planet, was discovered last year by van Capelleveen and Dr Laird Close from the University of Arizona.

Both these exoplanets are at early stages of formation in the disc around Wispit 2, which is located in the Constellation of the Eagle, a prominent equatorial constellation visible in the northern hemisphere summer along the Milky Way.

Advertisement

Lawlor’s discovery makes Wispit 2 the second known young and still forming multi-planet system. The only other system yet discovered with more than one planet developing is PDS 70, some 400 light years away from Earth.

Wispit 2C is a gas giant, likely around 10 times the mass of Jupiter. It is twice as massive as Wispit 2B and orbits four-times closer to its host star, which makes it incredibly difficult to detect with ground-based telescopes.

A mostly black space with a hazy white gaseous looking ring in the middle. A graphic is used to circle the object in the centre of the ring.

Wispit 2B and Wispit 2C forming around Wispit 2. Image: ESO/C Lawlor, R F van Capelleveen et al.

The exoplanet was detected using the European Southern Observatory’s Very Large Telescope in Chile’s Atacama desert. By linking several telescopes together to act as one giant instrument, the research team was able to observe regions very close to the star. In their analysis, the team was able to detect carbon monoxide gas, a chemical commonly found in the atmospheres of young giant planets.

Advertisement

Lawlor said earlier this week: “At first, we weren’t sure if it was a planet or a very large dust clump. We very quickly made follow-up observations using the Very Large Telescope Interferometer, an incredible setup where multiple telescopes can be connected to form a large virtual telescope.

“This allowed us to take what we call a spectrum, which is essentially a chemical fingerprint revealing the elements and molecules in an object’s atmosphere.”

Lawlor led the study, which has been published in The Astrophysical Journal Letters.

Prof Frances Fahy, director of the Ryan Institute, said: “The discovery of the planet Wispit 2C is a remarkable achievement and highlights the world-class astrophysics research taking place at University of Galway.”

Advertisement

The team will continue on with their efforts to hopefully find more planets in the system.

Last year, a study from Scotland’s University of St Andrews showed how giant free-floating planets could make their own miniature planetary systems without needing a star to orbit around. In a different study from 2025, scientists – for the first time – observed the very early stages of the creation of a new solar system around a baby star.

Don’t miss out on the knowledge you need to succeed. Sign up for the Daily Brief, Silicon Republic’s digest of need-to-know sci-tech news.

Advertisement

Source link

Continue Reading

Tech

New Harry Potter TV Series Trailer Reintroduces Hogwarts Magic for HBO

Published

on

Go ahead and get ready to cue your inevitable comparisons: the new trailer for HBO’s Harry Potter series dropped on Wednesday, giving audiences a first look at muggles, magical Hogwarts students and The Boy Who Lived. Due to hit HBO and HBO Max for Christmas 2026, the TV show will be a direct adaptation of the wizarding books, starting off with The Philosopher’s Stone.

To fans familiar with the movie franchise, this may feel like a rediscovery — or reintroduction — to the live-action version of the world of Harry Potter. The trailer shows a young Harry and his signature scar, his tyrant of an aunt and the moment he received his invitation from Hogwarts. Take a look at the first meet with Hagrid, tender moments with Ron and Hermione, and a look at Lucius Malfoy and Snape. 

The series features Dominic McLaughlin as Harry Potter, Arabella Stanton as Hermione Granger and Alastair Stout 
as Ron Weasley, and the expansive cast also includes Nick Frost as Hagrid, John Lithgow as Hogwarts headmaster Dumbledore and Paapa Essiedu as Professor Snape. 

Advertisement

Source link

Continue Reading

Tech

Nintendo might charge less for digital Switch games?

Published

on

Nintendo might finally be doing something gamers have been asking for… forever. The company has officially confirmed that Switch 2 games will have different pricing for digital and physical versions, with digital copies expected to be cheaper.

The change begins in May 2026, starting with titles like Yoshi and the Mysterious Book. For example, early listings on the eShop show the game priced at $59.99 digitally vs $69.99 physically, marking a clear shift in how Nintendo handles game pricing.

Why is Nintendo doing this?

Let’s be real, physical games are comparatively expensive to make. Nintendo says the change reflects the higher costs of manufacturing and distributing cartridges, compared to digital downloads. This aligns with what the industry has been doing for years, except Nintendo has been one of the few holdouts where digital and physical games often cost the same.

There’s also a bigger strategy at play here. By making digital games cheaper, Nintendo could nudge more players toward digital purchases. That essentially translates to higher margins, fewer logistics headaches, and a tighter grip on its ecosystem. In other words, this isn’t just about fairness in pricing… It’s also about where Nintendo wants its future sales to go.

What does this mean for players?

So, does this mean all games going forward will have different prices? Well, not exactly, and this is where things get a little messy. While Nintendo is setting a lower MSRP for digital games, actual pricing can still vary depending on the title and retailer. Plus, not every game will follow the same pattern, so bigger releases could still carry higher price tags, making the gap between digital and physical a bit inconsistent.

For players, though, this is still a win. Digital games are finally getting a clear pricing advantage after years of being oddly equal (or sometimes pricier) than physical copies. That said, the trade-off remains. Physical games can be resold or shared, while digital ones stay locked to your account.

Advertisement

Source link

Continue Reading

Tech

Meta and YouTube Found Negligent in Landmark Social Media Addiction Case

Published

on

A jury found Meta and YouTube negligent in a landmark social media addiction case, ruling that addictive design features such as infinite scroll and algorithmic recommendations harmed a young user and contributed to her mental health distress. The verdict awards $3 million in compensatory damages so far and could pave the way for more lawsuits seeking financial penalties and product changes across the social media industry. “Meta is responsible for 70 percent of that cost and YouTube for the remainder,” notes The New York Times. “TikTok and Snap both settled with the plaintiff for undisclosed terms before the trial started.” From the report: The bellwether case, which was brought by a now 20-year-old woman identified as K.G.M., had accused social media companies of creating products as addictive as cigarettes or digital casinos. K.G.M. sued Meta, which owns Instagram and Facebook, and Google’s YouTube over features like infinite scroll and algorithmic recommendations that she claimed led to anxiety and depression.

The jury of seven women and five men will deliberate further to decide what further punitive damages the companies should pay for malice or fraud. The verdict in K.G.M.’s case — one of thousands of lawsuits filed by teenagers, school districts and state attorneys general against Meta, YouTube, TikTok and Snap, which owns Snapchat — was a major win for the plaintiffs. The finding validates a novel legal theory that social media sites or apps can cause personal injury. It is likely to factor into similar cases expected to go to trial this year, which could expose the internet giants to further financial damages and force changes to their products. The verdict also comes on the heels of a New Mexico jury ruling that found Meta liable for violating state law by failing to protect users of its apps from child predators.

Source link

Continue Reading

Tech

How xMemory cuts token costs and context bloat in AI agents

Published

on

Standard RAG pipelines break when enterprises try to use them for long-term, multi-session LLM agent deployments. This is a critical limitation as demand for persistent AI assistants grows.

xMemory, a new technique developed by researchers at King’s College London and The Alan Turing Institute, solves this by organizing conversations into a searchable hierarchy of semantic themes.

Experiments show that xMemory improves answer quality and long-range reasoning across various LLMs while cutting inference costs. According to the researchers, it drops token usage from over 9,000 to roughly 4,700 tokens per query compared to existing systems on some tasks.

For real-world enterprise applications like personalized AI assistants and multi-session decision support tools, this means organizations can deploy more reliable, context-aware agents capable of maintaining coherent long-term memory without blowing up computational expenses.

Advertisement

RAG wasn’t built for this

In many enterprise LLM applications, a critical expectation is that these systems will maintain coherence and personalization across long, multi-session interactions. To support this long-term reasoning, one common approach is to use standard RAG: store past dialogues and events, retrieve a fixed number of top matches based on embedding similarity, and concatenate them into a context window to generate answers.

However, traditional RAG is built for large databases where the retrieved documents are highly diverse. The main challenge is filtering out entirely irrelevant information. An AI agent’s memory, by contrast, is a bounded and continuous stream of conversation, meaning the stored data chunks are highly correlated and frequently contain near-duplicates.

To understand why simply increasing the context window doesn’t work, consider how standard RAG handles a concept like citrus fruit.

Imagine a user has had many conversations saying things like “I love oranges,” “I like mandarins,” and separately, other conversations about what counts as a citrus fruit. Traditional RAG may treat all of these as semantically close and keep retrieving similar “citrus-like” snippets. 

Advertisement

“If retrieval collapses onto whichever cluster is densest in embedding space, the agent may get many highly similar passages about preference, while missing the category facts needed to answer the actual query,” Lin Gui, co-author of the paper, told VentureBeat. 

A common fix for engineering teams is to apply post-retrieval pruning or compression to filter out the noise. These methods assume that the retrieved passages are highly diverse and that irrelevant noise patterns can be cleanly separated from useful facts.

This approach falls short in conversational agent memory because human dialogue is “temporally entangled,” the researchers write. Conversational memory relies heavily on co-references, ellipsis, and strict timeline dependencies. Because of this interconnectedness, traditional pruning tools often accidentally delete important bits of a conversation, leaving the AI without vital context needed to reason accurately.

Naive RAG vs strucured memory

Naive RAG vs structured memory (source: arXiv)

Advertisement

Why the fix most teams reach for makes things worse

To overcome these limitations, the researchers propose a shift in how agent memory is built and searched, which they describe as “decoupling to aggregation.”

Instead of matching user queries directly against raw, overlapping chat logs, the system organizes the conversation into a hierarchical structure. First it decouples the conversation stream into distinct, standalone semantic components. These individual facts are then aggregated into a higher-level structural hierarchy of themes.

When the AI needs to recall information, it searches top-down through the hierarchy, going from themes to semantics and finally to raw snippets. This approach avoids redundancy. If two dialogue snippets have similar embeddings, the system is unlikely to retrieve them together if they have been assigned to different semantic components.

For this architecture to succeed, it must balance two vital structural properties. The semantic components must be sufficiently differentiated to prevent the AI from retrieving redundant data. At the same time, the higher-level aggregations must remain semantically faithful to the original context to ensure the model can craft accurate answers.

Advertisement

A four-level hierarchy that shrinks the context window

The researchers developed xMemory, a framework that combines structured memory management with an adaptive, top-down search strategy.

xMemory continuously organizes the raw stream of conversation into a structured, four-level hierarchy. At the base are the raw messages, which are first summarized into contiguous blocks called “episodes.” From these episodes, the system distills reusable facts as semantics that disentangle the core, long-term knowledge from repetitive chat logs. Finally, related semantics are grouped together into high-level themes to make them easily searchable.

xmemory

xMemory architecture (source: arXiv)

xMemory uses a special objective function to constantly optimize how it groups these items. This prevents categories from becoming too bloated, which slows down search, or too fragmented, which weakens the model’s ability to aggregate evidence and answer questions.

Advertisement

When it receives a prompt, xMemory performs a top-down retrieval across this hierarchy. It starts at the theme and semantic levels, selecting a diverse, compact set of relevant facts. This is crucial for real-world applications where user queries often require gathering descriptions across multiple topics or chaining connected facts together for complex, multi-hop reasoning.

Once it has this high-level skeleton of facts, the system controls redundancy through what the researchers call “Uncertainty Gating.” It only drills down to pull the finer, raw evidence at the episode or message level if that specific detail measurably decreases the model’s uncertainty.

“Semantic similarity is a candidate-generation signal; uncertainty is a decision signal,” Gui said. “Similarity tells you what is nearby. Uncertainty tells you what is actually worth paying for in the prompt budget.” It stops expanding when it detects that adding more detail no longer helps answer the question.

What are the alternatives?

Existing agent memory systems generally fall into two structural categories: flat designs and structured designs. Both suffer from fundamental limitations.

Advertisement

Flat approaches such as MemGPT log raw dialogue or minimally processed traces. This captures the conversation but accumulates massive redundancy and increases retrieval costs as the history grows longer.

Structured systems such as A-MEM and MemoryOS try to solve this by organizing memories into hierarchies or graphs. However, they still rely on raw or minimally processed text as their primary retrieval unit, often pulling in extensive, bloated contexts. These systems also depend heavily on LLM-generated memory records that have strict schema constraints. If the AI deviates slightly in its formatting, it can cause memory failure.

xMemory addresses these limitations through its optimized memory construction scheme, hierarchical retrieval, and dynamic restructuring of its memory as it grows larger.

When to use xMemory

For enterprise architects, knowing when to adopt this architecture over standard RAG is critical. According to Gui, “xMemory is most compelling where the system needs to stay coherent across weeks or months of interaction.”

Advertisement

Customer support agents, for instance, benefit greatly from this approach because they must remember stable user preferences, past incidents, and account-specific context without repeatedly pulling up near-duplicate support tickets. Personalized coaching is another ideal use case, requiring the AI to separate enduring user traits from episodic, day-to-day details.

Conversely, if an enterprise is building an AI to chat with a repository of files, such as policy manuals or technical documentation, “a simpler RAG stack is still the better engineering choice,” Gui said. In those static, document-centric scenarios, the corpus is diverse enough that standard nearest-neighbor retrieval works perfectly well without the operational overhead of hierarchical memory.

The write tax is worth it

xMemory cuts the latency bottleneck associated with the LLM’s final answer generation. In standard RAG systems, the LLM is forced to read and process a bloated context window full of redundant dialogue. Because xMemory’s precise, top-down retrieval builds a much smaller, highly targeted context window, the reader LLM spends far less compute time analyzing the prompt and generating the final output.

In their experiments on long-context tasks, both open and closed models equipped with xMemory outperformed other baselines, using considerably fewer tokens while increasing task accuracy.

Advertisement
xMemory performance

xMemory increases performance on different tasks while reducing token costs (source: arXiv)

However, this efficient retrieval comes with an upfront cost. For an enterprise deployment, the catch with xMemory is that it trades a massive read tax for an upfront write tax. While it ultimately makes answering user queries faster and cheaper, maintaining its sophisticated architecture requires substantial background processing.

Unlike standard RAG pipelines, which cheaply dump raw text embeddings into a database, xMemory must execute multiple auxiliary LLM calls to detect conversation boundaries, summarize episodes, extract long-term semantic facts, and synthesize overarching themes.

Furthermore, xMemory’s restructuring process adds additional computational requirements as the AI must curate, link, and update its own internal filing system. To manage this operational complexity in production, teams can execute this heavy restructuring asynchronously or in micro-batches rather than synchronously blocking the user’s query.

Advertisement

For developers eager to prototype, the xMemory code is publicly available on GitHub under an MIT license, making it viable for commercial uses. If you are trying to implement this in existing orchestration tools like LangChain, Gui advises focusing on the core innovation first: “The most important thing to build first is not a fancier retriever prompt. It is the memory decomposition layer. If you get only one thing right first, make it the indexing and decomposition logic.”

Retrieval isn’t the last bottleneck

While xMemory offers a powerful solution to today’s context-window limitations, it clears the path for the next generation of challenges in agentic workflows. As AI agents collaborate over longer horizons, simply finding the right information won’t be enough.

“Retrieval is a bottleneck, but once retrieval improves, these systems quickly run into lifecycle management and memory governance as the next bottlenecks,” Gui said. Navigating how data should decay, handling user privacy, and maintaining shared memory across multiple agents is exactly “where I expect a lot of the next wave of work to happen,” he said.

Source link

Advertisement
Continue Reading

Trending

Copyright © 2025