Connect with us

Tech

How to watch Jensen Huang’s Nvidia GTC 2026 keynote

Published

on

Nvidia kicks off its annual GTC developer conference in San Jose, California, next week with CEO Jensen Huang’s keynote scheduled for Monday at 11am PT / 2pm ET.

GTC — which stands for GPU Technology Conference — is Nvidia’s flagship annual event, where the chipmaker typically uses the spotlight to announce new products, champion partnerships, and lay out its vision for the future of computing. Huang’s keynote will focus on Nvidia’s role in the future of computing and AI. You can watch the two-hour address in person at the SAP Center or livestream the talk on the event’s website.

The broader three-day event is focused on what’s coming next for AI across industries including healthcare, robotics, and autonomous vehicles, among others.

On the software side, it’s rumored that Nvidia will release an open source platform for enterprise AI agents, dubbed NemoClaw, as originally reported by Wired. The platform would give businesses a structured way to build and deploy AI agents (software that can carry out multi-step tasks autonomously) and would position Nvidia to mirror similar offerings from companies like OpenAI.

Advertisement

On the hardware side, the company is also rumored to be releasing a new chip designed to accelerate the AI inference process — the process by which an AI model applies what it has learned to generate responses or make decisions, as distinct from the initial training process, which requires far more computing power. Faster, cheaper inference is widely seen as one of the last bottlenecks to scaling AI applications broadly. The chip, if confirmed, would represent Nvidia’s latest bid to dominate not just the training market, where it already commands an estimated 80% share, but the inference market as well, where competition from custom chips built by Google, Amazon and others is fast intensifying.

Kevin Cook, a senior equity strategist at Zacks Investment Research, told TechCrunch that attendees should also expect to learn what the company plans to do with its relationship with Groq, the inference company Nvidia reportedly paid $20 billion late last year to license its technology. There’s a lot of curiosity around this tie-up, given that Jonathan Ross, Groq’s founder, Sunny Madra, Groq’s President, and other members of the Groq team agreed to join Nvidia to help advance and scale that licensed tech.

There will, of course, also be a range of partnership announcements and demonstrations showcasing Nvidia’s AI capabilities across industries.

Techcrunch event

Advertisement

San Francisco, CA
|
October 13-15, 2026

Source link

Advertisement
Continue Reading
Click to comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Tech

‘Uncanny Valley’: Anthropic’s DOD Lawsuit, War Memes, and AI Coming for VC Jobs

Published

on

Brian Barrett: The irony is my favorite part because I feel like venture capitalists have largely positioned themselves as immune to the effects of AI because they’re very special and surely a machine can—

Zoë Schiffer: It’s art, not science.

Brian Barrett: Yeah. It’s art, not science. Machines can take every job, but not us. The ladder stops just below VC for them in a way that is entertaining and fun. So I wonder how many people are actually using this now, especially because venture capitalists themselves are so skeptical of it, it seems like. Who’s the audience? Is it finding real traction out there?

Zoë Schiffer: Yeah. So the way that ADIN works is they have scouts that go out and look for potential deals, and then those scouts can make money on said deals. So I think this would be something where VCs wouldn’t necessarily be adopting the network, but people would be going around them, and they wouldn’t be as necessary, as useful. I think there was another great irony, which Arielle pulled out in her piece, which is that also, if you can start a company with just yourself and a bunch of AI agents, you’re vibe coding your way to success. Do you even need all of that venture capital money to begin with?

Advertisement

Leah Feiger: I don’t know. There’s so much to me, there’s so much fear about AI taking jobs. I feel like every other article that is like, “And these people are nervous, and these people are nervous.” Brian’s right, the part that is funny is these are the folks that have just gone all in on AI, but I’m still waiting. I’m still waiting for AI to take the jobs. Has it yet? Will it yet?

Zoë Schiffer: Yeah. I think that there’s recent research. I was talking to Will Knight, one of our fantastic AI reporters, about this yesterday, and he was saying, “Look, the evidence just isn’t there yet for many, many industries. The hype has, as it often does, gone way out ahead of the actual data here. We don’t know that AI is taking jobs.” But I will say, being in San Francisco, I am hearing a lot of people say engineering teams in particular are very bloated right now. Agents can actually do a lot of the work, and you definitely need humans on top managing those agents, but you could cut a lot of teams by 80 percent, 50 percent, 60 percent. And so I think that we are going to see more AI-related job loss, first in engineering and then in other sectors.

Brian Barrett: Marc Andreessen, famous venture capitalist, cofounder of Andreessen Horowitz, said this very thing in a recent podcast. Listen to how special he thinks his own profession is.

Marc Andreessen, archival audio: Every great venture capitalist in the last 70 years has missed most of the great companies of his generation. If it was a science, you could eventually have somebody who just dials it in and gets 8 out of 10, but in the real world, it’s not like that. It’s just you’re in the fluke business. And so there’s an intangibility to it. There’s a taste aspect, the human relationship aspect, the psychology. And I don’t want to be definitive, but it’s possible that that is quite literally timeless. And when the AIs are doing everything else, that may be one of the last remaining fields that people are still doing.

Advertisement

Source link

Continue Reading

Tech

F1's debut race in the 2026 season beat everyone's expectations, even Apple's

Published

on

Apple’s senior VP of services, Eddy Cue, is happy to share that viewership of the Australian Grand Prix was higher on Apple TV versus ESPN in 2025, but that’s as much detail as we can expect.

The F1 streaming deal brings every race of the season to Apple TV
The F1 streaming deal brings every race of the season to Apple TV

It’s the first race of the 2026 season, but Apple is already out celebrating the turnout with F1. The ecosystem-wide push helped Apple in other areas too, like the Sports app seeing its biggest week ever.
These revelations were shared by Services SVP Eddy Cue via Hollywood Reporter. The success can likely be attributed to Apple’s significant marketing push and the popularity of the F1 movie.
Continue Reading on AppleInsider | Discuss on our Forums

Source link

Continue Reading

Tech

Here’s the pitch: UW students get in the room with key investors to share their AI startup ideas

Published

on

A team of entrepreneurial students from the University of Washington pitches their idea for a startup called Wayfinder to a room full of investors at Pioneer Square Labs in Seattle on Wednesday. (GeekWire Photo / Kurt Schlosser)

It was a pitch-perfect evening at Pioneer Square Labs in Seattle on Wednesday as students from the University of Washington laid out their startup business plans in front of some of the city’s most influential venture capitalists.

The event was the culmination of a 10-week class — “Entrepreneurship: Company-Building from Formation to Successful Exit” — in which students are taught to develop a business plan, pitch, and product demo.

The program is taught by venture capitalist Greg Gottesman, co-founder and managing director at PSL, along with Ed Lazowska, a longtime computer science professor at the UW. Throughout the quarter, the class attracts a “who’s who” of guests from Seattle’s tech industry, including Amazon CEO Andy Jassy, Microsoft CFO Amy Hood, and Zillow and Expedia co-founder Rich Barton.

Seven teams comprising 67 undergraduate and graduate students came from across campus disciplines including business, computer science, design and more. They pitched a variety of tech solutions aimed at helping new parents, college students, seniors and lots of people in between. And everything relied on a healthy dose of artificial intelligence.

The teams gathered at PSL’s Pioneer Square offices to deliver their final pitches and answer questions from VCs. This year’s event included a new wrinkle: one pitch round was held in a conference room meant to mimic a high-stakes, real-world meeting, where students were interrupted and sidetracked throughout their presentations. Another round of eight-minute pitches in front of all the other teams remained uninterrupted.

Advertisement

“One of the reasons that this is so fun is we really do learn a lot more from the students than they even learned from us,” Gottesman told GeekWire. “They’re at the cutting edge of using AI, thinking about new solutions to old problems and some new problems.”

PSL Managing Director Greg Gottesman, second from left, takes a call during the pitch from HeyLily, a team working on a product to detect scam calls. (GeekWire Photo / Kurt Schlosser)

Lazowska said the class, which attracted 150 applicants, starts with coaching and can often mirror the startup emotional rollercoaster: an idea seems unachievable, pivots happen, and depression sets in. Then, by the end, a “miracle” occurs. He attributes much of the success to the diversity of majors.

“A big part of it is those people learning to work together, learning each one has something really exciting to contribute,” Lazowska said. “And for tech people, you always think that tech is what makes a company, and that’s totally wrong in most cases, right?”

The goal is to get to a viable business by the end of the course, and it was clear students had achieved that as they discussed go-to-market strategies, the competitive landscape, potential revenue streams, and more. VCs from PSL, Madrona, Flying Fish Ventures, Fuse, Voyager Capital, Ascend and elsewhere provided input and advice.

UW professor Ed Lazowska, left, listens as Devang Thakkar, global head of Christie’s Ventures, shares feedback with entrepreneurial students at PSL in Seattle. (GeekWire Photo / Kurt Schlosser)

Ideas and reaction from students included:

  • Adelin Mah, a second-year computer science student, pitched with her team Instant Quote, a proposal generator using AI to speed such a process for tradespeople, starting with professional painters. Mah liked how the mix in the class put her alongside people who are already gaining experience at companies including Amazon and Google. “I was really into entrepreneurship, not as a career prospect, but just interest,” she said. “I build a lot of projects from hackathons and places like that, so I was already building, but I wanted to take it a step further.”
  • Tanmay Shah, a computer science graduate student who is a software engineer at Uber, was pitching with his team Wayfinder, a tool for helping college students (and their parents) stay on top of the application and admissions process. “One of the things that I’ve realized in the last couple of years is that there is a big opportunity out there to build something of your own and create a wedge very easily into existing markets,” Shah said. “This class is super good in terms of taking you from zero to somewhere where you can actually pitch your idea to a VC.”
  • Avni Rao is a third-year computer science major at UW who also leads a club called Computing Community. Her team, Nurture, was pitching a wearable baby monitor designed to collect data on infant sleep patterns. “I think I’ve probably learned the most in this class than any other class I’ve taken,” Rao said, adding that the experience does a good job of keeping up with the real world and an industry that is moving very fast.
Investors raise their hands to signal support for the top team pitches following the UW class event held by Greg Gottesman and Ed Lazowska at PSL in Seattle on Wednesday. (GeekWire Photo / Kurt Schlosser)

Second-year MBA Anshula Singh of AuthScript was part of the pitch that received the most top votes from judges. AuthScript’s embedded AI agent serves as an intelligent clinical partner for physicians by securely analyzing patient records in real-time and submitting complex prior authorization forms in seconds.

Singh said their idea is tackling “the most burdensome administrative problem in healthcare” around prior authorization, a tactic used by insurance companies to control costs. Alongside co-founder Jessica Hadley, another MBA student with experience in the health care sector, Singh said the class taught the team to double down and stay connected to an idea they believed in.

Advertisement

“We faced so many hurdles,” Singh said. “I think the first time that Greg heard the idea, he was like, ‘Nah. Why are you guys doing this?’ And we said, ‘No, there is a problem, there’s a market, and there’s people that are willing to pay.’”

Jacob Colker, co-founder and managing director of Seattle’s AI2 Incubator, summed up the room’s reaction to AuthScript: “You impressed 17 of Seattle’s top investors.”

His sentiment was echoed by other investors, who noted that every team would have earned a second meeting based on the quality of their pitches.

Source link

Advertisement
Continue Reading

Tech

Stewie from Family Guy is getting his own two-season spinoff series

Published

on

Fox is expanding the Family Guy universe with a new spinoff centered on one of its most iconic characters. The network has officially ordered ‘Stewie’, a standalone animated series focused on Stewie Griffin, the mischievous baby genius from the long-running comedy show.

Seth MacFarlane, the creator of Family Guy, will return as the voice of Stewie and will also serve as an executive producer on the project. Longtime Family Guy writer and producer Kirker Butler will take on the role of showrunner, helping shape the direction of the new series.

What the new Stewie spinoff series is about

Unlike the main Family Guy series, which follows the chaotic Griffin family in the fictional town of Quahog, the new show focuses entirely on Stewie’s life and adventures. Fox provided an official description of Stewie, which was reported earlier today by Deadline:

“After getting the boot from his old preschool, Stewie is forced to enroll in a new one that’s not exactly top-of-the-line. It’s attended by a handful of kids he doesn’t know, and a 75 year-old class turtle with a half-cocked theory on just about every subject.

Stewie’s miserable, the other kids are miserable, and even the turtle is miserable…until Stewie begins rolling out his trusty array of devices to take them anywhere in space and time, turning every boring day at school into an insane and surreal adventure.”

Advertisement

Brian, the Griffin family dog and Stewie’s closest ally in the best episodes of Family Guy, is not mentioned in the synopsis. However, Deadline reports he will appear as a recurring character.

Family Guy’s Stewie spinoff release date and streaming plans

Fox has ordered two seasons of the series from the start, and the show is currently planned for the 2027–2028 television season.

Both Family Guy and Seth MacFarlane’s other Fox animated series, American Dad, are already renewed through 2029, and the new two-season order for Stewie aligns with that timeline.

When it premieres, Stewie will air on Fox and will stream the next day on Hulu in the US. International audiences will also be able to watch it through Disney+.

Advertisement

Source link

Continue Reading

Tech

Ctrl-Alt-Speech: Writing Some Wrongs | Techdirt

Published

on

from the ctrl-alt-speech dept

Ctrl-Alt-Speech is a weekly podcast about the latest news in online speech, from Mike Masnick and Everything in Moderation‘s Ben Whitelaw.

Subscribe now on Apple Podcasts, Overcast, Spotify, Pocket Casts, YouTube, or your podcast app of choice — or go straight to the RSS feed.

In this week’s round-up of the latest news in online speech, content moderation and internet regulation, Mike and Ben cover:

Advertisement

Play along with Ctrl-Alt-Speech’s 2026 Bingo Card and get in touch if you win!

Filed Under: ai, artificial intelligence, child safety, content moderation, molly russell, publicity rights, trust and safety

Companies: grammarly, meta, whatsapp

Source link

Advertisement
Continue Reading

Tech

Professional Community Investment Yields Big Returns

Published

on

Engineering is so much more than solving problems or writing efficient code. It is about creating solutions that affect billions of lives and contributing to a profession built on innovation, responsibility, and collaboration. Although technical skills remain critical, what truly will accelerate the growth of the next generation of engineers is community and professional involvement.

Learning from communities

University programs provide a strong foundation in theory and practice, but they cannot capture the complexity of real-world engineering. As an IEEE senior member, I believe professional communities such as IEEE can help bridge the gap by offering:

I have served as a mentor and judge for a variety of hackathons across different age groups, including high school competitions United Hacks and NextStep Hacks, as well as graduate-level events such as HackHarvard.

The experiences demonstrate how transformative community-driven opportunities can be for young engineers. They provide exposure to teamwork, innovation, and the realities of solving problems at scale.

Advertisement

The power of mentorship

Engineers don’t develop skills in isolation. Mentorship, whether formal or informal, plays a pivotal role in shaping careers. Senior professionals who invest in guiding students and early-career engineers pass on more than technical knowledge. They share decision-making approaches, ethical considerations, and strategies for navigating careers, thereby expanding the engineering field.

As a keynote speaker at conferences, I have seen how sharing real-world experiences can ignite students’ curiosity and confidence. What they often value most is not a lecture on technology but candid insights into how to be resilient, grow their career, and learn about the different engineering paths.

Building ethical awareness

With the rise of artificial intelligence, biotechnology, and other high-impact innovations, engineers’ ethical responsibilities are more important than ever. Professional organizations such as IEEE and ACM emphasize codes of ethics and standards to help ensure that technology is developed responsibly.

Through my work as a peer reviewer and committee member for IEEE and ACM conferences, including those at the university level, I have seen how the organizations promote rigor and accountability.

Advertisement

When students engage with such communities early, they can not only expand their technical knowledge but also build an understanding of responsible innovation.

Networking as a catalyst for innovation

Engineering breakthroughs often emerge at the intersections of different fields. Professional communities create the space for such interactions. A student working on computer vision, for example, might discover health care applications by collaborating with biomedical engineers.

While reviewing papers for conferences, I have seen how interdisciplinary ideas spark promising innovations.

I bring the same perspective to my role as an IEEE Collabratec mentor, connecting with innovators across different disciplines and industries.

Advertisement

“When we invest in the community, we invest in the future of engineering.”

By collaborating on projects and expanding your reach, you can find the mentors or partners you need to inspire your next breakthrough.

Participating in forums allows students and professionals alike to broaden their horizons and explore solutions that go beyond traditional boundaries.

Giving back shapes leadership

Community involvement is not only about what you gain. It is also about what you give. Engineers who volunteer for educational programs, STEM initiatives, and professional committees can develop leadership skills that extend beyond technical expertise. They can learn to inspire, organize, and guide others.

Advertisement

Judging hackathons and mentoring student teams reminds me that leadership often begins with service. When experienced professionals actively invest in the growth of others, they help create a culture wherein learning and leadership are passed forward.

Preparing for a lifelong journey

Learning how to be an engineer doesn’t end when you earn your degree. It is a lifelong journey of learning, adapting, and contributing. By engaging with communities and professional networks early, students and graduates can develop habits that serve them throughout their career. They can stay current with emerging trends, build trusted professional relationships, and gain resilience through shared challenges.

Community involvement can transform engineers from problem-solvers into change agents.

Investing in the community

The future of engineering depends not only on technological advancement but also on the collective strength of its communities. By fostering mentorship, encouraging collaboration, and embedding ethical responsibility, professional and community involvement can ensure that the next generation of engineers is prepared to meet tomorrow’s challenges with competence and character.

Advertisement

My journey as a mentor, judge, keynote speaker, and peer reviewer has reinforced a clear truth: When we invest in the community, we invest in the future of engineering. The students and young professionals we support today will be the ones building the world we live in tomorrow.

From Your Site Articles

Related Articles Around the Web

Advertisement

Source link

Continue Reading

Tech

John Solly Is the DOGE Operative Accused of Planning to Take Social Security Data to His New Job

Published

on

John Solly, a software engineer and former member of the so-called Department of Government Efficiency (DOGE), is the DOGE operative reportedly accused in a whistleblower complaint of telling colleagues that he stored sensitive Social Security Administration (SSA) data on a thumb drive and wanted to share the information with his new employer, multiple sources tell WIRED.

Since October, according to a copy of his résumé, Solly has worked as the chief technology officer for the health IT division of a government contractor called Leidos, which has already received millions in SSA contracts and could receive up to $1.5 billion in contracts with SSA based on a five-year deal it signed in 2023. Solly’s personal website and LinkedIn have been taken offline as of this week.

Responding to a request for comment, Solly, through his legal counsel, denied engaging in any wrongdoing. A spokesperson for Leidos also said the company found no evidence supporting the whistleblower’s claims against Solly.

Solly was one of 12 DOGE team members at SSA, where, according to the résumé on his personal website, he supported “other DOGE engineers on initiatives including Digital SSN, Death Master File cleanup,” and “SSN verification API (EDEN 2.0).” The “death master file” is an SSA database containing millions of Social Security records of deceased people and is maintained so that their identities can’t be used for fraud. An API, or application programming interface, allows different programs to talk to each other, including pulling data and information from each other. In this case, it could allow Social Security data to be accessed by agencies and institutions outside of SSA.

Advertisement

The allegation was revealed in a complaint filed to SSA’s internal watchdog first reported earlier this week by The Washington Post, which did not name Solly or Leidos. According to the Post, the complaint was filed with the SSA’s Office of the Inspector General earlier this year and alleges that the former DOGE employee told coworkers he took copies of the SSA’s Numerical Identification System, or NUMIDENT, as well as the “death master file.” NUMIDENT is a master SSA database containing all information included in a Social Security number application, including full names, birth dates, race, and more personally identifiable information.

In the complaint, according to the Post, a whistleblower alleges that the former DOGE employee sought help transferring a set of data from a thumb drive to a personal computer so he could “sanitize” it before uploading it for use at a private-sector company. The former DOGE employee allegedly said that he expected to receive a presidential pardon if his actions were unlawful, the complaint reportedly stated.

Solly “did not share, access, or view any personally identifiable information (PII) maintained by SSA, including SSA’s Death Master File (DMF) and Numerical Identification System (Numident). The allegations made by a supposedly anonymous source are patently false and slanderous. Mr. Solly will take all appropriate steps to clear his good name and stellar reputation,” says Seth Waxman, who is representing Solly. “He is certain that any fair review of the facts and circumstances surrounding these spurious allegations will fully exonerate him.”

Leidos is a major contractor for SSA. Between 2010 and 2018, the company brought in millions of dollars in SSA IT contracts. In 2018, Leidos was awarded contracts potentially worth up to $639 million for IT support services and processing disability claims. In 2023, the company announced that it had been awarded an estimated $1.5 billion IT contract with the agency. As part of DOGE’s blitz into the US government in early 2025, Leidos, like many government contractors, saw some of its contracts cut.

Advertisement

Source link

Continue Reading

Tech

Wayve, Uber, Nissan to launch robotaxis in Tokyo

Published

on

A roll-out is planned later this year, with safety riders initially deployed in all robotaxis.

UK start-up Wayve, Uber and Nissan are collaborating to deploy robotaxi services in Tokyo.

In a joint press release, the companies said that the deal will see Nissan’s Leaf electric vehicles equipped with Wayve’s AI technology, made available to customers via Uber’s platform. A roll-out is planned later this year.

During the initial phase, safety riders will be seated in all robotaxis, the three said. Last September, Nissan said that it was testing a driver assistance system that uses Wayve’s technology, with a planned launch in Japan in 2027.

Advertisement

“We have been testing our technology throughout Japan since early 2025, building extensive experience in the country’s unique road environments,” said Alex Kendall, the co-founder and CEO of Wayve.

“Partnering with Uber and Nissan to begin pilot deployment of robotaxi[s] allows us to introduce this technology in a responsible way, while continuing to learn and expand.”

This is Uber’s first robotaxi partnership in Japan. The company recently announced its plans for an international roll-out that also includes London, Madrid, Munich, Hong Kong, and a number of US cities.

Uber’s London roll-out this spring is in partnership with Wayve, a company it backs. The ride-hailing platform recently announced its intentions to become the leading provider of robotaxi services by 2029.

Advertisement

“Autonomous mobility is becoming an increasingly important part of the Uber platform,” said Dara Khosrowshahi, the CEO of Uber, of the three-way partnership.

“Following our planned pilot deployment in London, we look forward to expanding into Tokyo and introducing new, modern ways to travel in some of the world’s largest cities … Our goal is to give riders more ways to move with seamless access through the Uber app.”

Ivan Espinosa, the president and CEO of Nissan, said: “Our work with Wayve to integrate advanced AI technology across our consumer vehicle portfolio has laid strong foundations, and we are excited to take this partnership further with a pilot deployment of robotaxi[s] in Tokyo, bringing together Wayve’s AI technology, Uber’s network and Nissan vehicles.

Nissan supported Wayve in a $1.2bn Series D round announced in February. Big-name backers Nvidia and SoftBank also participated in the round.

Advertisement

Don’t miss out on the knowledge you need to succeed. Sign up for the Daily Brief, Silicon Republic’s digest of need-to-know sci-tech news.

Source link

Advertisement
Continue Reading

Tech

The who, what, and why of the attack that has shut down Stryker’s Windows network”

Published

on

What else is known about Handala Hack?

The group has existed since at least 2023. It takes its name from a character in the political cartoons of Palestinian artist Naji al-Ali. The group’s logo depicts a small Palestinian boy who is a symbol associated with Palestinian resistance.

Check Point and other security firms have said Handala Hack is affiliated with Iran’s Ministry of Intelligence and Security and maintains multiple online personas. Compared to other nation-state-sponsored hacking groups, Handala Hack has kept a comparatively lower profile. Still, it has carried out a series of destructive wiping attacks and influence operations over the years.

Around the same time the Stryker attack came to light, posts to a Telegram account and website controlled by Handala Hack took credit for the takedown. Handala posts cited last week’s killing of 165 civilians at a girls’ school in Iran by an American Tomahawk missile and past hacking operations that the US and Israel have perpetuated on Iran.

What is the point of striking a corporation in retaliation for airstrikes carried out by the US and Israel?

Such actions are taken for their psychological effects, which are often disproportionately larger than the resources required to bring them about. With limited means for Iran to strike back militarily, the Stryker disruption allows an alternative means for the country and its allies to retaliate. The success is intended to demonstrate that pro-Iranian forces can still exact a price that has a material effect on large populations in the US, Israel, and countries allied with them.

Advertisement

As a major supplier of lifesaving medical devices relied on throughout the US and its allies, Stryker plays a strategic and symbolic role in their security, researchers at Flash Point said Thursday. “By operating behind a persona styled as a grassroots, pro-Palestinian resistance movement, Iranian state-nexus actors are able to conduct destructive cyber operations against Western organizations while maintaining a degree of plausible deniability.”

Source link

Advertisement
Continue Reading

Tech

Agents need vector search more than RAG ever did

Published

on

What’s the role of vector databases in the agentic AI world? That’s a question that organizations have been coming to terms with in recent months.

The narrative had real momentum. As large language models scaled to million-token context windows, a credible argument circulated among enterprise architects: purpose-built vector search was a stopgap, not infrastructure. Agentic memory would absorb the retrieval problem. Vector databases were a RAG-era artifact.

The production evidence is running the other way.

Qdrant, the Berlin-based open source vector search company, announced a $50 million Series B on Thursday, two years after a $28 million Series A. The timing is not incidental. The company is also shipping version 1.17 of its platform. Together, they reflect a specific argument: The retrieval problem did not shrink when agents arrived. It scaled up and got harder.

Advertisement

“Humans make a few queries every few minutes,” Andre Zayarni, Qdrant’s CEO and co-founder, told VentureBeat. “Agents make hundreds or even thousands of queries per second, just gathering information to be able to make decisions.”

That shift changes the infrastructure requirements in ways that RAG-era deployments were never designed to handle.

Why agents need a retrieval layer that memory can’t replace

Agents operate on information they were never trained on: proprietary enterprise data, current information, millions of documents that change continuously. Context windows manage session state. They don’t provide high-recall search across that data, maintain retrieval quality as it changes, or sustain the query volumes autonomous decision-making generates.

“The majority of AI memory frameworks out there are using some kind of vector storage,” Zayarni said. 

Advertisement

The implication is direct: even the tools positioned as memory alternatives rely on retrieval infrastructure underneath.

Three failure modes surface when that retrieval layer isn’t purpose-built for the load. At document scale, a missed result is not a latency problem — it is a quality-of-decision problem that compounds across every retrieval pass in a single agent turn. Under write load, relevance degrades because newly ingested data sits in unoptimized segments before indexing catches up, making searches over the freshest data slower and less accurate precisely when current information matters most. Across distributed infrastructure, a single slow replica pushes latency across every parallel tool call in an agent turn — a delay a human user absorbs as inconvenience but an autonomous agent cannot.

Qdrant’s 1.17 release addresses each directly. A relevance feedback query improves recall by adjusting similarity scoring on the next retrieval pass using lightweight model-generated signals, without retraining the embedding model. A delayed fan-out feature queries a second replica when the first exceeds a configurable latency threshold. A new cluster-wide telemetry API replaces node-by-node troubleshooting with a single view across the entire cluster.

Why Qdrant doesn’t want to be called a vector database anymore

Nearly every major database now supports vectors as a data type — from hyperscalers to traditional relational systems. That shift has changed the competitive question. The data type is now table stakes. What remains specialized is retrieval quality at production scale.

Advertisement

That distinction is why Zayarni no longer wants Qdrant called a vector database.

“We’re building an information retrieval layer for the AI age,” he said. “Databases are for storing user data. If the quality of search results matters, you need a search engine.”

His advice for teams starting out: use whatever vector support is already in your stack. The teams that migrate to purpose-built retrieval do so when scale forces the issue.

“We see companies come to us every day saying they started with Postgres and thought it was good enough — and it’s not.”

Advertisement

Qdrant’s architecture, written in Rust, gives it memory efficiency and low-level performance control that higher-level languages don’t match at the same cost. The open source foundation compounds that advantage — community feedback and developer adoption are what allow a company at Qdrant’s scale to compete with vendors that have far larger engineering resources.

“Without it, we wouldn’t be where we are right now at all,” Zayarni said.

How two production teams found the limits of general-purpose databases

The companies building production AI systems on Qdrant are making the same argument from different directions: agents need a retrieval layer, and conversational or contextual memory is not a substitute for it.

GlassDollar helps enterprises including Siemens and Mahle evaluate startups. Search is the core product: a user describes a need in natural language and gets back a ranked shortlist from a corpus of millions of companies. The architecture runs query expansion on every request – a single prompt fans out into multiple parallel queries, each retrieving candidates from a different angle, before results are combined and re-ranked. That is an agentic retrieval pattern, not a RAG pattern, and it requires purpose-built search infrastructure to sustain it at volume.

Advertisement

The company migrated from Elasticsearch as it scaled toward 10 million indexed documents. After moving to Qdrant it cut infrastructure costs by roughly 40%, dropped a keyword-based compensation layer it had maintained to offset Elasticsearch’s relevance gaps, and saw a 3x increase in user engagement.

“We measure success by recall,” Kamen Kanev, GlassDollar’s head of product, told VentureBeat. “If the best companies aren’t in the results, nothing else matters. The user loses trust.” 

Agentic memory and extended context windows aren’t enough to absorb the workload that GlassDollar needs, either.

 “That’s an infrastructure problem, not a conversation state management task,” Kanev said. “It’s not something you solve by extending a context window.”

Advertisement

Another Qdrant user is &AI, which is building infrastructure for patent litigation. Its AI agent, Andy, runs semantic search across hundreds of millions of documents spanning decades and multiple jurisdictions. Patent attorneys will not act on AI-generated legal text, which means every result the agent surfaces has to be grounded in a real document.

“Our whole architecture is designed to minimize hallucination risk by making retrieval the core primitive, not generation,” Herbie Turner, &AI’s founder and CTO, told VentureBeat. 

For &AI, the agent layer and the retrieval layer are distinct by design.

 “Andy, our patent agent, is built on top of Qdrant,” Turner said. “The agent is the interface. The vector database is the ground truth.”

Advertisement

Three signals it’s time to move off your current setup

The practical starting point: use whatever vector capability is already in your stack. The evaluation question isn’t whether to add vector search — it’s when your current setup stops being adequate. Three signals mark that point: retrieval quality is directly tied to business outcomes; query patterns involve expansion, multi-stage re-ranking, or parallel tool calls; or data volume crosses into the tens of millions of documents.

At that point the evaluation shifts to operational questions: how much visibility does your current setup give you into what’s happening across a distributed cluster, and how much performance headroom does it have when agent query volumes increase.

“There’s a lot of noise right now about what replaces the retrieval layer,” Kanev said. “But for anyone building a product where retrieval quality is the product, where missing a result has real business consequences, you need dedicated search infrastructure.”

Source link

Advertisement
Continue Reading

Trending

Copyright © 2025