Connect with us

Technology

Patronus AI launches world’s first self-serve API to stop AI hallucinations

Published

on

Patronus AI launches world’s first self-serve API to stop AI hallucinations

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More


A customer service chatbot confidently describes a product that doesn’t exist. A financial AI invents market data. A healthcare bot provides dangerous medical advice. These AI hallucinations, once dismissed as amusing quirks, have become million-dollar problems for companies rushing to deploy artificial intelligence.

Today, Patronus AI, a San Francisco startup that recently secured $17 million in Series A funding, launched what it calls the first self-serve platform to detect and prevent AI failures in real-time. Think of it as a sophisticated spell-checker for AI systems, catching errors before they reach users.

Inside the AI safety net: How it works

“Many companies are grappling with AI failures in production, facing issues like hallucinations, security vulnerabilities, and unpredictable behavior,” said Anand Kannappan, Patronus AI’s CEO, in an interview with VentureBeat. The stakes are high: Recent research by the company found that leading AI models like GPT-4 reproduce copyrighted content 44% of the time when prompted, while even advanced models generate unsafe responses in over 20% of basic safety tests.

Advertisement

The timing couldn’t be more critical. As companies rush to implement generative AI capabilities — from customer service chatbots to content generation systems — they’re discovering that existing safety measures fall short. Current evaluation tools like Meta’s LlamaGuard perform below 50% accuracy, making them little better than a coin flip.

Patronus AI’s solution introduces several innovations that could reshape how businesses deploy AI. Perhaps most significant is its “judge evaluators” feature, which allows companies to create custom rules in plain English.

“You can customize evaluation to exactly [meet] your product needs,” Varun Joshi, Patronus AI’s product lead, told VentureBeat. “We let customers write out in English what they want to evaluate and check for.” A financial services company might specify rules about regulatory compliance, while a healthcare provider could focus on patient privacy and medical accuracy.

From detection to prevention: The technical breakthrough

The system’s cornerstone is Lynx, a breakthrough hallucination detection model that outperforms GPT-4 by 8.3% in detecting medical inaccuracies. The platform operates at two speeds: a quick-response version for real-time monitoring and a more thorough version for deeper analysis. “The small versions can be used for real-time guardrails, and the large ones might be more appropriate for offline analysis,” Joshi told VentureBeat.

Advertisement

Beyond traditional error checking, the company has developed specialized tools like CopyrightCatcher, which detects when AI systems reproduce protected content, and FinanceBench, the industry’s first benchmark for evaluating AI performance on financial questions. These tools work in concert with Lynx to provide comprehensive coverage against AI failures.

Beyond simple guard rails: Reshaping AI safety

The company has adopted a pay-as-you-go pricing model, starting at $10 per 1000 API calls for smaller evaluators and $20 per 1000 API calls for larger ones. This pricing structure could dramatically increase access to AI safety tools, making them available to startups and smaller businesses that previously couldn’t afford sophisticated AI monitoring.

Early adoption suggests major enterprises see AI safety as a critical investment, not just a nice-to-have feature. The company has already attracted clients including HP, AngelList, and Pearson, along with partnerships with tech giants like Nvidia, MongoDB, and IBM.

What sets Patronus AI apart is its focus on improvement rather than just detection. “We can actually highlight the span of the specific piece of text where the hallucination is,” Kannappan explained. This precision allows engineers to quickly identify and fix problems, rather than just knowing something went wrong.

Advertisement

The race against AI hallucinations

The launch comes at a pivotal moment in AI development. As large language models like GPT-4 and Claude become more powerful and widely used, the risks of AI failures grow correspondingly larger. A hallucinating AI system could expose companies to legal liability, damage customer trust, or worse.

Recent regulatory moves, including President Biden’s AI executive order and the EU’s AI Act, suggest that companies will soon face legal requirements to ensure their AI systems are safe and reliable. Tools like Patronus AI’s platform could become essential for compliance.

“Good evaluation is not just protecting against a bad outcome — it’s deeply about improving your models and improving your products,” Joshi emphasizes. This philosophy reflects a maturing approach to AI safety, moving from simple guard rails to continuous improvement.

The real test for Patronus AI isn’t just catching mistakes — it will be keeping pace with AI’s breakneck evolution. As language models grow more sophisticated, their hallucinations may become harder to spot, like finding increasingly convincing forgeries.

Advertisement

The stakes couldn’t be higher. Every time an AI system invents facts, recommends dangerous treatments, or generates copyrighted content, it erodes the trust these tools need to transform business. Without reliable guardrails, the AI revolution risks stumbling before it truly begins.

In the end, it’s a simple truth: If artificial intelligence can’t stop making things up, it may be humans who end up paying the price.


Source link
Continue Reading
Advertisement
Click to comment

You must be logged in to post a comment Login

Leave a Reply

Technology

CareYaya is enabling affordable home care by connecting healthcare students with elders

Published

on

CareYaya is enabling affordable home care by connecting healthcare students with elders

CareYaya, a platform that matches people who need caregivers with healthcare students, is working to disrupt the caregiving industry. The startup, which exhibited as part of the Battlefield 200 at TechCrunch Disrupt, is looking to enhance affordable in-home support, while also helping students prepare for their future healthcare careers.

The startup was founded in 2022 by Neal Shah, who came up with the idea for the startup based on his own experience as a caregiver for his wife after she became ill with cancer and various other ailments. During this time, Shah was a partner at a hedge fund and had to wind down his fund to become a full-time caregiver for two years. 

To get additional care for his wife, Shah hired college students who were studying healthcare to be caregivers for his wife. Shah learned that other families were doing the same thing informally by posting flyers at local campuses to find someone who was qualified to look after their loved one. 

“I was like, wouldn’t it be nice to just build a formal system for them to do it, where you don’t have to go to your local nursing school or your local undergrad campus and post flyers,” Shah told TechCrunch. “This is what I was doing. So we were like, if you can bring that into a formal capacity through a tech platform, you can make a big impact.” 

Advertisement

Fast-forward to 2024, and the platform now has over 25,000 students on its platform from numerous schools, including Duke University, Stanford, UC Berkeley, San Jose State, University of Texas at Austin, and more. 

Image Credits:CareYaya

CareYaya performs background checks on students who want to join the platform and then completes video-based interviews with them. On the user side, people can join the platform and then detail the type of care their loved one needs. CareYaya then matches students to families, whether it’s for one-off sessions or continuous care. After the first session, both parties can leave ratings.

The startup says it can help families save thousands of dollars on recurring senior care. While at-home care costs an average of $35 per hour in the U.S., CareYaya charges between $17 and $20 per hour.

Since the students providing the care are tech savvy, CareYaya is equipping them with AI-powered technology to recognize and track disease progression in patients with Alzheimer’s and dementia. The company recently launched an LLM (large language model) that integrates with smart glasses to gather visual data to help students provide better real-time assistance and conduct early dementia screening.

In terms of the future, CareYaya wants to explore expanding beyond the United States, as the platform has seen interest from people in places like Canada, Australia, and the United Kingdom. 

Advertisement

Source link

Continue Reading

Technology

Windblown shows how good roguelikes can be with friends

Published

on

Windblown shows how good roguelikes can be with friends

Some of the most beloved roguelikes are single-player — the likes of Hades, Balatro, and Dead Cells are all solo titles. But Windblown, the new roguelike from Motion Twin, the studio that created Dead Cells, showed me just how cool it can be to play a roguelike with other people.

In Windblown, your character, one of a few adorable animal adventurers like an axolotl or a bat, is shot out of a cannon into a mysterious giant tornado to fight your way through various zones. Like Dead Cells, you can equip up to two main weapons. I typically have one for close-range bouts and another for long-distance attacks. But with every weapon, you’re also able to pull off a combo that uses a special move from the other weapon called an “Alterattack.”

Here’s an example. I love using a crossbow to attack enemies from a distance, and I pair it with a giant heavy blade. I rarely use the blade on its own; instead, I use its Alterattack that cracks open the earth in a straight line to continue to wallop on enemies at range. That turns a run into a steady rhythm of slinging arrows and using the Alterattack at exactly the right time, and with my five hours so far with the game, I haven’t gotten tired of the pattern.

Windblown just launched in early access, and you can already unlock more than a dozen weapons, meaning there are a lot of combinations that I haven’t messed around with. And with four different biomes to get through on a run, there’s a lot to see, too.

Advertisement

The bosses are no joke.
Image: Motion Twin

All of that would be enough to make Windblown part of my regular rotation of roguelikes I use to wind down at the end of a long day. But the game’s multiplayer is making Windblown the game I turn to every time I turn on my Steam Deck.

Windblown’s multiplayer lobbies, which you unlock fairly early on, let you play a full run with a team of three people. You can use voice and text chat to communicate, but it’s not required; I haven’t used those at all, instead relying on four in-game emoji. I also like that you can name your lobbies. I created one titled “help me get 1st win” and immediately had two helpful people join up to help me tackle the tornado. (Sadly, we did not get the win.) 

When playing solo, I’ve found that I’m somewhat cautious and strategic as I think about how to use weapons and positioning to take on the game’s aggressive enemies and dodge their attacks. With the help of a team, battles are speedier and become delightful explosions of light, color, sound, and damage. It’s so fun to absolutely annihilate baddies with other people, and it’s comforting to know that they’ve got your back in a pinch.

Advertisement

There are a lot of great roguelikes to play right now; Hades II just got a huge update, Balatro is nearly impossible to put down (especially now that it’s on mobile), and I’ve wanted to get back into Shogun Showdown, which I think everyone is sleeping on. Windblown needed more than just its Motion Twin pedigree to stand out, but so far, the multiplayer is the hook that keeps me coming back.

Source link

Continue Reading

Technology

Google could soon make sharing files from Android to iPhone much easier

Published

on

Quick Share between a laptop and phone

  • Quick Share could come to iOS and macOS soon
  • It enables speedy file transfers between devices
  • Third-party alternative tools are already available

Quick Share on Android is the equivalent of AirDrop, enabling files to be easily transferred between Android devices, Chromebooks, and Windows – and there are signs that Google is planning to add support for iPhones, iPads, and Macs.

As spotted by the team at Android Authority, a comment left by a Google engineer on code essential to Quick Share mentions iOS and macOS specifically – a comment which would make more sense if an app for these platforms was in the works.

Source link

Continue Reading

Technology

Aptera’s 3-wheel solar EV heads to 2025 commercialization

Published

on

Aptera’s 3-wheel solar EV heads to 2025 commercialization

EV drivers may relish that charging networks are climbing over each other to provide needed juice alongside roads and highways.

But they may relish even more not having to make many recharging stops along the way, as their EV soaks up the bountiful energy coming straight from the sun.

That’s the bet from Aptera Motors, a crowdfunded, California-based maker of solar-powered electric vehicles.

Aptera says it just completed a successful test drive of ‘PI-2’, the first production-intent version of its futuristic-looking two-seater, three-wheel solar electric vehicle. The EV’s latest version was engineered to rigorously test performance metrics such as range, solar charging capability, and efficiency, Aptera says.

“Driving our first production-intent vehicle marks an extraordinary moment in Aptera’s journey,” said Steve Fambro, Aptera’s Co-Founder and Co-CEO in a statement. “It demonstrates real progress toward delivery a vehicle that redefines efficiency, sustainability, and energy independence.”

Advertisement

Aptera says it already has over 50,000 reservations for its EV, which are scheduled to start being delivered in the second quarter of 2025. Last year, it unveiled a $33,200 launch version featuring an under 6-seconds 0-60 mph acceleration time, a battery pack providing a range of 400 miles, and a solar charge range of 40 miles per day.

The Aptera EV also features Tesla’s North American Charging Standard (NACS) port to charge its battery.

The company said its production-intent models will continue to evolve over time as they undergo further tests, including for key metrics such as solar charging rates and watt-hours per miles.

Other versions of the Aptera EV were said to provide as much as 1,000 miles of range with a 0-60 mph acceleration in 3.5 seconds.

Advertisement

Aptera has so far raised over $100 million since launching a crowdfunding program three years ago.

Solar-powered electric vehicles are also being developed by the likes of Germany’s Sono Motors and the Netherlands’ Lightyear, and by big automakers such as Hyundai and Mercedes-Benz.






Source link

Continue Reading

Technology

OpenAI just took a shot at Google with this feature

Published

on

OpenAI just took a shot at Google with this feature

Well, the time has finally come! After months of waiting and speculation, the rumored ChatGPT search feature has finally landed. With that, OpenAI is properly set to take on Google.

We first got news about this feature a few months ago, and people who use ChatGPT often for information will love this feature. If you’re a free user, then we have some bad news. The ChatGPT Search feature is only for ChatGPT Plus users for the moment. OpenAI will make this functionality available for its free and Enterprise users over the next couple of weeks. So, you’ll need to wait a bit if you want to use this feature.

ChatGPT now has a search feature

Since the beginning of this whole AI explosion, one of the things that companies fantasized about was the AI-powered search engine. The AI search engine already exists, thanks to Perplexity. Well, OpenAI’s search engine is similar to that one.

When you search for something, you’ll see an AI-generated explanation of what you searched for. This section will take up most of the screen. That’s not very different from what we’ve seen so far. However, off to the right side, you’ll see a Citations section. This will house the sources where ChatGPT got its information. In the image provided by The Verge, we see a list of five sources listed to the side. We’re not sure if the list includes more sources off-screen.

Advertisement

ChatGPT search
Source: The Verge

Five sources is not a bad amount, and they’re shown pretty prominently. ChatGPT isn’t hiding them behind a button. This shows that the company is thinking about the sources it’s surfacing.

In the screenshot, we see image results as well. This is good, as it shows that ChatGPT is trying to be a proper search engine.

Another way this feature is great is that ChatGPt can now access current events. Before, if you used the chatbot, you’d have to deal with a knowledge cut-off date. For example, when ChatGPT first launched, the model it used was more than a year out of date.  However, if you’re using ChatGPT for research, you’ll have access to more modern events.

Should Google be worried? Probably not yet. However, with ChatGPT’s massive user base, it may only be a matter of time.

Source link

Advertisement

Continue Reading

Technology

Apple’s AirPods Pro 2 drop to $179 in this early Black Friday deal

Published

on

Apple’s AirPods Pro 2 drop to $179 in this early Black Friday deal

There’s a great deal on Apple’s over at Amazon right now. The earbuds are currently 28 percent off, bringing them down to $179. That’s just $10 more than the all-time-low price we saw during October Prime Day, and will save you $70. The AirPods Pro 2 got an update earlier this year that , most notably a suite of hearing health tools and the capability to be used as hearing aids. On top of that, they now offer new gesture-based Siri Interactions and Voice Isolation to reduce background noise when you’re on a call.

Apple

Siri Interactions allow for hands- and voice-free Siri controls; you can respond to Siri’s questions simply by nodding or shaking your head. The second-generation AirPods Pro are users, with better sound quality than their predecessors and seamless integration with the other devices in the Apple ecosystem. The AirPods Pro 2 offer active noise cancellation and transparency mode, which allows for more natural conversations while they’re in your ear. They also support spatial audio and Dolby Atmos for certain media.

The buds come with four pairs of silicone tips in different sizes and are IP54 rated for protection against dust and sweat. They get up to 6 hours of listening time (though this will be less with certain features, like ANC, enabled) and up to 30 hours with a little help from the USB-C MagSafe Charging Case.

Check out all of the latest Black Friday and Cyber Monday deals here.

Advertisement

Source link

Continue Reading

Trending

Copyright © 2024 WordupNews.com