Technology

GenAI demands greater emphasis on data quality

Published

4 hours ago

September 21, 2024

GenAI demands greater emphasis on data quality

Data quality has perhaps never been more important. And a year from now, then a year beyond that, it will likely be even more important than it is now.

The reason: AI, and in particular, generative AI.

Given its potential benefits, including exponentially increased efficiency and more widespread use of data to inform decisions, enterprise interest in generative AI is exploding. But for enterprises to benefit from generative AI, the data used to inform models and applications needs to be high-quality. The data must be accurate for the generative AI outputs to be accurate.

Meanwhile, generative AI models and applications require massive amounts of data to understand how to respond to a user’s query. Their outputs aren’t based on individual data points, but instead on aggregations of data. So, even if the data used to train a model or application is high-quality, if there’s not enough of it, the model or application will be prone to deliver an incorrect output called an AI hallucination.

With so much data needed to reduce the likelihood of hallucinations, data pipelines need to be automated. Therefore, with data pipelines automated and humans unable to monitor every data point or data set at every step of the pipeline, it’s imperative that the data be high-quality from the start and there be checks on outputs at the end, according to David Menninger, an analyst at ISG’s Ventana Research.

Otherwise, not only inaccuracies, but also biased and potentially offensive outputs could result.

As we’re deploying more and more generative AI, if you’re not paying attention to data quality, you run the risks of toxicity, of bias. You’ve got to curate your data before training the models, and you have to do some postprocessing to ensure the quality of the results.

David MenningerAnalyst, ISG’s Ventana Research

WordupNews

GenAI demands greater emphasis on data quality

Technology

GenAI demands greater emphasis on data quality

Leave a Reply
Cancel reply

Leave a Reply

Technology

Guitar Hero meets Earthbound in 2024’s strangest game

Spiraling out of orbit

Technology

AT&T’s 2023 breach exposed data that should have been deleted

AT&T got a $13 million fine for a 2023 data breach related to a cloud vendor

It was forced to establish new procedures for handling customer data

Technology

Couchbase launches database tools to foster AI development

Technology

PlayStation’s 30th anniversary PS5 and PS5 Pro consoles are so very pretty

Technology

Last Day to Apply: Boost your brand at Disrupt 2024

Perks of hosting a side event

Boost your brand before applications close tonight

Technology

This little box provides on-demand power when off the grid

Installation

Performance

Trending

WordupNews

GenAI demands greater emphasis on data quality

The rise of GenAI

The need for data quality

Benefits and consequences

Ensuring data quality

You may like

Leave a Reply Cancel reply

Leave a Reply

Guitar Hero meets Earthbound in 2024’s strangest game

AT&T’s 2023 breach exposed data that should have been deleted

AT&T got a $13 million fine for a 2023 data breach related to a cloud vendor

It was forced to establish new procedures for handling customer data

Couchbase launches database tools to foster AI development

New capabilities

Plans

PlayStation’s 30th anniversary PS5 and PS5 Pro consoles are so very pretty

Last Day to Apply: Boost your brand at Disrupt 2024

Perks of hosting a side event

Boost your brand before applications close tonight

This little box provides on-demand power when off the grid

Installation

Performance

Trending

Leave a Reply
Cancel reply