“Just as a student learns more by reading more books, large language models can better pinpoint patterns in text and be more accurate with more information.”

– Cade Metz,  Cecilia Kang, Sheera Frenkel, Stuart A. Thompson and Nico Grant

Describing what they call “a desperate hunt for digital data,” The New York Times says Meta (formerly Facebook) discussed buying the Simon & Shuster publishing house as a way to gain more long-form text. Another prominent example they cite is how other AI model-makers are using text-to-speech algorithms to transform YouTube videos into training words

The Times says AI companies, including OpenAI, Google, and Meta, are “using the data faster than it is being produced.”

SEE FULL STORY

How Tech Giants Cut Corners to Harvest Data for A.I. | THE NEW YORK TIMES | April 5, 2024 | by Cade Metz,  Cecilia Kang, Sheera Frenkel, Stuart A. Thompson and Nico Grant

LATEST

Discover more from journalismAI.com

Subscribe now to keep reading and get access to the full archive.

Continue reading