Close Menu
Mondo NewsMondo News
  • Technology
  • Science
  • Blockchain
What's Hot
Scientists Investigate Superagers for 25 Years Heres What They Discovered
Science

Scientists Investigate “Superagers” for 25 Years: Here’s What They Discovered

The Labour Party Needs To Spread Their Message, Regardless Of
Technology

The Labour Party Needs to Spread Their Message, Regardless of Their Desires | Social Media

Elon Musk Unexpectedly Withdraws Legal Action Against Sam Altman And
Technology

Elon Musk unexpectedly withdraws legal action against Sam Altman and OpenAI

  • About Us
  • Privacy Policy
  • Terms & Conditions
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
Mondo NewsMondo News
  • Technology
    Exploring the Limitations of AI Safety Management Practices

    Exploring the Limitations of AI Safety Management Practices

    May 14, 2026
    What is the likelihood of an asteroid impacting Earth

    What is the likelihood of an asteroid impacting Earth?

    December 21, 2025
    Understanding Britains Debt Through Biscuits How Labour MPs Embrace Viral

    Understanding Britain’s Debt Through Biscuits: How Labour MPs Embrace Viral Trends

    December 5, 2025
    Tesla Launches Affordable Model 3 in Europe Amid Criticism of

    Tesla Launches Affordable Model 3 in Europe Amid Criticism of Mask Sales

    December 5, 2025
    Horror Game Horses Banned Is the Controversy Bigger Than You

    Horror Game Horses Banned: Is the Controversy Bigger Than You Think?

    December 5, 2025
  • Science
    Ancient Human Habitation Uncovered at 2000 Meters Experts Stunned by

    Ancient Human Habitation Uncovered at 2,000 Meters: Experts Stunned by Mountain Discovery

    June 2, 2026
    7 Reasons We Overtrust AI and the Hidden Costs Were

    7 Reasons We Overtrust AI and the Hidden Costs We’re Already Facing

    June 2, 2026
    Webb Space Telescope Discovers Methane in Interstellar Comet 3IATLAS

    Webb Space Telescope Discovers Methane in Interstellar Comet 3I/ATLAS

    June 2, 2026
    Newly Discovered Axolotl Fossil Unearthed in Mexico

    Newly Discovered Axolotl Fossil Unearthed in Mexico

    June 2, 2026
    Breakthrough Pancreatic Cancer Drug Doubles Survival Rates A Revolutionary Treatment

    Breakthrough Pancreatic Cancer Drug Doubles Survival Rates: A Revolutionary Treatment

    June 2, 2026
  • Blockchain
    Top 5 Best Altcoins Of 2024 Revealed: Etfs (etfs), Pepe

    Top 4 Altcoins Unveiled by Expert for 100x Portfolio Growth: Blockchain News, Opinion, TV, Jobs

    May 21, 2024
    Blockchain Experts Forecast Which Tokens Will Generate Profits

    Blockchain experts forecast which tokens will generate profits

    May 17, 2024
    The Leading Platform For Seasoned Traders Featuring Blockchain News,

    The Leading Platform for Seasoned Traders – Featuring Blockchain News, Insights, TV, and Job Listings

    May 8, 2024
    Darklume Fantasy Metaverse: Presale Now Available Latest Blockchain Updates,

    Darklume Fantasy Metaverse: Presale Now Available – Latest Blockchain Updates, Opinions, Television, and Job Listings

    April 30, 2024
    Sui Collaborates With Google Cloud To Drive Web3 Advancement Through

    Sui collaborates with Google Cloud to drive Web3 advancement through improved security, scalability, and AI features

    April 30, 2024
Mondo NewsMondo News
You are at:Home » Meta’s AI Memorable Book Verbatim – Can Cost Billions
Metas AI Memorable Book Verbatim – Can Cost Billions
Science June 10, 2025

Meta’s AI Memorable Book Verbatim – Can Cost Billions

Share
Facebook Twitter LinkedIn Pinterest Email

In April, authors and publishers protested utilizing copyrighted books for AI training

Vuk Valcic/Alamy Live News

Amid legal battles, billions are at stake as courts in the US and UK deliberate on whether technology firms can legitimately train AI models using copyrighted literature. Numerous lawsuits have been filed by authors and publishers, revealing that at least one AI model has not only utilized popular texts for training but has also memorized portions of these works verbatim.

The crux of the dispute lies in whether AI developers hold the legal authority to employ copyrighted materials without obtaining prior permission. Previous research highlighted that many large language models (LLMs) powering popular AI chatbots were trained on the “Books3” dataset. Developers of these models argued they were not infringing copyright, claiming they were generating new combinations of words rather than directly reproducing the copyrighted content.

However, recent investigations have examined various AI models to determine the extent of verbatim recall from their training datasets. While most models did not retain exact texts, one particular model from Meta remembered nearly the entire text of a specific book. Should the ruling be unfavorable to the company, researchers predict damages could exceed $1 billion.

“AI models are not merely ‘plagiarism machines’ as some suggest; they do not just capture general relationships among words,” explained Mark Remley from Stanford University. “The diversity in responses among different models complicates the establishment of universal legal standards.”

Previously, Lemley defended Meta in a copyright case involving generative AI known as Kadrey V Meta Platforms. The plaintiff, whose works were used to train Meta’s AI models, filed a class-action lawsuit against the tech giant for copyright infringement. The case is currently under consideration in Northern California.

In January 2025, Remley announced he had parted ways with Meta as a client, yet he remains convinced of the company’s favorable chances in the lawsuit. Emile Vasquez, a Meta spokesperson, stated, “Fair use of copyrighted materials is crucial. We challenge the plaintiff’s claims, and the full record presents a different narrative.”

In this new study, Lemley and his team evaluated the memory capabilities of the AI by dividing excerpts from a small book into prefix and suffix segments, checking if a model prompted with the prefix could recall the suffix. For instance, one excerpt from F. Scott Fitzgerald’s The Great Gatsby was divided into a prefix that read, “They were careless people, Tom and Daisy—they broke things and creatures and then retreated,” and a suffix that concluded with, “We went back to money and their vast carelessness, which kept them together and allowed them to clean up any mess that other people had made.”

Researchers calculated the probability of each AI model completing the excerpt accurately and compared these probabilities against random chance.

The tested excerpts included selections from 36 copyrighted works, featuring popular titles by authors like George RR Martin’s Games and Cheryl Sandberg’s Lean In. Additionally, excerpts from books authored by plaintiffs in the Kadrey V Meta Platforms case were also examined.

The experiments involved 13 open-source AI models, including those created by Meta, Google, DeepMind, EleutherAI, and Microsoft. Most companies outside of Meta did not provide comments, with Microsoft opting not to comment.

The analysis revealed that Meta’s Llama 3.1 70b model had a significant recall of texts from JK Rowling’s first Harry Potter tome, as well as from The Great Gatsby and George Orwell’s 1984. Other models, however, showed minimal recall of the texts, including those penned by the plaintiffs. Meta declined to comment on these findings.

Researchers estimate that an AI model found to have infringed on merely 3% of the Books3 dataset could incur almost $1 billion in damages.

This technique has potential as a “forensic tool” for gauging the extent of AI memory, as noted by Randy McCarthy from Hallestill Law Office in Oklahoma. Yet, it does not address whether companies are legally permitted to train AI models on copyrighted works under US “fair use” provisions.

McCarthy points out that AI firms generally utilize copyrighted material for training. “The real question is whether they had the right to do so,” he remarked.

Meanwhile, in the UK, memory assessment is crucial from a copyright perspective, according to Robert Lands from Howard Kennedy Law Office in London. UK copyright legislation adheres to “fair dealing,” which presents much narrower allowances for copyright infringement compared to US fair use doctrine. Therefore, he posits that AI models retaining pirated content would not satisfy this exception.

Topics:

  • artificial intelligence/
  • Law

Source: www.newscientist.com

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleApple Unveils Software Enhancements and New Apps, AI Takes a Back Seat
Next Article Google Addresses Fox’s Incursion on the Roof of Its £1 Billion London Office

Related Posts

Ancient Human Habitation Uncovered at 2000 Meters Experts Stunned by
Science

Ancient Human Habitation Uncovered at 2,000 Meters: Experts Stunned by Mountain Discovery

7 Reasons We Overtrust AI and the Hidden Costs Were
Science

7 Reasons We Overtrust AI and the Hidden Costs We’re Already Facing

Webb Space Telescope Discovers Methane in Interstellar Comet 3IATLAS
Science

Webb Space Telescope Discovers Methane in Interstellar Comet 3I/ATLAS

Newly Discovered Axolotl Fossil Unearthed in Mexico
Science

Newly Discovered Axolotl Fossil Unearthed in Mexico

Breakthrough Pancreatic Cancer Drug Doubles Survival Rates A Revolutionary Treatment
Science

Breakthrough Pancreatic Cancer Drug Doubles Survival Rates: A Revolutionary Treatment

How Pigeons Use Superparamagnetic Immune Cells in Their Livers to
Science

How Pigeons Use Superparamagnetic Immune Cells in Their Livers to Detect Earth’s Magnetic Field

Leveraging Human Error as a Tactic Against Large Scale Language Models
Science

Leveraging Human Error as a Tactic Against Large-Scale Language Models

Exploring the Real Health Benefits of Turmeric and Curcumin
Science

Exploring the Real Health Benefits of Turmeric and Curcumin

Leave A Reply Cancel Reply

Stay In Touch
  • Facebook
  • Twitter
  • Instagram
  • Pinterest
Quote of the day

A good traveler has no fixed plans, and is not intent on arriving.

Lao Tzu
Exchange Rate

Exchange Rate EUR: Tue, 2 Jun.

Top Insights
U.s. States And Big Tech Companies Clash Over Online Child Technology

U.S. states and big tech companies clash over online child safety bills: Battle lines drawn

Scientists Uncover Mesozoic Carbon Dioxide Levels and Photosynthesis Through Dinosaur Science

Scientists Uncover Mesozoic Carbon Dioxide Levels and Photosynthesis Through Dinosaur Tooth Enamel Analysis

How Fast Does the DNA Repair Leader in Your Cells Science

How Fast Does the DNA Repair Leader in Your Cells Take Control?

Categories
  • Blockchain (65)
  • Science (7,685)
  • Technology (2,968)
Top Posts
UK Government to Renew Dispute with Apple Over Access to

UK Government to Renew Dispute with Apple Over Access to User Data | Data Protection

October 2, 2025
Ai Invents New Battery Design That Decreases Lithium Usage By

AI invents new battery design that decreases lithium usage by 70%

January 9, 2024
Human Level AI is Inevitable Harnessing the Power to Influence the

Human-Level AI is Inevitable: Harnessing the Power to Influence the Journey | Garrison Nice

July 21, 2025

Mondo News is a Professional Technology & Science Blog. Here we will provide you with only exciting content that you will enjoy and find useful. We’re working to turn our passion into a successful website. We hope you enjoy our Content as much as we enjoy offering them to you.

Facebook X (Twitter) Instagram Pinterest
Categories
  • Blockchain (65)
  • Science (7,685)
  • Technology (2,968)
Most Popular
Lessons From Uninhabitable Venus: Exploring The Potential For Extraterrestrial Life
Science

Lessons from Uninhabitable Venus: Exploring the Potential for Extraterrestrial Life

A Recently Discovered Tiny Moon Orbits Neptune And Uranus
Science

A recently discovered tiny moon orbits Neptune and Uranus

SiteLock
© 2026 Mondo News.
  • Home
  • About Us
  • Privacy Policy
  • Terms & Conditions

Type above and press Enter to search. Press Esc to cancel.

We are using cookies to give you the best experience on our website.

You can find out more about which cookies we are using or switch them off in .

Ad Blocker Enabled!
Ad Blocker Enabled!
Our website is made possible by displaying online advertisements to our visitors. Please support us by disabling your Ad Blocker.
Go to mobile version
Powered by  GDPR Cookie Compliance
Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Strictly Necessary Cookies

Strictly Necessary Cookie should be enabled at all times so that we can save your preferences for cookie settings.