Judge Chhabria commenced with stating that the ultimate goal of copyrights is to promote a broad public availability of literature, music and other arts. Copyright law does this by granting artists incentives to create by giving them exclusive rights. An example is the right to prevent others from reproducing their works. However, copyrights law also seeks to strike a balance between protecting ownership and allowing room for innovation through the fair use doctrine. Fair use is defined in Section 107 of the Copyright Act, which allows copyrighted works to be used without the creator’s permission if certain criteria are met. The Copyright Act outlines four factors that should be considered when assessing whether a particular use is fair:
- The purpose and character of the use;
- The nature of the copyrighted work;
- The amount and substantiality of the portion used in relation to the copyrighted work as a whole;
- The effect of the use upon the potential market for or value of the copyrighted work.
Meta invoked this fair use doctrine. The first factor favors Meta, because the purpose of the thirteen authors and Meta clearly differ. There is no doubt that Meta’s use of the copyrighted works is highly transformative. In this case the use by Meta was to train its LLMs, which are innovative tools that can be used to generate a variety of texts and perform a wide range of functions. By contrast, the author’s books are intended for entertainment or education purposes.
With regard to the second factor the Court was brief. Meta chose these books precisely because of their strongly expressive value. Works that are more creative are more likely to receive greater copyright protection and are less likely to fall under the fair use doctrine. The second factor favors the thirteen authors, as there is consensus that LLMs perform better when trained on high-quality material involving multiple creative choices.
The third factor requires the amount and substantiality of the portion used in the new content to be reasonable in relation to the purpose of copying. This is related to the first factor. The extent to which new content may contain fragments of copyright-protected works is determined by the purpose of its use. Given the transformative purpose, judge Chhabria decided the amount copied by Meta is reasonable. Another relevant consideration is the amount of copyrighted material made available to the public, rather than the amount used by the alleged infringer. Experts from both parties reached the conclusion that Llama could not regurgitate more than fifty words from any of the authors’ books, even when adversarial prompting techniques were deployed.
The Court emphasizes that great importance must be given to the fourth factor, which considers the impact upon the potential market for or value of the copyrighted work. While Meta suggested that the transformative use of the copyrighted works constituted fair use, Judge Chhabria did not follow their reasoning. Even if the use is transformative, it can still have an impact on the market or on the value of the copyrighted work. The Court highlights a promising argument that provides support for this: an LLM can potentially generate works that compete with the original work by acting as indirect substitutes for it. Although the authors are unable to provide satisfactory evidence, the Court zooms in on this point. The training of an LLM can help to enable the rapid creation of countless works that can compete with copyrighted works, even if those works are not infringing any copyrights. This harm of market dilution – also referred to as harm of indirect substitution – will maybe not crowd out bestselling authors, but it is imaginable that lesser-known authors or up-and-coming authors will encounter difficulties due to artificial competition on the market. The plaintiffs did not present any empirical evidence that the training of LLMs is already harming the market or will harm it in the future, but instead solely offered speculation to the Court. Given the combination of the foregoing and Meta’s demonstration of a lack of market harm, the Court followed Meta on its fair use defense.