{"id":129455,"date":"2025-05-13T15:06:29","date_gmt":"2025-05-13T09:36:29","guid":{"rendered":"https:\/\/www.bananaip.com\/intellepedia\/?p=129455"},"modified":"2025-05-13T15:06:29","modified_gmt":"2025-05-13T09:36:29","slug":"generative-ai-training-copyright-report","status":"publish","type":"post","link":"https:\/\/www.bananaip.com\/intellepedia\/generative-ai-training-copyright-report\/","title":{"rendered":"Generative AI Training and Copyright: U.S. Copyright Office\u2019s Pre-Publication Report"},"content":{"rendered":"<blockquote>\n<p style=\"font-weight: 400;\"><em>\u201cA model is not a magical portal that pulls fresh information from some parallel universe into our own.\u201d <\/em><\/p>\n<p style=\"font-weight: 400;\"><span style=\"color: #1a1a1a; font-size: 16px;\">\u2014 A. Feder Cooper &amp; James Grimmelmann, The Files are in the Computer: Copyright, Memorization and Generative AI at 23-24<\/span><\/p>\n<\/blockquote>\n<h2>Background<\/h2>\n<p style=\"font-weight: 400;\">The U.S. Copyright Office has been examining the intersection of copyright law and artificial intelligence (AI) through a three-part report series titled <a href=\"https:\/\/www.copyright.gov\/ai\/\" target=\"_blank\" rel=\"noopener\">Copyright and Artificial Intelligence<\/a>. This initiative, launched in early 2023, aims to address the legal and policy challenges posed by AI technologies. Each part of the report focuses on a different aspect of AI&#8217;s impact on copyright law. The parts have been divided as follows:<\/p>\n<ul>\n<li><strong>Part 1: Digital Replicas (published on July 31, 2024): <\/strong><span style=\"font-size: 16px;\"><span style=\"font-size: 16px;\">This Part addressed the rise of digital replicas, realistic but false depictions of individuals created through AI, commonly known as <em>deepfakes<\/em>. This part may be accessed at: <a href=\"https:\/\/www.copyright.gov\/ai\/Copyright-and-Artificial-Intelligence-Part-1-Digital-Replicas-Report.pdf\" target=\"_blank\" rel=\"noopener\">Part 1<\/a><\/span><\/span><\/li>\n<\/ul>\n<ul>\n<li><strong>Part 2: Copyrightability (published January 29, 2025): <\/strong><span style=\"font-size: 16px;\">This Part delved into the copyrightability of works generated using generative AI. It reaffirmed that human authorship is a fundamental requirement for copyright protection under U.S. laws. This part may be accessed at: <a href=\"https:\/\/www.copyright.gov\/ai\/Copyright-and-Artificial-Intelligence-Part-2-Copyrightability-Report.pdf\" target=\"_blank\" rel=\"noopener\">Part 2<\/a><\/span><\/li>\n<\/ul>\n<ul>\n<li><strong>Part 3: Copyright and Artificial Intelligence:<\/strong> The U.S. Copyright Office released a pre-publication version of Part 3 on May 9, 2025.\u00a0This Part delves into the issues around the use of copyrighted materials in training generative AI models<\/li>\n<\/ul>\n<h2>Overview of Part 3: Generative AI Training<\/h2>\n<p style=\"font-weight: 400;\">The third and final Part, released in pre-publication form on May 9, 2025, examines the legal implications of using copyrighted materials to train generative AI models. The Report has been divided into six sections, as follows:<\/p>\n<h5 style=\"padding-left: 40px;\"><strong style=\"font-size: 19px;\">Introduction<\/strong><\/h5>\n<p style=\"font-weight: 400; padding-left: 40px;\">This section sets out what the report is about and why the Copyright Office is looking into how AI and copyright law interact. It mentions the growing concerns from the public, ongoing court cases, and interest from lawmakers, and gives an overview of what the report will cover.<\/p>\n<h5 style=\"font-weight: 400; padding-left: 40px;\"><strong>Technical Background<\/strong><\/h5>\n<p style=\"font-weight: 400; padding-left: 40px;\">This part explains in simple terms how generative AI works. It covers how these systems are built and trained using large amounts of data. This part provides an overview of how generative AI systems function, starting with basic machine learning concepts and continuing through model training and deployment.\u00a0It also mentions how they often include copyrighted material and how they are used in practice.<\/p>\n<h5 style=\"font-weight: 400; padding-left: 40px;\"><strong>Prima Facie Infringement<\/strong><\/h5>\n<p style=\"font-weight: 400; padding-left: 40px;\">This section looks at which parts of the AI development process might be in violation of copyright rules. It focuses on the reproduction and use of copyrighted works during data collection, training, retrieval-augmented generation (RAG), and the creation of outputs.\u00a0It explains how copying and using protected works during training might infringe copyright, especially the rights to reproduce or adapt those works.<\/p>\n<h5 style=\"font-weight: 400; padding-left: 40px;\"><strong>Fair Use<\/strong><\/h5>\n<p style=\"font-weight: 400; padding-left: 40px;\">This is an important section of the report. It explores whether the use of copyrighted material in AI training might be allowed under the fair use rule. It goes through the four main factors the law considers and outlines the arguments both for and against fair use in this context. The Four Factors taken into consideration are:<\/p>\n<ol>\n<li style=\"list-style-type: none;\">\n<ol>\n<li style=\"list-style-type: none;\">\n<ol>\n<li style=\"list-style-type: none;\">\n<ol>\n<li><strong>The purpose and character of the use<\/strong> &#8211; Is the material being used for something new, like education, commentary, or research? Is the use transformative (i.e. does it add something new or change the original)? Also, is the use commercial or non-commercial?<\/li>\n<li><strong>The nature of the copyrighted work<\/strong> &#8211; Is the original work more creative or more factual? This factor considers whether the original work is more creative or expressive (like novels, music, or visual art) or more factual or functional (like news articles, computer code, or technical writing).<\/li>\n<li><strong>The amount and substantiality of the portion used<\/strong> &#8211; How much of the original work is being used, and is it the &#8220;heart&#8221; or most important part of the work? It may also be considered how much of each work is used; the reasonableness of the amount in light of the purpose of the use; and the amount made accessible to the public.<\/li>\n<li><strong>The effect on the market for the original work <\/strong>&#8211; Does the use harm the market for the original work? The enquiry must take account not only of harm to the original but also of harm to the market for derivative works.<\/li>\n<\/ol>\n<\/li>\n<\/ol>\n<\/li>\n<\/ol>\n<\/li>\n<\/ol>\n<h5 style=\"font-weight: 400; padding-left: 40px;\"><strong>Licensing for AI Training<\/strong><\/h5>\n<p style=\"font-weight: 400; padding-left: 40px;\">This part discusses ways that creators might give permission (or licences) for their work to be used in AI training. It looks at voluntary licensing, including its feasibility, potential for fair compensation, and legal barriers to collective licensing. It also examines statutory approaches such as compulsory licensing, extended collective licensing, and opt-out mechanisms. The section concludes with an analysis and recommendations on how these models could be implemented to support both innovation and copyright protection.<\/p>\n<h5 style=\"font-weight: 400; padding-left: 40px;\"><strong>Conclusion<\/strong><\/h5>\n<p style=\"font-weight: 400; padding-left: 40px;\">The final section sums up the main points. It highlights the issues and stresses on the need for further monitoring, clearer laws, and possible new policies as AI technology continues to develop.<\/p>\n<h2>Disclaimer<\/h2>\n<p style=\"font-weight: 400;\">This version of Part 3 comes with a disclaimer as follows:<\/p>\n<p style=\"font-weight: 400; padding-left: 40px;\">\u201c<em>The Office is releasing this pre-publication version of Part 3 in response to congressional inquiries and expressions of interest from stakeholders. A final version will be published in the near future, without any substantive changes expected in the analysis or conclusions\u201d<\/em><\/p>\n<p style=\"font-weight: 400;\">You can access the US Copyright Office&#8217;s pre-publication report here: <a href=\"https:\/\/www.copyright.gov\/ai\/Copyright-and-Artificial-Intelligence-Part-3-Generative-AI-Training-Report-Pre-Publication-Version.pdf\" target=\"_blank\" rel=\"noopener\">Copyright and Artificial Intelligence \u2013 Part 3: Generative AI Training (Pre-publication)<\/a>.<\/p>\n<p>Accessibility Review: Ms. Benita Alphonsa Basil<\/p>\n","protected":false},"excerpt":{"rendered":"<p>This post offers a structured summary of Part 3 of the U.S. Copyright Office\u2019s AI report series. It highlights the pre-publication report\u2019s focus on legal concerns surrounding generative AI training and a link to the main report.<\/p>\n","protected":false},"author":2,"featured_media":129471,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"iawp_total_views":82,"footnotes":""},"categories":[3,6,2994],"tags":[6736,2,540,5572,6735],"class_list":["post-129455","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-copyrights","category-intellectual-property","category-software","tag-ai-policy","tag-copyright","tag-fair-use","tag-generative-ai","tag-u-s-copyright-office"],"_links":{"self":[{"href":"https:\/\/www.bananaip.com\/intellepedia\/wp-json\/wp\/v2\/posts\/129455","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.bananaip.com\/intellepedia\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.bananaip.com\/intellepedia\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.bananaip.com\/intellepedia\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.bananaip.com\/intellepedia\/wp-json\/wp\/v2\/comments?post=129455"}],"version-history":[{"count":20,"href":"https:\/\/www.bananaip.com\/intellepedia\/wp-json\/wp\/v2\/posts\/129455\/revisions"}],"predecessor-version":[{"id":129478,"href":"https:\/\/www.bananaip.com\/intellepedia\/wp-json\/wp\/v2\/posts\/129455\/revisions\/129478"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.bananaip.com\/intellepedia\/wp-json\/wp\/v2\/media\/129471"}],"wp:attachment":[{"href":"https:\/\/www.bananaip.com\/intellepedia\/wp-json\/wp\/v2\/media?parent=129455"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.bananaip.com\/intellepedia\/wp-json\/wp\/v2\/categories?post=129455"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.bananaip.com\/intellepedia\/wp-json\/wp\/v2\/tags?post=129455"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}