インシデント 997: translated-ja-Meta and OpenAI Accused of Using LibGen’s Pirated Books to Train AI Models
概要: translated-ja-Court records reveal that Meta employees allegedly discussed pirating books to train LLaMA 3, citing cost and speed concerns with licensing. Internal messages suggest Meta accessed LibGen, a repository of over 7.5 million pirated books, with apparent approval from Mark Zuckerberg. Employees allegedly took steps to obscure the dataset’s origins. OpenAI has also been implicated in using LibGen.
Editor Notes: Please refer to these two legal filings for more information; the incident date of 02/28/2023 is drawn from (2): (1) Case 3:23-cv-03417-VC, Document 417-6, filed 02/05/2025, Exhibit C, https://storage.courtlistener.com/recap/gov.uscourts.cand.415175/gov.uscourts.cand.415175.449.4.pdf; and (2) Case 3:23-cv-03417-VC, Document 449-4, filed 02/20/2025, Woodhouse Exhibit 4, Exhibit C, https://storage.courtlistener.com/recap/gov.uscourts.cand.415175/gov.uscourts.cand.415175.449.4.pdf. See also Incidents 995 and especially 996 for similarly related cases.
推定: OpenAI , Meta , OpenAI models , Llama 3 , Library Genesis (LibGen) , GPT-4 と BitTorrentが開発し提供したAIシステムで、Writers , publishers , Journalists , Authors と Academic researchersに影響を与えた
インシデントのステータス
インシデントID
997
レポート数
4
インシデント発生日
2023-02-28
エディタ
Dummy Dummy
インシデントレポート
レポートタイムライン
Loading...