Digital Archive Anxiety Emerges as Meta Trains AI on Pirated Books While Cultural Preservation Faces Corporate Threats
According to ArsTechnica, Meta has unlawfully trained its AI systems utilizing at least 81.7 terabytes of stolen data from shadow libraries such as Anna's Archive, Z-Library, and LibGen. While the tech giant faces no repercussions, smaller instances of data theft are currently under criminal scrutiny. Access to information is becoming more limited due to book bans and cuts to library funding. Following a court decision that favored publishers, the Internet Archive lost over 500,000 books. Bad Bunny's upcoming 2025 album, 'Debí Tirar Más Fotos,' tackles themes of colonial erasure, sparking a TikTok trend, particularly among the Palestinian diaspora. In a November 2023 essay, writer Bami Oke addresses 'archive anxiety,' emphasizing the difficulties of preserving digital files.
Key facts
- Meta illegally trained AI models on 81.7 terabytes of pirated data from shadow libraries
- ArsTechnica reported on unsealed Meta emails confirming data piracy last month
- Bad Bunny released album 'Debí Tirar Más Fotos' in 2025 exploring Puerto Rican colonial history
- A TikTok trend emerged from Bad Bunny's title track featuring slideshows of family memories
- The Internet Archive lost over 500,000 books after a court ruling favored major publishers
- Writer Bami Oke published an essay on 'archive anxiety' in e-flux journal in November 2023
- i-D and MTV removed their online archives to save money, erasing decades of cultural journalism
- Anna's Archive is a shadow library formed after Z-Library shutdown, used for pirated content
Entities
Artists
- Bad Bunny
- Aria Dean
- Bami Oke
- Michelle Santiago Cortés
Institutions
- Meta
- Anna's Archive
- Z-Library
- LibGen
- Internet Archive
- ArsTechnica
- e-flux journal
- i-D
- MTV
- Amazon
- Apple Music
- X
- TikTok
Locations
- Puerto Rico
- Gaza
- Palestine