Showing posts with label digital preservation. Show all posts
Showing posts with label digital preservation. Show all posts

Monday, November 17, 2025

Inside the old church where one trillion webpages are being saved; CNN, November 16, 2025

  , CNN; Inside the old church where one trillion webpages are being saved

"The Wayback Machine, a tool used by millions every day, has proven critical for academics and journalists searching for historical information on what corporations, people and governments have published online in the past, long after their websites have been updated or changed.

For many, the Wayback Machine is like a living history of the internet, and it just logged its trillionth page last month.

Archiving the web is more important and more challenging than ever before. The White House in January ordered vast amounts of government webpages to be taken down. Meanwhile, artificial intelligence is blurring the line between what’s real and what’s artificially generated — in some ways replacing the need to visit websites entirely. And more of the internet is now hidden behind paywalls or tucked in conversations with AI chatbots.

It’s the Internet Archive’s job to figure out how to preserve it all."

Monday, June 9, 2025

Newsmaker: Brewster Kahle; American Libraries, June 4, 2025

 Anne Ford, American Libraries; Newsmaker: Brewster Kahle

"How has the work of the Internet Archive been affected since Trump took office?

Well, the biggest effect has been getting a lot of attention for what we do. We spend a lot of time on Democracy’s Library, which is a name for collecting all the born-digital and digitized publications of government at the federal, state, and municipal levels. There’s been so much attention about all of the [digital] takedowns that we’ve received lots and lots of volunteer help toward collecting not only web assets but also databases that are being removed from government websites. It’s all hands on deck.

And you just launched a new YouTube channel.

Yes, we unveiled our next-generation microfiche scanning as part of our Democracy’s Library project, because a lot of .gov sites are on microfiche, and people don’t want to use microfiche anymore. Fortunately, the US government in its early era was pro–access to information and made government documents public domain. So we put out a YouTube livestream of the microfiche being digitized.

What would you like to see libraries and librarians do during this challenging time?

We need libraries to have at least as good rights in the digital world as we have in the physical world. There’s an upcoming website [from the Internet Archive and others] called the Four Digital Rights of Libraries, and that is something libraries can sign onto as institutions. [The website will launch during the Association of European Research Libraries’ LIBER 2025 Conference in Lausanne, Switzerland, July 2-4.]

People generally don’t know that libraries, in this digital era, are prevented from buying any ebooks or MP3s. They are not allowed by the publishers to have them. They spend and spend and spend, but they don’t end up owning anything. They’re not building collections. So the publishers can change or delete anything at any time, and they do. In their dream case, libraries will never own anything ever again. This is a structural attack on libraries. You don’t need to be a deep historian to know what happens to libraries. They’re actively destroyed by the powerful.

So let’s spend [our collection budgets] buying ebooks, buying music, buying material from small publishers or anybody [else] that will actually sell to us. Make it so we are building our own collections, not this licensing thing where these books disappear.

That’s a big ask. But the great thing about that will be that our libraries start buying things from small publishers, where most of the money goes back to the authors, not stopping with the big multinational publishers. Let’s build a system that works for more players than just big corporations that make a habit of suing libraries."

Wednesday, January 8, 2025

The Internet Archive is in danger; WBUR, January 7, 2025

 

The Internet Archive is in danger


"More than 900 billion webpages are preserved on The Wayback Machine, a history of humanity online. Now, copyright lawsuits could wipe it out.

Guests

Brewster Kahle, founder and director of the Internet Archive. Digital librarian and computer engineer.

James Grimmelmann, professor of digital and information law at Cornell Tech and Cornell Law School. Studies how laws regulating software affect freedom, wealth, and power."