Internet Archive


July 6, 2022

The Internet Archive is a non-profit digital library that has the stated purpose of enabling "universal access to knowledge". It offers a permanent digital space for access to various types of resources: for example, websites, audio, moving images (videos) and books. The Internet Archive was founded by Brewster Kahle in 1996 and is part of the International Internet Preservation Consortium (IIPC). In addition to its primary archiving function, the Internet Archive is an organization that campaigns for a free and open Internet and is an officially recognized nonprofit in the United States of America. The administrative offices are located in San Francisco while the data processing centers are located in San Francisco, Redwood City and Mountain View, California. The library's most massive digital collection is the web archive, a sort of collection of "still images" from the World Wide Web cataloged by date of acquisition. To ensure the stability and security of the archived data, the entire collection has a mirror on the servers of the Bibliotheca Alexandrina in Alexandria, Egypt. The archive allows the public to upload and download digital material to and from its servers at no cost. It also provides access to one of the largest digital book archiving projects in existence, is part of the American Library Association and is officially recognized by the state of California as a public library. The company has 200 employees, many of whom are engaged in scanning paper volumes at specialized centers. The main San Francisco office has thirty employees. The Internet Archive has an annual budget of approximately $ 10 million, largely derived from a variety of sources: revenue from web crawling services, partnerships, grants, donations, and the Kahle-Austin Foundation. by Internet Archive "Many companies place importance on the preservation of artifacts relating to their cultural heritage. Without these artifacts civilization has no memory and no way to learn from its successes and failures. Our culture now produces more and more products in the form The Internet Archive's mission is to help preserve these artifacts and create a digital library on the Internet for researchers, historians and scholars. "


Brewster Kahle founded Internet Archive in 1996, around the same time he founded Alexa Internet, a company engaged in web crawling services. The Internet Archive began archiving the World Wide Web since 1996, but the collection was not accessible until 2001, when the Wayback Machine was developed. In 1999, the Internet Archive expanded by adding other collections including the Prelinger Archive. The Internet Archive currently includes, inter alia, text, audio, moving images and software. It hosts a number of other projects including a NASA image archive, the Archive-It indexing service, and Open Library, a catalog of volumes that can be edited via wiki-like software. On March 25, 2020, following the COVID-19 pandemic, The Internet Archive launched the National Emergency Library, an initiative that allows free full-text access to the entire digital collection of more than 1 million titles in the catalog.


Wayback Machine

Wayback Machine is the web interface used by Internet Archive for the extraction of data on websites from archives. The archived sites represent a sort of "still images" collected when the pages are acquired through the indexing software of the Internet Archive. The name "Wayback Machine" comes from the term "WABAC Machine" used in one of the stories from the animated series Rocky and Bullwinkle. The service, thanks to the Alexa spider, memorizes