Yahoo has unveiled a project to digitise books in the public domain. The company is partnering with the newly formed Open Content Alliance, which aims to offer PDF documents of non-copyright books to the public free of charge.
The Internet Archive was set up in 1996 to build a library for the internet that offers access to historical collections. Its most well-known online project is the Wayback Machine, which indexes historical snapshots of websites.
The project is using optical character recognition technology to create digital versions of works in the public domain. The Internet Archive will host the content and Yahoo will index the text.
"In-copyright still has some twists and turns to go, but at least we can get substantial work going on the public domain," said Kahle.
The project is similar to Google Print, in which the search engine firm is digitally scanning books.
Google also plans to keep its library closed, offering searchers only excerpts from the books.
Participants in the Open Content Alliance unveiled their plans just 24 hours after Google announced Google Print last year.
Using photocatalysts to convert carbon dioxide into usable energy such as methane or ethane
Trained on curated data from Moorfields Eye Hospital, the neural network also shows clinicians how it reached its judgement
Yokohama National University demonstrate technology that could lead to a fault-tolerant universal quantum computer
Top-of-the-range Threadripper 2990WX now available from Scan, Ebuyer, Overclockers, Novatech and Amazon