Internet Archive expands OCA book digitizing effort

December 20, 2006, 04:19 PM —  IDG News Service — 

The Internet Archive has received a grant from the Alfred P. Sloan Foundation to expand its book-digitizing efforts, which so far have resulted in the scanning of about 100,000 books now available on the group's Web site.

The grant will also benefit the Open Content Alliance, an initiative launched in October 2005 and backed by the Internet Archive, Yahoo Inc. and others to digitize books and multimedia material and make them available online, the Internet Archive announced Wednesday.

The scanned works hosted by the Internet Archive are also available for indexing by any search engine that adheres to the OCA's open-access terms for the content. These principles include providing "the greatest possible degree of access to and reuse of collections in the archive, while respecting the rights of content owners and contributors," according to the OCA Web site.

The Sloan Foundation awarded the grant to support the digitization of historical collections from five major libraries by the Internet Archive, a nonprofit organization building an online library of texts, audio, video, software and Web pages.

The US$1 million grant will be used in part to scan the complete personal library of founding father and U.S. President John Adams, housed at the Boston Public Library. Meanwhile, the Getty Research Institute in Los Angeles is making available art, architecture and performing arts books.

The archive of publications issued by New York City's Metropolitan Museum of Art will also be digitized, as well as California Gold Rush primary texts from the University of California at Berkeley's Bancroft Library. Finally, the Internet Archive will also scan the James Birney Collection of Anti-Slavery materials from Johns Hopkins University libraries in Baltimore.

Scanning books to make them available online has become a controversial practice primarily due to Google Inc.'s approach. The search engine giant is digitizing library collections that include copyright books without always asking for permission from the copyright owners. It indexes the full text of these works and makes them searchable through its Book Search service.

Google faces lawsuits alleging that this is a violation of copyright law. Google claims it is protected by the fair use principle, because it only displays snippets of text from copyright works.

The Internet Archive has refrained from digitizing copyright books, although it is interested in seeing copyright issues worked out, because its ultimate goal is to provide access to as many works as possible for the benefit of people worldwide, said Brewster Kahle, Internet Archive founder.

Sign up for ITworld's Daily newsletter
Follow ITworld on Twitter @IT_world

I like it!
Post a comment
The content of this field is kept private and will not be shown publicly.
  • Allowed HTML tags: <a> <em> <strong> <cite> <code> <ul> <ol> <li> <dl> <dt> <dd>
  • Lines and paragraphs break automatically.
peer-to-peer

Esther Schindler
If the comments are ugly, the code is ugly

claird
SVG a graphics format for 21st century

pasmith
Take Chrome OS for a test spin

Sandra Henry-Stocker
Solaris Tip: Have Your Files Changed Since Installation?

sjvn
64-bits of protection?

jfruh
Android fragments vs. the iPhone monolith

mikelgan
What Gizmodo missed about the Pro WX Wireless USB disk drive

 

Sidekick: The Good News & the Bad News
Either way you look at it Microsoft Data Center management did not follow standards or best practices in this failure. In which case it makes me wonder more about the outsourcing of corporate data much less personal data.
- mburton325

Join the conversation here

The Daily Tip

The Daily TipQuick, practical advice for IT pros. Made fresh daily.

Hot tips:

Want to cash in on your IT savvy? Send your tip to tips@itworld.com. If we post it, we'll send you a $25 Amazon e-gift card.

Newsletters

Subscribe to ITWORLD TODAY and receive the latest IT news and analysis.

I would like to receive offers via email from ITworld partners.
By clicking submit you agree to the terms and conditions outlined in ITworld's privacy policy.
Featured Sponsor

AISO founders envisioned a Web hosting company that was environmentally friendly. While the company employed energy-efficient innovations like solar panels, its infrastructure produced unacceptable power and cooling requirements. Find out how AISO leveraged AMD technology to overcome their challenge in this case study white paper.

In this whitepaper, Scalar explores the opportunity to change the landscape with respect to mission critical databases built around Oracle. Leveraging technologies such as Linux, high-end commodity processing power and Oracle RAC technology to architect, design, build and maintain database infrastructure that delivers maximum availability, reliability and performance at a fraction of traditional cost.

On a typical day, weather.com, the Web site for The Weather Channel in Atlanta, serves up between 15 million and 20 million page views. But in September 2004, when back-to-back hurricanes ransacked Florida, the peak traffic on one day more than tripled: over 70 million page views by more than 7 million unique visitors. Read the full success story now.

Marketplace