Home > CMU Researcher Uses eCommerce Tool To Digitize Books

News

CMU Researcher Uses eCommerce Tool To Digitize Books

6/4/2007

A researcher at Carnegie Mellon University has found a way to turn the process by which people register at commercial websites into a method for digitizing books, the Associated Press reported.

The method involves putting the time and effort people spend deciphering the short word puzzles used to confirm a registration to better use by having users key-in print materials that need digitizing.

The word puzzles are known as CAPTCHAs, short for "completely automated public Turing tests to tell computers and humans apart."
Computers can't decipher the letters and numbers, ensuring that real people are using the websites.

CMU researchers estimated about 60 million CAPTCHA puzzles are solved every day, taking about 10 seconds each. Researchers have now come up with a way for people to type in snippets of books when registering at a site to help speed up the process of putting texts online.

"Humanity is wasting 150,000 hours every day on these," said Luis von Ahn, an assistant professor of computer science at Carnegie Mellon, who helped develop the original system.

Von Ahn is working with the Internet Archive, which runs several book-scanning projects, to use CAPTCHAs for this instead. The Archive scans 12,000 books a month and sends von Ahn image files that the computer cannot recognize. The files are split up into single words that can be used as CAPTCHAs at sites all over the Internet.

Read More:


Paul McCloskey is a contributing editor for the Campus Technology group of publications.

Cite this Site

Paul McCloskey, "CMU Researcher Uses eCommerce Tool To Digitize Books," Campus Technology, 6/4/2007, http://www.campustechnology.com/article.aspx?aid=48372

copy text (above) for proper citation



Recommended Reading
  • Sun, Stanford Working To Archive History

    In May in San Francisco, experts from leading universities, libraries, and research institutions around the world met as part of an ongoing effort to address a pressing issue: archiving the world's history, right up to today.

  • The Quilt Coalition Rolls Out XO Communications for High-Capacity Network Services

    The Quilt, a coalition of 28 regional network organizations, has added XO Communications Services to its authorized vendor list. The Quilt represents 200 universities and thousands of other educational institutions across the United States. With this new relationship, Quilt members can purchase XO's high-speed IP transit and network transport services at competitive rates.

  • Wimba Classroom 5.2 Expands Classroom Capture Support, Adds MP3 Downloads

    At the NECC 2008 conference in Texas this week, Wimba launched a new version of Wimba Classroom, the virtual classroom component of the company's Collaboration Suite. The new 5.2 release expands options for classroom capture and adds a variety of other functional and ease of use features.

  • Automation Chimera: Education Is Not Management

    The lure of automating workflow online so human intervention is minimized is continually reinforced in the minds of higher education administrators by examples of automated campus systems such as financials, student information systems, and other enterprise systems. But what's good for management is not always good for learning.

  • Cognos Releases BI Software for Linux-based IBM System z Mainframe

    Cognos, which IBM acquired in January, has released an update to its business intelligence software that will run on the Linux operating system on IBM System z mainframes. IBM Cognos 8 BI was being developed by the two companies prior to the acquisition, but assimilation of Cognos into IBM accelerated development.

  • Facebook and Collegiality: A Serendipitous Social Niche

    Facebook is a way to greet a colleague as if she or he is on your own campus: a wave at a distance, a hello at the corner burrito place, a honk as you both leave the campus parking lot. Informal collegiality has been extended over the miles.