Blog Post

Google Book Search Adds Copyright Data

Google Book Search now indicates what books are still under copyrightand what ones are not. As Cameron Parkins says on the Creative Commons website, we can now "use free information to free information.  Here's the reblog from CC:


Google Book Search Adds Copyright Renewal Data

Cameron Parkins, June 27th, 2008

Google Book Searchrecently did a great service for those interested in the public domainby digitizing a huge amount of copyright renewal data for books datingas far back as 1923. From Inside Google Book Search:

How do you find out whether a book was renewed? You have to check the U.S. Copyright Office records. Records from 1978 onward are online (see but not downloadable in bulk. The Copyright Office hasn?t digitized their earlier records, but Carnegie Mellon scanned them as part of their Universal Library Project, and the tireless folks at Project Gutenberg and the Distributed Proofreaders painstakingly corrected the OCR.

Thanks to the efforts of Google software engineer Jarkko Hietaniemi, we?ve gathered the records from both sources, massaged them a bit for easier parsing, and combined them into a single XML file available for download here.

This allows for a much clearer (although still somewhat problematic)understanding of which books have maintained their copyright status andwhich have gone in to the PD. Jakob Kramer-Duffield speaks well to the implicationsof Google?s efforts in pointing out ?there?s a danger [...] that ourgreat knowledge resources from the past are ignored or left to molder,and the difficulty of determining copyright status has been somethingof a hurdle to digitization efforts thusfar.? Peter Suber more succinctly states, ?I love the way we can now use free information to free information.?


No comments