<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>AllThingsD &#187; book scanning</title>
	<atom:link href="http://allthingsd.com/tag/book-scanning/feed/" rel="self" type="application/rss+xml" />
	<link>http://allthingsd.com</link>
	<description></description>
	<lastBuildDate>Sat, 11 Feb 2012 20:29:40 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
<atom:link rel="hub" href="http://pubsubhubbub.appspot.com"/><image>
		  <url>http://allthingsd.com/theme/images/logo-rss.jpg</url>
		  <title>All Things Digital</title>
		  <link>http://allthingsd.com/</link>
		  <width>144</width>
		  <height>22</height>
	</image>		<item>
		<title>Insert Bad &quot;Google Captchas reCAPTCHA&quot; Pun Here</title>
		<link>http://allthingsd.com/20090916/google-captures-recaptcha/</link>
		<comments>http://allthingsd.com/20090916/google-captures-recaptcha/#comments</comments>
		<pubDate>Wed, 16 Sep 2009 18:38:23 +0000</pubDate>
		<dc:creator>John Paczkowski</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[accuracy]]></category>
		<category><![CDATA[archival]]></category>
		<category><![CDATA[book scanning]]></category>
		<category><![CDATA[books]]></category>
		<category><![CDATA[CAPTCHA]]></category>
		<category><![CDATA[challenge response]]></category>
		<category><![CDATA[company blog]]></category>
		<category><![CDATA[computers]]></category>
		<category><![CDATA[deal]]></category>
		<category><![CDATA[digital]]></category>
		<category><![CDATA[digitize]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[humans]]></category>
		<category><![CDATA[innovation]]></category>
		<category><![CDATA[Internet]]></category>
		<category><![CDATA[John Paczkowski]]></category>
		<category><![CDATA[newspapers]]></category>
		<category><![CDATA[old books]]></category>
		<category><![CDATA[programs]]></category>
		<category><![CDATA[reCAPTCHA]]></category>
		<category><![CDATA[robots]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[security]]></category>
		<category><![CDATA[technology]]></category>
		<category><![CDATA[Turing test]]></category>
		<category><![CDATA[Web]]></category>

		<guid isPermaLink="false">http://digitaldaily.allthingsd.com/?p=24881</guid>
		<description><![CDATA[Evidently, Google’s efforts to create a new CAPTCHA system that requires people to rotate images until they're upright aren’t moving as quickly as the company would like. Because this morning, the search giant said it had acquired reCAPTCHA, developer of the Web’s preeminent CAPTCHA technology.]]></description>
			<content:encoded><![CDATA[<p><img src="http://digitaldaily.allthingsd.com/files/2009/09/recaptcha.jpg" alt="recaptcha" title="recaptcha" width="350" height="200" class="aligncenter size-full wp-image-24882" />Evidently, <a href="http://www.nytimes.com/2009/05/24/business/24novelties.html">Google’s efforts to create a new CAPTCHA system</a> that requires people to rotate images until they&#8217;re upright, aren’t moving as quickly as the company would like. Because this morning, the search giant said it had acquired reCAPTCHA, developer of the Web’s preeminent CAPTCHA technology. Terms of the deal were not disclosed.</p>
<p>CAPTCHA, for those of you just joining us, stands for Completely Automated Public Turing test to tell Computers and Humans Apart. Essentially, <a href="http://recaptcha.net/learnmore.html">it’s a challenge-response test used to distinguish between humans and spam-spewing robots</a>. What’s interesting about reCAPTCHA’s implementation is that it&#8217;s used for digitizing books.</p>
<p>&#8220;Since computers have trouble reading squiggly words like these, CAPTCHAs are designed to allow humans in but prevent malicious programs from scalping tickets or obtain millions of email accounts for spamming,&#8221; <a href="http://googleblog.blogspot.com/2009/09/teaching-computers-to-read-google.html">Google explains in a post to the company blog</a>. &#8220;But there’s a twist&#8211;the words in many of the CAPTCHAs provided by reCAPTCHA come from scanned archival newspapers and old books. Computers find it hard to recognize these words because the ink and paper have degraded over time, but by typing them in as a CAPTCHA, crowds teach computers to read the scanned text.&#8221;</p>
<p><a href="http://recaptcha.net/reCAPTCHA_Science.pdf">An ingenious idea, crowdsourcing book transcriptions in this way</a>. An effective one too: reCAPTCHA boasts <a href="http://recaptcha.net/digitizing.html"> 99.5 percent accuracy</a> at the word level.</p>
<p>Little wonder, then, that Google (GOOG) has acquired it. The company can clearly put reCaptcha&#8217;s technology to good use, not just as a security measure, but as a means of improving its own massive book-scanning project.</p>
]]></content:encoded>
			<wfw:commentRss>http://allthingsd.com/20090916/google-captures-recaptcha/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Insert Bad "Google Captchas reCAPTCHA" Pun Here</title>
		<link>http://allthingsd.com/20090916/google-captures-recaptcha-2/</link>
		<comments>http://allthingsd.com/20090916/google-captures-recaptcha-2/#comments</comments>
		<pubDate>Wed, 16 Sep 2009 18:38:23 +0000</pubDate>
		<dc:creator>John Paczkowski</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[accuracy]]></category>
		<category><![CDATA[archival]]></category>
		<category><![CDATA[book scanning]]></category>
		<category><![CDATA[books]]></category>
		<category><![CDATA[CAPTCHA]]></category>
		<category><![CDATA[challenge response]]></category>
		<category><![CDATA[company blog]]></category>
		<category><![CDATA[computers]]></category>
		<category><![CDATA[deal]]></category>
		<category><![CDATA[digital]]></category>
		<category><![CDATA[digitize]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[humans]]></category>
		<category><![CDATA[innovation]]></category>
		<category><![CDATA[Internet]]></category>
		<category><![CDATA[John Paczkowski]]></category>
		<category><![CDATA[newspapers]]></category>
		<category><![CDATA[old books]]></category>
		<category><![CDATA[programs]]></category>
		<category><![CDATA[reCAPTCHA]]></category>
		<category><![CDATA[robots]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[security]]></category>
		<category><![CDATA[technology]]></category>
		<category><![CDATA[Turing test]]></category>
		<category><![CDATA[Web]]></category>

		<guid isPermaLink="false">http://digitaldaily.allthingsd.com/?p=24881</guid>
		<description><![CDATA[Evidently, Google’s efforts to create a new CAPTCHA system that requires people to rotate images until they're upright aren’t moving as quickly as the company would like. Because this morning, the search giant said it had acquired reCAPTCHA, developer of the Web’s preeminent CAPTCHA technology.]]></description>
			<content:encoded><![CDATA[<p><img src="http://digitaldaily.allthingsd.com/files/2009/09/recaptcha.jpg" alt="recaptcha" title="recaptcha" width="350" height="200" class="aligncenter size-full wp-image-24882" />Evidently, <a href="http://www.nytimes.com/2009/05/24/business/24novelties.html">Google’s efforts to create a new CAPTCHA system</a> that requires people to rotate images until they&#8217;re upright, aren’t moving as quickly as the company would like. Because this morning, the search giant said it had acquired reCAPTCHA, developer of the Web’s preeminent CAPTCHA technology. Terms of the deal were not disclosed.</p>
<p>CAPTCHA, for those of you just joining us, stands for Completely Automated Public Turing test to tell Computers and Humans Apart. Essentially, <a href="http://recaptcha.net/learnmore.html">it’s a challenge-response test used to distinguish between humans and spam-spewing robots</a>. What’s interesting about reCAPTCHA’s implementation is that it&#8217;s used for digitizing books. </p>
<p>&#8220;Since computers have trouble reading squiggly words like these, CAPTCHAs are designed to allow humans in but prevent malicious programs from scalping tickets or obtain millions of email accounts for spamming,&#8221; <a href="http://googleblog.blogspot.com/2009/09/teaching-computers-to-read-google.html">Google explains in a post to the company blog</a>. &#8220;But there’s a twist&#8211;the words in many of the CAPTCHAs provided by reCAPTCHA come from scanned archival newspapers and old books. Computers find it hard to recognize these words because the ink and paper have degraded over time, but by typing them in as a CAPTCHA, crowds teach computers to read the scanned text.&#8221;</p>
<p><a href="http://recaptcha.net/reCAPTCHA_Science.pdf">An ingenious idea, crowdsourcing book transcriptions in this way</a>. An effective one too: reCAPTCHA boasts <a href="http://recaptcha.net/digitizing.html"> 99.5 percent accuracy</a> at the word level. </p>
<p>Little wonder, then, that Google (GOOG) has acquired it. The company can clearly put reCaptcha&#8217;s technology to good use, not just as a security measure, but as a means of improving its own massive book-scanning project.</p>
]]></content:encoded>
			<wfw:commentRss>http://allthingsd.com/20090916/google-captures-recaptcha-2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Internet Archive Founder Questions Google Books Settlement</title>
		<link>http://allthingsd.com/20090519/internet-archive-founder-questions-google-books-settlement/</link>
		<comments>http://allthingsd.com/20090519/internet-archive-founder-questions-google-books-settlement/#comments</comments>
		<pubDate>Tue, 19 May 2009 19:10:23 +0000</pubDate>
		<dc:creator>Marisa Taylor</dc:creator>
				<category><![CDATA[Media]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Voices]]></category>
		<category><![CDATA[American Association of Publishers]]></category>
		<category><![CDATA[Authors Guild]]></category>
		<category><![CDATA[book scanning]]></category>
		<category><![CDATA[Brewster Kahle]]></category>
		<category><![CDATA[copyright]]></category>
		<category><![CDATA[copyright law]]></category>
		<category><![CDATA[copyrighted works]]></category>
		<category><![CDATA[digital]]></category>
		<category><![CDATA[Digits]]></category>
		<category><![CDATA[frontpage]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Google Book Search Library Project]]></category>
		<category><![CDATA[Internet]]></category>
		<category><![CDATA[Internet Archive]]></category>
		<category><![CDATA[Marisa Taylor]]></category>
		<category><![CDATA[monopoly]]></category>
		<category><![CDATA[op-ed]]></category>
		<category><![CDATA[plaintiffs]]></category>
		<category><![CDATA[software]]></category>
		<category><![CDATA[The Wall Street Journal]]></category>
		<category><![CDATA[Washington Post]]></category>

		<guid isPermaLink="false">http://voices.allthingsd.com/?p=11928</guid>
		<description><![CDATA[Will the settlement agreement between Google’s Book Search Library Project and authors and publishers put Google in monopoly territory?

That’s the argument that Brewster Kahle, co-founder of the Internet Archive, made in an op-ed in the Washington Post, in which he writes that the settlement “provides a new and unsettling form of media consolidation.”]]></description>
			<content:encoded><![CDATA[<p><img src="http://voices.allthingsd.com/files/2009/05/brewsterkahle-250x187.jpg" alt="brewsterkahle" title="brewsterkahle" width="250" height="187" class="alignright size-medium wp-image-11929" />Will the settlement agreement between Google’s Book Search Library Project and authors and publishers put Google (GOOG) in monopoly territory?</p>
<p>That’s the argument that Brewster Kahle, co-founder of the Internet Archive, made in an op-ed in the Washington Post, in which he writes that the settlement “provides a new and unsettling form of media consolidation.”</p>
<p>Google’s book-scanning project drew outcry and a class-action lawsuit from the Authors Guild and the American Association of Publishers, who said the Internet company was violating copyright laws by scanning copyrighted works. A settlement agreement was reached in October of 2008 which would allow publishers and authors to share Google’s profits from the sale of digital versions of copyrighted works. The deadline for plaintiffs to object to or opt out of the settlement was recently extended to Sept. 4, 2009.</p>
<p><a href="http://blogs.wsj.com/digits/2009/05/19/internet-archive-founder-questions-google-books-settlement/">Read the rest of this post on the original site</a></p>
]]></content:encoded>
			<wfw:commentRss>http://allthingsd.com/20090519/internet-archive-founder-questions-google-books-settlement/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Nevermind the Baallocks, Here&#039;s the Econolypse</title>
		<link>http://allthingsd.com/20081103/nevermind-the-baallocks-heres-the-econolypse/</link>
		<comments>http://allthingsd.com/20081103/nevermind-the-baallocks-heres-the-econolypse/#comments</comments>
		<pubDate>Mon, 03 Nov 2008 19:00:58 +0000</pubDate>
		<dc:creator>John Paczkowski</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[Authors Guild]]></category>
		<category><![CDATA[book scanning]]></category>
		<category><![CDATA[Circuit City]]></category>
		<category><![CDATA[comScore]]></category>
		<category><![CDATA[consumer]]></category>
		<category><![CDATA[copyright]]></category>
		<category><![CDATA[Digital Daily Live]]></category>
		<category><![CDATA[economic crisis]]></category>
		<category><![CDATA[electronics]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Harvard University]]></category>
		<category><![CDATA[Internet]]></category>
		<category><![CDATA[John Paczkowski]]></category>
		<category><![CDATA[lawsuit]]></category>
		<category><![CDATA[loss]]></category>
		<category><![CDATA[Nielsen Online]]></category>
		<category><![CDATA[retailer]]></category>
		<category><![CDATA[sales]]></category>
		<category><![CDATA[search market]]></category>
		<category><![CDATA[settlement]]></category>
		<category><![CDATA[share price]]></category>
		<category><![CDATA[store closing]]></category>
		<category><![CDATA[video stream]]></category>
		<category><![CDATA[work force]]></category>
		<category><![CDATA[Yahoo]]></category>
		<category><![CDATA[Yahoo Finance]]></category>

		<guid isPermaLink="false">http://digitaldaily.allthingsd.com/?p=7704</guid>
		<description><![CDATA[[ See post to watch video ]]]></description>
			<content:encoded><![CDATA[<p><div class="video-wsj"><embed src="http://s.wsj.net/media/swf/microPlayer.swf" bgcolor="#FFFFFF" flashVars="videoGUID={1898343183}&playerid=4001&plyMediaEnabled=1&configURL=http://m.wsj.net/video-players/&autoStart=false" base="http://s.wsj.net/media/swf/" name="microflashPlayer" width="320" height="240" seamlesstabbing="false" type="application/x-shockwave-flash" swLiveConnect="true" pluginspage="http://www.macromedia.com/shockwave/download/index.cgi?P1_Prod_Version=ShockwaveFlash"></embed><br />[ See post to watch video ]</div></p>
]]></content:encoded>
			<wfw:commentRss>http://allthingsd.com/20081103/nevermind-the-baallocks-heres-the-econolypse/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Nevermind the Baallocks, Here's the Econolypse</title>
		<link>http://allthingsd.com/20081103/nevermind-the-baallocks-heres-the-econolypse-2/</link>
		<comments>http://allthingsd.com/20081103/nevermind-the-baallocks-heres-the-econolypse-2/#comments</comments>
		<pubDate>Mon, 03 Nov 2008 19:00:58 +0000</pubDate>
		<dc:creator>John Paczkowski</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[Authors Guild]]></category>
		<category><![CDATA[book scanning]]></category>
		<category><![CDATA[Circuit City]]></category>
		<category><![CDATA[comScore]]></category>
		<category><![CDATA[consumer]]></category>
		<category><![CDATA[copyright]]></category>
		<category><![CDATA[Digital Daily Live]]></category>
		<category><![CDATA[economic crisis]]></category>
		<category><![CDATA[electronics]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Harvard University]]></category>
		<category><![CDATA[Internet]]></category>
		<category><![CDATA[John Paczkowski]]></category>
		<category><![CDATA[lawsuit]]></category>
		<category><![CDATA[loss]]></category>
		<category><![CDATA[Nielsen Online]]></category>
		<category><![CDATA[retailer]]></category>
		<category><![CDATA[sales]]></category>
		<category><![CDATA[search market]]></category>
		<category><![CDATA[settlement]]></category>
		<category><![CDATA[share price]]></category>
		<category><![CDATA[store closing]]></category>
		<category><![CDATA[video stream]]></category>
		<category><![CDATA[work force]]></category>
		<category><![CDATA[Yahoo]]></category>
		<category><![CDATA[Yahoo Finance]]></category>

		<guid isPermaLink="false">http://digitaldaily.allthingsd.com/?p=7704</guid>
		<description><![CDATA[[ See post to watch video ]]]></description>
			<content:encoded><![CDATA[<p><div class="video-wsj"><embed src="http://s.wsj.net/media/swf/microPlayer.swf" bgcolor="#FFFFFF" flashVars="videoGUID={1898343183}&playerid=4001&plyMediaEnabled=1&configURL=http://m.wsj.net/video-players/&autoStart=false" base="http://s.wsj.net/media/swf/" name="microflashPlayer" width="320" height="240" seamlesstabbing="false" type="application/x-shockwave-flash" swLiveConnect="true" pluginspage="http://www.macromedia.com/shockwave/download/index.cgi?P1_Prod_Version=ShockwaveFlash"></embed><br />[ See post to watch video ]</div></p>
]]></content:encoded>
			<wfw:commentRss>http://allthingsd.com/20081103/nevermind-the-baallocks-heres-the-econolypse-2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

