<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>danwin.com &#187; data-cleaning</title>
	<atom:link href="https://danwin.com/tag/data-cleaning/feed/" rel="self" type="application/rss+xml" />
	<link>https://danwin.com</link>
	<description>Words, photos, and code by Dan Nguyen. The &#039;g&#039; is mostly silent.</description>
	<lastBuildDate>Thu, 21 Nov 2019 12:29:57 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=4.2.39</generator>
	<item>
		<title>Google Refine, a.k.a. Gridworks 2.0 released; ProPublica&#8217;s &#8220;Dollars for Docs&#8221; featured.</title>
		<link>https://danwin.com/2010/11/googlerefine-a-k-a-gridworks-2-0-released-propublicas-dollars-for-docs-featured/</link>
		<comments>https://danwin.com/2010/11/googlerefine-a-k-a-gridworks-2-0-released-propublicas-dollars-for-docs-featured/#comments</comments>
		<pubDate>Wed, 10 Nov 2010 23:39:40 +0000</pubDate>
		<dc:creator><![CDATA[Dan Nguyen]]></dc:creator>
				<category><![CDATA[works]]></category>
		<category><![CDATA[data-cleaning]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Google Refine]]></category>
		<category><![CDATA[Gridworks]]></category>
		<category><![CDATA[Refine]]></category>

		<guid isPermaLink="false">https://danwin.com/?p=1409</guid>
		<description><![CDATA[<p>Good news for data-nerds everywhere. The 2.0 version of Google&#8217;s fantastic data-cleaning tool, Google Refine (formerly Gridworks), has been released. And they were nice enough to feature ProPublica&#8217;s Dollars for Docs as an example of a use-case. I talked briefly to BusinessJournalism.org about how I used Refine to put together the pharma top earners list. [&#8230;]</p>
<p>The post <a rel="nofollow" href="https://danwin.com/2010/11/googlerefine-a-k-a-gridworks-2-0-released-propublicas-dollars-for-docs-featured/">Google Refine, a.k.a. Gridworks 2.0 released; ProPublica&#8217;s &#8220;Dollars for Docs&#8221; featured.</a> appeared first on <a rel="nofollow" href="https://danwin.com">danwin.com</a>.</p>
]]></description>
				<content:encoded><![CDATA[<p>Good news for data-nerds everywhere. The <a href="http://google-opensource.blogspot.com/2010/11/announcing-google-refine-20-power-tool.html">2.0 version of Google&#8217;s fantastic data-cleaning tool, Google Refine (formerly Gridworks), has been release</a>d. And they were nice enough to feature <a href="http://projects.propublica.org/docdollars/">ProPublica&#8217;s Dollars for Docs</a> as an example of a use-case. I talked briefly to <a href="http://businessjournalism.org/2010/10/21/propublica-uses-google-refine-to-sort-messy-data-for-dollars-for-docs/">BusinessJournalism.or</a>g about how I used Refine to put together the pharma top earners list. </p>
<p>It&#8217;s possible I could&#8217;ve done it using SQL queries and Ruby libraries. But I definitely would&#8217;ve missed a lot of matches, and probably overdosed on over-the-counter pharma-painkillers.</p>
<p><object width="640" height="390"><param name="movie" value="http://www.youtube.com/v/yNccGtn3Wb0&#038;hl=en_US&#038;feature=player_embedded&#038;version=3"></param><param name="allowFullScreen" value="true"></param><param name="allowScriptAccess" value="always"></param><embed src="http://www.youtube.com/v/yNccGtn3Wb0&#038;hl=en_US&#038;feature=player_embedded&#038;version=3" type="application/x-shockwave-flash" allowfullscreen="true" allowScriptAccess="always" width="640" height="390"></embed></object></p>
<p>The post <a rel="nofollow" href="https://danwin.com/2010/11/googlerefine-a-k-a-gridworks-2-0-released-propublicas-dollars-for-docs-featured/">Google Refine, a.k.a. Gridworks 2.0 released; ProPublica&#8217;s &#8220;Dollars for Docs&#8221; featured.</a> appeared first on <a rel="nofollow" href="https://danwin.com">danwin.com</a>.</p>
]]></content:encoded>
			<wfw:commentRss>https://danwin.com/2010/11/googlerefine-a-k-a-gridworks-2-0-released-propublicas-dollars-for-docs-featured/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
