Comments on: Coding for Journalists 101: Go from knowing nothing to scraping Web pages. In an hour. Hopefully.

By: Dubai Ubar

Dubai Ubar — Sun, 28 Jun 2020 14:06:49 +0000

Dubai Ubar Girls

https://lailajan389.hatenadiary.com/entry/2020/06/28/225649

By: Dataliser » Blog Archive » Tutorial Websites

Dataliser » Blog Archive » Tutorial Websites — Tue, 03 Apr 2012 00:53:41 +0000

[…] Coding for journalists 101: Scraping for Journalism: A Guide for Collecting Data […]

By: Cracking the Code: My Pledge to Learn How to Web Scrape « Newsfangled: Learning to Teach New Media

Fri, 30 Mar 2012 15:42:33 +0000

[…] Coding for Journalists […]

By: Datenjournalismus | Pearltrees

Datenjournalismus | Pearltrees — Fri, 23 Mar 2012 12:13:11 +0000

[…] Coding for Journalists 101: Go from knowing nothing to scraping Web pages. In an hour. Hopefully. | … But now, itâ€™s possible that a public-information officer will just point you to the public website and say, there it is. And itâ€™s not always a case of them being ignorant/disdainful of laws that oblige them to give the dataset, in electronic form, that backs the website. From their viewpoint, the information is there for any idiot with an Internet connection to ask for, so what are you whining about? At this point, you can either go through a weeks-long argument through emails and phone messages that ends with their legal counsel compelling the PI officer to hand over the data. Or, if keeping your story idea secret isnâ€™t a priority, you could explain what your intent is, and why you need a whole dataset to see if a trend exists. […]

By: Screen Scraping (It’s Kinda Like Christmas)! « The One Techeteer

Screen Scraping (It’s Kinda Like Christmas)! « The One Techeteer — Tue, 27 Dec 2011 08:39:12 +0000

[…] found a great tutorial by Dan Nguyen, a developer at ProPublica, to screen scrape with Ruby and the Nokogiri parsing tool. […]

By: Barney

Barney — Fri, 14 Oct 2011 15:58:08 +0000

Wow. You put a lot of effort into this post. Awesome. I would like to add that there are web scraping software options out there that take away the need for any coding (although, learning XPath will be helpful to anyone that is serious about web scraping). For instance, I work for Mozenda, and we have people successfully using it that haven't touched a line of code in their lives. Anyway, I don't want to take anything away from such an in-depth tutorial, but if anyone is finding this stuff daunting, you can always check out Mozenda or any of our competitors (we're pretty confident in offering the easiest-to-use scraper at an affordable price). Again, great job on the tutorial, and good luck to everyone in getting your data!

By: Grepsr

Grepsr — Thu, 13 Oct 2011 11:39:52 +0000

You could always try Grepsr – http://www.grepsr.com/ a newly launched innovative cloud based data extraction service. We are offering a free trial at the moment.

By: Coding for Journalists 101: Go from knowing nothing to scraping Web pages. In an hour. Hopefully. | Dan Nguyen pronounced fast is danwin | AZ Journalism | Journalism news | Journalism career

Tue, 16 Aug 2011 16:13:38 +0000

[…] Delicious/tag/journalism This entry was posted in Journalism and tagged Coding, danwin, fast, from, Hopefully., hour., journalists, knowing, Nguyen, nothing, pages., pronounced, scraping. Bookmark the permalink. ← Google buys Motorola Mobility for $12.5bn Starbucks guitar man → […]

By: anon

anon — Mon, 30 May 2011 01:53:58 +0000

You could always just use freelancers on sites like Freelancer.com

They will do it for peanuts!

By: Norm Cimon

Norm Cimon — Mon, 17 Jan 2011 18:39:58 +0000

Good morning. Two things:
– Thank you very much for the tutorial. It’s a nice introduction to nokogiri and to the use of xpath. That’s much appreciated.

– Given the changes to the wiki page and the dropped content attribute, the correct construct to fetch the president’s name is:

list_of_presidents.xpath(“//tr/td[4]/a[1]”)[0].content

Again, thank you very much.