google refine

Investigate Companies By Scraping Data Off The Web

Investigate Companies By Scraping Data Off The Web

In order to put together its awesome “Dollars for Docs” database that let readers search to see if their doctor had received pharma company payments ProPublica had to convert data from all sorts of Websites, PDFs, Excel docs and even Flash sites into one system. Not an easy task, but that kind of data wrasslin’ is key for modern investigative journalism, and ProPublica have put together tutorials to show you how you can do it too. [More]