dannguyen / nicar-google-refine Goto Github PK
View Code? Open in Web Editor NEWThe lesson and source files for Dan Nguyen's NICAR 2012 lesson on Google Refine
The lesson and source files for Dan Nguyen's NICAR 2012 lesson on Google Refine
# Google Refine for Investigative Journalism *An introduction to one of the best data tools for any reporter of **any** technical level* This is a hands-on walkthrough for [NICAR 2012](http://www.ire.org/conferences/nicar-2012/). It will take place on Friday, from 2-2:50PM in the **Jeffersonian/Knickerbocker** room. It will be led by Dan Nguyen ([@dancow](http://twitter.com/dancow)) with help from Joe Kokenge ([@josephkokenge](http://twitter.com/josephkokenge)) of ProPublica. ## A tool for cleaning and investigations You may have heard how [Google Refine](http://code.google.com/p/google-refine/) – a "power tool for working with messy data" – is a great data cleaning tool. But if you haven't tried it out yet, then you're missing out on the potential stories and insights that Refine can easily (and sometimes exclusively) find in data. It doesn't matter what skill level you have. Refine is one of those unique tools that is as equally useful to those who have never left their click-and-drag interfaces into command-line world as it is to the most anal-retentive detail-oriented data analysts and power-programmers. In this lesson, I'll start at the very basics: opening a file with Refine, doing basic sorting, searching (things that you can do in Excel, of course) to its easy-to-use data cleaning methods and then to how Refine can help you probe unfamiliar datasets to scout out a story. The two datasets I will be working with are: * [FEC Individual Contribution Data](http://www.fec.gov/finance/disclosure/ftpdet.shtml#a2011_2012) – the list of citizens who have individually contributed $200+ to the political process * The [White House Visitor Logs](http://www.whitehouse.gov/briefing-room/disclosures/visitor-records) – everyone who has visited the White House during the Obama Administration (in the time period it chose to disclose), who they visited, and (sometimes) why. (hopefully we'll have time to do both) ## Contacts * Dan Nguyen ([@dancow](http://twitter.com/dancow)) * Joseph Kokenge ([@josephkokenge](http://twitter.com/josephkokenge)) ## The FEC Data ** This tip sheet is a work-in-progress...I will be filling it out through today and it should be done by the time of my hands-on session** ## The White House Visitor Logs ** This tip sheet is a work-in-progress...I will be filling it out through today and it should be done by the time of my hands-on session**
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.