Parallel big data processing in MapReduce (CS5600 Northeastern University)
This course project focuses on measuring airline performance using various algorithms in plain java MapReduce, HBase, Hive and PigLatin.
See docs/ for details about the project.