Analysis and exploration of this dataset https://github.com/lukefeilberg/onion . The dataset consists of post titles from the satirical news website The Onion and real news headlines from the subreddit r/NotTheOnion . The Onion articles are labeled 1 and the r/NotTheOnion articles are labeled 0.
The aim of this project will be to explore NLP methods and to build a classifier to identify whether a news article is real or fake.