This project now serve as a toy project for testing purpose.
Extract keywords form given text, using the method of statistics, previously trained with corpus from University of Oxford Text Archive.
-TF-IDF & cosine resemblance
-Keyword Extraction
-Article Resemblance Check
Stop Words list: http://www.ranks.nl/stopwords
British National Corpus: http://www.natcorp.ox.ac.uk/
American National Corpus: http://www.anc.org/
Leipzig Corpora Collection: http://corpora.informatik.uni-leipzig.de/download.html