TextRank You have to install spark and python( + nltk, pytube library ).
- Test with dataset ( 50 sample in dataset, new.zip ) ( precision, recall )
- multipre.py -> multipage.scala -> multipost.py
- Test with Youtube video
- preproc.py -> pagerank.scala -> postproc.py
@InProceedings{Hulth:2003,
author = {Anette Hulth},
title = {Improved automatic keyword extraction given more linguistic knowledge},
booktitle = {Proceedings of the 2003 conference on Empirical methods in natural language processing},
year = {2003},
pages = {216--223}
}