kamron-h / reu_capterra_n-gram_analysis Goto Github PK
View Code? Open in Web Editor NEWA collection of Python scripts for N-gram analysis of software reviews. This repository tokenizes software review data, removes stopwords, lemmatizes, and generates N-grams using NLTK, pandas, and scikit-learn for an NSF REU AI research internship. It also processes review date data. Before running, check the script's directory for data files.