This is a small project on Movie Recommendation System using K Means Clustering. This project was carried out by scraping the top movies of various genres in the IMDB official site. Using the genres as properties these movies were clustered.
-
scraper.py to scrape imdb website
-
main.py to run the code in order (No need to run pre_processing.py and clustering_code.py they are imported by main and run automatically)
-
pre_processing.py to preprocess the scraped data and store in desired format
-
clustering_code.py to prepare data and fit to kmeans clustering model from sklearn
-
plot.py to plot the cluster graph