Web application of Plagiarism Checker using Python-Flask. TF-IDF and cosine similarity is a very common technique. It allows the system to quickly retrieve documents similar to a search query. Similarly, based on the same concept instead of retrieving documents similar to a query, it checks for how similar the query is to the existing database file.
- User enters a query
- Query gets processed (Uppercase to lowercase, Removal of punctuationmarks, etc.)
- Calculations are done (Term Frequency, Cosine Similarity)
- The Plagiarism Percentage is returned on the web page