Code Monkey home page Code Monkey logo

sentiment-analysis-on-stanford-s-movie-review-corpus's Introduction

Sentiment Analysis on Stanford's Movie Review Corpus

Overview

This project focuses on performing sentiment analysis on Stanford's movie review dataset. Various machine learning models, including Linear SVM, Naive Bayes, Logistic Regression, Random Forest, and Gradient Boosting, are trained and evaluated to classify movie reviews as positive or negative.

Getting Started

1. Dataset

The dataset can be downloaded from Stanford's website at the following link: Stanford Movie Review Dataset

After downloading, unzip the files and place them in a directory named data within the project's root directory.

2. Installation

To set up the required environment:

pip install -r requirements.txt

Results

Based on the conducted experiments, the following accuracies were observed on the test set:

Model Test Accuracy (%)
Linear SVM 88.82
Logistic Regression 87.37
Naive Bayes 85.66
Random Forest 84.49
Gradient Boosting 80.75

From the results, the Linear SVM and Logistic Regression models stood out with their high accuracies, showcasing their effectiveness for this sentiment analysis task.

Model Results

For a more detailed comparison and analysis, refer to the output plots generated by the script or the provided report.

Authors

Parsa Mazaheri, October 2023

sentiment-analysis-on-stanford-s-movie-review-corpus's People

Contributors

parsa-mz avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.