This project crawls data from IMDb and Rotten Tomatoes, extracts structured data from them and stores it in two tables. The schema of the tables is manually defined as per the attributes available on both the websites. Provided below is the count of movies extracted from each site:
- IMDb: 3476
- Rotten Tomatoes: 3061
Attributes | Description |
---|---|
Name | Name of the movie |
Release Year | The year when the movie was released (2005) |
Rating | The rating of the movie as submitted by users |
Runtime | The duration of the movie |
Release Date | Date of release of the movie |
Director's Name | Name(s) of director(s) who directed the movie |
Certificate | Rating of the movie (R: Restricted, G: General Audiences, PG: Parental Guidance Suggested, etc.) |
Genre | The category of the movie (Drama, Comedy, Action, Horror, Romance, etc.) |