Code Monkey home page Code Monkey logo

imdb_forever's Introduction

IMDB Titles Scraper By Company

This actor allows you to scrape titles from IMDB that created or disturbed by a specific company. All you need to do in order to start the scraper is to get the company id which is the main input "for now" in the scraper.

Input configuration

The actor has the following input options:

  • Company ID - Paste the company id you get from IMDB into this field.
  • Testing - Optionally, select yes to scrape only the first 50 movies.
  • Proxy - Optionally, select a proxy to be used by the actor, in order to avoid IP address-based blocking by the target website. The actor automatically executes all the Scrapy's HTTP(S) requests through the proxy.

How to get the Company ID?

It might sound complicated, but it is too easy...

  1. First, from the main IMDB search bar, select Companies...

  2. Then, type the name of the company...

  3. Click the search icon, the pick the company you would like to scrape their title from the results...

  4. Finally, get the company id from the URL. The id in this example is: co0047120.

Results:

Each record represent a title. The following fields is the current data you will get from the scraper...

{
    "id": "tt4574334",
    "title": "Stranger Things",
    "year": "2016โ€“",
    "certificate": "15",
    "runtime": "51 min",
    "genre": "Drama, Fantasy, Horror",
    "rating": "8.7",
    "plot": "When a young boy disappears, his mother, a police chief and his friends must confront terrifying supernatural forces in order to get him back.",
    "stars": "Millie Bobby Brown| Finn Wolfhard| Winona Ryder| David Harbour",
    "votes": "1,031,928",
    "url": "https://www.imdb.com/title/tt4574334/",
    "poster_url": "https://m.media-amazon.com/images/M/MV5BODZlYjQ4NzYtZTg1MC00NGY4LTg4NjQtNGE3ZjRkMjk3YjMyXkEyXkFqcGdeQXVyMTkxNjUyNQ@@._V1_UY98_CR5,0,67,98_AL_.jpg",
    "big_poster_url": "https://m.media-amazon.com/images/M/MV5BODZlYjQ4NzYtZTg1MC00NGY4LTg4NjQtNGE3ZjRkMjk3YjMyXkEyXkFqcGdeQXVyMTkxNjUyNQ@@._V1_SY1000_CR0,0,674,1000_AL_.jpg"
  }

And That's it for now!

If you have any problem or anything does not work, please file an issue on Apify.

imdb_forever's People

Contributors

lxth0rz avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.