Code Monkey home page Code Monkey logo

e-commerce-web-scraper's Introduction

e-commerce-web-scraper

File : amazon_search_image_scraper.py
Description : Get images of available products on amazon for a particular search query
Steps

  1. Go to Amazon.in
  2. Enter the search query for product
  3. Copy the URL
  4. Execute amazon_search_image_scraper.py file
$ python amazon_search_image_scraper.py
  Search Query URL: 

Result : Get the images of products saved in images/ folder

Example :

Usecase : It can be useful for creating dataset for training Machine learning or Deep learning models on Fashion data.



File : asin_scraper.py
Description : Get ASIN of available products on amazon for a particular search query
Steps

  1. Go to Amazon.in
  2. Enter the search query for product
  3. Copy the URL
  4. Execute asin_scraper.py file
$ python asin_scraper.py 
  URL for scraping

Result : Get the ASIN numbers of the products in ASINs.csv

Example :

Usecase : It is useful later for scraping all details of the product using scrape_product_details_using_asins.py

What is ASIN - ASIN stands for Amazon Standard Identification Number. It's a 10-charcter alphanumeric unique identifier that's assigned by Amazon.com and its partners. It's used for product-identification within Amazon.com organization. ASINs are only guaranteed unique within a marketplace.



File : scrape_product_details_using_asins.py
Description : Get the product details like _product_name, brand_name, product_image_urls, price, rating and no_of_reviews
Steps

  1. Go to Amazon.in
  2. Enter the search query for product
  3. Copy the URL
  4. Execute scrape_product_details_using_asins.py file
$ python asin_scraper.py 

Result : Get the products details for all the ASIN numbers from input file ASINs.csv and store details in ASIN_product_details.csv

Example :

Usecase : It gets data which can be useful for various applications like price analysis, Market Analysis, Price Comparison gaining rich insights to help them develop strategies to compete against their competitors and also save time and cost in the process,

e-commerce-web-scraper's People

Contributors

swatimodi avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.