Code Monkey home page Code Monkey logo

scrapy.org's Introduction

Scrapy website

This is the website that runs on https://scrapy.org.

Requirements

This website is made with Jekyll. The most universal way to install it is:

bundle install

Usage

To start the web server locally auto-reloading when files change use:

jekyll serve --watch

The website is updated via CI when merging to the main branch.

To generate CSS, you can use .. code:: bash

sass --watch ./css/main.scss:./css/main.css

Adding your company to the website

Are you a company or individual using Scrapy for your product or project? Add yourself to the "Companies using Scrapy" list by sending a pull request:

  • add a "company card" in _data/companies/list/<companyslug>.yml as a YAML file; you need to set a few fields: a name, a "logouser" for your logo filename, a homepage and a blurb in markdown syntax on how you're using Scrapy. Do add links to any article or tweet spreading your love of Scrapy
  • add your logo to img/
  • add the slug of your company card to the list in _data/companies/users.yml.

Are you providing Scrapy consulting?

If you are a company providing Scrapy consulting or spider development, you can submit a pull request to add your company to the "Scrapy Pros" list.

The companies are sorted by their overall contribution to the Scrapy project and ecosystem. There's a small process to manage that:

  1. In order to appear in the list, a company must score at least five points in the table below.
  2. The company must submit an issue to scrapy.org repo whenever they want to update their contributions and ranking.
  3. This issue must contain references to all the contributions that the company made since the last time their rank has been updated.

This is the criteria that will be used:

Contribution Points
Pull request accepted in scrapy repo

50

Creation or contribution to public Scrapy plugins

20

Instructional materials such as screencasts, tutorials or blog posts

10

Blog posts that spread the word about Scrapy

5

Landing page in the company website describing the Scrapy services

2

Scrapy listed in the company's technologies stack (if any)            2  
Answer to StackOverflow question regarding Scrapy

1

In practice, to be listed, you'll need to:

  • add/update your "company card" in _data/companies/list/<companyslug>.yml as a YAML file; it needs a url field (e.g. the landing page of your consulting services offer), a description field describing your services (use markdown), and a logo filename
  • add your logo to img/
  • add/update your contributions in _data/companies/pros/contributions.csv
  • re-run update_ranking.py inside _data/companies/pros/

and send a pull request with these changes.

scrapy.org's People

Contributors

adarshpunj avatar aniversarioperu avatar bigb06 avatar bulatbulat48 avatar cyberplant avatar dangra avatar dogweather avatar eliasdorneles avatar elrull avatar gallaecio avatar giopiazza avatar gthole avatar gtlambert avatar kmike avatar mayankzyte avatar migganimon avatar nyov avatar pablohoffman avatar pandianmn avatar pervillalva avatar preetwinder avatar promediacorp avatar qwurty avatar rajatgarg79 avatar redapple avatar shaneaevans avatar stummjr avatar sylvainti avatar telosantos avatar wrar avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

scrapy.org's Issues

This section needs a little improvement

Summary

While visiting the website I noticed that many features are not interactive. Mainly this section which needs more attention so that it become more interactive to user. We can make a card for each of them or make the each div interactive so that when user hovers it does something.

Describe alternatives you've considered

We can either make divs to do something when hovers or make a card for each of them.

Additional context

Screenshot (252)

P.S : You can assign me the issue.

Handling of outdated company information

What's the policy about outdated information on the scrapy.org site?

I noticed that several of the companies/projects are no longer existing.
Should those be removed or at least de-linked (moved to a 'historically used scrapy' subsection)?

Add link to GSoC page

Scrapy is going to participate (again) in GSoC. Can we put a link to GSoC 2015 ideas page?

A few people don't see headers

Hello everyone.
I use Fedora and Firefox and I don't see headers on https://scrapy.org/
Also, I asked a few people and they tell me they also can't see headers.
One use Firefox and Linux Mint.
The second guy uses Manjaro and Firefox.
Screenshot_2019-06-08 Scrapy A Fast and Powerful Scraping and Web Crawling Framework
On Chromium all good
Screenshot from 2019-06-08 22-16-45
My Firefox is 67.0 (64-bit)
User-Agent: Mozilla/5.0 (X11; Fedora; Linux x86_64; rv:67.0) Gecko/20100101 Firefox/67.0

Fix the build

CI is failing with:

ruby 3.1.1p18 (2022-02-18 revision 53f5fc4236) [x86_64-linux-musl]
Configuration file: /srv/jekyll/_config.yml
            Source: /srv/jekyll
       Destination: /srv/jekyll/_site
 Incremental build: disabled. Enable with --incremental
      Generating... 
  Conversion error: Jekyll::Converters::Scss encountered an error while converting 'css/main.scss':
                    end of file reached
                    ------------------------------------------------
      Jekyll 4.3.2   Please append `--trace` to the `build` command 
                     for any additional information or backtrace. 
                    ------------------------------------------------
/usr/gem/gems/sass-embedded-1.64.1-x86_64-linux-musl/lib/sass/embedded/varint.rb:20:in `readbyte': end of file reached (EOFError)
	from /usr/gem/gems/sass-embedded-1.64.1-x86_64-linux-musl/lib/sass/embedded/varint.rb:20:in `block in read'
	from /usr/gem/gems/sass-embedded-1.64.1-x86_64-linux-musl/lib/sass/embedded/varint.rb:19:in `loop'
	from /usr/gem/gems/sass-embedded-1.64.1-x86_64-linux-musl/lib/sass/embedded/varint.rb:19:in `read'
	from /usr/gem/gems/sass-embedded-1.64.1-x86_64-linux-musl/lib/sass/embedded/connection.rb:62:in `block in read'
	from /usr/gem/gems/sass-embedded-1.64.1-x86_64-linux-musl/lib/sass/embedded/connection.rb:61:in `synchronize'
	from /usr/gem/gems/sass-embedded-1.64.1-x86_64-linux-musl/lib/sass/embedded/connection.rb:61:in `read'
	from /usr/gem/gems/sass-embedded-1.64.1-x86_64-linux-musl/lib/sass/embedded/dispatcher.rb:76:in `receive_proto'
	from /usr/gem/gems/sass-embedded-1.64.1-x86_64-linux-musl/lib/sass/embedded/dispatcher.rb:19:in `block (2 levels) in initialize'
	from /usr/gem/gems/sass-embedded-1.64.1-x86_64-linux-musl/lib/sass/embedded/dispatcher.rb:18:in `loop'
	from /usr/gem/gems/sass-embedded-1.64.1-x86_64-linux-musl/lib/sass/embedded/dispatcher.rb:18:in `block in initialize'
Error: Process completed with exit code 1.

Add Datahut to commercial support

Company description: Datahut provides Scrapy consulting services across different business verticals like e-commerce, content discovery, lead generation, opinion mining etc. They provide clean ready to use data in the most common formats.

Logo is attached here:
logo 1

@giopiazza could you tackle this?. Thanks!

Clarify how to get to the companies list

It should be explained better how to get enlisted here: http://scrapy.org/companies/ and here: http://scrapy.org/support/. Current issues:

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.