zakird / crux-top-lists Goto Github PK
View Code? Open in Web Editor NEWDownloadable snapshots of the Chrome Top Million Websites pulled from public CrUX data in Google BigQuery.
Home Page: https://developer.chrome.com/docs/crux
Downloadable snapshots of the Chrome Top Million Websites pulled from public CrUX data in Google BigQuery.
Home Page: https://developer.chrome.com/docs/crux
Thanks for the super handy project!
The only gap I ran into: I couldn't find the ,5000
entries in the CSV. This bucket was recently introduced I believe by the CrUX report:
https://developer.chrome.com/docs/crux/release-notes/#202210
Great project, was looking for this! Your cron scheduled downloader stopped working a few months ago though.
Hello,
Thank you for maintaining this repository and cached versions of crux-top-list.
202205.csv
contains only 859188 records instead of the usual 1M. Can the corresponding list be regenerated and updated here or is the data also missing from Google's BigQuery database?
>>> import pandas as pd
>>> df = pd.read_csv("202205.csv")
>>> df
origin rank
0 http://iporntv.net 1000
1 https://eldenring.wiki.fextralife.com 1000
2 https://m.lightinthebox.com 1000
3 https://ssc.nic.in 1000
4 https://ja.m.wikipedia.org 1000
... ... ...
859183 https://www.vulcaodaborracha.com.br 1000000
859184 https://www.vub.be 1000000
859185 https://www.virginianaturalgas.com 1000000
859186 https://www.virtualregatta.com 1000000
859187 https://zamosc.lento.pl 1000000
[859188 rows x 2 columns]
>>> df.groupby("rank").nunique()
origin
rank
1000 904
10000 7806
100000 76566
1000000 773912
Thanks!
Do you know if it's possible to have the URL category as per:
https://support.google.com/a/answer/13306955?hl=en
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.