Comments (5)
I don't think there are, but I had a potentially useful idea for a windowed scrape (at least for events).
Right now, we attempt the widowed scrapes attempt to find all events that have been updated after the beginning of the window.
It would probably be helpful if we also scraped events that are scheduled to occur after the beginning of the window.
These recent and upcoming events are the ones most likely to change.
An increased false positive rate seems worth it if this helps us really reduce the false negatives.
from python-legistar-scraper.
@fgregg That's really smart!
from python-legistar-scraper.
Could do something similar with bills by using
MatterAgendaDate
MatterIntroducitonDate
and the other date field on matters.
from python-legistar-scraper.
let me know if you'd like to see a PR
from python-legistar-scraper.
@fgregg We have the go-ahead from Metro to prioritize this for May. I'd love to review a PR at your convenience.
from python-legistar-scraper.
Related Issues (20)
- Perform more specific check for inaccessible gateway links HOT 3
- Migrate from Travis to GitHub Actions HOT 1
- How do we determine which version of a related matter is "active"?
- `event['Meeting Details']` not always accurate HOT 4
- Release to PyPI HOT 1
- whitelist some subsequent actions as not dupes
- Question about PrimeGov
- tidy up repo and make README so this thing can be used more easily by others. HOT 2
- Remove pupa dependency from legistar.base.LegistarScraper
- Pip installing legistar from a requirements file gives an error HOT 4
- AssertionError when sending a POST request to the Legistar calendar HOT 1
- add a README HOT 1
- Add a method for scraping specific bills and events HOT 2
- Should we endow the API scraper/s with knowledge of the available endpoints?
- Consider integrating enriched API and web event classes HOT 1
- Document URLs returned from onclick attributes are invalid
- iCal icon breaks data table parsing
- retry requests that raise 104 errors HOT 5
- The API improves, may be able to get away with not scraping web HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from python-legistar-scraper.