Code Monkey home page Code Monkey logo

covid_dc's People

Contributors

taylorterry3 avatar

Watchers

 avatar

covid_dc's Issues

Update school decodes

A few new school names have shown up in the data set, so they need to be added to the decode dict.

Add side table of school info

Add a side table of ward, address, DCPS codes, and other info for each school. This will also require creating a canonical short name for each school.

Switch to diffing data

Right now this system scrapes everything every time, which will stop working soon because the source page only shows the most recent 500 letters (see https://dcpsreopenstrong.com/health/response/notifications/). I had planned to switch to reading the PDFs and keep the old ones in the repo, which would be slow but stateless. The PDFs have been buggy and occasionally malformed, though, so I need a plan B.

Fix things that assume year in text is current year

When the year rolls over there will be a need to fix things that assume that the dates being parsed are in the current year. This may just amount to versioning the data through December and starting a fresh round in January.

Archive PDF's

Similar to #14 , the PDF's are about to spill off into the dustbin of history and need to be archived.

One-Shots need to be versioned along with the archived data

Because some of the one-shots find and replace data within the scraped data they will need to be versioned along with the archived data so they aren't looking for things that aren't there. Another option would be to handle the error.

Add tests

existing functions could use some tests

Type all the things

I've been pretty inconsistent with adding type annotations outside of function signatures, so at some point I need to fix that.

Add secrets for non-local deploy

The main script needs some sort of secrets storage so that it can be deployed somewhere other than my laptop and run on a cron.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.