Code Monkey home page Code Monkey logo

Comments (3)

ChristopheLambert avatar ChristopheLambert commented on August 31, 2024

While the original data provided identifiers for both the provider and provider institution, there does not appear to be any location information for either in the source data. Hence null is placed for the location of care_site to signify it is not known.

Patients, on the other hand, do have location information down to state and FIPS county code. One might try to take the most frequent residence state for patient visits to a given care_site to infer a good guess of its location, assuming there is some consistency in the identifiers versus some kind of randomization or blinding being done in the synthetic transformation process that created these pseudo-patients. For instance, there are fields for the "Provider Institution Tax Number", within the Carrier Claims records, but there are way more of these than number of healthcare institutions in the US, suggesting they are random numbers.

from etl-cms.

ChristopheLambert avatar ChristopheLambert commented on August 31, 2024

Closing out issue, as it appears to be a limitation on the source data.

from etl-cms.

larchiu avatar larchiu commented on August 31, 2024

Thanks @ChristopheLambert !

from etl-cms.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.