Code Monkey home page Code Monkey logo

clinical-analytics-demo's Introduction

Case Study: Predicting Health Status Through Prescriptions and Diagnoses

Data Overview

In this case study, we will be looking at methods to define a patient’s health status using diagnosis data, and then try to predict a patient’s health status using prescription data only. This is a problem relevant in healthcare, for example as we try to understand a patient’s future therapeutic needs.

For this task, we have available three datasets that simulate data for approximately 85,000 patients over a period of 3 years:

  1. Patient Diagnosis (Diagnosis.csv). Each row of this dataset contains a patient’s diagnosis provided on a specific date. The diagnosis codes are presented in a standard called ICD-10.

  2. Patient Prescriptions (Prescriptions.csv). Each row of this dataset contains a patient’s prescription filled on a specific date. The prescriptions contain drug category, drug group, and drug class.

  3. ICD-to-Clinical Categories Map (CCS). Each row in this file contains an ICD-10 diagnosis code (with a slightly different formatting than in the Patient Diagnosis table) and diagnosis descriptions as explained here. Note that NOT every diagnosis code has a CSS code, so we will have to work around this.

Example Tasks

  1. Defining a Patient’s Health Status. For this task we will focus on defining a patient’s health status based on their diagnostic data. The CCS map will be useful for this. Hint: Use whatever information is appropriate in the given data sets to define a robust characterization of each patient’s health status. Ideally, the characterization should be useful for establishing something like “Patient 0123 has anemia and skin infection.”

  2. Predicting Heath Status using Prescription Data alone. In this task, we will infer a patient’s health status using only their prescription data. In particular, we will build a model that would allow us to potentially predict the health status of patients outside the ones provided.

Example Solutions (Version 0)

a. Solution Files

  • Case Study - Prescriptions vs Health Status.ipynb: Solution notebook
  • Case-Study-tentative-solution-barnett.pdf: Documentation for the solution. Note that the solution notebook includes both the demo code for essential modules and functions - and the thought process by which the aforementioned tasks are addressed.
  • Case-Study-v0-short-presentation.mov: A short presentation for this demo.

b. Modules

  • feature_extractor.py: A module for feature extraction based on (simple) NLP methods.
  • data_pipeline.py: Data preprocessing
  • evaluate.py: Model evalution
  • icd_utils.py
  • plot_utils.py
  • utils.py

Advanced Solutions (Todo)

  • Collaborative filtering
  • Seq2seq

clinical-analytics-demo's People

Contributors

metatron53 avatar pleiadian53 avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.