Performing a Kaplan-Meier survival analysis on a dataset containing cancer patient information. The dataset includes details such as patient demographics, dates of diagnosis, treatment received, and outcomes.
- The dataset is loaded from a CSV file.
- The data is preprocessed accordingly.
- Kaplan-Meier survival analysis is performed in order to estimate the survival function for the dataset.
- Survival probabilities are calculated over time using the Kaplan-Meier estimator.
- The result is plotted including the censored subjects.
- Conducting subgroup analyses by stratifying the data based on most relevant attributes and calculate survival probabilities for each subgroup.
- Patient ID: Unique identifier for each patient, presented in a fancier format.
- Date of Birth: The date of birth of each patient.
- Age: The age of each patient at the time of diagnosis.
- Gender: The gender of each patient.
- Date of Diagnosis: The date when each patient was diagnosed with the medical condition.
- Date of Death: The date of death of each patient, if applicable.
- Date of Last Follow-Up: The date of the last follow-up for each patient.
- Date of Admission: The date of admission to the hospital, if applicable.
- Date of Discharge: The date of discharge from the hospital, if applicable.
- Disease Stage: The stage of the disease diagnosed in each patient.
- Treatment Received: The type of treatment received by each patient.
- Comorbidities: Any additional medical conditions present in each patient.
- Smoking Status: The smoking status of each patient.
- Family History: Any family history of the medical condition in each patient. -Vital Signs: The vital signs (e.g., blood pressure) recorded for each patient.