Code Monkey home page Code Monkey logo

medfuse's People

Contributors

farahshamout avatar nasir6 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

medfuse's Issues

modify the code in create_split.py to sovle the problem '0 sample in val and test dataset of CXR_UNI'

First of all thank you for your work, it's very useful!

However I found some problems when using it, in create_split.py the elements in the lists val_subject_ids and test_subject_ids are formatted as strings, not int types. This leads to a problem in

cxr_splits.loc[cxr_splits.subject_id.isin(val_subject_ids_int), 'split'] = 'validate'
cxr_splits.loc[cxr_splits.subject_id.isin(test_subject_ids_int), 'split'] = 'test' 

The corresponding val and test tags could not be matched in these two steps. This causes the val and test sections of the CXR_UNI dataset to be empty.

I added

val_subject_ids_int = [int(i) for i in val_subject_ids]
test_subject_ids_int = [int(i) for i in test_subject_ids]

after val_subject_ids and test_subject_ids, which solves the problem and we can then get the CXR dataset for pretraining. Thanks!

about the create_split.py file

hi, thanks for your interesting work : )
In the README, you mentioned the create_split.py file related to preprocessing. But I can't find this file. Could you please provide this file. Thanks a lot!

unable to download

Due to low system configuration, unfortunately, the download fails intermittently, which I tried many times.
I understand your concern. Recently, I have been certified with MIT, CITI-data use Agreement through physionet. Please share the processed data, which will be helpful in proceeding forward with my research work.

Need of the Processed data from MedFuse.

Hi @farahshamout ,

I tried to extract the data which you mentioned in the ReadMe file of Github. while extracting, my system gets struck up. During our discussion through email, you mentioned that you had preprocessed data from MIMIC-CXR and MIMIC-IV, Can you please share it for my work?

Pretrained Model

Hi, are you planning to release pre-trained model along with sample dataset for inference and validation purposes?
Thanks...

'None' data issue when executing sh ./scripts/radiology/uni_cxr.sh

when executing sh ./scripts/radiology/uni_cxr.sh, error occurs.
Below are the details:
File "/mnt/gsai/brain/MedFuse/datasets/fusion.py", line 202, in
x = [item[0] for item in batch]
TypeError: 'NoneType' object is not subscriptable

In datasets/fusion.py, there is a def my_colate() ,used for dataloader.
I debugged with following code"

`
def my_collate(batch):
# for loop for debugging

for item in batch:

    if item is not None:

        ehr_data, cxr_data, labels_ehr, labels_cxr = item

        print(ehr_data.shape, cxr_data.shape, labels_ehr.shape, labels_cxr.shape)

    else:

        print(item)`

It prints 'None' for every item in batch.
Can you figure it out what's the problem?

Duplicate samples in dataset ‘partial_ehr_cxr’

The code

index = random.randint(0, len(self.ehr_files_unpaired)-1)

in datasets/fusion.py produces duplicate samples in dataset ‘partial_ehr_cxr’ about 20% (depending on the random seed). If you want to get the dataset without duplicate samples, considering use

index = index - len(self.ehr_files_paired)

Thanks.

Request to get test_listfile.csv and val_listfile.csv

hi, I am doing project with your datasets. I cannot able to get the mimic_iv_extracted folder , because i need test_listfile.csv and val_listfile.csv. Can you please lead me how to get those files for my work?

About the ICD mapping in the phenotype classification task

Thank you for your work first, which has been a great inspiration to me.
I have noticed that "we mapped all ICD-10 codes to ICD-9 using the guidelines provided by the Centers for Medicare & Medicaid Services1, and then map them to CCS categories" was mentioned in section 4.1 Datasets and Benchmark Tasks. But I did not find the relevant content in the code. May I ask how it is implemented here?

pre-trained model

Could you please share your pretrained model for evaluation? thanks

>Processed Data

          > 

Due to low system configuration, unfortunately, the download fails intermittently, which I tried many times.
I understand your concern. Recently, I have been certified with MIT, CITI-data use Agreement through physionet. Please share the processed data, which will be helpful tin proceeding forward with my research work.

Originally posted by @AmudhaTK in #7 (comment)

Needs clarity in runing the shell script

Hi @farahshamout,
I can understand the situation of not sharing the original protected datasets. But with your help I come to know how to get preprocessed data from raw datasets. After getting proper system up-gradation, I started downloading the raw data from MIMIC IV and MIMIC-CXR-JPG. Thank you for the proper guidance on the memory needed to download the data.

Now, I am working on the processing of the raw data. I did MIMIC IV data extraction using MIMIC III. I need clarification on this part.

Kindly help me with the details of the output that we can get after the execution of this Shell script:

train the imaging model with 14 radiology labels.

sh ./scripts/radiology/uni_cxr.sh

train LSTM model on extracted time-series EHR data for phenotype task.

sh ./scripts/phenotyping/train/uni_all.sh

train LSTM model for the in-hospital-mortality task

sh ./scripts/mortality/train/uni_all.sh

It is showing error,as needs two positional arguments.
Which arguments I should pass to run the shell script?
It would be helpful to know the proper output of the above command.

thank you,
Amudha T K

needs help in running the shell scripts

Dear @farahshamout ,
Hope you are doing well.

I need help in running the shell scripts.
sh ./scripts/phenotype/uni_csr.sh

sh ./scripts/mortality/train/uni_all.sh

It is showing an error, as it needs two positional arguments.
Do you know which arguments I should pass to run the shell script?
It would be helpful to know the proper output of the above command.

thank you,
Amudha T K

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.