Code Monkey home page Code Monkey logo

cellpainting-gallery's Introduction

Cell Painting Gallery

This page provides a guide to the datasets that are available in the Cell Painting Gallery, hosted by the AWS Registry of Open Data (RODA): https://registry.opendata.aws/cellpainting-gallery

Citation/license

All the data is released with CC0 1.0 Universal (CC0 1.0). Still, professional ethics require that you cite the appropriate resources/publications, listed below, when using individual datasets. For example,

We used the dataset cpg0000 (Chandrasekaran et al., 2022), available from the Cell Painting Gallery on the Registry of Open Data on AWS (https://registry.opendata.aws/cellpainting-gallery/).

Available datasets

All datasets are generated using the Cell Painting assay unless indicated otherwise. Several updates to that protocol exist (Cell Painting wiki).

The datasets are stored with the prefix indicated by the dataset name. E.g. the first dataset is located at s3://cellpainting-gallery/cpg0000-jump-pilot and can be listed using aws s3 ls --no-sign-request s3://cellpainting-gallery/cpg0000-jump-pilot/ (note the / at the end).

The datasets' accession numbers are the first seven characters of the dataset name. E.g. the accession number of the first dataset is cpg0000.

Dataset name Description Publication to cite IDR accession number
cpg0000-jump-pilot 300+ compounds and 160+ genes (CRISPR knockout and overexpression) profiled in A549 and U2OS cells, at two timepoints 3
cpg0001-cellpainting-protocol 300+ compounds profiled in U2OS cells using several different modifications of the Cell Painting protocol 6
cpg0002-jump-scope 300+ compounds profiled in U2OS using different microscopes and settings 7
cpg0003-rosetta 28,000+ genes and compounds profiled in Cell Painting and L1000 gene expression 5
cpg0004-lincs 1,571 compounds across 6 doses in A549 cells 4 idr0125
cpg0012-wawer-bioactivecompoundprofiling 30,000 compound dataset in U2OS cells 1,2 idr0016
cpg0015-heterogeneity 2,200+ compounds and 200+ genes profiles in U2OS cells 8 idr0016,idr0036, idr0033
cpg0016-jump 116,000+ compounds and 16+ genes (CRISPR knockout and overexpression) profiled in U2OS cells. Over 8 million images (>126 TB), over 1.5 billion cells of numerical data (>126TB), for over 250 TB data in total. 9
cpg0017-rohban-pathways 323 genes overexpressed in U2OS cells. Original images re-profiled in 2023 11
cpg0019-moshkov-deepprofiler 8.3 million single cells from 232 plates, across 488 treatments from 5 public datasets, used for learning representations 10

Downloading from Cell Painting Gallery

See Folder Structure for a complete description of data organization in Cell Painting gallery. Note that for each dataset you can download just images, just extracted features and metadata, or both. Note also that many datasets contain separate batches and you may want a subset of available batches.

If you'd like to just browse the data, it's a lot easier to do so using a storage browser.

Publications using datasets in Cell Painting Gallery

First Author
Title
Year
Publication URL
Dataset Name in Gallery
1 Wawer Toward performance-diverse small-molecule libraries for cell-based phenotypic screening using multiplexed high-dimensional profiling 2014 https://doi.org/10.1073/pnas.1410933111 cpg0012-wawer-bioactivecompoundprofiling
2 Bray A dataset of images and morphological profiles of 30 000 small-molecule treatments using the Cell Painting assay 2017 https://doi.org/10.1093/gigascience/giw014 cpg0012-wawer-bioactivecompoundprofiling
3 Chandrasekaran Three million images and morphological profiles of cells treated with matched chemical and genetic perturbations 2022 https://doi.org/10.1101/2022.01.05.475090 cpg0000-jump-pilot
4 Way Morphology and gene expression profiling provide complementary information for mapping cell state 2022 https://doi.org/10.1101/2021.10.21.465335 cpg0004-lincs
5 Haghighi High-Dimensional Gene Expression and Morphology Profiles of Cells across 28,000 Genetic and Chemical Perturbations 2022 https://doi.org/10.1101/2021.09.08.459417 cpg0003-rosetta
6 Cimini Optimizing the Cell Painting assay for image-based profiling 2022 https://doi.org/10.1101/2022.07.13.499171 cpg0001-cellpainting-protocol
7 Jamali Assessing the performance of the Cell Painting assay across different imaging systems 2023 https://doi.org/10.1101/2023.02.15.528711 cpg0002-jump-scope
8 Rohban Capturing single-cell heterogeneity via data fusion improves image-based profiling 2019 https://doi.org/10.1038/s41467-019-10154-8 cpg0015-heterogeneity
9 Chandrasekaran JUMP Cell Painting dataset: morphological impact of 136,000 chemical and genetic perturbations 2023 https://doi.org/10.1101/2023.03.23.534023 cpg0016-jump
10 Moshkov Learning representations for image-based profiling of perturbations 2022 https://doi.org/10.1101/2022.08.12.503783 cpg0019-moshkov-deepprofiler
11 Rohban Systematic morphological profiling of human gene and allele function via Cell Painting 2017 https://doi.org/10.7554/eLife.24060 cpg0017-rohban-pathways

Contributing to Cell Painting Gallery

See Folder Structure for the required folder structure of your data. See Upload for a complete description of how to upload to the Cell Painting gallery bucket.

Any data contributions to Cell Painting Gallery must be accompanied by a pull request to this repository with updates to this README to add your dataset to Available datasets and Publications.

Complementary Datasets

For other sources of publicly available Cell Painting datasets we encourage you to explore:

cellpainting-gallery's People

Contributors

annecarpenter avatar erinweisbart avatar shntnu avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

cellpainting-gallery's Issues

2022_08_17_TAORF (cpg0017)

Segmentation/ Feature extraction has been redone by Erin in Cimini Lab.
Profile creation has been redone by Erin in Cimini Lab.
Data can be public in RODA Immediately

Update as generated:
Profile repo for image reprocessing: https://github.com/broadinstitute/cpg0017-rohban-pathways
Publication repo for initial image processing: https://github.com/carpenterlab/2017_rohban_elife
cpg0017-rohban-pathways

  • Metadata completely filled out in Project Profiler Database (Imaging Platform internal use only)
  • Segmentation/Feature extraction complete
  • Profiling complete

Transfer to CellPainting Gallery:

  • Upload data to RODA (is private by default)
  • Run validation script to ensure completion
  • Update cellpainting-gallery/README.md
  • Make RODA entry public

If data is being published, prepare for publication:
- [ ] Run Distributed-BioFormats2Raw to create .ome.zarr files
- [ ] Upload (meta)data to IDR (images remain hosted in cellpainting-gallery).

Once published:
- [ ] Make IDR entry public

  • Update cellpainting-gallery/README.md and open-data-registry/cellpainting-gallery.yml to reflect publication

Structure for publication-associated data?

Can cpg act as LFS for files generated in the publication of a dataset that are too large to be held in the publication repository itself?
If so, what structure would we want?

I propose that yes, we allow (but not require) cpg to host publication-associated large files that are too large to fit directly into a publication repo. When compared to the size of the rest of any dataset, publication-associated files are unlikely to cause any noticeable change in total size. For any publication, I would expect we would include download instructions for accessing the raw data in cpg anyway, so hosting publication-associated files has the major benefit of simplifying access to those files as well.

I suggest a folder structure of:

└── workspace
           └── publication_data
                      └── YEAR_FIRSTAUTHOR
                                  ├── large_file_example1.csv.gz
                                  ├── large_file_example2.csv.gz
                                  └── large_file_example3.csv.gz

@shntnu thoughts?

How to analyze data directly from s3 bucket via S3 REST API or S3 SDK for python ?

Requirement

Dear Community,

This is Robin from Novo Nordisk Pharmaceutical company.

Firstly, thanks to your contribution to project cellpainting-gallery.

As cellpainting-gallery is a public S3 bucket, We'd like to do some data analysis based the data you have on AWS S3.

I can list datasets with AWS CLI aws s3 ls --no-sign-request s3://cellpainting-gallery/cpg0000-jump-pilot/ .

seems it allows anonymous access the S3 bucket with AWS CLI , but when I access the S3 bucket via browser, it prompts me Access Denied.

http://s3.amazonaws.com/cellpainting-gallery/cpg0000-jump-pilot

<Error>
<Code>AccessDenied</Code>
<Message>Access Denied</Message>
<RequestId>1Z4Z91ER4NJVXQYX</RequestId>
<HostId>0O1/TtCuJTVnxKUfT9tD4hfK78DTNAxLd/wEc4nYyGwhykyaKJzoZQljS+AKHjjYl1IbjRcJOIg=</HostId>
</Error>

We want to do that in a smart way, here we'd like to analyze data directly via s3 bucket instead of download datasets to local to analyze.

Because of limitation of local storage space, I'd like to know if it's possible to access the cellpainting-gallery's datasets via S3 REST API or S3 SDK for python (like boto3).

Let's take dataset cellpainting-gallery/cpg0000-jump-pilot as example, I'd like to get file size recursively through the whole dataset.

What prerequisites do I need to prepare for?

  • Do I need to create an AWS Account?
  • What default AWS S3 profile I need to prepare for accessing bucket cellpainting-gallery? like account and region.
  • Would you please provide code snippet of S3 REST API or S3 SDK for python to retrieve the dataset?

It would be great if you could tell me the detailed steps to implement it?

Best Regards,
Robin

2018_06_05_cmQTL (cpg0022)

Data can be public in RODA Immediately
profile repo: https://github.com/broadinstitute/cmQTL
cpg0022-cmqtl

  • Metadata collection form filled out by collaborator or Imaging Platform member (https://airtable.com/shrVxz9DcoMlDoCBI)
  • Metadata completely filled out in Project Profiler Database (Imaging Platform internal use only)
  • Segmentation/Feature extraction complete
  • Profiling complete

Transfer to CellPainting Gallery:

  • Upload data to RODA (is private by default)
  • Run validation script to ensure completion
  • Update cellpainting-gallery/README.md
  • Make RODA entry public

If data is being published, prepare for publication:

  • Run Distributed-BioFormats2Raw to create .ome.zarr files
  • Upload (meta)data to IDR (images remain hosted in cellpainting-gallery).

Once published:

  • Make IDR entry public
  • Update cellpainting-gallery/README.md and open-data-registry/cellpainting-gallery.yml to reflect publication

2018_05_30_ResistanceMechanisms_Kapoor (cpg0028)

Segmentation/ Feature extraction is being performed by Cimini lab (done)
Profile creation is being performed by Carpenter-Singh lab (done)
Data can be public in RODA Immediately

Update as generated:

Link to profile repo (same as publication repo): https://github.com/broadinstitute/profiling-resistance-mechanisms/tree/master/0.generate-profiles/profiles
Link to publication repo: https://github.com/broadinstitute/profiling-resistance-mechanisms
cellpainting-gallery identifier: cpg0028-kelley-resistance

  • Metadata collection form filled out by collaborator or Imaging Platform member (https://airtable.com/shrVxz9DcoMlDoCBI)
  • Metadata completely filled out in Project Profiler Database (Imaging Platform internal use only)
  • Segmentation/Feature extraction complete
  • Profiling complete

Transfer to CellPainting Gallery:

If data is being published, prepare for publication:

  • Run Distributed-BioFormats2Raw to create .ome.zarr files
  • Upload (meta)data to IDR (images remain hosted in cellpainting-gallery).

Once published:

  • Make IDR entry public
  • Update cellpainting-gallery/README.md and open-data-registry/cellpainting-gallery.yml to reflect publication
  • Move this Issue from cellpainting-gallery-private to cellpainting-gallery. This step can be performed at an earlier point if it needs inputs from an external collaborator.

2022_07_13_CDRP (cpg0012)

Segmentation/ Feature extraction is being performed by Cimini lab (Erin)
Profile creation is being performed by Carpenter-Singh lab (Niranj)
Data can be public in RODA Immediately

Update as generated:
Link to profile repo: https://github.com/broadinstitute/2015_Bray_GigaScience-data
Link to publication repo: https://doi.org/10.1093/gigascience/giw014
cellpainting-gallery identifier: cpg0012-wawer-bioactivecompoundprofiling

  • Metadata completely filled out in Project Profiler Database (entered as batch CDRP re-analysis in 2008 12 04 Imaging CDRP for MLPCN)
  • Segmentation/Feature extraction complete
  • Profiling complete

Transfer to CellPainting Gallery:

  • Upload data to RODA (is private by default)
  • Run validation script to ensure completion
  • Update cellpainting-gallery/README.md
  • Make RODA entry public

If data is being published, prepare for publication:
- [ ] Run Distributed-BioFormats2Raw to create .ome.zarr files
- [ ] Upload (meta)data to IDR (images remain hosted in cellpainting-gallery).

Data is already available at http://idr.openmicroscopy.org/webclient/?show=screen-1251

Once published:

  • Make IDR entry public
  • Update cellpainting-gallery/README.md and open-data-registry/cellpainting-gallery.yml to reflect publication
  • Move this Issue from cellpainting-gallery-private to cellpainting-gallery. This step can be performed at an earlier point if it needs inputs from an external collaborator.

Note: This is a reprocessing of an existing dataset

Readme updates

Jamali et al should be listed as Jamali and Tromans-Coia et al; DOI is https://doi.org/10.1101/2023.02.15.528711

Looks like the second Chandrasekaran will be 2023 so can probably now not do 2022a, etc.

(Would do these myself but I don't have write access :( )

2020_11_04_CPJUMP1 (cpg0000-jump-pilot)

Hello, I would like to download the dataset cpg0000-jump-pilot to work on the 300+ compounds. My question is where to find the name of the compounds (is there a csv file for that) and is there an information about cell death modality induced by those 300+compounds (such as necrosis, apoptosis)? Many thanks in advance!

Jump-Pilot metadata: source of gene/target information for compounds

Hi there,

My name is Reese and I am a data scientist at ViQi Inc. working to do AI analysis on the jump datasets. I am currently looking at the jump pilot data and I had some questions regarding the JUMP-Target-1_compound_metadata_targets.tsv. What specifically do the genes in the target and target_list columns in this file represent? What source did it come from (pubchem, etc.)? Thank you so much for your time and continued efforts!

Cheers, Reese

2023_05_19_BarcodeCalling (cpg0021)

[Link to publication repo]
cellpainting-gallery identifier = cpg0021-periscope

  • Metadata collection form filled out by collaborator or Imaging Platform member (https://airtable.com/shrVxz9DcoMlDoCBI)
  • Metadata completely filled out in Project Profiler Database (Imaging Platform internal use only)
  • Segmentation/Feature extraction complete
  • Profiling complete

Transfer to CellPainting Gallery:

If data is being published, prepare for publication:

  • Run Distributed-BioFormats2Raw to create .ome.zarr files
  • Upload (meta)data to IDR (images remain hosted in cellpainting-gallery).

Once published:

  • Make IDR entry public
  • Update cellpainting-gallery/README.md and open-data-registry/cellpainting-gallery.yml to reflect publication
  • Move this Issue from cellpainting-gallery-private to cellpainting-gallery. This step can be performed at an earlier point if it needs inputs from an external collaborator.

Add info re: where to ask what questions

Formalize what kinds of questions should be asked here and make that info known.

  • Make issue template for "question about dataset" that tells folks where to go instead (suggested add direct link for JUMP datasets and general description that applies to others)
  • Add to README info about where to direct questions

2023_03_21_Dactyloscopy (cpg0025)

Segmentation/ Feature extraction is being performed by (Cimini lab / Carpenter-Singh lab)
Profile creation is being performed by (Cimini lab / Carpenter-Singh lab)
Data can be public in RODA Immediately

Update as generated:
[Link to profile repo]
[Link to publication repo]
cellpainting-gallery identifier: cpg0025

  • Metadata collection form filled out by collaborator or Imaging Platform member (https://airtable.com/shrVxz9DcoMlDoCBI)
  • Metadata completely filled out in Project Profiler Database (Imaging Platform internal use only)
  • Segmentation/Feature extraction complete
  • Profiling complete

Transfer to CellPainting Gallery:

If data is being published, prepare for publication:

  • Run Distributed-BioFormats2Raw to create .ome.zarr files
  • Upload (meta)data to IDR (images remain hosted in cellpainting-gallery).

Once published:

  • Make IDR entry public
  • Update cellpainting-gallery/README.md and open-data-registry/cellpainting-gallery.yml to reflect publication
  • Move this Issue from cellpainting-gallery-private to cellpainting-gallery. This step can be performed at an earlier point if it needs inputs from an external collaborator.

cpg0014-jump-adipocyte

Segmentation/ Feature extraction is being performed by Erin in Cimini lab
Profile creation is being performed by Erin in Cimini lab
Data can be public in RODA Immediately

Update as generated:
profiler repo = https://github.com/jump-cellpainting/cpg0014-JUMP-adipocyte
[Link to publication repo]
cellpainting-gallery identifier = cpg0014-jump-adipocyte

  • Metadata collection form filled out by collaborator or Imaging Platform member (https://airtable.com/shrVxz9DcoMlDoCBI)
  • Metadata completely filled out in Project Profiler Database (Imaging Platform internal use only)
  • Segmentation/Feature extraction complete
  • Profiling complete

Transfer to CellPainting Gallery:

If data is being published, prepare for publication:

  • Run Distributed-BioFormats2Raw to create .ome.zarr files
  • Upload (meta)data to IDR (images remain hosted in cellpainting-gallery).

Once published:

  • Make IDR entry public
  • Update cellpainting-gallery/README.md and open-data-registry/cellpainting-gallery.yml to reflect publication

2022_08_17_SeedSeq (cpg0018)

Segmentation/ Feature extraction has already been performed
Profile creation has already been performed
Data can be public in RODA Immediately

Update as generated:
[Link to profile repo]
[Link to publication repo]
cpg0018

  • Metadata completely filled out in Project Profiler Database (Imaging Platform internal use only)
  • Segmentation/Feature extraction complete
  • Profiling complete

Transfer to CellPainting Gallery:

  • Upload data to RODA (is private by default)
  • Run validation script to ensure completion
  • Update cellpainting-gallery/README.md
  • Make RODA entry public

If data is being published, prepare for publication:

  • Run Distributed-BioFormats2Raw to create .ome.zarr files
  • Upload (meta)data to IDR (images remain hosted in cellpainting-gallery).

Once published:

  • Make IDR entry public
  • Update cellpainting-gallery/README.md and open-data-registry/cellpainting-gallery.yml to reflect publication
  • Move this Issue from cellpainting-gallery-private to cellpainting-gallery. This step can be performed at an earlier point if it needs inputs from an external collaborator.

2021_09_01_VarChAMP_Pilot (cpg0020)

Segmentation/ Feature extraction is being performed by Cimini lab
Profile creation is being performed by Cimini lab
Data can be public in RODA Immediately - yes

Update as generated:
Link to profile repo
[Link to publication repo]
[cpg0020-varchamp-pilot]

  • Metadata collection form filled out by collaborator or Imaging Platform member (https://airtable.com/shrVxz9DcoMlDoCBI)
  • Metadata completely filled out in Project Profiler Database (Imaging Platform internal use only)
  • Segmentation/Feature extraction complete
  • Profiling complete

Transfer to CellPainting Gallery:

  • Upload data to RODA (is private by default)
  • Run validation script to ensure completion
  • Update cellpainting-gallery/README.md
  • Make RODA entry public

If data is being published, prepare for publication:

  • Run Distributed-BioFormats2Raw to create .ome.zarr files
  • Upload (meta)data to IDR (images remain hosted in cellpainting-gallery).

Once published:

  • Make IDR entry public
  • Update cellpainting-gallery/README.md and open-data-registry/cellpainting-gallery.yml to reflect publication
  • Move this Issue from cellpainting-gallery-private to cellpainting-gallery. This step can be performed at an earlier point if it needs inputs from an external collaborator.

2021_09_21_MPI (cpg0023)

Segmentation/ Feature extraction is being performed by (Cimini lab / Carpenter-Singh lab)
Profile creation is being performed by (Cimini lab / Carpenter-Singh lab)
Data can be public in RODA Immediately

Update as generated:
[Link to profile repo]
[Link to publication repo]
[cellpainting-gallery identifier]

  • Metadata collection form filled out by collaborator or Imaging Platform member (https://airtable.com/shrVxz9DcoMlDoCBI)
  • Metadata completely filled out in Project Profiler Database (Imaging Platform internal use only)
  • Segmentation/Feature extraction complete
  • Profiling complete

Transfer to CellPainting Gallery:

If data is being published, prepare for publication:

  • Run Distributed-BioFormats2Raw to create .ome.zarr files
  • Upload (meta)data to IDR (images remain hosted in cellpainting-gallery).

Once published:

  • Make IDR entry public
  • Update cellpainting-gallery/README.md and open-data-registry/cellpainting-gallery.yml to reflect publication
  • Move this Issue from cellpainting-gallery-private to cellpainting-gallery. This step can be performed at an earlier point if it needs inputs from an external collaborator.

2022_09_DD_DeepProfiler (cpg0019)

Segmentation/ Feature extraction is being performed by (Cimini lab / Carpenter-Singh lab)
Profile creation is being performed by (Cimini lab / Carpenter-Singh lab)
Data can be public in RODA Immediately

Update as generated:
[Link to profile repo]
https://doi.org/10.1101/2022.08.12.503783
cpg0019-moshkov-deepprofiler

  • Metadata completely filled out in Project Profiler Database (Imaging Platform internal use only)
  • Segmentation/Feature extraction complete
  • Profiling complete

Transfer to CellPainting Gallery:

  • Upload data to RODA (is private by default)
  • Run validation script to ensure completion
  • Update cellpainting-gallery/README.md
  • Make RODA entry public

If data is being published, prepare for publication:
These are only training images (crops)

  • Run Distributed-BioFormats2Raw to create .ome.zarr files
  • Upload (meta)data to IDR (images remain hosted in cellpainting-gallery).

Once published:

  • ~~ Make IDR entry public~~
  • Update cellpainting-gallery/README.md and open-data-registry/cellpainting-gallery.yml to reflect publication
  • Move this Issue from cellpainting-gallery-private to cellpainting-gallery. This step can be performed at an earlier point if it needs inputs from an external collaborator.

2023_04_14_Lacoste-Haghighi

Segmentation/ Feature extraction is being performed by Cimini lab
Profile creation is being performed by (Cimini lab / Carpenter-Singh lab)
Data can be public in RODA Immediately

Update as generated:
[Link to profile repo]
[Link to publication repo]
cpg0026-lacoste_haghighi-rare-diseases

  • Metadata collection form filled out by collaborator or Imaging Platform member (https://airtable.com/shrVxz9DcoMlDoCBI)
  • Metadata completely filled out in Project Profiler Database (Imaging Platform internal use only)
  • Segmentation/Feature extraction complete
  • Profiling complete

Transfer to CellPainting Gallery:

If data is being published, prepare for publication:

  • Run Distributed-BioFormats2Raw to create .ome.zarr files
  • Upload (meta)data to IDR (images remain hosted in cellpainting-gallery).

Once published:

  • Make IDR entry public
  • Update cellpainting-gallery/README.md and open-data-registry/cellpainting-gallery.yml to reflect publication
  • Move this Issue from cellpainting-gallery-private to cellpainting-gallery. This step can be performed at an earlier point if it needs inputs from an external collaborator.

2023_03_27_ECBL_Pilot

Segmentation/ Feature extraction is being performed by (Cimini lab / Carpenter-Singh lab)
Profile creation is being performed by (Cimini lab / Carpenter-Singh lab)
Data can be public in RODA Immediately

Update as generated:
[Link to profile repo]
[Link to publication repo]
[cellpainting-gallery identifier]

  • Metadata collection form filled out by collaborator or Imaging Platform member (https://airtable.com/shrVxz9DcoMlDoCBI)
  • Metadata completely filled out in Project Profiler Database (Imaging Platform internal use only)
  • Segmentation/Feature extraction complete
  • Profiling complete

Transfer to CellPainting Gallery:

If data is being published, prepare for publication:

  • Run Distributed-BioFormats2Raw to create .ome.zarr files
  • Upload (meta)data to IDR (images remain hosted in cellpainting-gallery).

Once published:

  • Make IDR entry public
  • Update cellpainting-gallery/README.md and open-data-registry/cellpainting-gallery.yml to reflect publication
  • Move this Issue from cellpainting-gallery-private to cellpainting-gallery. This step can be performed at an earlier point if it needs inputs from an external collaborator.

Could NOT find metadata dir for cpg0016-jump

Hi
Thank you for your great work of Cell painting.
I want to download cpg0016-jump image data for analysis with meta data, howerver, I could not find metadata anymore(aws s3 ls --no-sign-request s3://cellpainting-gallery/cpg0016-jump/source_2/workspace/).
图片
Can I get metadata for any other methods?

Bests

2022_04_14_Lipocyte_Profiler (cpg0011)

Segmentation/ Feature extraction is being performed by Cimini lab
Profile creation is being performed by Cimini lab
Data can be public in RODA Immediately

Update as generated:

Link to profile repo: https://github.com/broadinstitute/2019_08_03_Adipocyte_CellPainting_Claussnitzer
Link to publication: https://www.biorxiv.org/content/10.1101/2021.07.17.452050v1
cellpainting-gallery identifier: cpg0011-lipocyteprofiler

  • Metadata completely filled out in Project Profiler Database (Imaging Platform internal use only)
  • Segmentation/Feature extraction complete
  • Profiling complete

Transfer to CellPainting Gallery:

  • Upload data to RODA (is private by default)
  • Run validation script to ensure completion
  • Update cellpainting-gallery/README.md
  • Make RODA entry public

If data is being published, prepare for publication:

  • Run Distributed-BioFormats2Raw to create .ome.zarr files
  • Upload (meta)data to IDR (images remain hosted in cellpainting-gallery).

Once published:

  • Make IDR entry public
  • Update cellpainting-gallery/README.md and open-data-registry/cellpainting-gallery.yml to reflect publication
  • Move this Issue from cellpainting-gallery-private to cellpainting-gallery. This step can be performed at an earlier point if it needs inputs from an external collaborator.

Hesam's original top post is below

Hi Shantanu,
We are organizing our LipocyteProfiler data according to the outline. We populated the branches based on our available data, and I was wondering if the following information are sufficient for the cellpainting-gallery? if yes, how can we upload the data, and if no, what are the required information?
Screen Shot 2022-04-14 at 1 23 41 PM
Thanks!

2023_08_31_Chromalive

Segmentation/ Feature extraction is being performed by (Cimini lab / Carpenter-Singh lab)
Profile creation is being performed by (Cimini lab / Carpenter-Singh lab)
Data can be public in RODA Immediately

Update as generated:
[Link to profile repo]
[Link to publication repo]
cellpainting-gallery identifier: cpg0029-chroma-pilot

  • Metadata collection form filled out by collaborator or Imaging Platform member (https://airtable.com/shrVxz9DcoMlDoCBI)
  • Metadata completely filled out in Project Profiler Database (Imaging Platform internal use only)
  • Segmentation/Feature extraction complete
  • Profiling complete

Transfer to CellPainting Gallery:

If data is being published, prepare for publication:

  • Run Distributed-BioFormats2Raw to create .ome.zarr files
  • Upload (meta)data to IDR (images remain hosted in cellpainting-gallery).

Once published:

  • Make IDR entry public
  • Update cellpainting-gallery/README.md and open-data-registry/cellpainting-gallery.yml to reflect publication
  • Move this Issue from cellpainting-gallery-private to cellpainting-gallery. This step can be performed at an earlier point if it needs inputs from an external collaborator.

Citation file?

Would that be a good idea, to make the gallery as a whole easier to cite?

Add dataset size to README

Would be nice to have approximate size of datasets (maybe list image and numerical data sizes separately) in the Available Datasets table in the README so folks wanting to use the datasets have some idea of what they are getting themselves into...

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.