Code Monkey home page Code Monkey logo

scpathology_publication's Introduction

DOI

The Single-Cell Pathology Landscape of Breast Cancer

This repository contains all code used to produce the results and figures of the publication "The Single-Cell Pathology Landscape of Breast Cancer". All data, including tiff images, masks, single-cell and patient data are available on Zenodo (10.5281/zenodo.3518284).

Matlab scripts:

Image and other early analysis steps were performed using Matlab. Since the single-cell data was extracted using histoCAT, the Matlab scripts assume a data structure as in a loaded histoCAT session. Saved histoCAT sessions can be downloaded from Zenodo (10.5281/zenodo.3518284).

R scripts:

Downstream analysis was performed using R pipelines. The R analysis is divided into one notebook for the analysis of the first TMA of 281 patients from University Hospital Basel and a second one for comparison and analysis of the second multi-core cohort from Univerity Hospital Zurich. All input data required to reproduce the figures of this publication are available on Zenodo(10.5281/zenodo.3518284). The BaselTMA and ZurichTMA folders contain the input data for the respective R pipelines.

Data organization on Zenodo:

OMEandSingleCellMasks.zip contains the ome-tiff stacks and the single-cell masks. TumorStroma_masks.zip contains masks for tumor and stromal regions. SingleCell_and_Metadata.zip contains the single-cell and patient data as well as all other input data for the R pipelines provided here.

Where to find: Subpath
Patient and core metadata BaselTMA SingleCell_and_Metadata/BaselTMA/Basel_PatientMetadata.csv
Patient and core metadata ZurichTMA SingleCell_and_Metadata/ZurichTMA/Zuri_PatientMetadata.csv
Single-cell data BaselTMA SingleCell_and_Metadata/BaselTMA/SC_dat.csv
Single-cell data ZurichTMA SingleCell_and_Metadata/ZurichTMA/SC_dat.csv
Single-cell segmentation masks both TMAs OMEandSingleCellMasks/Basel_Zuri_masks/
Image tiffs both TMAs OMEandSingleCellMasks/ome/
Antibody panel SingleCell_and_Metadata/Basel_Zuri_StainingPanel.csv

Important notes when working with the data provided on Zenodo:

  • The single-cell data provided for downstream R analysis is already spillover corrected.
  • The single-cell masks that were generated using CellProfiler do not always contain strictly sequential single-cell labels. Every now and then an ID is skipped due to excluded edge cells. This can cause issues in histoCAT and therefore the single cells are automatically relabelled sequentially during loading into histoCAT. We exported the single-cell data from histoCAT for downstream R analysis and therefore the single-cell labels are the newly assigned sequential ones and match the labels in the histoCAT sessions. However, the original mask files that are also provided here still contain the original labels from CellProfiler. For matching the single-cell data provided here directly to the masks (e.g. for visualization of single-cell features on the image outside of histoCAT), the single-cell labels in the mask need to be relabelled as well or matched based on the rank.

scpathology_publication's People

Contributors

janafischer avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.