Code Monkey home page Code Monkey logo

hibaskinlesionsdataset's Introduction

Hospital Italiano de Buenos Aires Skin Lesions Images (2019-2022) Dataset

This repository hosts the Python script to perform the Exploratory Data Analysis (EDA) of the Hospital Italiano de Buenos Aires skin lesion dataset shared through the ISIC Archive. This project was conducted by the Dermatology Department and the Artificial Intelligence and Data Science program of the Health Informatics Department of the institution.

Dataset description

The dataset described here was composed of information collected from 623 patients seen by expert dermatologists at the Hospital Italiano de Buenos Aires (Buenos Aires, Argentina). The collection comprises 1,616 images (1,270 contact-polarized dermoscopy images and 346 clinical images) captured from 1,246 lesions corresponding to the most frequent diagnoses observed at the institution.

image

Access to the image collection

The dataset is registered as a collection in the ISIC Archive. In this sense, it can be downloaded in two ways:

  1. By visiting the ISIC Archive website or accessing the collection DOI https://doi.org/10.34970/587329. image

  2. By resorting to the official command line tool for interacting with the ISIC Archive (for more information visit https://github.com/ImageMarkup/isic-cli)

# Install tool 
pip install isic-cli 

# Find the collection ID
isic collection list # Hospital Italiano de Buenos Aires Skin Lesions (ID 251)

# Download the collection metadata
isic metadata download --collections 251

# Download the collection metadata
isic metadata download --collections 251

# Download the collection images (and save them in the 'images/' folder).
isic image download --collections 251 images/

Exploratory Data Analysis (EDA)

In this Python notebook we performed the download of the collection from the ISIC Archive by resorting to the command line tool, and carried out the exploratory data analysis (EDA). In this regard, we evaluated the distribution of patients, lesions and images, as well as analyzed the characteristics of patients (age, sex, personal and family history of melanoma) and lesions (diagnosis, type of confirmation for diagnosis, location).

Datasets Comparison

In this Python notebook we performed a comparison between the dataset presented here and publicly available datasets.

hibaskinlesionsdataset's People

Contributors

piashiba avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.