Code Monkey home page Code Monkey logo

surgical_scenegraph_generation's Introduction

Learning and Reasoning with the Graph Structure Representation in Robotic Surgery

Mobarakol Islam, lalithkumar Seenivasan, Lim Chwee Ming & Hongliang Ren


[ arXiv ] [ Paper ]

International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI) 2020


The paper can be cited by using below bibtex.

@inproceedings{islam2020learning,
  title={Learning and reasoning with the graph structure representation in robotic surgery},
  author={Islam, Mobarakol and Seenivasan, Lalithkumar and Ming, Lim Chwee and Ren, Hongliang},
  booktitle={International Conference on Medical Image Computing and Computer-Assisted Intervention},
  pages={627--636},
  year={2020},
  organization={Springer}
}

Abstract

Learning to infer graph representations and performing spatial reasoning in a complex surgical environment can play a vital role in surgical scene understanding in robotic surgery. For this purpose, we develop an approach to generate the scene graph and predict surgical interactions between instruments and surgical region of interest (ROI) during robot-assisted surgery. We design an attention link function and integrate with a graph parsing network to recognize the surgical interactions. To embed each node with corresponding neighbouring node features, we further incorporate SageConv into the network. The scene graph generation and active edge classification mostly depend on the embedding or feature extraction of node and edge features from complex image representation. Here, we empirically demonstrate the feature extraction methods by employing label smoothing weighted loss. Smoothing the hard label can avoid the over-confident prediction of the model and enhances the feature representation learned by the penultimate layer. To obtain the graph scene label, we annotate the bounding box and the instrument-ROI interactions on the robotic scene segmentation challenge 2018 dataset with an experienced clinical expert in robotic surgery and employ it to evaluate our propositions.

Surgical Scene Graph

  • The base interaction model "Graph Parsing Neural Networks" is adopted from repository and integrated with attention to improve the performance.
  • To improve the model calibration, we integrated label smoothing by following this repository

Proposed Architecture

features

Proposed architecture: Given a surgical scene, firstly, label smoothened features F are extracted. The network then outputs a parse graph based on the F. The attention link function predicts the adjacent matrix of the parse graph. The thicker edge indicates possible interaction between the node. The SageConv embeds each node with its neighbouring node features. A brighter node color represents the existence of neighbouring node features within a node. The message function updates the node values based on the sum of interaction features between neighbour nodes. The hidden features of each node are updated by the update node. The readout function predicts the node labels that signify the interaction type between the connected nodes.

Dataset

features

  • Images: Left camera images from 2018 robotic scene segmentation challenge are used in this work.
  • Bounding Box and interaction: Our annotations (Cite this paper when using these annotations.)
  • An interactive Colab notebook can be found on how to read the annotation of bounding box and tool-tissue interaction class Open In Colab

Feature Extraction:

  • To be released!

Download samples of evalution data features.The features of the evaluation data is not from this paper. You can follow our extended work [ICRA+RA-L2022 paper]([ arXiv ] |[ Paper ]|[Github ]) to get information about feature extraction.

Training

python main.py

Evaluation

python evaluation_metrics.py

Our Extended Works

  1. Global-Reasoned Multi-Task Model for Surgical Scene Understanding ([ arXiv ] |[ Paper ]|[Github ])

surgical_scenegraph_generation's People

Contributors

lalithjets avatar mobarakol avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Forkers

lalithjets bmemm

surgical_scenegraph_generation's Issues

Feature Extraction

Hi, I'm very interested in your work. Here are some issures when I use this code. The document in instrument_dataset.py {}_edge_features.npy is not found. I want to know how to generate this document. Could you add the Feature Extraction module in detail? Thanks very much!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.