Code Monkey home page Code Monkey logo

tc-seq2seq's Introduction

Temporally Consistent Sequence-to-Sequence Translation of Cataract Surgeries

This is the official code repository for Temporally Consistent Sequence-to-Sequence Translation of Cataract Surgeries

Setup

  • Install a virtual environment (e.g. conda or venv)
  • Install the required external packages with pip install -r requirements.txt
  • Install the custom repository package with pip install -e .

Training

  • The training scripts for each method used in the paper can be found in scripts/train/

Inference

  • Evaluation scripts can be found in scripts/eval/

Checkpoints

  • Model checkpoints can be found here.

Example Results

How-to Cite

If you use our research or the resources within this repository, please consider citing our work. Below is the recommended citation:

APA (American Psychological Association) Style:

Frisch, Y., Fuchs, M., & Mukhopadhyay, A. (2023).
Temporally consistent sequence-to-sequence translation of cataract surgeries.
International Journal of Computer Assisted Radiology and Surgery, 1-8.

BibTeX (for LaTeX users):

@article{frisch2023temporally,
  title={Temporally consistent sequence-to-sequence translation of cataract surgeries},
  author={Frisch, Yannik and Fuchs, Moritz and Mukhopadhyay, Anirban},
  journal={International Journal of Computer Assisted Radiology and Surgery},
  pages={1--8},
  year={2023},
  publisher={Springer}
}

tc-seq2seq's People

Contributors

yfrisch avatar

tc-seq2seq's Issues

Inquiry on dataset configuration

Hello, @YFrisch !

First of all, thank you for sharing your outstanding research.

I wanted to train on a custom dataset, but I couldn't find information on this in the readme, so I've opened an issue.

  1. From the paper and supplementary material, it seems that datasets of a similar nature need to be included together to some extent. Does this mean that training with an unpaired dataset is difficult?

  2. In the code, I noticed that frames are extracted from video files(.mp4) to form a sequence. Could you tell me approximately how many frames are there in one sequence, and if it's okay to structure the dataset in this manner?

-train/
--domain A/
----seq1/
------seq1_frame001.png
------seq1_frame002.png
------ ...
----seq2/
------seq2_frame001.png
------seq2_frame002.png
------ ...
--domain B/
----seq1/
------seq1_frame001.png
------seq1_frame002.png
------ ...
----seq2/
------seq2_frame001.png
------seq2_frame002.png
------ ...
-val/
--domain B/
----seq3/
------seq3_frame001.png
------seq3_frame002.png
------ ...
----seq4/
------seq4_frame001.png
------seq4_frame002.png
------ ...
--domain B/
----seq3/
------seq3_frame001.png
------seq3_frame002.png
------ ...
----seq4/
------seq4_frame001.png
------seq4_frame002.png
------ ...

  1. Is it possible to construct multiple sequences from a single video? Also, what was the longest sequence in terms of frame count in the dataset you constructed?

I look forward to your response. Thank you once again.

Sincerely,

Jangbi

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.