Code Monkey home page Code Monkey logo

yc.bio's Introduction

YC.Bio

Tools for metagenomic assembly processing.

Build status:

Branch .NET Mono
Master Build status Build Status

yc.bio's People

Contributors

gorohovart avatar gsvgit avatar luninapolina avatar

Stargazers

 avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

yc.bio's Issues

16s classification based on GLL GSS nodes distribution

Result of 16s parsing is a GSS. Seems that distribution of GSS nodes may be a good feature for filtering (of false positive) and classification.

  • Choose method for distribution analysis
  • Choose method for classification
  • Apply to linear sequence parsing
  • Apply to metagenomic analysis

16s images classification

Secondary structure of some sequences may contains information for classification, but stable metric for secondary structure of sequences with mutation is a problem. Images processing deals with noisy data and we want to use image classification techniques for 16s classification.

We can draw secondary structure of 16s, but images are very specific (presented in attachment). It is necessary to choose "the best" method for such images classification.
For basic experiments:

Examples of pictures:
ciib01000023 4139 7040
cmon01000010 50534 53692

Features extraction

Try to extract and visualize features from the trained network and map it back to sequence.

  • Implement a dense network visualization tool for visual investigation of features
  • Classical folded rna structure visualization (use existing tools for rna visualization)
  • Feynman diagram (use existing tools for rna visualization)

Replace FASTA reading/writing with functions from BioFSharp

Replace FASTA reading/writing with functions from BioFSharp. If we have some useful functionality, then move it to BioFSharp.

  • Propose to owners of BioFSharp to publish NuGet package. (Post an issue). Propose to help to create packaging scripts.
  • Replace FASTA processing with functions from BioFSharp package. If necessary then move some functions from our project to BioFSharp.

Improve repo

Improve this repo.

  • ProjectScaffold
  • CI build
    • Mono
    • .NET
  • Tests
  • Documentation (gh-pages)
  • Examples

Тестовая задача

Дана размеченная выборка изображений:

Провести эксперимент по классификации данных изображений (метод любой, инстументы любые). Составить отчёт об эксперименте. Ссылки на текст отчёта (предпочтительный формат -- pdf) и репозиторий с кодом, необходимым для воспроизведения эксперимента оставлять в комментариях к этой таске. Вопросы задавать там же.

Web service for tRNA classification

Create demo for our solution based on parsing and artificial neural networks.

  • Design and develop UI which provide an ability
    • to specify a sequence and get results of it classification
    • to specify a batch of sequences in the FASTA format and get results of classification
  • Develop server component for deployment in a cloud. Core of functionality is DNN (model is ready) which can classify sequences.
  • Create infrastructure for development automation: ci-build, testing, deployment

Тестовая задача. Весна 2019

Тестовая задача для проекта #10 .

В зависимости от подзадачи, на которую Вы претендуете, предоставьте ссылку на репозиторий GitHub (или любой другой), содержащий

  • Реализованный Вами web-сервис, развёрнутый в одном из облаков (Google, Microsoft, Amazon)
  • Реализованный Вами web-ui
  • Какой либо проект, при условии, что у данного репозитория идеальная инфраструктура.

Ваши ссылки можно оставлять здесь в комментариях. Там же можно задавать вопросы.

Experiments on chimeric processing

  • Create function for parametric chimeric sequences generation. The function should generate a file in FASTA format. Use SILVA or Greengenes as a source. Save metadata for each chimeric sequence.
  • Try to apply trained network to chimeric sequences.
  • If the quality of the previous step is low, then try to use chimeric sequences for training.

Тестовая задача

Опишите, пожалуйста, какие особенности вторичной структуры задаёт данная грамматика.

Предложите конструктивное, с точки зрения вторичной структуры, изменение грамматики. Представте изменённую грамматику в документе, опишите изменения в терминах вторичной структуры.

Описание языка, на котором написана грамматика, можно найти здесь.

Ответ оформлять в виде pdf-файла, ссылку на файл оставлять в коментариях к этой задаче.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.