We use kth order Markov Chain to solve sequence prediction problem in bioinformatics
genomes
contains ten reference genomestest.fa
contains several labeled sequence which can be used for model training or validation. The annotation file isseq_id.map
reads.fa
is the unlabeled short sequences used for prediction