Hello, Dr. Xun Chen,
I have a question about the INFO column. For the "INFOR" field, the description stands for "NAME,START,END,LEN,DIRECTION,STATUS". However, it seems that the "START" is not always 1, here is some examples of my results:
#CHROM POS ID REF ALT QUAL FILTER INFO
chr01 629412 . T <INS_MEI:TE_00011020_LTR#ClassII_DNA_Mutator_nMITE> . PASS CR=114;GR=0,0.656,0.745,0.633,0.659,0.444,0.333;GTF=YES;INFOR=TE_00011020_LTR#DNA/Mutator,571,839,269,+,5;SR=8
chr01 951226 . G <INS_MEI:TE_00005384#ClassII_DNA_hAT_MITE> . PASS CR=23;GR=0,0.581,0.359;GTF=YES;INFOR=TE_00005384#DNA/hAT,3526,6274,2749,+,5;SR=9
chr01 951675 . T <INS_MEI:TE_00003559_INT#ClassI_LTR_Copia> . PASS CR=5;GR=0,0.556,0.467;GTF=YES;INFOR=TE_00003559_INT#LTR/Copia,1836,1908,73,-,5;SR=7
chr01 986317 . A <INS_MEI:TE_00006433_LTR#ClassI_LTR_Gypsy> . PASS CR=9;GR=0,1,0.444,0.5;GTF=YES;INFOR=TE_00006433_LTR#LTR/Gypsy,6,2539,2534,+,5;SR=6
chr01 990068 . A <INS_MEI:TE_00011590_LTR#ClassI_LTR_Gypsy> . PASS CR=8;GR=0,0.611;GTF=YES;INFOR=TE_00011590_LTR#LTR/Gypsy,2024,2437,414,+,5;SR=3
I'm wondering what's the insertion range of a TE insertion. For example, in the first record, the POS is chr01:629412, while the "START" of "INFOR" field is 571, does that mean the actual start position for the insertion is "629412 + 571" and the end position is "629412 + 839" ? Or the start position is 629412? In my current study, I need to obtain the start and end postion of TE insertions. I'm confused and could you do me a favor?
Best wishes,
Rain