Comments (13)
Hi,
Yes, the package came with a carefully selected pHMM file. You can provide a new one but need further modification into the script. So I suggest you use the TEfam.hmm unless you have a good reason to use others.
I have pushed a new release to GitHub, please use this version of LTR_retriever. It comes with more user-friendly status messages which can help to keep track of the program. Just save your paths file and replace everything else. Let me know if you still have difficulties to run it.
Shujun
from ltr_retriever.
Hi, I moved your last comment to a new thread because this is a different issue.
Shujun
from ltr_retriever.
Some additional info:
Some nonempty files were generated.
- ~50% of the input elements ended up in the *defalse file
- ~3% input elements are in the *pass.list files
- the *fa files are empty
from ltr_retriever.
Hi,
Thanks for using LTR_retriever and providing the above messages.
The parse failure is reported by hmmsearch. You can use -v to retain the intermediate files. Please send me the Salvinia_cucullata_v1.1.fa.retriever.scn.extend.fa.aa if possible.
If you see a lot of empty files, something is wrong. i.e., there is not a lot of LTR founded in the genome.
Best,
Shujun
from ltr_retriever.
Hey Shujun, I think I found the problem. For -TEhmm I was using the Dfam profile HMM database of repeats, which has a lot of nice profiles, but they are nucleotide profile HMMs. I am re-running again now with a few Pfam profiles.
from ltr_retriever.
Ok it still aborted without completing I think. This time I used the call:
LTR_retriever \
-genome Salvinia_cucullata_v1.1.fa \
-inharvest input/LTRHarvest.out \
-linelib input/LINEs.viridiplantae.fa \
-dnalib input/DNA_TEs.viridiplantae.fa \
-TEhmm /home/derstudent/data/other/pfam-hmms/retrotransposon/combined.hmm \
-threads 40 \
-v \
1>LTR_retriever.out
2>LTR_retriever.err
and got the error:
Attempt to free unreferenced scalar: SV 0x1a6a310, Perl interpreter: 0x1a667e0.
Use of uninitialized value $list[0] in pattern match (m//) at /home/joshd/software/LTR_retriever/bin/call_seq_by_list.pl line 76.
Use of uninitialized value in split at /home/joshd/software/LTR_retriever/bin/call_seq_by_list.pl line 79.
Use of uninitialized value in pattern match (m//) at /home/joshd/software/LTR_retriever/bin/call_seq_by_list.pl line 79.
Use of uninitialized value $chr_pre in hash element at /home/joshd/software/LTR_retriever/bin/call_seq_by_list.pl line 81.
Use of uninitialized value within %genome in length at /home/joshd/software/LTR_retriever/bin/call_seq_by_list.pl line 81.
Use of uninitialized value $list[0] in pattern match (m//) at /home/joshd/software/LTR_retriever/bin/call_seq_by_list.pl line 76.
Use of uninitialized value in split at /home/joshd/software/LTR_retriever/bin/call_seq_by_list.pl line 79.
Use of uninitialized value in pattern match (m//) at /home/joshd/software/LTR_retriever/bin/call_seq_by_list.pl line 79.
Use of uninitialized value $chr_pre in hash element at /home/joshd/software/LTR_retriever/bin/call_seq_by_list.pl line 81.
Use of uninitialized value within %genome in length at /home/joshd/software/LTR_retriever/bin/call_seq_by_list.pl line 81.
No such file or directory at /home/joshd/software/LTR_retriever/bin/cleanup.pl line 50.
LOC list is empty.
Warning: [blastx] Query is Empty!
LOC list is empty.
No such file or directory at /home/joshd/software/LTR_retriever/bin/cleanup.pl line 50.
perl annotate_gff.pl lib.fa gff > anno.gff
from ltr_retriever.
Hi,
It's not suggestive to replace the TE.hmm file because they are hand-categorized. If you want to use Dfam you need to know which Dfam entry belongs to LTR which are not, and change the annotate_TE.pl script accordingly.
I am updating the program and hopefully it will provide more status messages for users to learn about the process.
Shujun
from ltr_retriever.
Ok thank you! The source of the illegal character error seems to be Dfam.hmm. I don't think I can use any Dfam profile HMMs because they don't have transition probabilities for amino acids. Unless there is a different version of the Dfam database that has protein profile HMMs.
Edit: I didn't realize LTR_retriever came with a profile HMM database (LTR_retriever/database/TEfam.hmm).
from ltr_retriever.
Hi Shujun,
The new release seems to not be recognizing the location of the external dependencies. I think it is this line, 193, but I am not fluent in perl and don't know what the unless -X "string" test does. I added some code and made sure that makeblastdb is in the path held by $blastplus and it is, and makeblastdb can execute from within the code. Do you know the problem? (and could you explain unless -X if you have a minute :-)
die "makeblastdb is not exist in the BLAST+ path $blastplus!\n" unless -X "${blastplus}makeblastdb";
from ltr_retriever.
from ltr_retriever.
Hmm I tried all three of these:
BLAST+=/share/apps/genomics/ncbi-blast-2.2.31+/bin/
BLAST+=/share/apps/genomics/ncbi-blast-2.2.31+/bin
BLAST+= # with makeblastdb in PATH
and each time it gives this error, with the trailing forward slash:
makeblastdb is not exist in the BLAST+ path /share/apps/genomics/ncbi-blast-2.2.31+/bin/!
from ltr_retriever.
Hi,
I fixed the bug and make the path reading module more sturdy. It should work with all three forms now.
Shujun
from ltr_retriever.
OK, the program runs for a little while and now gives this error:
ERROR: No such file or directory at /home/joshd/software/LTR_retriever/bin/cleanup.pl line 50.
ERROR: No such file or directory at /home/joshd/software/LTR_retriever/bin/cleanup.pl line 50.
Fatal Error:
Failed to open the database file
Program halted !!
Can't open Salvinia_cucullata_v1.1.fa.LTRlib.clust: No such file or directory.
ERROR: This script is written to convert fasta files into a prettier format.
Usage: fasta-reformat.pl input-fasta-file number-of-positions-per-line
ERROR: No such file or directory at /home/joshd/software/LTR_retriever/bin/annotate_gff.pl line 12.
Salvinia_cucullata_v1.1.fa.LTRlib.clust doesn't exist, and I ran LTR_retriever with -v
from ltr_retriever.
Related Issues (20)
- Repeatmodeler 、RepeatMasker and LTR_retriever HOT 4
- Serious error ⇒ Dependency checking: Error: The RMblast engine is not installed in RepeatMasker! HOT 3
- Can one use EDTA genome.mod.pass.list and genome.mod.EDTA_TEanno.out output in LAI HOT 3
- No candidate is found in the file(s) you specified HOT 2
- Use of uninitialized value $seq in string eq at call_seq_by_list.pl line 129 HOT 10
- Are LTRs (insertion time of 0) the likely cause of somatic variation? HOT 2
- LTR_retriever warning HOT 1
- Total length annotated by RepeatMasker is longer than LTR_retriever HOT 3
- RE: Masking LTR HOT 2
- A question about LAI in autopolyploid HOT 2
- How to discover "sequencing gaps" in genome assemblies. HOT 3
- ask for help? panLTR HOT 1
- Running Ltr_retriever...Died at /app/RepeatModeler-2.0.4/LTRPipeline line 693. HOT 2
- No tbl file issue HOT 5
- LAI is not applicable on the current genome assembly HOT 3
- conda install errors HOT 1
- I can´t install LTR_retriever via conda HOT 4
- Invalid value for shared scalar HOT 10
- awk: fatal: cannot open file `genome.fa.mod.retriever.scn.extend.fa.rexdb.cls.tsv' for reading (No such file or directory) HOT 1
- awk: cannot open N02.fa.retriever.scn.extend.fa.rexdb.cls.tsv HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ltr_retriever.