Comments (6)
It is likely an issue with the GTF file (e.g., non-UNIX line endings). I you can provide it, I'll take a look.
from snpgenie.
Aus004_gtf.zip
Here!
I don`t know whats going on, I solve the last issue changing the CDS as you told me, then I remove the double quotes. But still wrong!
from snpgenie.
I see; as discussed in the Troubleshooting of the SNPGenie documentation, the line endings in all your files must be Unix. However, the line endings here are Mac. You'll have to convert them to Unix in a program like TextWrangler (Mac) or Notepad++ (Windows). Let me know.
from snpgenie.
Hi Chase, Sorry for the delay!
So I restart my process, I ran gffcompare again, to convert my gff file in gtf file! And now, the gtf file seems to be ok. In my last issue, the program was not recognizing + strands because the word (CDS is missing). I solved this opening my file in Excel and replacing the names, but what happening is the line ending problem. (Your last reply in this issue here).
So now, after convert my gff file any of this problems happened, but know, this is the message:
The CDS coordinates for gene gene45 in the gtf file do not yield a set of complete codons,
or are absent from the file. The number of nucleotides must be a multiple of 3.
SNPGenie terminated.
I tried to delete all CDS lines with this problems, but I still having error. I saw other issue here that the guy had the same problem, but in his case, his GTF file had multiple transcripts for the same gene. I checked my GTF file but this not seems to be the problem.
I`m attaching you the GTF file.
Really sorry for a bunch of this issues, I really need to run your program.
GTF_Aus0004.zip
Regards,
Stephanie
from snpgenie.
Hi @StephanieRodrigues ! No worries. I checked gene45 and indeed, there are (50502-49545+1)/3 = 319.3333 codons, so this gene length is not a multiple of 3. There may be multiple "CDS" lines for the same gene (e.g., protein-coding exons), but alll "CDS" for each unique gene name must sum to a multiple of 3.
First, make sure CDS records for the same gene have the same gene name. If there are still problems with some genes, then the problematic (non-multiple of 3) genes should be removed. Let me know.
from snpgenie.
As this issue has been silent 26 days, I am closing it now. Please feel free to re-open if you have further issues.
from snpgenie.
Related Issues (20)
- triplet error for spliced proteins HOT 2
- Negative values for mean_gdiv_polymorphic HOT 1
- What is the best option? HOT 1
- Empty output HOT 8
- SNP genie
- Coverage warning HOT 4
- within-host diversity influenza whole genome HOT 11
- within-host diversity analysis : one individual, different time-points HOT 4
- All classified as synonymous HOT 3
- How to join the output for a whole genome analysis HOT 5
- GTF file does not contain any sense (+) strand products HOT 12
- Need help to determine method for inference of convergent evolution HOT 1
- CDS annotation(s) does not have a gene_id HOT 2
- Using SNPGenie on VCF from RAD-seq HOT 1
- gtf2revcom.pl script issue HOT 2
- VCF has no header
- No snps problems HOT 4
- Issue with SNPGenie_sliding_windows.R HOT 4
- Warning for coverage and nucleotide sums HOT 3
- problem with minfreq HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from snpgenie.