Code Monkey home page Code Monkey logo

Comments (6)

singing-scientist avatar singing-scientist commented on May 20, 2024

It is likely an issue with the GTF file (e.g., non-UNIX line endings). I you can provide it, I'll take a look.

from snpgenie.

StephanieRodrigues avatar StephanieRodrigues commented on May 20, 2024

Aus004_gtf.zip
Here!
I don`t know whats going on, I solve the last issue changing the CDS as you told me, then I remove the double quotes. But still wrong!

from snpgenie.

singing-scientist avatar singing-scientist commented on May 20, 2024

I see; as discussed in the Troubleshooting of the SNPGenie documentation, the line endings in all your files must be Unix. However, the line endings here are Mac. You'll have to convert them to Unix in a program like TextWrangler (Mac) or Notepad++ (Windows). Let me know.

from snpgenie.

StephanieRodrigues avatar StephanieRodrigues commented on May 20, 2024

Hi Chase, Sorry for the delay!
So I restart my process, I ran gffcompare again, to convert my gff file in gtf file! And now, the gtf file seems to be ok. In my last issue, the program was not recognizing + strands because the word (CDS is missing). I solved this opening my file in Excel and replacing the names, but what happening is the line ending problem. (Your last reply in this issue here).
So now, after convert my gff file any of this problems happened, but know, this is the message:

The CDS coordinates for gene gene45 in the gtf file do not yield a set of complete codons,

or are absent from the file. The number of nucleotides must be a multiple of 3.

SNPGenie terminated.

I tried to delete all CDS lines with this problems, but I still having error. I saw other issue here that the guy had the same problem, but in his case, his GTF file had multiple transcripts for the same gene. I checked my GTF file but this not seems to be the problem.
I`m attaching you the GTF file.
Really sorry for a bunch of this issues, I really need to run your program.
GTF_Aus0004.zip

Regards,
Stephanie

from snpgenie.

singing-scientist avatar singing-scientist commented on May 20, 2024

Hi @StephanieRodrigues ! No worries. I checked gene45 and indeed, there are (50502-49545+1)/3 = 319.3333 codons, so this gene length is not a multiple of 3. There may be multiple "CDS" lines for the same gene (e.g., protein-coding exons), but alll "CDS" for each unique gene name must sum to a multiple of 3.

First, make sure CDS records for the same gene have the same gene name. If there are still problems with some genes, then the problematic (non-multiple of 3) genes should be removed. Let me know.

from snpgenie.

singing-scientist avatar singing-scientist commented on May 20, 2024

As this issue has been silent 26 days, I am closing it now. Please feel free to re-open if you have further issues.

from snpgenie.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.