Code Monkey home page Code Monkey logo

Comments (9)

cdeboever3 avatar cdeboever3 commented on July 29, 2024

Looking at my older runs of WASP, it seems that reads were output multiple times in the fastq files in the past as well. I guess the bug here may be that this read pair doesn't make it through the filtering step even though it aligns to the same spot. I've added files to the zip that show this.

from wasp.

gmcvicker avatar gmcvicker commented on July 29, 2024

Hi Chris, thanks for the bug report, looking into this now...

from wasp.

gmcvicker avatar gmcvicker commented on July 29, 2024

I think I found the problem and have committed a fix here:
b1e8219

Thanks again for the bug report and let us know if you have any further issues.

from wasp.

cdeboever3 avatar cdeboever3 commented on July 29, 2024

Thanks Graham. It seems the results (e.g. whether a read pair is kept or not) from the mapping pipeline weren't affected by this bug right?

from wasp.

gmcvicker avatar gmcvicker commented on July 29, 2024

I am not 100% certain, but unfortunately I think that it could have
affected which paired end reads are filtered. I think that some PE reads
may have dropped out of the pipeline even though they could have been kept.

The other outstanding issue is that Step #5 (rmdup) does not currently
support PE reads. I am working to fix this now.

On Thu, Jun 25, 2015 at 12:50 PM, Christopher DeBoever <
[email protected]> wrote:

Thanks Graham. It seems the results (e.g. whether a read pair is kept or
not) from the mapping pipeline weren't affected by this bug right?


Reply to this email directly or view it on GitHub
#18 (comment).

from wasp.

gmcvicker avatar gmcvicker commented on July 29, 2024

It turns out the 'fix' I made was not correct and has created some issues with the PE reads. I have reverted to the old version and I am working on fixing the original issue (which was minor by comparison).

from wasp.

cdeboever3 avatar cdeboever3 commented on July 29, 2024

Sounds good, I was actually looking at the code last week although so far
I've mostly just added comments. I'm hoping to start refactoring a bit
tomorrow and adding in some unit tests.

On Mon, Jul 27, 2015 at 2:39 PM, Graham McVicker [email protected]
wrote:

It turns out the 'fix' I made was not correct and has created some issues
with the PE reads. I have reverted to the old version and I am working on
fixing the original issue (which was minor by comparison).


Reply to this email directly or view it on GitHub
#18 (comment).

from wasp.

cdeboever3 avatar cdeboever3 commented on July 29, 2024

I've been able to clean up the code a bit and add a lot of documentation and some tests (0170a01). I actually looked into this bug and it turns out it's not a bug. The two reads both overlap the SNP so the three possible read pairs are output. I added a test for the data I provided initially.

I can make a pull request, but I was also wondering if we could add an option to specify that the input bam file is already coordinate sorted? I can add that in before I make the pull request.

from wasp.

gmcvicker avatar gmcvicker commented on July 29, 2024

Hi Chris,

That changes and test look great. You are welcome to add an option to
indicate that the input bam is already sorted. Once you are ready to make a
pull request we can accept it.

Thanks a lot for your help!

Graham

On Mon, Aug 3, 2015 at 3:17 PM, Christopher DeBoever <
[email protected]> wrote:

I've been able to clean up the code a bit and add a lot of documentation
and some tests (0170a01
0170a01).
I actually looked into this bug and it turns out it's not a bug. The two
reads both overlap the SNP so the three possible read pairs are output. I
added a test for the data I provided initially.

I can make a pull request, but I was also wondering if we could add an
option to specify that the input bam file is already coordinate sorted? I
can add that in before I make the pull request.


Reply to this email directly or view it on GitHub
#18 (comment).

from wasp.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.