Code Monkey home page Code Monkey logo

Comments (3)

uqrmaie1 avatar uqrmaie1 commented on September 2, 2024 1

The admixtools methods don't use any information that is in VCF files but not in plink files. So even if you wouldn't explicitly convert VCF to plink, any information other than the 0/1/2 genotype calls would be discarded anyway if admixtools read VCFs directly. Allele polarization also doesn't matter for admixtools.
If you have VCF files and don't want to use any of that extra information to filter SNPs or genotypes, then you can just use plink or something similar to do the conversion.
If you do want to use that extra information in some way, then that's a task which is outside the scope of what admixtools does.

from admixtools.

uqrmaie1 avatar uqrmaie1 commented on September 2, 2024

No worries, glad you like it!

I'm not sure which issues you are thinking of in going from VCF to plink or eigenstrat formats, but I assume they have to do with the fact that there is not an unambiguous way to convert VCF to plink, with many edge cases that could be handled in several different ways. I doubt that I could implement this conversion any better than existing programs. So I think the best solution would be to use plink or a similar program to convert VCF to plink.

from admixtools.

yassineS avatar yassineS commented on September 2, 2024

Hi Robert,

Many thanks for your quick response. There are a number of issues associated with the conversation from VCF to plink and vice-versa. First of all, there's a huge information loss of all non-genotype data that the user might want to come back to. Also, there are a number of documented issues related to allele polarisation in plink, and datasets merging to list a few.

To read in a vcf, external libraries such as vcfR or PopGenome can be used.

from admixtools.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.