Code Monkey home page Code Monkey logo

mktests_jy's People

Contributors

jayoung avatar

Watchers

 avatar

mktests_jy's Issues

fix plotting when seqs contain Ns

some plots do NOT have anything in gray for the polymorphisms.

it seems like it fails when the alignment contains Ns, but it's fine once the Ns are removed

Add methods tab to output

Add methods tab to spreadsheet to capture parameters and warnings, perhaps also a tab to name the seqs in each group

it could get really cluttered. one methods tab PER analysis? one seqs tab PER analysis? (we already have one sites tab per analysis)

check case-sensitivity

double-check code can handle upper and lower case alignments

should be just making everything uppercase before I even start

add trim option

To help troubleshooting, make a trim function, to consider only some portions of the alignment (and it might be useful in general, e.g. to look only at some domains while retaining original alignment coordinates)

Excel export rounding / NA / numbers stored as text issue

would be nice to round more columns.

I did it for the unpolarized results (asked Excel to SHOW the rounded values, didn't actually round them) but I didn't do it for the polarized results

how am I using the pop1alias and pop2alias values? I thought those would go in colnames of the excel export, but maybe not

clarify outgroup treatment

I wrote this in MKfunctions.R: "for now we are treating all outgroups (e.g. yak, sec) as one big group"

is that true? I thought I allowed the possibility of a hierarchical outgroup list

in any case, clarify in the README.md how to deal with outgroups

add discussion of reconstructing ancestors

add this discussion to README.md

Ching-Ho takes a different approach when polarizing:
use multiple outgroup species (about 5? includes eugracilis), reconstruct the mel-sim ancestor. Use the ancestor instead of sim (or instead of mel) in my R scripts to run the unpolarized code. This is equivalent to getting the mel-branch-only MK result.

he uses MEGA for this: make a tree from the alignment (nucleotide level, not codon level), reconstruct ancestors, and it will spit out the most likely nucleotide at each site. Alternatively, you could also ask it to tell you probabilities, and perhaps replace any very uncertain positions with N.

he's also curious about IQtree to reconstruct ancestors. PAML is another possibility.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.