pcaseq's People
pcaseq's Issues
Make the # of Loci to read in a parameter
Allow the user to control how many loci are read in at the same time (currently set to 5000(
Allow non-Beta Weights
Make the weights parameter take a function and provide the Beta weight function as one built-in option. This means that the beta weight function needs to be added in to the name space, and the functions that take weights need to be modified. Probably also want to include a function to find the MAF as a convenience for users of the package.
The weighting function should take in a MAF and return a numeric value that is not Inf or NaN.
Add Testing for seqGRM and seqPCA
Add testing for these two functions. Currently the functions they call are tested.
NA Triggers Errors in checkMAF function
When the MAF is NA, an error triggers--something is wrong with the logic.
Return the actual SNPs used to calculate the GRM
Currently, seqPCA returns the SNPs in the data set, before filtering. Need to return the actual SNPs used to calculate the GRM. (requires changing the GRM calculation function to return the SNPs actually used).
Make popStruct handle genome data errors
Conditions to check:
- Letters instead of numbers
- Data isn't in 0,1,2 format
- Non-matrix (or at least cannot be converted to matrix)
- Missing genotypes
- Only one subject
- Only one locus
Check that missing values are properly handled if the missingness filter is not used
Looks like missing values are ignored unless they are filtered out---double check this
Improve test cases for grmCalc
Make these test cases more comprehensive.
Add Method to Restrict Dataset to a Subset of the Data Based on Allele Frequency
Add an argument to PopStructure that takes a vector of length 2 that sets the allele frequency of loci to keep in the data set.
Add option to parallelize the GRM and PCA calculations
Look into how this is done in SNPRelate and SeqArray/SeqVarTools
Update Package to be Compatible with SeqArray
Allow both SeqArray data structures and use the functions from SeqArray/SeqVarTools
Automatically Remove Monomorphic and Singleton Loci
Make the code that finds the GRM automatically remove monomorphic and singleton loci to improve computation time.
Add a MAF filter argument to popStruct
Allow the user to select a lower bound on the MAF; primarily for use with EIGENSTRAT method.
Fix SNP filtering functions
Currently, for each block of 5,000 SNPs, the SNP filtering functions throw an error if there is no remaining SNPs after filter. However, it's possible in a data set to get one block that has no remaining SNPs but still have a valid data set. Instead, for each block, there should be a warning to the user and after all the filtering is finished, check once to make sure there is still SNP data (and if not throw an error).
Make seqPCA return the sample id's even when a sample id argument isn't used
currently returns NULL
Draft a test case to verify that missing data is properly ignored in calculations
Verify that if a SNP has missing data, that the final GRM entry is not missing and is properly calculated. Create a test case for this.
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ๐๐๐
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google โค๏ธ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.