alcov's People
alcov's Issues
ImportError
Dear,
I tried the installation both suggested ways but I get error:
ImportError: cannot import name 'pywrapinit' from 'ortools.init'
I was thus wondering if this needs to be adapted in the code? Or if I need a particular python/pip version to use your tool?
Thank you
Model, query mutations and threshold questions
Dear Isaac,
Thank you for sharing your your solution to predict Pango lineage composition in a pooled wastewater sample.
Here are a couple of questions and suggestions.
From the code we see that you fit a linear model between selected mutations abundance in the inputs (Y) and mutation profiles defining selected Pango Lineages (X).
- Why only 12 lineages were selected for querying (
B.1.526.1
,B.1.427
,B.1.617.3
,B.1.617.2
,B.1.526
,B.1.429
,B.1.617
,B.1.1.7
,P.1
,B.1.617.1
,B.1.351
,B.1.526.2
,P.2
)? Could you provide logic on the mutations and lineages selection process? Why these were selected and not all ~1200+ Pango Lineages? - It seems that only lineages with
S:N501Y
mutation were selected, was there a reason for this? Why this filter is necessary? - How Pango lineage mutation profiles were defined? I see that you've used
Outbreak.Info
API queries by lineage such as https://api.outbreak.info/genomics/lineage-mutations?pangolin_lineage=B.1.1.7 to pull data. What that the case? - Does the future iteration of the model would consider multiple testing corrections and intrinsic hierarchical structure existing between lineages given rise to increased error rates and inflated beta coefficients?
- The beta coefficients of Panglo lineage terms are presented as abundances, but are just weights of the model terms. Do you plan also to assign p-values to terms and do some randomization (e.g. permutations) to improve reliability and stability of the prediction?
- Do you also plan to update
setup.py
to automatically setup dependenciesnumpy
,scikit-learn
,matplotlib
seaborn
andpysam
during install? - What if a given query position has more than one mutation? Are you only taking into account abundance of the lineage-associated SNV while disregarding other potential mutations?
- When selecting mutations "strongly" associated with Pango lineages
mut_lins[aa_m][l] > 0.5
, why global mutation prevalence of 0.5 was selected as a cutoff and not more higher values such as 0.9 in case you would like to look for "lineage semi-exclusive" mutations? Would you consider adding this as a command-line parameter? - Have you considered training a tree ensemble classifier (XGBoost, RF) or conditional inference forest (CIF) on the filtered GISAID input?
Good luck with this project and we would be glad to exchange ideas and test other approaches on lineages resolution from the mixed wastewater samples.
Kirill
Lineages not accepted
Hello,
I am having an issue where lineages added to the lineages.txt file are not accepted by alcov, for example XBB.1.22 I get the following error:
File "/usr/local/lib/python3.8/dist-packages/alcov/lineages.py", line 375, in
lin_mut_profiles = [[round(mut_lins[mut][lin]) for mut in covered_muts] for lin in lineages]
KeyError: 'Omicron (XBB.1.22-like)'
Thank you.
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ๐๐๐
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google โค๏ธ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.