llrs / gseadv Goto Github PK
View Code? Open in Web Editor NEWPackage to analyse gene sets
License: Other
Package to analyse gene sets
License: Other
In this paper "On Entropy and Information in Gene Interaction Networks" they explore perturbations of a network, it has an associated package at ucsd-ccbb/gsia that could be interesting to use or to provide coverage for it or use for other analysis
A user request a method to compare the same GeneSet names of different GeneSetCollections (GSC), by counting the number of genes in each.
Could go on the line of compare(GSC1, GSC2)
and then look for same pathways/GeneSet names and compare the number in each case. Could return the total number of shared names and the differences between them in a data.frame.
Would it be expandable to more than two GSC? Yes if in the names of the data.frame we set the names of the GSC being compared and return NA if the term is not shared between two GSC.
Like: Terms GSC1_GSC2 GSC1_GSC3 GSC2_GSC3
This distribution seems interesting
From the https://github.com/llrs/integration-helper see the correct function (although it would require some tweaking to improve efficiency to the scales of gene sets)
Add to install from biocondutor (from), perhaps install first bioconductor then the other packages
I should suggest or introduce how to use the adjacency for creating networks, using the graph.adjacency
(idea from biostars)
Add the possibility to drop a relationship but without removing completely all the genes or GeneSets. The opposite of add (which really adds a relationship, and if not present it adds a Gene or a GeneSet.)
Idea taken from here (already saved the reference)
https://twitter.com/AchimZeileis/status/1064802740663132160
x <- diag(5) # Or any other matrix
table(x, row(x))
This could come handy for counting size of pathways, or the distribution of genes. Or the centrality of the elements.
This would need a bit more care like adding colnames
to the resulting tables or something alike
See this, to use for an article and as a examples or for tests
Interesting paper on local adjacency https://www.biorxiv.org/content/early/2018/11/04/460964.full.pdf
Add the original object stats with the simulations in Figure 3 of the README.
Also do the simulation with the number of genes.
While from*
creates the new GSC objects the should be a function that calculates how many combinations exists that hold this true.
Many function that simulate a GSC are quite slow (more than 300 iterations ~ 1 min) if they reach a solution in timely manner.
Proposed change: use dynamic programming, create the amount of data and remove when they are picked. It might improve time results.
While estimating it would be better to provide a parameter if we assume that the original object pass the check function or otherwise
This software can be implemented using the methods on this package.
At the moment it is only designed for GeneSetCollections, but I expect that it will be a need to cluster cell lines. As such it would be interesting to have a general set class in bioconductor and then a subclass "GeneSet", and equivalent "SetCollection" and the "GeneSetCollection".
This would require a feature request in https://github.com/bioconductor/GSEABase
For both gene.R and pathway.R it would be nice to introduce a helper, either shared or not.
It should help also on the case of one gene/pathway or multiples
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.