Comments (6)
I can confirm it's not avoiding any collapses, as if I reduce the ptable to omit 1
ie
A:1 B:1 0.0 0.0 0.95 0.05
B:1 C:1 0.0 0.0 0.95 0.05
A:1 C:1 0.99 0.0 0.01 0.0
then it correctly finds
B:1 EquivalentTo C:1 (most probable) 0.95
A:1 EquivalentTo B:1 (most probable) 0.95
from boomer.
I think the issue here is the high number of "windows" requested (100). Input rows are sorted according to their best probability, then the list of rows is chunked into the given number of windows. Across each independent run, shuffling occurs within each window, but the windows stay in the same total order. So it will always first add A ProperSubClassOf C
. If you use a window value of 1, the rows are completely randomized and it is able to find the best solution.
from boomer.
See the logging at the beginning of a run (with 100 windows requested):
2021.02.05 14:32:54:070 [zio-def...] [INFO ] org.monarchinitiative.boomer.Boom.evaluate:30 - Bin size: 1; Most probable: 0.99
2021.02.05 14:32:54:091 [zio-def...] [INFO ] org.monarchinitiative.boomer.Boom.evaluate:30 - Bin size: 2; Most probable: 0.95
2021.02.05 14:32:54:095 [zio-def...] [INFO ] org.monarchinitiative.boomer.Boom.evaluate:33 - Max possible joint probability: -0.11263692462860261
The axioms in the first bin will always be added before proceeding to the next bin. Different runs will just shuffle the order of the two items in the second bin.
from boomer.
my ticket is in error... more later
from boomer.
I think we cleared this up. "windows" may not be as obvious as they ought to be but I think the UI will continue to evolve.
from boomer.
still an issue
A:1 B:1 0.0 0.0 0.95 0.05
B:1 C:1 0.0 0.0 0.95 0.05
A:1 C:1 0.99 0.0 0.01 0.0
running
boomer -t triangle.ptable.tsv -a triangle.owl -p prefixes.yaml -r 500 -w 1 -e 200 --output-internal-axioms true
yields
## SINGLETONS
Method: singletons
Score: -0.05129329438755058
Estimated probability: 1.0
Confidence: 1.0
Subsequent scores (max 10):
- [B:1](http://purl.obolibrary.org/obo/B_1) EquivalentTo [C:1](http://purl.obolibrary.org/obo/C_1) (most probable) 0.95
and an incoherent output.ofn
from boomer.
Related Issues (20)
- Boomer markdown output HOT 2
- Include SiblingOf in json output HOT 2
- Bayesian calculation of unspecified probabilities from priors
- --output-internal-axioms is dropping disjointness axioms
- Provided more detailed documentation of window-count
- Consider not outputting cliques of size 2 or less HOT 1
- What is the best way to communicate changes in axioms to a user HOT 2
- allow providing prefixes declarations separate from list of prohibited within-namespace equivalents HOT 1
- Boomer: How to deal with huge cliques? HOT 3
- Output sssom mapping files rather than (just) owl HOT 1
- What do the images mean? HOT 2
- JSON filenames that are hashed have no mention in the `output.md` file HOT 1
- Supporting Mapping QC workflow
- Supporting the Mapping Integration workflow HOT 1
- Report posterior probability of each proposed axiom in a solution
- Docs: Add link to `README.md`
- Add additional diagnostics to figure out points where boomer doesn't complete HOT 1
- fail fast if there are no satisfiable solutions
- "No possible resolution of perplexity" + no results HOT 11
- Merging 14 Ontologies (huge merge) HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from boomer.