Code Monkey home page Code Monkey logo

Comments (8)

jchodera avatar jchodera commented on June 12, 2024

I don't think this is what we want. GPs are great for data in which there is a natural spatial relationship between the collected data, but that relationship must be learned. We are dealing with a very different case---we know what the relationship is, through the dissociation constant equations and mass conservation laws. Utilizing a GP of the sort in those examples would not only "forget" that information, but it doesn't allow us to propagate any uncertainty in which points are outliers into the posterior.

from assaytools.

jchodera avatar jchodera commented on June 12, 2024

Instead, I think we should use an approach like this, where there is a prior on the fraction of outliers and the outlier distribution has a mean and variance that is inferred (and marginalized out) during MCMC sampling:
http://www.astroml.org/book_figures/chapter8/fig_outlier_rejection.html

from assaytools.

jchodera avatar jchodera commented on June 12, 2024

But first, before we even talk about models, we absolutely need to collect some examples of the outliers and look at them to see what it tells us about the nature of the data.

from assaytools.

sonyahanson avatar sonyahanson commented on June 12, 2024

Just making a note here that this is something we should keep at the front of our minds.

from assaytools.

jchodera avatar jchodera commented on June 12, 2024

Agreed! Would be great to compile a list of data with outliers to find a strategy that works!

from assaytools.

sonyahanson avatar sonyahanson commented on June 12, 2024

Here is an example of two almost identical datasets with and without an outlier:

With outlier:
https://github.com/choderalab/fluorescence-assay-manuscript/blob/fig_sketches/analysis/bayes/DMSO-backfill/delG_Bosutinib-AB-2016-07-31%2020:10.png
https://github.com/choderalab/fluorescence-assay-manuscript/blob/fig_sketches/analysis/bayes/DMSO-backfill/Bosutinib-AB-2016-07-31%2020:10.json

Without outlier:
https://github.com/choderalab/fluorescence-assay-manuscript/blob/fig_sketches/analysis/bayes/DMSO-backfill/delG_Bosutinib-IJ-2016-07-31%2020:13.png
https://github.com/choderalab/fluorescence-assay-manuscript/blob/fig_sketches/analysis/bayes/DMSO-backfill/Bosutinib-IJ-2016-07-31%2020:13.json

from assaytools.

jchodera avatar jchodera commented on June 12, 2024

Awesome! This is exactly what we need to make this work! Thanks!

from assaytools.

sonyahanson avatar sonyahanson commented on June 12, 2024

@jchodera has an idea about Bayesian outlier detection that he is interested in implementing.

from assaytools.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.