Comments (4)
@nsevilla great, looking forward to your PR.
I think it would be better that this test is a standalone test in DESCQA. For one, as you mentioned, there are potential validation data sets and criteria that we can implement, if at a later time we deem that it is important that the catalogs have certain reasonable color-color distribution. In addition, the DESCQA framework allows running several tests at once, so there's no need to put many tests into one.
from descqa.
@nsevilla : I would say that you're guaranteed a horrible chi-squared. The errors in the distribution should be dominated by counting statistics and N is very large.
Your proposed statistic would also be very sensitive to small shifts in the overall colors of objects in data vs. simulations (as that will shift objects from bin-to-bin) as well as the choice of binning. I would suggest that we would want to do something that is unbinned and that applies shifts to align distributions before comparison. In 1D, that is all well-defined, but that is not true in 2D (e.g. there is not a true 2D analog of the K-S test).
from descqa.
Sounds good to get me started @yymao . It runs under the descqa already in my version at NERSC as I said in the hack day. Will tidy it up and make a PR.
Concerning a possible test (I know it's not required any more), a possibility would be binning the SDSS 2D data, creating some jackknife errors in each bin, and check statistical compatibility... I think it is going to be far from a good chi2 in general, but nonetheless, maybe to check quantitatively the impact of any changes in the catalogs (this process can be a bit computationally expensive though but could be optional).
But, if only plots are required, wouldn't this be better part of the general suite of tests done by readiness_test.py for instance?
from descqa.
closed by #88
from descqa.
Related Issues (20)
- Update CheckColors test to be compatible with DM outputs HOT 5
- Tree ring test
- validate instance catalogs to filter out offending AGN HOT 3
- number counts test updates HOT 6
- Update DESCQA web app's landing page HOT 4
- Ability to "tag" a run after the run is complete
- `sklearn.cluster.k_means` not working when `n_jobs=-1` is set HOT 1
- README is pointing to jupyter-dev
- Segmentation fault running some correlation function tests HOT 2
- Shear Test fails due to OSError: libgfortran.so.3: cannot open shared object file: No such file or directory HOT 7
- shear_test fails due to camb attribute error
- Move from project/projecta to cfs
- Python Environment Name Change stack => desc HOT 1
- Need to update versions of gsl and cray-fftw in run_master.sh HOT 2
- Set HDF5_USE_FILE_LOCKING=FALSE in run_master.sh
- Make DESCQA compatible with new desc-python environment HOT 6
- Is there a way to pass `external_data_dir` as an argument? HOT 5
- Revert to using the desc-python env at NERSC
- Experiment with generalizing tests beyond GCRCatalogs HOT 3
- Versioning of releases and plans any plans for an updated release? HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from descqa.