Comments (2)
We ultimately don't care about the typical strict architecture comparison found in other ML benchmarks. We care about measuring how good ML (any form of ML) is at OOD materials stability prediction. If some models (interatomic potentials) are trained on forces and therefore can leverage more of the maximum training set released with our benchmark (the entirety of the MP v2022.10.28 database release) then that's a genuine advantage of force-full models for the real-world application we care about and we want our benchmark to reflect that.
In short, we want to provide a walled garden for asking system level questions which a traditional ML benchmark is too rigid to answer. I believe we succeeded at that. Matbench Discovery clearly demonstrated that universal interatomic potentials emulating DFT relaxation are the winning methodology for high-throughput OOD materials stability prediction.
from matbench-discovery.
Also, training set size and overfitting are two different concepts which you seem to be conflating. What's more, the empirical evidence suggests overfitting is a non-issue with large models.
from matbench-discovery.
Related Issues (20)
- test_plots.py and test_preds.py failing HOT 4
- Package uses non-standard site-package paths for resources HOT 4
- df_summary.index contains nan values HOT 1
- fetch_process_wbm_dataset.py: data/wbm/2022-10-19-wbm-init-structs.json.bz2 does not exist
- compute_struct_fingerprints.py: cannot insert material_id, already exists
- fetch_process_wbm_dataset.py: Generating Aflow labels raised exception=KeyError('wyckoff_spglib') HOT 1
- Location of site-stats.json.gz
- Benchmark design questions HOT 15
- Obtain E_above_hull predictions HOT 10
- Reference: Critical examination of robustness and generalizability HOT 2
- Importing CSV with pd.read_json() HOT 3
- Simplified user interface HOT 1
- Pytorch module and virtual environment usage HOT 6
- dead link in contributing HOT 1
- fetch_process_wbm_dataset.py: bad JSON file checksum HOT 1
- Mismatching fingerprint paths HOT 1
- MIssing `"direct_url.json"` causes `JSONDecodeError`: Expecting value: line 1 column 1 (char 0)
- df_wbm has wrong index column name type for wandb.Table HOT 2
- Inconsistency in GNoME's F1 Scores on Matbench HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from matbench-discovery.