This repository contains the code supplement to Benchmark AFLOW Data Sets for Machine Learning by CL Clement, SK Kauwe, and TD Sparks, published in 27 May 2020 in Integrating Materials and Manufacturing Innovation.
The data sets can be downloaded together as a ZIP file at https://doi.org/10.6084/m9.figshare.11954742
The code and downloadable data sets for this project are provided "as-is" under the MIT Licence, which allows for personal, public, and commerical use and modification. However, we ask that you please respect the license of the original data source (aflowlib.org), and refrain from using the data sets for proprietary or commercial purposes.
Functions for downloading compound property and crystal structure data via the AFLOW API. Property values are encoded as JSON files, and structure data as crystallographic information files (CIFs).
A script to split property CSV files into separate training, validation, and testing data sets.
A script for iterating over each compound, collecting data values by property, and grouping them into CSV files.
A single column CSV listing the queriable material properties.