TY - JOUR
T1 - Machine Learning Prediction on Properties of Nanoporous Materials Utilizing Pore Geometry Barcodes
AU - Zhang, Xiangyu
AU - Cui, Jing
AU - Zhang, Kexin
AU - Wu, Jiasheng
AU - Lee, Yongjin
N1 - Publisher Copyright:
Copyright © 2019 American Chemical Society.
PY - 2019/11/25
Y1 - 2019/11/25
N2 - In this work, we propose a computational framework for machine learning prediction on structural and performance properties of nanoporous materials for methane storage application. For our machine learning prediction, two descriptors based on pore geometry barcodes were developed; one descriptor is a set of distances from a structure to the most diverse set in barcode space, and the second descriptor extracts and uses the most important features from the barcodes. First, to identify the optimal condition for machine learning prediction, the effects of training set preparation method, training set size, and machine learning models were investigated. Our analysis showed that kernel ridge regression provides the highest prediction accuracy, and randomly selected 5% structures of the entire set would work well as a training set. Our results showed that both descriptors accurately predicted performance and even structural properties of zeolites. Furthermore, we demonstrated that our approach predicts accurately properties of metal-organic frameworks, which might indicate the possibility of this approach to be easily applied to predict the properties of other types of nanoporous materials.
AB - In this work, we propose a computational framework for machine learning prediction on structural and performance properties of nanoporous materials for methane storage application. For our machine learning prediction, two descriptors based on pore geometry barcodes were developed; one descriptor is a set of distances from a structure to the most diverse set in barcode space, and the second descriptor extracts and uses the most important features from the barcodes. First, to identify the optimal condition for machine learning prediction, the effects of training set preparation method, training set size, and machine learning models were investigated. Our analysis showed that kernel ridge regression provides the highest prediction accuracy, and randomly selected 5% structures of the entire set would work well as a training set. Our results showed that both descriptors accurately predicted performance and even structural properties of zeolites. Furthermore, we demonstrated that our approach predicts accurately properties of metal-organic frameworks, which might indicate the possibility of this approach to be easily applied to predict the properties of other types of nanoporous materials.
UR - http://www.scopus.com/inward/record.url?scp=85075145259&partnerID=8YFLogxK
U2 - 10.1021/acs.jcim.9b00623
DO - 10.1021/acs.jcim.9b00623
M3 - Article
C2 - 31661958
AN - SCOPUS:85075145259
SN - 1549-9596
VL - 59
SP - 4636
EP - 4644
JO - Journal of Chemical Information and Modeling
JF - Journal of Chemical Information and Modeling
IS - 11
ER -