Abstract
Rapidly predicting enzyme properties for catalyzing specific substrates is essential for identifying potential enzymes for industrial transformations. The demand for sustainable production of valuable industry chemicals utilizing biological resources raised a pressing need to speed up biocatalyst screening using machine learning techniques. In this research, we developed an all-purpose deep-learning-based multiple-toolkit (ALDELE) workflow for screening enzyme catalysts. ALDELE incorporates both structural and sequence representations of proteins, alongside representations of ligands by subgraphs and overall physicochemical properties. Comprehensive evaluation demonstrated that ALDELE can predict the catalytic activities of enzymes, and particularly, it identifies residue-based hotspots to guide enzyme engineering and generates substrate heat maps to explore the substrate scope for a given biocatalyst. Moreover, our models notably match empirical data, reinforcing the practicality and reliability of our approach through the alignment with confirmed mutation sites. ALDELE offers a facile and comprehensive solution by integrating different toolkits tailored for different purposes at affordable computational cost and therefore would be valuable to speed up the discovery of new functional enzymes for their exploitation by the industry.
Original language | English |
---|---|
Pages (from-to) | 3123–3139 |
Number of pages | 17 |
Journal | Journal of Chemical Information and Modeling |
Volume | 64 |
Issue number | 8 |
Early online date | 04 Apr 2024 |
DOIs | |
Publication status | Published - 22 Apr 2024 |
Fingerprint
Dive into the research topics of 'ALDELE: all-purpose deep learning toolkits for predicting the biocatalytic activities of enzymes'. Together they form a unique fingerprint.Datasets
-
ALDELE: All-Purpose Deep Learning Toolkits for Predicting the Biocatalytic Activities of Enzymes
Huang, M. (Owner), Queen's University Belfast, Mar 2024
http://Github.com/Xiangwen-Wang/ALDELE
Dataset