TY - JOUR
T1 - When B2is Not Enough
T2 - Evaluating Simple Metrics for Predicting Phase Separation of Intrinsically Disordered Proteins
AU - Oliver, Wesley W.
AU - Jacobs, William M.
AU - Webb, Michael A.
N1 - Publisher Copyright:
© 2025 American Chemical Society
PY - 2025/9/18
Y1 - 2025/9/18
N2 - Understanding and predicting the phase behavior of intrinsically disordered proteins (IDPs) is of significant interest due to their role in many biological processes. However, effectively characterizing phase behavior and its complex dependence on protein primary sequence remains challenging. In this study, we evaluate the efficacy of several simple computational metrics to quantify the propensity of single-component IDP solutions to phase separate; specific metrics considered include the single-chain radius of gyration, the second virial coefficient, and a newly proposed quantity termed the expenditure density. Each metric is computed using coarse-grained molecular dynamics simulations for 2,034 IDP sequences. Using machine learning, we analyze this data to understand how sequence features correlate with the predictive performance of each metric and to develop insight into their respective strengths and limitations. The expenditure density is determined to be a broadly useful metric that combines simplicity, low computational cost, and accuracy; it also provides a continuous measure that remains informative across both phase-separating and non-phase-separating sequences. Additionally, this metric shows promise in its ability to improve predictions of other properties for IDP systems. This work extends existing literature by advancing beyond binary classification, which can be useful for rapidly screening phase behavior or predicting other properties of IDP-related systems.
AB - Understanding and predicting the phase behavior of intrinsically disordered proteins (IDPs) is of significant interest due to their role in many biological processes. However, effectively characterizing phase behavior and its complex dependence on protein primary sequence remains challenging. In this study, we evaluate the efficacy of several simple computational metrics to quantify the propensity of single-component IDP solutions to phase separate; specific metrics considered include the single-chain radius of gyration, the second virial coefficient, and a newly proposed quantity termed the expenditure density. Each metric is computed using coarse-grained molecular dynamics simulations for 2,034 IDP sequences. Using machine learning, we analyze this data to understand how sequence features correlate with the predictive performance of each metric and to develop insight into their respective strengths and limitations. The expenditure density is determined to be a broadly useful metric that combines simplicity, low computational cost, and accuracy; it also provides a continuous measure that remains informative across both phase-separating and non-phase-separating sequences. Additionally, this metric shows promise in its ability to improve predictions of other properties for IDP systems. This work extends existing literature by advancing beyond binary classification, which can be useful for rapidly screening phase behavior or predicting other properties of IDP-related systems.
UR - https://www.scopus.com/pages/publications/105016549682
UR - https://www.scopus.com/pages/publications/105016549682#tab=citedBy
U2 - 10.1021/acs.jpcb.5c04955
DO - 10.1021/acs.jpcb.5c04955
M3 - Article
C2 - 40911789
AN - SCOPUS:105016549682
SN - 1520-6106
VL - 129
SP - 9551
EP - 9565
JO - Journal of Physical Chemistry B
JF - Journal of Physical Chemistry B
IS - 37
ER -