TY - JOUR
T1 - Parallelization of Change Point Detection
AU - Song, Nancy
AU - Yang, Haw
N1 - Publisher Copyright:
© 2017 American Chemical Society.
PY - 2017/7/13
Y1 - 2017/7/13
N2 - The change point detection method (Watkins, L. P.; Yang, H. J. Phys. Chem. B 2005, 109, 617) allows the objective identification and isolation of abrupt changes along a data series. Because this method is grounded in statistical tests, it is particularly powerful for probing complex and noisy signals without artificially imposing a kinetics model. The original algorithm, however, has a time complexity of O(N2), where N is the size of the data and is, therefore, limited in its scalability. This paper puts forth a parallelization of change point detection to address these time and memory constraints. This parallelization method was evaluated by applying it to changes in the mean of Gaussian-distributed data and found that time decreases superlinearly with respect to the number of processes (i.e., parallelization with two processes takes less than half of the time of one process). Moreover, there was minimal reduction in detection power. These results suggest that our parallelization algorithm is a viable scheme that can be implemented for other change point detection methods.
AB - The change point detection method (Watkins, L. P.; Yang, H. J. Phys. Chem. B 2005, 109, 617) allows the objective identification and isolation of abrupt changes along a data series. Because this method is grounded in statistical tests, it is particularly powerful for probing complex and noisy signals without artificially imposing a kinetics model. The original algorithm, however, has a time complexity of O(N2), where N is the size of the data and is, therefore, limited in its scalability. This paper puts forth a parallelization of change point detection to address these time and memory constraints. This parallelization method was evaluated by applying it to changes in the mean of Gaussian-distributed data and found that time decreases superlinearly with respect to the number of processes (i.e., parallelization with two processes takes less than half of the time of one process). Moreover, there was minimal reduction in detection power. These results suggest that our parallelization algorithm is a viable scheme that can be implemented for other change point detection methods.
UR - http://www.scopus.com/inward/record.url?scp=85024928001&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85024928001&partnerID=8YFLogxK
U2 - 10.1021/acs.jpca.7b04378
DO - 10.1021/acs.jpca.7b04378
M3 - Article
C2 - 28616980
AN - SCOPUS:85024928001
SN - 1089-5639
VL - 121
SP - 5100
EP - 5109
JO - Journal of Physical Chemistry A
JF - Journal of Physical Chemistry A
IS - 27
ER -