TY - JOUR
T1 - Divergence estimation of continuous distributions based on data-dependent partitions
AU - Wang, Qing
AU - Kulkarni, Sanjeev R.
AU - Verdú, Sergio
N1 - Funding Information:
Manuscript received November 11, 2004; revised April 27, 2005. This work was supported in part by ARL MURI under Grant DAAD19-00-1-0466, by Draper Laboratory under IR&D 6002 Grant DL-H-546263, and by the National Science Foundation under Grant CCR-0312413.
PY - 2005/9
Y1 - 2005/9
N2 - We present a universal estimator of the divergence D(P ∥ Q) for two arbitrary continuous distributions P and Q satisfying certain regularity conditions. This algorithm, which observes independent and identically distributed (i.i.d.) samples from both P and Q, is based on the estimation of the Radon-Nikodym derivative dP/dQ via a data-dependent partition of the observation space. Strong convergence of this estimator is proved with an empirically equivalent segmentation of the space. This basic estimator is further improved by adaptive partitioning schemes and by bias correction. The application of the algorithms to data with memory is also investigated. In the simulations, we compare our estimators with the direct plug-in estimator and estimators based on other partitioning approaches. Experimental results show that our methods achieve the best convergence performance in most of the tested cases.
AB - We present a universal estimator of the divergence D(P ∥ Q) for two arbitrary continuous distributions P and Q satisfying certain regularity conditions. This algorithm, which observes independent and identically distributed (i.i.d.) samples from both P and Q, is based on the estimation of the Radon-Nikodym derivative dP/dQ via a data-dependent partition of the observation space. Strong convergence of this estimator is proved with an empirically equivalent segmentation of the space. This basic estimator is further improved by adaptive partitioning schemes and by bias correction. The application of the algorithms to data with memory is also investigated. In the simulations, we compare our estimators with the direct plug-in estimator and estimators based on other partitioning approaches. Experimental results show that our methods achieve the best convergence performance in most of the tested cases.
KW - Bias correction
KW - Data-dependent partition
KW - Divergence
KW - Radon-Nikodym derivative
KW - Stationary and ergodic data
KW - Universal estimation of information measures
UR - http://www.scopus.com/inward/record.url?scp=26444495559&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=26444495559&partnerID=8YFLogxK
U2 - 10.1109/TIT.2005.853314
DO - 10.1109/TIT.2005.853314
M3 - Article
AN - SCOPUS:26444495559
SN - 0018-9448
VL - 51
SP - 3064
EP - 3074
JO - IEEE Transactions on Information Theory
JF - IEEE Transactions on Information Theory
IS - 9
ER -