TY - JOUR
T1 - Do We Exploit all Information for Counterfactual Analysis? Benefits of Factor Models and Idiosyncratic Correction
AU - Fan, Jianqing
AU - Masini, Ricardo
AU - Medeiros, Marcelo C.
N1 - Funding Information:
Fan’s research was supported by NSF grants DMS-1712591, DMS-2052926, DMS-2053832 and ONR grant N00014-19-1-2120. Masini’s and Medeiros’ research was partially supported by CNPq and CAPES. We are also in debt with Thiago Milagres for helping us with the dataset and all the team from the D-LAB@PUC-Rio for providing a superb research environment. The authors thank an associate editor and three anonymous referees for very insightful comments.
Publisher Copyright:
© 2022 American Statistical Association.
PY - 2022
Y1 - 2022
N2 - Optimal pricing, that is determining the price level that maximizes profit or revenue of a given product, is a vital task for the retail industry. To select such a quantity, one needs first to estimate the price elasticity from the product demand. Regression methods usually fail to recover such elasticities due to confounding effects and price endogeneity. Therefore, randomized experiments are typically required. However, elasticities can be highly heterogeneous depending on the location of stores, for example. As the randomization frequently occurs at the municipal level, standard difference-in-differences methods may also fail. Possible solutions are based on methodologies to measure the effects of treatments on a single (or just a few) treated unit(s) based on counterfactuals constructed from artificial controls. For example, for each city in the treatment group, a counterfactual may be constructed from the untreated locations. In this article, we apply a novel high-dimensional statistical method to measure the effects of price changes on daily sales from a major retailer in Brazil. The proposed methodology combines principal components (factors) and sparse regressions, resulting in a method called Factor-Adjusted Regularized Method for Treatment evaluation (FarmTreat). The data consist of daily sales and prices of five different products over more than 400 municipalities. The products considered belong to the sweet and candies category and experiments have been conducted over the years of 2016 and 2017. Our results confirm the hypothesis of a high degree of heterogeneity yielding very different pricing strategies over distinct municipalities. Supplementary materials for this article are available online.
AB - Optimal pricing, that is determining the price level that maximizes profit or revenue of a given product, is a vital task for the retail industry. To select such a quantity, one needs first to estimate the price elasticity from the product demand. Regression methods usually fail to recover such elasticities due to confounding effects and price endogeneity. Therefore, randomized experiments are typically required. However, elasticities can be highly heterogeneous depending on the location of stores, for example. As the randomization frequently occurs at the municipal level, standard difference-in-differences methods may also fail. Possible solutions are based on methodologies to measure the effects of treatments on a single (or just a few) treated unit(s) based on counterfactuals constructed from artificial controls. For example, for each city in the treatment group, a counterfactual may be constructed from the untreated locations. In this article, we apply a novel high-dimensional statistical method to measure the effects of price changes on daily sales from a major retailer in Brazil. The proposed methodology combines principal components (factors) and sparse regressions, resulting in a method called Factor-Adjusted Regularized Method for Treatment evaluation (FarmTreat). The data consist of daily sales and prices of five different products over more than 400 municipalities. The products considered belong to the sweet and candies category and experiments have been conducted over the years of 2016 and 2017. Our results confirm the hypothesis of a high degree of heterogeneity yielding very different pricing strategies over distinct municipalities. Supplementary materials for this article are available online.
KW - Counterfactual
KW - Demand estimation
KW - Factor models
KW - High-dimensional testing
KW - Optimal pricing
UR - http://www.scopus.com/inward/record.url?scp=85124122392&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85124122392&partnerID=8YFLogxK
U2 - 10.1080/01621459.2021.2004895
DO - 10.1080/01621459.2021.2004895
M3 - Article
AN - SCOPUS:85124122392
SN - 0162-1459
VL - 117
SP - 574
EP - 590
JO - Journal of the American Statistical Association
JF - Journal of the American Statistical Association
IS - 538
ER -