TY - JOUR
T1 - How to model the weather-migration link
T2 - a machine-learning approach to variable selection in the Mexico-U.S. context
AU - Molina, Mario D.
AU - Chau, Nancy
AU - Rodewald, Amanda D.
AU - Garip, Filiz
N1 - Publisher Copyright:
© 2022 Informa UK Limited, trading as Taylor & Francis Group.
PY - 2023
Y1 - 2023
N2 - A growing body of research investigates how changes in weather shape individual choices about migration, yet highly variable results continue to challenge our understanding of the weather-migration nexus. We use a data-driven approach to identify which weather variables best predicted migration decisions of 54,986 individuals originating in Mexico between 1989 and 2016. Using supervised machine learning, we fit random forests to model migration choices based on individual, household, and community attributes in training data (three-fourths of the sample) from the Mexican Migration Project. We aggregated 36 annual weather variables at the community level and applied k-fold cross-validation to evaluate which models best predicted migration decisions. The top performing models were then applied to the test data (one-fourth of our sample). Three weather variables consistently out-performed others across models: minimum temperature during day, maximum temperature at night, and ‘growing degree days’–the number of days with optimal growth temperatures for corn (the major crop for most communities). Our results demonstrate that weather is related to individual choices about migration and illustrate the utility of using principled variable selection which revealed that both customized (growing degree days for a particular crop) and generic (max-min temperatures) metrics can be predictive of migration behaviors.
AB - A growing body of research investigates how changes in weather shape individual choices about migration, yet highly variable results continue to challenge our understanding of the weather-migration nexus. We use a data-driven approach to identify which weather variables best predicted migration decisions of 54,986 individuals originating in Mexico between 1989 and 2016. Using supervised machine learning, we fit random forests to model migration choices based on individual, household, and community attributes in training data (three-fourths of the sample) from the Mexican Migration Project. We aggregated 36 annual weather variables at the community level and applied k-fold cross-validation to evaluate which models best predicted migration decisions. The top performing models were then applied to the test data (one-fourth of our sample). Three weather variables consistently out-performed others across models: minimum temperature during day, maximum temperature at night, and ‘growing degree days’–the number of days with optimal growth temperatures for corn (the major crop for most communities). Our results demonstrate that weather is related to individual choices about migration and illustrate the utility of using principled variable selection which revealed that both customized (growing degree days for a particular crop) and generic (max-min temperatures) metrics can be predictive of migration behaviors.
KW - Mexico
KW - Migration
KW - climate change
KW - random forests
KW - supervised machine learning
UR - http://www.scopus.com/inward/record.url?scp=85123330689&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85123330689&partnerID=8YFLogxK
U2 - 10.1080/1369183X.2022.2100549
DO - 10.1080/1369183X.2022.2100549
M3 - Article
AN - SCOPUS:85123330689
SN - 1369-183X
VL - 49
SP - 465
EP - 491
JO - Journal of Ethnic and Migration Studies
JF - Journal of Ethnic and Migration Studies
IS - 2
ER -