TY - JOUR
T1 - The Matsu Wheel
T2 - a reanalysis framework for Earth satellite imagery in data commons
AU - Patterson, Maria T.
AU - Anderson, Nikolas
AU - Bennett, Collin
AU - Bruggemann, Jacob
AU - Grossman, Robert L.
AU - Handy, Matthew
AU - Ly, Vuong
AU - Mandl, Daniel J.
AU - Pederson, Shane
AU - Pivarski, James
AU - Powell, Ray
AU - Spring, Jonathan
AU - Wells, Walt
AU - Xia, John
N1 - Funding Information:
Project Matsu is an Open Commons Consortium (OCC)-sponsored project supported by the Open Science Data Cloud. The source code and documentation are made available on GitHub at ( https://github.com/LabAdvComp/matsu-project ). This work was supported in part by grants from Gordon and Betty Moore Foundation and the National Science Foundation (Grant OISE 1129076 and CISE 1127316). The Earth Observing-1 satellite image is courtesy of the Earth Observing-1 project team at NASA Goddard Space Flight Center. The EarthExplorer cloud coverage calculations are available from the US Geological Survey on earthexplorer.usgs.gov.
Funding Information:
Project Matsu is an Open Commons Consortium (OCC)-sponsored project supported by the Open Science Data Cloud. The source code and documentation are made available on GitHub at (https://github.com/LabAdvComp/matsu-project). This work was supported in part by grants from Gordon and Betty Moore Foundation and the National Science Foundation (Grant OISE 1129076 and CISE 1127316). The Earth Observing-1 satellite image is courtesy of the Earth Observing-1 project team at NASA Goddard Space Flight Center. The EarthExplorer cloud coverage calculations are available from the US Geological Survey on earthexplorer.usgs.gov.
Publisher Copyright:
© 2017, The Author(s).
PY - 2017/12/1
Y1 - 2017/12/1
N2 - Project Matsu is a collaboration between the Open Commons Consortium and NASA focused on developing open source technology for the cloud-based processing of Earth satellite imagery and for detecting fires and floods to help support natural disaster detection and relief. We describe a framework for efficient analysis and reanalysis of large amounts of data called the Matsu “Wheel” and the analytics used to process hyperspectral data produced daily by NASA’s Earth Observing-1 (EO-1) satellite. The wheel is designed to be able to support scanning queries using cloud computing applications, such as Hadoop and Accumulo. A scanning query processes all, or most, of the data in a database or data repository. In contrast, standard queries typically process a relatively small percentage of the data. The wheel is a framework in which multiple scanning queries are grouped together and processed in turn, over chunks of data from the database or repository. Over time, the framework brings all data to each group of scanning queries. With this approach, contention and the overall time to process all scanning queries can be reduced. We describe our Wheel analytics, including an anomaly detector for rare spectral signatures or anomalies in hyperspectral data and a land cover classifier that can be used for water and flood detection. The resultant products of the analytics are made accessible through an API for further distribution. The Matsu Wheel allows many shared data services to be performed together to efficiently use resources for processing hyperspectral satellite image data and other, e.g., large environmental datasets that may be analyzed for many purposes.
AB - Project Matsu is a collaboration between the Open Commons Consortium and NASA focused on developing open source technology for the cloud-based processing of Earth satellite imagery and for detecting fires and floods to help support natural disaster detection and relief. We describe a framework for efficient analysis and reanalysis of large amounts of data called the Matsu “Wheel” and the analytics used to process hyperspectral data produced daily by NASA’s Earth Observing-1 (EO-1) satellite. The wheel is designed to be able to support scanning queries using cloud computing applications, such as Hadoop and Accumulo. A scanning query processes all, or most, of the data in a database or data repository. In contrast, standard queries typically process a relatively small percentage of the data. The wheel is a framework in which multiple scanning queries are grouped together and processed in turn, over chunks of data from the database or repository. Over time, the framework brings all data to each group of scanning queries. With this approach, contention and the overall time to process all scanning queries can be reduced. We describe our Wheel analytics, including an anomaly detector for rare spectral signatures or anomalies in hyperspectral data and a land cover classifier that can be used for water and flood detection. The resultant products of the analytics are made accessible through an API for further distribution. The Matsu Wheel allows many shared data services to be performed together to efficiently use resources for processing hyperspectral satellite image data and other, e.g., large environmental datasets that may be analyzed for many purposes.
KW - Data commons
KW - Earth satellite data
KW - Reanalysis framework
UR - http://www.scopus.com/inward/record.url?scp=85075507098&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85075507098&partnerID=8YFLogxK
U2 - 10.1007/s41060-017-0052-3
DO - 10.1007/s41060-017-0052-3
M3 - Article
AN - SCOPUS:85075507098
SN - 2364-415X
VL - 4
SP - 251
EP - 264
JO - International Journal of Data Science and Analytics
JF - International Journal of Data Science and Analytics
IS - 4
ER -