Principal Investigator (PI): Sara Graves, University of Alabama Huntsville
This project proposes to create a suite of specialized deployable data mining services designed specifically for NASA data, leveraging the Algorithm Development and Mining (ADaM) toolkit as the basis. The ADaM toolkit is a robust, mature and freely available science data mining toolkit that is being used by different research organizations and educational institutions worldwide.
These deployable services will give the scientific community a powerful and versatile data mining capability that can be used to create higher order products such as thematic maps from current and future NASA satellite data records with methods that are not currently available. The specialized data mining, pattern recognition, image processing and data preparation algorithms in ADaM are specifically geared towards satellite imagery, making these tools a perfect fit for NASA satellite data.
In addition to providing specialized deployable data mining services, the suite will also include the Earth Science Markup Language (ESML) giving users the power to handle a variety of heterogeneous data formats seamlessly. ESML is another proven technology that, like ADaM, is in use by organizations worldwide. The deployable package of mining and related services will be developed using web services standards so that community based measurement processing systems can access and interoperate with them.
The maturation of web services standards and technology sets the stage for a distributed Service-Oriented Architecture for NASA's next generation science data processing.
This architecture will allow members of the scientific community to create and combine persistent distributed data processing services and make them available to other users over the internet.
Two operational sites will be used for demonstrations of the proposed mining services in an SOA: the Distributed Information Services for Climate and Ocean Products and Visualizations for Earth Research (DISCOVER) passive microwave data pool at NASA's Global Hydrometeorology Resource Center Distributed Active Archive Center (GHRC DAAC) and Goddard Earth Sciences Data and Information Services Center (GES DISC). The ADaM web services will also be deployed for use in the SERVIR data repository and decision support system in the third year, if funding is available.