Publication

Mapping uncharted waters: exploratory analysis, visualization, and clustering of oceanographic data.

Source:

International Conference on Machine Learning Applications (2008)

Abstract:

In this paper we describe an interdisciplinary collaboration between researchers in machine learning and oceanog- raphy. The collaboration was formed to study the problem of open ocean biome classification. Biomes are regions on Earth with similar climate (e.g., temperature and rainfall) and vegetation structure (e.g., grasslands, coniferous forests, and deserts). To discover biomes in the open ocean, we apply leading methods in high dimensional data anal- ysis, clustering, and visualization to oceanographic measurements culled from multiple existing databases. We com- pare traditional approaches, such as k-means clustering and principal component analysis, to newer approaches such as Isomap and maximum variance unfolding. Our work provides the first quantitative classification of open ocean biomes from an automated statistical analysis of multivariate data. It also provides a valuable case study in the use (and misuse) of recently developed algorithms for high dimensional data analysis.