Evaluating Indices

An evaluation of the dashboard data revealed several interesting trends in the acoustic indices. Highlights are indicated below, and a more comprehensive summary can be found by downloading the Exploratory Summary.

Relationship with Environmental Data

We examined categories of acoustic indices to assess whether any consistent trends emerged by region, particularly focusing on whether similar habitats exhibited comparable correlations with water classes. However, our results did not reveal a clear pattern of consistency across similar environments. For instance, given the similarity in diel patterns between Biscayne Bay and May River, we initially hypothesized that these regions would exhibit comparable correlations with Seascapes water classes. Contrary to our expectations, Table 2 demonstrates that each region displayed distinctly different correlation profiles. Biscayne Bay showed strong positive correlations among many of the complexity indices, whereas May River presented variable, low to moderate correlations, suggesting that even within seemingly similar shallow water habitats, underlying environmental or ecological differences influence acoustic index relationships differently.

Table 2: Pearson correlation coefficient values for Biscayne Bay and May River for water class 15. Extreme green cells indicates large positive correlation, and a large negative value indicates a strong negative correlation.

To explore this further, we shifted our focus to per-index correlation measures per water class, aiming to determine whether at least specific acoustic index measurements maintained consistent correlations within particular water classes. This analysis involved plotting the distribution of mean index values, calculated across eight-day intervals to align with the temporal resolution of the remotely sensed water class data, and comparing these to the observed correlation coefficients. For example, Water Class 12, which comprises a substantial portion of the ONC-MEF and OOI-HYDBBA106 datasets (Figure 4), exhibited opposite extreme correlation coefficients between these datasets. We visualized the mean index values for the SNRt, HpairedShannon, and EVNtMean indices to interpret these trends. Our analysis revealed that increased index values were associated with lower extreme correlation values for both EVNtMean and SNRt (Figure 5). However, this pattern did not hold for HpairedShannon, which showed reduced index measures at the ONC-MEP site, despite a similarly strong negative correlation coefficient. Importantly, while specific trends within single water classes were occasionally evident, these associations did not translate consistently across other water classes. For instance, in Water Class 15 (Figure 6), high values of HpairedShannon did not correlate with a drastic decrease in correlation coefficients, demonstrating the complex and potentially site-specific nature of these relationships.

Figure 4: Composition of water class data for the period of March through April 2019 to give context in the representation of water classes in a region. Only February is incorporated in subsequent analyses. Water class 12 is only found in the OOI-HYDBBA106 and ONC-MEF datasets.

Figure 5: Plots of the distribution of mean index values associated with water class 12. Only datasets with profiles consistent of 5 or more cells of the specified water class are reported. Pearson correlation coefficients are reported for comparison to index measurements. Water class 12 is only represented in ONC-MEF and OOI-HYDBBA106. This figure displays measurements for EVNtMean (A), H_pairedShannon (B), and SNRt (C).

Figure 6: Distribution of mean H_pairedShannon measurements by dataset and associated correlation coefficient for water class 15. No relationship between correlation coefficients and measurements is noted across sites.

Initial Exploration of Multivariate Analyses

As a cursory investigation into the potential use of a combined index approach for evaluating the unique natures of these datasets, we applied two advanced analytical techniques to explore how a multi-index framework might enhance our understanding of marine soundscapes. This preliminary analysis aimed to determine whether combining multiple acoustic indices could reveal latent patterns and relationships not evident when indices are considered in isolation. The analytical methods employed included:

  • Principal Component Analysis (PCA): Performed on 10 select presentative acoustic indices from each index category (e.g., per site to reduce dimensionality and highlight key patterns.

  • K-Means Clustering: Classified sites into 8 groups based on their acoustic profiles, providing insights into site relationships and ecological significance.

We explored the natural structure of the acoustic indices by combining two complementary analytical techniques. We applied Principal Component Analysis (PCA) to the normalized indices to show that the multidimensional relationships could be mapped onto two principal dimensions (Figure 7). The dimensional reduction allowed us to identify the main gradients of variation. Each feature’s contribution to these principal components was carefully measured through loading coefficients, revealing which variables were most influential. In parallel, we introduced k-means clustering (k=8) to identify whether the natural groupings that emerge coincided with the geographical origins of the measured indices.

Figure 7: Principal component contributions of acoustic index to PC1 and PC2.

Our analysis revealed that k-means clustering patterns did not significantly correlate with geographical locations (Figure 8A). Interestingly, however, location groupings formed distinct clusters within the principal component space (Figure 8B). We observed that geographically proximate regions often displayed similar acoustic profiles, suggesting a spatial gradient in acoustic properties. This spatial coherence provides promising evidence that acoustic indices may effectively characterize different soundscapes, potentially offering a quantitative method for distinguishing between acoustic environments.

Figure 8: PCA plots by k-means clusters (A) and location clusters (B).

Visual representations of the Principal Component Analysis (PCA) and K-Means clustering results were generated to support the interpretation of clustering and site associations. These visualizations allow for an interactive exploration of the multivariate analysis, enabling users to compare K-Means clusters and site-specific clusters for the set of 10 indices utilized in the cursory review of multivariate methods. To provide an interactive perspective, these visualizations are available online at: Exploring BioSound Data.