Research Document 2023/050
A preliminary review of the efficacy of several acoustic autodetection algorithms to identify North Atlantic right whale calls, and recommendations for next steps to further assess and optimize these algorithms
By Lawson, J.
Abstract
Automated detection and classification of the vocalizations of North Atlantic right whale (NARW) and other marine mammals is a highly desirable goal for researchers and managers seeking to monitor areas for whale presence as the basis to implement mitigation measures. Such automated acoustic processing is particularly important for real-time monitoring approaches where there are large-scale acoustic data inputs.
All of the Detection and Classification Systems (DCSs) used by Fisheries and Oceans Canada (DFO) are expected to perform similarly well, given the metric (e.g., hours with calls/day) used to present NARW occurrence time-series. Previously, this was demonstrated by comparing performances of a variety of detectors during studies in 2004, 2013, and 2017. Spectroplotter (a commercial programme) and Low-Frequency Detection and Classification System (LFDCS), which are the two systems that have been used to analyse acoustic data in Newfoundland and Labrador (NL) and Maritimes regions, perform well; although in one small study the LFDCS detected more actual NARW upcalls than Spectroplotter, but also generated more false positives.
DCS performance is influenced by multiple factors, including the ambient noise levels relative to the characteristics of the NARW upcalls, the location of the hydrophone, the characteristics of the recorder instrumentation, software settings and thresholds, and other contextual features, such as the presence of other species. The next generation of DCSs will incorporate context into their logic (e.g., presence of other marine mammals or abiotic sound sources and signal-to-noise ratio [SNR]).
Algorithm comparisons are less crucial in the historic NARW analyses as the metrics in which the present detection results are presented at a large enough scale (“has there been NARW detected at this recorder location today?”) that slight differences in algorithm performance would be subsumed in the amalgamation and summation process.
At smaller spatial and temporal sampling scales, differences in algorithm performance become more apparent. Thorough testing of the different DCSs being used in Atlantic Canada would require a series of manually validated acoustic datasets from a representative set of locations, time frames, seasons, and recording hardware. Such a DCS comparison would be a useful activity but would require agreed upon performance metrices and thresholds for the DCS.
Accessibility Notice
This document is available in PDF format. If the document is not accessible to you, please contact the Secretariat to obtain another appropriate format, such as regular print, large print, Braille or audio version.
- Date modified: