[HTML][HTML] Computational bioacoustics with deep learning: a review and roadmap
D Stowell�- PeerJ, 2022 - peerj.com
Animal vocalisations and natural soundscapes are fascinating objects of study, and contain
valuable evidence about animal behaviours, populations and ecosystems. They are studied�…
valuable evidence about animal behaviours, populations and ecosystems. They are studied�…
A review of automatic recognition technology for bird vocalizations in the deep learning era
Birds are considered critical indicators of ecosystem condition. Automatic recording devices
have emerged as a trending tool to assist field observations, contributing to biodiversity�…
have emerged as a trending tool to assist field observations, contributing to biodiversity�…
[PDF][PDF] STC antispoofing systems for the ASVspoof2021 challenge
A Tomilov, A Svishchev, M Volkova…�- Proc. ASVspoof 2021�…, 2021 - isca-archive.org
Abstract This paper describes Speech Technology Center (STC) antispoofing systems
submitted to the ASVspoof 2021 challenge in three tracks: logical access (LA), physical�…
submitted to the ASVspoof 2021 challenge in three tracks: logical access (LA), physical�…
Hear: Holistic evaluation of audio representations
What audio embedding approach generalizes best to a wide range of downstream tasks
across a variety of everyday domains without fine-tuning? The aim of the HEAR benchmark�…
across a variety of everyday domains without fine-tuning? The aim of the HEAR benchmark�…
Audio deepfake detection: A survey
Audio deepfake detection is an emerging active topic. A growing number of literatures have
aimed to study deepfake detection algorithms and achieved effective performance, the�…
aimed to study deepfake detection algorithms and achieved effective performance, the�…
In search for a generalizable method for source free domain adaptation
Source-free domain adaptation (SFDA) is compelling because it allows adapting an off-the-
shelf model to a new domain using only unlabelled data. In this work, we apply existing�…
shelf model to a new domain using only unlabelled data. In this work, we apply existing�…
UALF: A learnable front-end for intelligent underwater acoustic classification system
J Ren, Y Xie, X Zhang, J Xu�- Ocean Engineering, 2022 - Elsevier
In practical ocean engineering application, variable target characteristics and inevitable
environmental noise will decrease the recognition accuracy of underwater acoustic�…
environmental noise will decrease the recognition accuracy of underwater acoustic�…
Audio splicing detection and localization based on acquisition device traces
In recent years, the multimedia forensic community has put a great effort in developing
solutions to assess the integrity and authenticity of multimedia objects, focusing especially�…
solutions to assess the integrity and authenticity of multimedia objects, focusing especially�…
BAT: Block and token self-attention for speech emotion recognition
J Lei, X Zhu, Y Wang�- Neural Networks, 2022 - Elsevier
Transformers have achieved great success in many artificial intelligence fields, such as
computer vision (CV), audio processing and natural language processing (NLP). In speech�…
computer vision (CV), audio processing and natural language processing (NLP). In speech�…
Multimodal self-supervised learning of general audio representations
We present a multimodal framework to learn general audio representations from videos.
Existing contrastive audio representation learning methods mainly focus on using the audio�…
Existing contrastive audio representation learning methods mainly focus on using the audio�…