[HTML][HTML] Computational bioacoustics with deep learning: a review and roadmap

D Stowell�- PeerJ, 2022 - peerj.com
Animal vocalisations and natural soundscapes are fascinating objects of study, and contain
valuable evidence about animal behaviours, populations and ecosystems. They are studied�…

A review of automatic recognition technology for bird vocalizations in the deep learning era

J Xie, Y Zhong, J Zhang, S Liu, C Ding…�- Ecological�…, 2023 - Elsevier
Birds are considered critical indicators of ecosystem condition. Automatic recording devices
have emerged as a trending tool to assist field observations, contributing to biodiversity�…

[PDF][PDF] STC antispoofing systems for the ASVspoof2021 challenge

A Tomilov, A Svishchev, M Volkova…�- Proc. ASVspoof 2021�…, 2021 - isca-archive.org
Abstract This paper describes Speech Technology Center (STC) antispoofing systems
submitted to the ASVspoof 2021 challenge in three tracks: logical access (LA), physical�…

Hear: Holistic evaluation of audio representations

J Turian, J Shier, HR Khan, B Raj…�- NeurIPS 2021�…, 2022 - proceedings.mlr.press
What audio embedding approach generalizes best to a wide range of downstream tasks
across a variety of everyday domains without fine-tuning? The aim of the HEAR benchmark�…

Audio deepfake detection: A survey

J Yi, C Wang, J Tao, X Zhang, CY Zhang…�- arXiv preprint arXiv�…, 2023 - arxiv.org
Audio deepfake detection is an emerging active topic. A growing number of literatures have
aimed to study deepfake detection algorithms and achieved effective performance, the�…

In search for a generalizable method for source free domain adaptation

M Boudiaf, T Denton…�- International�…, 2023 - proceedings.mlr.press
Source-free domain adaptation (SFDA) is compelling because it allows adapting an off-the-
shelf model to a new domain using only unlabelled data. In this work, we apply existing�…

UALF: A learnable front-end for intelligent underwater acoustic classification system

J Ren, Y Xie, X Zhang, J Xu�- Ocean Engineering, 2022 - Elsevier
In practical ocean engineering application, variable target characteristics and inevitable
environmental noise will decrease the recognition accuracy of underwater acoustic�…

Audio splicing detection and localization based on acquisition device traces

DU Leonzio, L Cuccovillo, P Bestagini…�- IEEE Transactions�…, 2023 - ieeexplore.ieee.org
In recent years, the multimedia forensic community has put a great effort in developing
solutions to assess the integrity and authenticity of multimedia objects, focusing especially�…

BAT: Block and token self-attention for speech emotion recognition

J Lei, X Zhu, Y Wang�- Neural Networks, 2022 - Elsevier
Transformers have achieved great success in many artificial intelligence fields, such as
computer vision (CV), audio processing and natural language processing (NLP). In speech�…

Multimodal self-supervised learning of general audio representations

L Wang, P Luc, A Recasens, JB Alayrac…�- arXiv preprint arXiv�…, 2021 - arxiv.org
We present a multimodal framework to learn general audio representations from videos.
Existing contrastive audio representation learning methods mainly focus on using the audio�…