Google Scholar

D Stowell�- PeerJ, 2022 - peerj.com

Animal vocalisations and natural soundscapes are fascinating objects of study, and contain
valuable evidence about animal behaviours, populations and ecosystems. They are studied�…

Save Cite Cited by 195 Related articles All 10 versions Cached

A review of automatic recognition technology for bird vocalizations in the deep learning era

J Xie, Y Zhong, J Zhang, S Liu, C Ding…�- Ecological�…, 2023 - Elsevier

Birds are considered critical indicators of ecosystem condition. Automatic recording devices
have emerged as a trending tool to assist field observations, contributing to biodiversity�…

Save Cite Cited by 49 Related articles All 3 versions

[PDF] isca-archive.org

[PDF][PDF] STC antispoofing systems for the ASVspoof2021 challenge

A Tomilov, A Svishchev, M Volkova…�- Proc. ASVspoof 2021�…, 2021 - isca-archive.org

Abstract This paper describes Speech Technology Center (STC) antispoofing systems
submitted to the ASVspoof 2021 challenge in three tracks: logical access (LA), physical�…

Save Cite Cited by 82 Related articles All 3 versions View as HTML

[PDF] mlr.press

Hear: Holistic evaluation of audio representations

J Turian, J Shier, HR Khan, B Raj…�- NeurIPS 2021�…, 2022 - proceedings.mlr.press

What audio embedding approach generalizes best to a wide range of downstream tasks
across a variety of everyday domains without fine-tuning? The aim of the HEAR benchmark�…

Save Cite Cited by 89 Related articles All 10 versions View as HTML

[PDF] arxiv.org

Audio deepfake detection: A survey

J Yi, C Wang, J Tao, X Zhang, CY Zhang…�- arXiv preprint arXiv�…, 2023 - arxiv.org

Audio deepfake detection is an emerging active topic. A growing number of literatures have
aimed to study deepfake detection algorithms and achieved effective performance, the�…

Save Cite Cited by 18 Related articles All 4 versions View as HTML

[PDF] mlr.press

In search for a generalizable method for source free domain adaptation

M Boudiaf, T Denton…�- International�…, 2023 - proceedings.mlr.press

Source-free domain adaptation (SFDA) is compelling because it allows adapting an off-the-
shelf model to a new domain using only unlabelled data. In this work, we apply existing�…

Save Cite Cited by 12 Related articles All 6 versions View as HTML

UALF: A learnable front-end for intelligent underwater acoustic classification system

J Ren, Y Xie, X Zhang, J Xu�- Ocean Engineering, 2022 - Elsevier

In practical ocean engineering application, variable target characteristics and inevitable
environmental noise will decrease the recognition accuracy of underwater acoustic�…

Save Cite Cited by 31 Related articles All 2 versions

[PDF] ieee.org

Audio splicing detection and localization based on acquisition device traces

DU Leonzio, L Cuccovillo, P Bestagini…�- IEEE Transactions�…, 2023 - ieeexplore.ieee.org

In recent years, the multimedia forensic community has put a great effort in developing
solutions to assess the integrity and authenticity of multimedia objects, focusing especially�…

Save Cite Cited by 10 Related articles All 4 versions

BAT: Block and token self-attention for speech emotion recognition

J Lei, X Zhu, Y Wang�- Neural Networks, 2022 - Elsevier

Transformers have achieved great success in many artificial intelligence fields, such as
computer vision (CV), audio processing and natural language processing (NLP). In speech�…

Save Cite Cited by 20 Related articles All 4 versions

[PDF] arxiv.org

Multimodal self-supervised learning of general audio representations

L Wang, P Luc, A Recasens, JB Alayrac…�- arXiv preprint arXiv�…, 2021 - arxiv.org

We present a multimodal framework to learn general audio representations from videos.
Existing contrastive audio representation learning methods mainly focus on using the audio�…

Save Cite Cited by 45 Related articles All 4 versions View as HTML

Create alert

Cite

Advanced search

Saved to My library

LEAF: A learnable frontend for audio classification

[HTML][HTML] Computational bioacoustics with deep learning: a review and roadmap