[HTML][HTML] An interpretable deep learning model for automatic sound classification

P Zinemanas, M Rocamora, M Miron, F Font, X Serra�- Electronics, 2021 - mdpi.com
Deep learning models have improved cutting-edge technologies in many research areas,
but their black-box structure makes it difficult to understand their inner workings and the�…

Esresnet: Environmental sound classification based on visual domain models

A Guzhov, F Raue, J Hees…�- 2020 25th international�…, 2021 - ieeexplore.ieee.org
Environmental Sound Classification (ESC) is an active research area in the audio domain
and has seen a lot of progress in the past years. However, many of the existing approaches�…

Receptive field regularization techniques for audio classification and tagging with deep convolutional neural networks

K Koutini, H Eghbal-zadeh…�- IEEE/ACM Transactions�…, 2021 - ieeexplore.ieee.org
In this paper, we study the performance of variants of well-known Convolutional Neural
Network (CNN) architectures on different audio tasks. We show that tuning the Receptive�…

Deep convolutional neural networks and data augmentation for environmental sound classification

J Salamon, JP Bello�- IEEE Signal processing letters, 2017 - ieeexplore.ieee.org
The ability of deep convolutional neural networks (CNNs) to learn discriminative spectro-
temporal patterns makes them well suited to environmental sound classification. However�…

Spectrogram based multi-task audio classification

Y Zeng, H Mao, D Peng, Z Yi�- Multimedia Tools and Applications, 2019 - Springer
Audio classification is regarded as a great challenge in pattern recognition. Although audio
classification tasks are always treated as independent tasks, tasks are essentially related to�…

Training neural audio classifiers with few data

J Pons, J Serr�, X Serra�- ICASSP 2019-2019 IEEE�…, 2019 - ieeexplore.ieee.org
We investigate supervised learning strategies that improve the training of neural network
audio classifiers on small annotated collections. In particular, we study whether (i) a naive�…

Investigating multi-feature selection and ensembling for audio classification

M Turab, T Kumar, M Bendechache, T Saber�- arXiv preprint arXiv�…, 2022 - arxiv.org
Deep Learning (DL) algorithms have shown impressive performance in diverse domains.
Among them, audio has attracted many researchers over the last couple of decades due to�…

Cmkd: Cnn/transformer-based cross-model knowledge distillation for audio classification

Y Gong, S Khurana, A Rouditchenko…�- arXiv preprint arXiv�…, 2022 - arxiv.org
Audio classification is an active research area with a wide range of applications. Over the
past decade, convolutional neural networks (CNNs) have been the de-facto standard�…

End-to-end audio strikes back: Boosting augmentations towards an efficient audio classification network

A Gazneli, G Zimerman, T Ridnik, G Sharir…�- arXiv preprint arXiv�…, 2022 - arxiv.org
While efficient architectures and a plethora of augmentations for end-to-end image
classification tasks have been suggested and heavily investigated, state-of-the-art�…

Panns: Large-scale pretrained audio neural networks for audio pattern recognition

Q Kong, Y Cao, T Iqbal, Y Wang…�- …�on Audio, Speech�…, 2020 - ieeexplore.ieee.org
Audio pattern recognition is an important research topic in the machine learning area, and
includes several tasks such as audio tagging, acoustic scene classification, music�…