Raw waveform-based audio classification using sample-level CNN architectures
Music, speech, and acoustic scene sound are often handled separately in the audio domain
because of their different signal characteristics. However, as the image domain grows�…
because of their different signal characteristics. However, as the image domain grows�…
Comparison and analysis of SampleCNN architectures for audio classification
End-to-end learning with convolutional neural networks (CNNs) has become a standard
approach in image classification. However, in audio classification, CNN-based models that�…
approach in image classification. However, in audio classification, CNN-based models that�…
Audio transformers: Transformer architectures for large scale audio understanding. adieu convolutions
P Verma, J Berger�- arXiv preprint arXiv:2105.00335, 2021 - arxiv.org
Over the past two decades, CNN architectures have produced compelling models of sound
perception and cognition, learning hierarchical organizations of features. Analogous to�…
perception and cognition, learning hierarchical organizations of features. Analogous to�…
Rethinking CNN models for audio classification
In this paper, we show that ImageNet-Pretrained standard deep CNN models can be used
as strong baseline networks for audio classification. Even though there is a significant�…
as strong baseline networks for audio classification. Even though there is a significant�…
Learning environmental sounds with multi-scale convolutional neural network
B Zhu, C Wang, F Liu, J Lei, Z Huang…�- …�joint conference on�…, 2018 - ieeexplore.ieee.org
Deep learning has dramatically improved the performance of sounds recognition. However,
learning acoustic models directly from the raw waveform is still challenging. Current�…
learning acoustic models directly from the raw waveform is still challenging. Current�…
Aclnet: efficient end-to-end audio classification cnn
JJ Huang, JJA Leanos�- arXiv preprint arXiv:1811.06669, 2018 - arxiv.org
We propose an efficient end-to-end convolutional neural network architecture, AclNet, for
audio classification. When trained with our data augmentation and regularization, we�…
audio classification. When trained with our data augmentation and regularization, we�…
[HTML][HTML] An interpretable deep learning model for automatic sound classification
Deep learning models have improved cutting-edge technologies in many research areas,
but their black-box structure makes it difficult to understand their inner workings and the�…
but their black-box structure makes it difficult to understand their inner workings and the�…
Multi-stream network with temporal attention for environmental sound classification
Environmental sound classification systems often do not perform robustly across different
sound classification tasks and audio signals of varying temporal structures. We introduce a�…
sound classification tasks and audio signals of varying temporal structures. We introduce a�…
Pruning vs XNOR-Net: A comprehensive study of deep learning for audio classification on edge-devices
Deep learning has celebrated resounding successes in many application areas of relevance
to the Internet of Things (IoT), such as computer vision and machine listening. These�…
to the Internet of Things (IoT), such as computer vision and machine listening. These�…
End-to-end audio strikes back: Boosting augmentations towards an efficient audio classification network
While efficient architectures and a plethora of augmentations for end-to-end image
classification tasks have been suggested and heavily investigated, state-of-the-art�…
classification tasks have been suggested and heavily investigated, state-of-the-art�…