Aclnet: efficient end-to-end audio classification cnn
JJ Huang, JJA Leanos�- arXiv preprint arXiv:1811.06669, 2018 - arxiv.org
We propose an efficient end-to-end convolutional neural network architecture, AclNet, for
audio classification. When trained with our data augmentation and regularization, we�…
audio classification. When trained with our data augmentation and regularization, we�…
Pruning vs XNOR-Net: A comprehensive study of deep learning for audio classification on edge-devices
Deep learning has celebrated resounding successes in many application areas of relevance
to the Internet of Things (IoT), such as computer vision and machine listening. These�…
to the Internet of Things (IoT), such as computer vision and machine listening. These�…
Rethinking CNN models for audio classification
In this paper, we show that ImageNet-Pretrained standard deep CNN models can be used
as strong baseline networks for audio classification. Even though there is a significant�…
as strong baseline networks for audio classification. Even though there is a significant�…
Ast: Audio spectrogram transformer
In the past decade, convolutional neural networks (CNNs) have been widely adopted as the
main building block for end-to-end audio classification models, which aim to learn a direct�…
main building block for end-to-end audio classification models, which aim to learn a direct�…
LEAF: A learnable frontend for audio classification
Mel-filterbanks are fixed, engineered audio features which emulate human perception and
have been used through the history of audio understanding up to today. However, their�…
have been used through the history of audio understanding up to today. However, their�…
[HTML][HTML] An ensemble of convolutional neural networks for audio classification
Research in sound classification and recognition is rapidly advancing in the field of pattern
recognition. One important area in this field is environmental sound recognition, whether it�…
recognition. One important area in this field is environmental sound recognition, whether it�…
A comparison of deep learning inference engines for embedded real-time audio classification
Recent advancements in deep learning have shown great potential for audio applications,
improving the accuracy of previous solutions for tasks such as music transcription, beat�…
improving the accuracy of previous solutions for tasks such as music transcription, beat�…
Receptive field regularization techniques for audio classification and tagging with deep convolutional neural networks
K Koutini, H Eghbal-zadeh…�- IEEE/ACM Transactions�…, 2021 - ieeexplore.ieee.org
In this paper, we study the performance of variants of well-known Convolutional Neural
Network (CNN) architectures on different audio tasks. We show that tuning the Receptive�…
Network (CNN) architectures on different audio tasks. We show that tuning the Receptive�…
Audio transformers: Transformer architectures for large scale audio understanding. adieu convolutions
P Verma, J Berger�- arXiv preprint arXiv:2105.00335, 2021 - arxiv.org
Over the past two decades, CNN architectures have produced compelling models of sound
perception and cognition, learning hierarchical organizations of features. Analogous to�…
perception and cognition, learning hierarchical organizations of features. Analogous to�…
Raw waveform-based audio classification using sample-level CNN architectures
Music, speech, and acoustic scene sound are often handled separately in the audio domain
because of their different signal characteristics. However, as the image domain grows�…
because of their different signal characteristics. However, as the image domain grows�…