Comparison and analysis of SampleCNN architectures for audio classification
End-to-end learning with convolutional neural networks (CNNs) has become a standard
approach in image classification. However, in audio classification, CNN-based models that�…
approach in image classification. However, in audio classification, CNN-based models that�…
Raw waveform-based audio classification using sample-level CNN architectures
Music, speech, and acoustic scene sound are often handled separately in the audio domain
because of their different signal characteristics. However, as the image domain grows�…
because of their different signal characteristics. However, as the image domain grows�…
[HTML][HTML] SampleCNN: End-to-end deep convolutional neural networks using very small filters for music classification
Convolutional Neural Networks (CNN) have been applied to diverse machine learning tasks
for different modalities of raw data in an end-to-end fashion. In the audio domain, a raw�…
for different modalities of raw data in an end-to-end fashion. In the audio domain, a raw�…
Rethinking CNN models for audio classification
In this paper, we show that ImageNet-Pretrained standard deep CNN models can be used
as strong baseline networks for audio classification. Even though there is a significant�…
as strong baseline networks for audio classification. Even though there is a significant�…
Randomly weighted cnns for (music) audio classification
The computer vision literature shows that randomly weighted neural networks perform
reasonably as feature extractors. Following this idea, we study how non-trained (randomly�…
reasonably as feature extractors. Following this idea, we study how non-trained (randomly�…
[HTML][HTML] Masked conditional neural networks for sound classification
F Medhat, D Chesmore, J Robinson�- Applied Soft Computing, 2020 - Elsevier
The remarkable success of deep convolutional neural networks in image-related
applications has led to their adoption also for sound processing. Typically the input is a time�…
applications has led to their adoption also for sound processing. Typically the input is a time�…
Aclnet: efficient end-to-end audio classification cnn
JJ Huang, JJA Leanos�- arXiv preprint arXiv:1811.06669, 2018 - arxiv.org
We propose an efficient end-to-end convolutional neural network architecture, AclNet, for
audio classification. When trained with our data augmentation and regularization, we�…
audio classification. When trained with our data augmentation and regularization, we�…
A survey of audio classification using deep learning
Deep learning can be used for audio signal classification in a variety of ways. It can be used
to detect and classify various types of audio signals such as speech, music, and�…
to detect and classify various types of audio signals such as speech, music, and�…
Receptive field regularization techniques for audio classification and tagging with deep convolutional neural networks
K Koutini, H Eghbal-zadeh…�- IEEE/ACM Transactions�…, 2021 - ieeexplore.ieee.org
In this paper, we study the performance of variants of well-known Convolutional Neural
Network (CNN) architectures on different audio tasks. We show that tuning the Receptive�…
Network (CNN) architectures on different audio tasks. We show that tuning the Receptive�…
Audio transformers: Transformer architectures for large scale audio understanding. adieu convolutions
P Verma, J Berger�- arXiv preprint arXiv:2105.00335, 2021 - arxiv.org
Over the past two decades, CNN architectures have produced compelling models of sound
perception and cognition, learning hierarchical organizations of features. Analogous to�…
perception and cognition, learning hierarchical organizations of features. Analogous to�…