Comparison and analysis of SampleCNN architectures for audio classification

T Kim, J Lee, J Nam�- IEEE Journal of Selected Topics in Signal�…, 2019 - ieeexplore.ieee.org
End-to-end learning with convolutional neural networks (CNNs) has become a standard
approach in image classification. However, in audio classification, CNN-based models that�…

Raw waveform-based audio classification using sample-level CNN architectures

J Lee, T Kim, J Park, J Nam�- arXiv preprint arXiv:1712.00866, 2017 - arxiv.org
Music, speech, and acoustic scene sound are often handled separately in the audio domain
because of their different signal characteristics. However, as the image domain grows�…

[HTML][HTML] SampleCNN: End-to-end deep convolutional neural networks using very small filters for music classification

J Lee, J Park, KL Kim, J Nam�- Applied Sciences, 2018 - mdpi.com
Convolutional Neural Networks (CNN) have been applied to diverse machine learning tasks
for different modalities of raw data in an end-to-end fashion. In the audio domain, a raw�…

Rethinking CNN models for audio classification

K Palanisamy, D Singhania, A Yao�- arXiv preprint arXiv:2007.11154, 2020 - arxiv.org
In this paper, we show that ImageNet-Pretrained standard deep CNN models can be used
as strong baseline networks for audio classification. Even though there is a significant�…

Randomly weighted cnns for (music) audio classification

J Pons, X Serra�- …�2019-2019 IEEE international conference on�…, 2019 - ieeexplore.ieee.org
The computer vision literature shows that randomly weighted neural networks perform
reasonably as feature extractors. Following this idea, we study how non-trained (randomly�…

[HTML][HTML] Masked conditional neural networks for sound classification

F Medhat, D Chesmore, J Robinson�- Applied Soft Computing, 2020 - Elsevier
The remarkable success of deep convolutional neural networks in image-related
applications has led to their adoption also for sound processing. Typically the input is a time�…

Aclnet: efficient end-to-end audio classification cnn

JJ Huang, JJA Leanos�- arXiv preprint arXiv:1811.06669, 2018 - arxiv.org
We propose an efficient end-to-end convolutional neural network architecture, AclNet, for
audio classification. When trained with our data augmentation and regularization, we�…

A survey of audio classification using deep learning

K Zaman, M Sah, C Direkoglu, M Unoki�- IEEE Access, 2023 - ieeexplore.ieee.org
Deep learning can be used for audio signal classification in a variety of ways. It can be used
to detect and classify various types of audio signals such as speech, music, and�…

Receptive field regularization techniques for audio classification and tagging with deep convolutional neural networks

K Koutini, H Eghbal-zadeh…�- IEEE/ACM Transactions�…, 2021 - ieeexplore.ieee.org
In this paper, we study the performance of variants of well-known Convolutional Neural
Network (CNN) architectures on different audio tasks. We show that tuning the Receptive�…

Audio transformers: Transformer architectures for large scale audio understanding. adieu convolutions

P Verma, J Berger�- arXiv preprint arXiv:2105.00335, 2021 - arxiv.org
Over the past two decades, CNN architectures have produced compelling models of sound
perception and cognition, learning hierarchical organizations of features. Analogous to�…