Google Scholar

Audio transformers: Transformer architectures for large scale audio understanding. adieu convolutions

P Verma, J Berger�- arXiv preprint arXiv:2105.00335, 2021 - arxiv.org

Over the past two decades, CNN architectures have produced compelling models of sound
perception and cognition, learning hierarchical organizations of features. Analogous to�…

Save Cite Cited by 63 Related articles All 4 versions View as HTML

[PDF] arxiv.org

Raw waveform-based audio classification using sample-level CNN architectures

J Lee, T Kim, J Park, J Nam�- arXiv preprint arXiv:1712.00866, 2017 - arxiv.org

Music, speech, and acoustic scene sound are often handled separately in the audio domain
because of their different signal characteristics. However, as the image domain grows�…

Save Cite Cited by 96 Related articles All 4 versions View as HTML

[PDF] arxiv.org

Rethinking CNN models for audio classification

K Palanisamy, D Singhania, A Yao�- arXiv preprint arXiv:2007.11154, 2020 - arxiv.org

In this paper, we show that ImageNet-Pretrained standard deep CNN models can be used
as strong baseline networks for audio classification. Even though there is a significant�…

Save Cite Cited by 193 Related articles All 4 versions View as HTML

[PDF] arxiv.org

End-to-end audio strikes back: Boosting augmentations towards an efficient audio classification network

A Gazneli, G Zimerman, T Ridnik, G Sharir…�- arXiv preprint arXiv�…, 2022 - arxiv.org

While efficient architectures and a plethora of augmentations for end-to-end image
classification tasks have been suggested and heavily investigated, state-of-the-art�…

Save Cite Cited by 32 Related articles All 2 versions View as HTML

[PDF] arxiv.org

Aclnet: efficient end-to-end audio classification cnn

JJ Huang, JJA Leanos�- arXiv preprint arXiv:1811.06669, 2018 - arxiv.org

We propose an efficient end-to-end convolutional neural network architecture, AclNet, for
audio classification. When trained with our data augmentation and regularization, we�…

Save Cite Cited by 64 Related articles All 2 versions View as HTML

[PDF] mit.edu

Ast: Audio spectrogram transformer

Y Gong, YA Chung, J Glass�- arXiv preprint arXiv:2104.01778, 2021 - arxiv.org

In the past decade, convolutional neural networks (CNNs) have been widely adopted as the
main building block for end-to-end audio classification models, which aim to learn a direct�…

Save Cite Cited by 799 Related articles All 9 versions View as HTML

[PDF] arxiv.org

Receptive field regularization techniques for audio classification and tagging with deep convolutional neural networks

K Koutini, H Eghbal-zadeh…�- IEEE/ACM Transactions�…, 2021 - ieeexplore.ieee.org

In this paper, we study the performance of variants of well-known Convolutional Neural
Network (CNN) architectures on different audio tasks. We show that tuning the Receptive�…

Save Cite Cited by 51 Related articles All 4 versions

[PDF] arxiv.org

Tiny transformers for environmental sound classification at the edge

D Elliott, CE Otero, S Wyatt, E Martino�- arXiv preprint arXiv:2103.12157, 2021 - arxiv.org

With the growth of the Internet of Things and the rise of Big Data, data processing and
machine learning applications are being moved to cheap and low size, weight, and power�…

Save Cite Cited by 24 Related articles All 2 versions View as HTML

[PDF] arxiv.org

Randomly weighted cnns for (music) audio classification

J Pons, X Serra�- …�2019-2019 IEEE international conference on�…, 2019 - ieeexplore.ieee.org

The computer vision literature shows that randomly weighted neural networks perform
reasonably as feature extractors. Following this idea, we study how non-trained (randomly�…

Save Cite Cited by 116 Related articles All 4 versions

Comparison and analysis of SampleCNN architectures for audio classification

T Kim, J Lee, J Nam�- IEEE Journal of Selected Topics in Signal�…, 2019 - ieeexplore.ieee.org

End-to-end learning with convolutional neural networks (CNNs) has become a standard
approach in image classification. However, in audio classification, CNN-based models that�…

Save Cite Cited by 96 Related articles All 3 versions

Cite

Advanced search

Saved to My library

Audio transformers: Transformer architectures for large scale audio understanding. adieu convolutions

Raw waveform-based audio classification using sample-level CNN architectures

Rethinking CNN models for audio classification

End-to-end audio strikes back: Boosting augmentations towards an efficient audio classification network

Aclnet: efficient end-to-end audio classification cnn

Ast: Audio spectrogram transformer

Receptive field regularization techniques for audio classification and tagging with deep convolutional neural networks

Tiny transformers for environmental sound classification at the edge

Randomly weighted cnns for (music) audio classification

Comparison and analysis of SampleCNN architectures for audio classification

Related searches