Google Scholar

Aclnet: efficient end-to-end audio classification cnn

JJ Huang, JJA Leanos�- arXiv preprint arXiv:1811.06669, 2018 - arxiv.org

We propose an efficient end-to-end convolutional neural network architecture, AclNet, for
audio classification. When trained with our data augmentation and regularization, we�…

Save Cite Cited by 64 Related articles All 2 versions View as HTML

[PDF] ieee.org

Pruning vs XNOR-Net: A comprehensive study of deep learning for audio classification on edge-devices

M Mohaimenuzzaman, C Bergmeir, B Meyer�- IEEE Access, 2022 - ieeexplore.ieee.org

Deep learning has celebrated resounding successes in many application areas of relevance
to the Internet of Things (IoT), such as computer vision and machine listening. These�…

Save Cite Cited by 19 Related articles All 8 versions

[PDF] arxiv.org

Rethinking CNN models for audio classification

K Palanisamy, D Singhania, A Yao�- arXiv preprint arXiv:2007.11154, 2020 - arxiv.org

In this paper, we show that ImageNet-Pretrained standard deep CNN models can be used
as strong baseline networks for audio classification. Even though there is a significant�…

Save Cite Cited by 193 Related articles All 4 versions View as HTML

[PDF] mit.edu

Ast: Audio spectrogram transformer

Y Gong, YA Chung, J Glass�- arXiv preprint arXiv:2104.01778, 2021 - arxiv.org

In the past decade, convolutional neural networks (CNNs) have been widely adopted as the
main building block for end-to-end audio classification models, which aim to learn a direct�…

Save Cite Cited by 799 Related articles All 9 versions View as HTML

[PDF] arxiv.org

LEAF: A learnable frontend for audio classification

N Zeghidour, O Teboul, FDC Quitry…�- arXiv preprint arXiv�…, 2021 - arxiv.org

Mel-filterbanks are fixed, engineered audio features which emulate human perception and
have been used through the history of audio understanding up to today. However, their�…

Save Cite Cited by 151 Related articles All 3 versions View as HTML

[HTML] mdpi.com

[HTML][HTML] An ensemble of convolutional neural networks for audio classification

L Nanni, G Maguolo, S Brahnam, M Paci�- Applied Sciences, 2021 - mdpi.com

Research in sound classification and recognition is rapidly advancing in the field of pattern
recognition. One important area in this field is environmental sound recognition, whether it�…

Save Cite Cited by 98 Related articles All 10 versions Cached

[PDF] unitn.it

A comparison of deep learning inference engines for embedded real-time audio classification

D Stefani, S Peroni, L Turchet�- Proceedings of the International�…, 2022 - iris.unitn.it

Recent advancements in deep learning have shown great potential for audio applications,
improving the accuracy of previous solutions for tasks such as music transcription, beat�…

Save Cite Cited by 15 Related articles All 8 versions View as HTML

[PDF] arxiv.org

Receptive field regularization techniques for audio classification and tagging with deep convolutional neural networks

K Koutini, H Eghbal-zadeh…�- IEEE/ACM Transactions�…, 2021 - ieeexplore.ieee.org

In this paper, we study the performance of variants of well-known Convolutional Neural
Network (CNN) architectures on different audio tasks. We show that tuning the Receptive�…

Save Cite Cited by 51 Related articles All 4 versions

[PDF] arxiv.org

Audio transformers: Transformer architectures for large scale audio understanding. adieu convolutions

P Verma, J Berger�- arXiv preprint arXiv:2105.00335, 2021 - arxiv.org

Over the past two decades, CNN architectures have produced compelling models of sound
perception and cognition, learning hierarchical organizations of features. Analogous to�…

Save Cite Cited by 63 Related articles All 4 versions View as HTML

[PDF] arxiv.org

Raw waveform-based audio classification using sample-level CNN architectures

J Lee, T Kim, J Park, J Nam�- arXiv preprint arXiv:1712.00866, 2017 - arxiv.org

Music, speech, and acoustic scene sound are often handled separately in the audio domain
because of their different signal characteristics. However, as the image domain grows�…

Save Cite Cited by 96 Related articles All 4 versions View as HTML

Cite

Advanced search

Saved to My library

Aclnet: efficient end-to-end audio classification cnn

Pruning vs XNOR-Net: A comprehensive study of deep learning for audio classification on edge-devices

Rethinking CNN models for audio classification

Ast: Audio spectrogram transformer

LEAF: A learnable frontend for audio classification

[HTML][HTML] An ensemble of convolutional neural networks for audio classification

A comparison of deep learning inference engines for embedded real-time audio classification

Receptive field regularization techniques for audio classification and tagging with deep convolutional neural networks

Audio transformers: Transformer architectures for large scale audio understanding. adieu convolutions

Raw waveform-based audio classification using sample-level CNN architectures

Related searches