Google Scholar

Raw waveform-based audio classification using sample-level CNN architectures

J Lee, T Kim, J Park, J Nam�- arXiv preprint arXiv:1712.00866, 2017 - arxiv.org

Music, speech, and acoustic scene sound are often handled separately in the audio domain
because of their different signal characteristics. However, as the image domain grows�…

Save Cite Cited by 96 Related articles All 4 versions View as HTML

Comparison and analysis of SampleCNN architectures for audio classification

T Kim, J Lee, J Nam�- IEEE Journal of Selected Topics in Signal�…, 2019 - ieeexplore.ieee.org

End-to-end learning with convolutional neural networks (CNNs) has become a standard
approach in image classification. However, in audio classification, CNN-based models that�…

Save Cite Cited by 96 Related articles All 3 versions

[PDF] arxiv.org

Audio transformers: Transformer architectures for large scale audio understanding. adieu convolutions

P Verma, J Berger�- arXiv preprint arXiv:2105.00335, 2021 - arxiv.org

Over the past two decades, CNN architectures have produced compelling models of sound
perception and cognition, learning hierarchical organizations of features. Analogous to�…

Save Cite Cited by 63 Related articles All 4 versions View as HTML

[PDF] arxiv.org

Rethinking CNN models for audio classification

K Palanisamy, D Singhania, A Yao�- arXiv preprint arXiv:2007.11154, 2020 - arxiv.org

In this paper, we show that ImageNet-Pretrained standard deep CNN models can be used
as strong baseline networks for audio classification. Even though there is a significant�…

Save Cite Cited by 193 Related articles All 4 versions View as HTML

[PDF] arxiv.org

Learning environmental sounds with multi-scale convolutional neural network

B Zhu, C Wang, F Liu, J Lei, Z Huang…�- …�joint conference on�…, 2018 - ieeexplore.ieee.org

Deep learning has dramatically improved the performance of sounds recognition. However,
learning acoustic models directly from the raw waveform is still challenging. Current�…

Save Cite Cited by 83 Related articles All 4 versions

[PDF] arxiv.org

Aclnet: efficient end-to-end audio classification cnn

JJ Huang, JJA Leanos�- arXiv preprint arXiv:1811.06669, 2018 - arxiv.org

We propose an efficient end-to-end convolutional neural network architecture, AclNet, for
audio classification. When trained with our data augmentation and regularization, we�…

Save Cite Cited by 64 Related articles All 2 versions View as HTML

[HTML] mdpi.com

[HTML][HTML] An interpretable deep learning model for automatic sound classification

P Zinemanas, M Rocamora, M Miron, F Font, X Serra�- Electronics, 2021 - mdpi.com

Deep learning models have improved cutting-edge technologies in many research areas,
but their black-box structure makes it difficult to understand their inner workings and the�…

Save Cite Cited by 47 Related articles All 6 versions Cached

[PDF] arxiv.org

Multi-stream network with temporal attention for environmental sound classification

X Li, V Chebiyyam, K Kirchhoff�- arXiv preprint arXiv:1901.08608, 2019 - arxiv.org

Environmental sound classification systems often do not perform robustly across different
sound classification tasks and audio signals of varying temporal structures. We introduce a�…

Save Cite Cited by 84 Related articles All 8 versions View as HTML

[PDF] ieee.org

Pruning vs XNOR-Net: A comprehensive study of deep learning for audio classification on edge-devices

M Mohaimenuzzaman, C Bergmeir, B Meyer�- IEEE Access, 2022 - ieeexplore.ieee.org

Deep learning has celebrated resounding successes in many application areas of relevance
to the Internet of Things (IoT), such as computer vision and machine listening. These�…

Save Cite Cited by 19 Related articles All 8 versions

[PDF] arxiv.org

End-to-end audio strikes back: Boosting augmentations towards an efficient audio classification network

A Gazneli, G Zimerman, T Ridnik, G Sharir…�- arXiv preprint arXiv�…, 2022 - arxiv.org

While efficient architectures and a plethora of augmentations for end-to-end image
classification tasks have been suggested and heavily investigated, state-of-the-art�…

Save Cite Cited by 32 Related articles All 2 versions View as HTML

Cite

Advanced search

Saved to My library

Raw waveform-based audio classification using sample-level CNN architectures

Comparison and analysis of SampleCNN architectures for audio classification

Audio transformers: Transformer architectures for large scale audio understanding. adieu convolutions

Rethinking CNN models for audio classification

Learning environmental sounds with multi-scale convolutional neural network

Aclnet: efficient end-to-end audio classification cnn

[HTML][HTML] An interpretable deep learning model for automatic sound classification

Multi-stream network with temporal attention for environmental sound classification

Pruning vs XNOR-Net: A comprehensive study of deep learning for audio classification on edge-devices

End-to-end audio strikes back: Boosting augmentations towards an efficient audio classification network

Related searches