[HTML][HTML] An ensemble of convolutional neural networks for audio classification

L Nanni, G Maguolo, S Brahnam, M Paci�- Applied Sciences, 2021 - mdpi.com
Research in sound classification and recognition is rapidly advancing in the field of pattern
recognition. One important area in this field is environmental sound recognition, whether it�…

Listen2cough: Leveraging end-to-end deep learning cough detection model to enhance lung health assessment using passively sensed audio

X Xu, E Nemati, K Vatanparvar, V Nathan…�- Proceedings of the�…, 2021 - dl.acm.org
The prevalence of ubiquitous computing enables new opportunities for lung health
monitoring and assessment. In the past few years, there have been extensive studies on�…

CNN-based learnable gammatone filterbank and equal-loudness normalization for environmental sound classification

H Park, CD Yoo�- IEEE Signal Processing Letters, 2020 - ieeexplore.ieee.org
For environmental sound classification (ESC), this letter presents a learnable auditory
filterbank based on a one-dimensional (1D) convolutional neural network with strong�…

Efficient end-to-end audio embeddings generation for audio classification on target applications

P Lopez-Meyer, JA del Hoyo Ontiveros…�- ICASSP 2021-2021�…, 2021 - ieeexplore.ieee.org
We describe a general-purpose end-to-end audio embeddings generator that can be easily
adapted to various acoustic scene and event classification applications. In contrast to many�…

Cat: Causal audio transformer for audio classification

X Liu, H Lu, J Yuan, X Li�- ICASSP 2023-2023 IEEE�…, 2023 - ieeexplore.ieee.org
The attention-based Transformers have been increasingly applied to audio classification
because of their global receptive field and ability to handle long-term dependency. However�…

Acoustic scene classification using deep learning-based ensemble averaging

J Huang, H Lu, P Lopez Meyer, H Cordourier… - 2019 - archive.nyu.edu
In our submission to the DCASE 2019 Task 1a, we have explored the use of four different
deep learning based neural networks architectures: Vgg12, ResNet50, AclNet, and�…

Pruning vs XNOR-Net: A comprehensive study of deep learning for audio classification on edge-devices

M Mohaimenuzzaman, C Bergmeir, B Meyer�- IEEE Access, 2022 - ieeexplore.ieee.org
Deep learning has celebrated resounding successes in many application areas of relevance
to the Internet of Things (IoT), such as computer vision and machine listening. These�…

A stethoscope for drones: Transformers-based methods for UAVs acoustic anomaly detection

OH Anidjar, A Barak, B Ben-Moshe, E Hagai…�- IEEE�…, 2023 - ieeexplore.ieee.org
Unmanned Aerial Vehicles and the increasing variety of their applications are raising in
popularity. The growing number of UAVs, emphasizes the significance of drones' reliability�…

Spectrogram transformers for audio classification

Y Zhang, B Li, H Fang, Q Meng�- 2022 IEEE International�…, 2022 - ieeexplore.ieee.org
Audio classification is an important task in the machine learning field with a wide range of
applications. Since the last decade, deep learning based methods have been widely used�…

Multi-modal anomaly detection by using audio and visual cues

AU Rehman, HS Ullah, H Farooq, MS Khan…�- IEEE�…, 2021 - ieeexplore.ieee.org
This paper considers the problem of anomaly detection in an outdoor environment where
surveillance cameras are usually installed to monitor activities of general public. A novel�…