[HTML][HTML] An ensemble of convolutional neural networks for audio classification
Research in sound classification and recognition is rapidly advancing in the field of pattern
recognition. One important area in this field is environmental sound recognition, whether it�…
recognition. One important area in this field is environmental sound recognition, whether it�…
Listen2cough: Leveraging end-to-end deep learning cough detection model to enhance lung health assessment using passively sensed audio
The prevalence of ubiquitous computing enables new opportunities for lung health
monitoring and assessment. In the past few years, there have been extensive studies on�…
monitoring and assessment. In the past few years, there have been extensive studies on�…
CNN-based learnable gammatone filterbank and equal-loudness normalization for environmental sound classification
For environmental sound classification (ESC), this letter presents a learnable auditory
filterbank based on a one-dimensional (1D) convolutional neural network with strong�…
filterbank based on a one-dimensional (1D) convolutional neural network with strong�…
Efficient end-to-end audio embeddings generation for audio classification on target applications
P Lopez-Meyer, JA del Hoyo Ontiveros…�- ICASSP 2021-2021�…, 2021 - ieeexplore.ieee.org
We describe a general-purpose end-to-end audio embeddings generator that can be easily
adapted to various acoustic scene and event classification applications. In contrast to many�…
adapted to various acoustic scene and event classification applications. In contrast to many�…
Cat: Causal audio transformer for audio classification
The attention-based Transformers have been increasingly applied to audio classification
because of their global receptive field and ability to handle long-term dependency. However�…
because of their global receptive field and ability to handle long-term dependency. However�…
Acoustic scene classification using deep learning-based ensemble averaging
In our submission to the DCASE 2019 Task 1a, we have explored the use of four different
deep learning based neural networks architectures: Vgg12, ResNet50, AclNet, and�…
deep learning based neural networks architectures: Vgg12, ResNet50, AclNet, and�…
Pruning vs XNOR-Net: A comprehensive study of deep learning for audio classification on edge-devices
Deep learning has celebrated resounding successes in many application areas of relevance
to the Internet of Things (IoT), such as computer vision and machine listening. These�…
to the Internet of Things (IoT), such as computer vision and machine listening. These�…
A stethoscope for drones: Transformers-based methods for UAVs acoustic anomaly detection
Unmanned Aerial Vehicles and the increasing variety of their applications are raising in
popularity. The growing number of UAVs, emphasizes the significance of drones' reliability�…
popularity. The growing number of UAVs, emphasizes the significance of drones' reliability�…
Spectrogram transformers for audio classification
Audio classification is an important task in the machine learning field with a wide range of
applications. Since the last decade, deep learning based methods have been widely used�…
applications. Since the last decade, deep learning based methods have been widely used�…
Multi-modal anomaly detection by using audio and visual cues
This paper considers the problem of anomaly detection in an outdoor environment where
surveillance cameras are usually installed to monitor activities of general public. A novel�…
surveillance cameras are usually installed to monitor activities of general public. A novel�…