Skip to main content

Showing 1–50 of 66 results for author: Pham, C

  1. arXiv:2407.14726  [pdf, other

    cs.CV cs.LG

    MetaAug: Meta-Data Augmentation for Post-Training Quantization

    Authors: Cuong Pham, Hoang Anh Dung, Cuong C. Nguyen, Trung Le, Dinh Phung, Gustavo Carneiro, Thanh-Toan Do

    Abstract: Post-Training Quantization (PTQ) has received significant attention because it requires only a small set of calibration data to quantize a full-precision model, which is more practical in real-world applications in which full access to a large training set is not available. However, it often leads to overfitting on the small calibration dataset. Several methods have been proposed to address this i… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

    Comments: Accepted by ECCV 2024

  2. arXiv:2407.08792  [pdf, other

    cs.CR

    ProxyGPT: Enabling Anonymous Queries in AI Chatbots with (Un)Trustworthy Browser Proxies

    Authors: Dzung Pham, Jade Sheffey, Chau Minh Pham, Amir Houmansadr

    Abstract: AI-powered chatbots (ChatGPT, Claude, etc.) require users to create an account using their email and phone number, thereby linking their personally identifiable information to their conversational data and usage patterns. As these chatbots are increasingly being used for tasks involving sensitive information, privacy concerns have been raised about how chatbot providers handle user data. To addres… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  3. arXiv:2407.02721  [pdf, ps, other

    cs.LG cs.CV

    Model and Feature Diversity for Bayesian Neural Networks in Mutual Learning

    Authors: Cuong Pham, Cuong C. Nguyen, Trung Le, Dinh Phung, Gustavo Carneiro, Thanh-Toan Do

    Abstract: Bayesian Neural Networks (BNNs) offer probability distributions for model parameters, enabling uncertainty quantification in predictions. However, they often underperform compared to deterministic neural networks. Utilizing mutual learning can effectively enhance the performance of peer BNNs. In this paper, we propose a novel approach to improve BNNs performance through deep mutual learning. The p… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Accepted to NeurIPS 2023

  4. arXiv:2406.19928  [pdf, other

    cs.CL cs.HC cs.IR

    Interactive Topic Models with Optimal Transport

    Authors: Garima Dhanania, Sheshera Mysore, Chau Minh Pham, Mohit Iyyer, Hamed Zamani, Andrew McCallum

    Abstract: Topic models are widely used to analyze document collections. While they are valuable for discovering latent topics in a corpus when analysts are unfamiliar with the corpus, analysts also commonly start with an understanding of the content present in a corpus. This may be through categories obtained from an initial pass over the corpus or a desire to analyze the corpus through a predefined set of… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: Pre-print; Work in progress

  5. arXiv:2406.19371  [pdf, other

    cs.CL

    Suri: Multi-constraint Instruction Following for Long-form Text Generation

    Authors: Chau Minh Pham, Simeng Sun, Mohit Iyyer

    Abstract: Existing research on instruction following largely focuses on tasks with simple instructions and short responses. In this work, we explore multi-constraint instruction following for generating long-form text. We create Suri, a dataset with 20K human-written long-form texts paired with LLM-generated backtranslated instructions that contain multiple complex constraints. Because of prohibitive challe… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  6. arXiv:2406.06608  [pdf, other

    cs.CL cs.AI

    The Prompt Report: A Systematic Survey of Prompting Techniques

    Authors: Sander Schulhoff, Michael Ilie, Nishant Balepur, Konstantine Kahadze, Amanda Liu, Chenglei Si, Yinheng Li, Aayush Gupta, HyoJung Han, Sevien Schulhoff, Pranav Sandeep Dulepet, Saurav Vidyadhara, Dayeon Ki, Sweta Agrawal, Chau Pham, Gerson Kroiz, Feileen Li, Hudson Tao, Ashay Srivastava, Hevander Da Costa, Saloni Gupta, Megan L. Rogers, Inna Goncearenco, Giuseppe Sarli, Igor Galynker , et al. (6 additional authors not shown)

    Abstract: Generative Artificial Intelligence (GenAI) systems are being increasingly deployed across all parts of industry and research settings. Developers and end users interact with these systems through the use of prompting or prompt engineering. While prompting is a widespread and highly researched concept, there exists conflicting terminology and a poor ontological understanding of what constitutes a p… ▽ More

    Submitted 14 July, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

  7. arXiv:2406.04569  [pdf, other

    cs.CV

    Camera-Pose Robust Crater Detection from Chang'e 5

    Authors: Matthew Rodda, Sofia McLeod, Ky Cuong Pham, Tat-Jun Chin

    Abstract: As space missions aim to explore increasingly hazardous terrain, accurate and timely position estimates are required to ensure safe navigation. Vision-based navigation achieves this goal through correlating impact craters visible through onboard imagery with a known database to estimate a craft's pose. However, existing literature has not sufficiently evaluated crater-detection algorithm (CDA) per… ▽ More

    Submitted 12 July, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

  8. arXiv:2405.16419  [pdf, other

    cs.CV cs.AI

    Enhancing Feature Diversity Boosts Channel-Adaptive Vision Transformers

    Authors: Chau Pham, Bryan A. Plummer

    Abstract: Multi-Channel Imaging (MCI) contains an array of challenges for encoding useful feature representations not present in traditional images. For example, images from two different satellites may both contain RGB channels, but the remaining channels can be different for each imaging source. Thus, MCI models must support a variety of channel configurations at test time. Recent work has extended tradit… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  9. arXiv:2405.12252  [pdf, ps, other

    cs.DS cs.AI

    Enhanced Deterministic Approximation Algorithm for Non-monotone Submodular Maximization under Knapsack Constraint with Linear Query Complexity

    Authors: Canh V. Pham

    Abstract: In this work, we consider the Submodular Maximization under Knapsack (SMK) constraint problem over the ground set of size $n$. The problem recently attracted a lot of attention due to its applications in various domains of combination optimization, artificial intelligence, and machine learning. We improve the approximation factor of the fastest deterministic algorithm from $6+ε$ to $5+ε$ while kee… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  10. arXiv:2404.07122  [pdf, other

    cs.CV

    Driver Attention Tracking and Analysis

    Authors: Dat Viet Thanh Nguyen, Anh Tran, Hoai Nam Vu, Cuong Pham, Minh Hoai

    Abstract: We propose a novel method to estimate a driver's points-of-gaze using a pair of ordinary cameras mounted on the windshield and dashboard of a car. This is a challenging problem due to the dynamics of traffic environments with 3D scenes of unknown depths. This problem is further complicated by the volatile distance between the driver and the camera system. To tackle these challenges, we develop a n… ▽ More

    Submitted 11 April, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

  11. arXiv:2403.18605  [pdf, other

    cs.CV

    FlexEdit: Flexible and Controllable Diffusion-based Object-centric Image Editing

    Authors: Trong-Tung Nguyen, Duc-Anh Nguyen, Anh Tran, Cuong Pham

    Abstract: Our work addresses limitations seen in previous approaches for object-centric editing problems, such as unrealistic results due to shape discrepancies and limited control in object replacement or insertion. To this end, we introduce FlexEdit, a flexible and controllable editing framework for objects where we iteratively adjust latents at each denoising step using our FlexEdit block. Initially, we… ▽ More

    Submitted 27 March, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: Our project page: https://flex-edit.github.io/

  12. arXiv:2403.16205  [pdf, other

    cs.CV

    Blur2Blur: Blur Conversion for Unsupervised Image Deblurring on Unknown Domains

    Authors: Bang-Dang Pham, Phong Tran, Anh Tran, Cuong Pham, Rang Nguyen, Minh Hoai

    Abstract: This paper presents an innovative framework designed to train an image deblurring algorithm tailored to a specific camera device. This algorithm works by transforming a blurry input image, which is challenging to deblur, into another blurry image that is more amenable to deblurring. The transformation process, from one blurry state to another, leverages unpaired data consisting of sharp and blurry… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024

  13. arXiv:2403.05894  [pdf, other

    cs.CV

    Frequency Attention for Knowledge Distillation

    Authors: Cuong Pham, Van-Anh Nguyen, Trung Le, Dinh Phung, Gustavo Carneiro, Thanh-Toan Do

    Abstract: Knowledge distillation is an attractive approach for learning compact deep neural networks, which learns a lightweight student model by distilling knowledge from a complex teacher model. Attention-based knowledge distillation is a specific form of intermediate feature-based knowledge distillation that uses attention mechanisms to encourage the student to better mimic the teacher. However, most of… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: Appear to WACV 2024

  14. arXiv:2402.15321  [pdf, other

    cs.CV cs.AI cs.LG

    OpenSUN3D: 1st Workshop Challenge on Open-Vocabulary 3D Scene Understanding

    Authors: Francis Engelmann, Ayca Takmaz, Jonas Schult, Elisabetta Fedele, Johanna Wald, Songyou Peng, Xi Wang, Or Litany, Siyu Tang, Federico Tombari, Marc Pollefeys, Leonidas Guibas, Hongbo Tian, Chunjie Wang, Xiaosheng Yan, Bingwen Wang, Xuanyang Zhang, Xiao Liu, Phuc Nguyen, Khoi Nguyen, Anh Tran, Cuong Pham, Zhening Huang, Xiaoyang Wu, Xi Chen , et al. (3 additional authors not shown)

    Abstract: This report provides an overview of the challenge hosted at the OpenSUN3D Workshop on Open-Vocabulary 3D Scene Understanding held in conjunction with ICCV 2023. The goal of this workshop series is to provide a platform for exploration and discussion of open-vocabulary 3D scene understanding tasks, including but not limited to segmentation, detection and mapping. We provide an overview of the chall… ▽ More

    Submitted 17 March, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: Our OpenSUN3D workshop website for ICCV 2023: https://opensun3d.github.io/index_iccv23.html

  15. arXiv:2312.17330  [pdf, other

    cs.CV cs.AI

    Count What You Want: Exemplar Identification and Few-shot Counting of Human Actions in the Wild

    Authors: Yifeng Huang, Duc Duy Nguyen, Lam Nguyen, Cuong Pham, Minh Hoai

    Abstract: This paper addresses the task of counting human actions of interest using sensor data from wearable devices. We propose a novel exemplar-based framework, allowing users to provide exemplars of the actions they want to count by vocalizing predefined sounds ''one'', ''two'', and ''three''. Our method first localizes temporal positions of these utterances from the audio sequence. These positions serv… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

  16. arXiv:2312.17205  [pdf, other

    cs.CV

    EFHQ: Multi-purpose ExtremePose-Face-HQ dataset

    Authors: Trung Tuan Dao, Duc Hong Vu, Cuong Pham, Anh Tran

    Abstract: The existing facial datasets, while having plentiful images at near frontal views, lack images with extreme head poses, leading to the downgraded performance of deep learning models when dealing with profile or pitched faces. This work aims to address this gap by introducing a novel dataset named Extreme Pose Face High-Quality Dataset (EFHQ), which includes a maximum of 450k high-quality images of… ▽ More

    Submitted 11 April, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

    Comments: Project Page: https://bomcon123456.github.io/efhq/

  17. arXiv:2312.10671  [pdf, other

    cs.CV

    Open3DIS: Open-Vocabulary 3D Instance Segmentation with 2D Mask Guidance

    Authors: Phuc D. A. Nguyen, Tuan Duc Ngo, Evangelos Kalogerakis, Chuang Gan, Anh Tran, Cuong Pham, Khoi Nguyen

    Abstract: We introduce Open3DIS, a novel solution designed to tackle the problem of Open-Vocabulary Instance Segmentation within 3D scenes. Objects within 3D environments exhibit diverse shapes, scales, and colors, making precise instance-level identification a challenging task. Recent advancements in Open-Vocabulary scene understanding have made significant strides in this area by employing class-agnostic… ▽ More

    Submitted 5 April, 2024; v1 submitted 17 December, 2023; originally announced December 2023.

    Comments: CVPR 2024. Project page: https://open3dis.github.io/

  18. arXiv:2312.02185  [pdf, other

    cs.LG cs.AI cs.CV

    Virtual Fusion with Contrastive Learning for Single Sensor-based Activity Recognition

    Authors: Duc-Anh Nguyen, Cuong Pham, Nhien-An Le-Khac

    Abstract: Various types of sensors can be used for Human Activity Recognition (HAR), and each of them has different strengths and weaknesses. Sometimes a single sensor cannot fully observe the user's motions from its perspective, which causes wrong predictions. While sensor fusion provides more information for HAR, it comes with many inherent drawbacks like user privacy and acceptance, costly set-up, operat… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  19. arXiv:2312.01284  [pdf, other

    cs.CV

    Stable Messenger: Steganography for Message-Concealed Image Generation

    Authors: Quang Nguyen, Truong Vu, Cuong Pham, Anh Tran, Khoi Nguyen

    Abstract: In the ever-expanding digital landscape, safeguarding sensitive information remains paramount. This paper delves deep into digital protection, specifically focusing on steganography. While prior research predominantly fixated on individual bit decoding, we address this limitation by introducing ``message accuracy'', a novel metric evaluating the entirety of decoded messages for a more holistic eva… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

  20. arXiv:2312.00827  [pdf, other

    cs.CV

    A Unified Framework for Connecting Noise Modeling to Boost Noise Detection

    Authors: Siqi Wang, Chau Pham, Bryan A. Plummer

    Abstract: Noisy labels can impair model performance, making the study of learning with noisy labels an important topic. Two conventional approaches are noise modeling and noise detection. However, these two methods are typically studied independently, and there has been limited work on their collaboration. In this work, we explore the integration of these two approaches, proposing an interconnected structur… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

  21. arXiv:2311.04251  [pdf, other

    cs.LG cs.AI cs.CV

    MixtureGrowth: Growing Neural Networks by Recombining Learned Parameters

    Authors: Chau Pham, Piotr Teterwak, Soren Nelson, Bryan A. Plummer

    Abstract: Most deep neural networks are trained under fixed network architectures and require retraining when the architecture changes. If expanding the network's size is needed, it is necessary to retrain from scratch, which is expensive. To avoid this, one can grow from a small network by adding random weights over time to gradually achieve the target network size. However, this naive approach falls short… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: Accepted at IEEE Winter Conference on Applications of Computer Vision (WACV) 2024

  22. arXiv:2311.01449  [pdf, other

    cs.CL

    TopicGPT: A Prompt-based Topic Modeling Framework

    Authors: Chau Minh Pham, Alexander Hoyle, Simeng Sun, Philip Resnik, Mohit Iyyer

    Abstract: Topic modeling is a well-established technique for exploring text corpora. Conventional topic models (e.g., LDA) represent topics as bags of words that often require "reading the tea leaves" to interpret; additionally, they offer users minimal control over the formatting and specificity of resulting topics. To tackle these issues, we introduce TopicGPT, a prompt-based framework that uses large lan… ▽ More

    Submitted 1 April, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: Accepted to NAACL 2024 (Main conference)

  23. arXiv:2310.19224  [pdf, other

    cs.CV

    CHAMMI: A benchmark for channel-adaptive models in microscopy imaging

    Authors: Zitong Chen, Chau Pham, Siqi Wang, Michael Doron, Nikita Moshkov, Bryan A. Plummer, Juan C. Caicedo

    Abstract: Most neural networks assume that input images have a fixed number of channels (three for RGB images). However, there are many settings where the number of channels may vary, such as microscopy images where the number of channels changes depending on instruments and experimental goals. Yet, there has not been a systemic attempt to create and evaluate neural networks that are invariant to the number… ▽ More

    Submitted 16 January, 2024; v1 submitted 29 October, 2023; originally announced October 2023.

    Comments: Accepted at NeurIPS Track on Datasets and Benchmarks, 2023

  24. arXiv:2310.17109  [pdf, other

    cs.CV

    LP-OVOD: Open-Vocabulary Object Detection by Linear Probing

    Authors: Chau Pham, Truong Vu, Khoi Nguyen

    Abstract: This paper addresses the challenging problem of open-vocabulary object detection (OVOD) where an object detector must identify both seen and unseen classes in test images without labeled examples of the unseen classes in training. A typical approach for OVOD is to use joint text-image embeddings of CLIP to assign box proposals to their closest text label. However, this method has a critical issue:… ▽ More

    Submitted 2 June, 2024; v1 submitted 25 October, 2023; originally announced October 2023.

  25. arXiv:2310.06272  [pdf, other

    cs.CL cs.AI cs.LG

    Let Models Speak Ciphers: Multiagent Debate through Embeddings

    Authors: Chau Pham, Boyi Liu, Yingxiang Yang, Zhengyu Chen, Tianyi Liu, Jianbo Yuan, Bryan A. Plummer, Zhaoran Wang, Hongxia Yang

    Abstract: Discussion and debate among Large Language Models (LLMs) have gained considerable attention due to their potential to enhance the reasoning ability of LLMs. Although natural language is an obvious choice for communication due to LLM's language understanding capability, the token sampling step needed when generating natural language poses a potential risk of information loss, as it uses only one to… ▽ More

    Submitted 26 February, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: Accepted to ICLR 2024

  26. arXiv:2309.12025  [pdf, other

    cs.DS cs.CC cs.LG math.CO

    Robust Approximation Algorithms for Non-monotone $k$-Submodular Maximization under a Knapsack Constraint

    Authors: Dung T. K. Ha, Canh V. Pham, Tan D. Tran, Huan X. Hoang

    Abstract: The problem of non-monotone $k$-submodular maximization under a knapsack constraint ($\kSMK$) over the ground set size $n$ has been raised in many applications in machine learning, such as data summarization, information propagation, etc. However, existing algorithms for the problem are facing questioning of how to overcome the non-monotone case and how to fast return a good solution in case of th… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: 12 pages

    Report number: KSE-ID38

  27. arXiv:2309.01078  [pdf, other

    cs.CV cs.AI

    UnsMOT: Unified Framework for Unsupervised Multi-Object Tracking with Geometric Topology Guidance

    Authors: Son Tran, Cong Tran, Anh Tran, Cuong Pham

    Abstract: Object detection has long been a topic of high interest in computer vision literature. Motivated by the fact that annotating data for the multi-object tracking (MOT) problem is immensely expensive, recent studies have turned their attention to the unsupervised learning setting. In this paper, we push forward the state-of-the-art performance of unsupervised MOT methods by proposing UnsMOT, a novel… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

  28. arXiv:2309.01076  [pdf, other

    cs.LG cs.SD eess.AS

    Federated Few-shot Learning for Cough Classification with Edge Devices

    Authors: Ngan Dao Hoang, Dat Tran-Anh, Manh Luong, Cong Tran, Cuong Pham

    Abstract: Automatically classifying cough sounds is one of the most critical tasks for the diagnosis and treatment of respiratory diseases. However, collecting a huge amount of labeled cough dataset is challenging mainly due to high laborious expenses, data scarcity, and privacy concerns. In this work, our aim is to develop a framework that can effectively perform cough classification even in situations whe… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

    Comments: 21 pages, 5 figures

  29. arXiv:2306.03280  [pdf, other

    cs.HC

    AHA!: Facilitating AI Impact Assessment by Generating Examples of Harms

    Authors: Zana Buçinca, Chau Minh Pham, Maurice Jakesch, Marco Tulio Ribeiro, Alexandra Olteanu, Saleema Amershi

    Abstract: While demands for change and accountability for harmful AI consequences mount, foreseeing the downstream effects of deploying AI systems remains a challenging task. We developed AHA! (Anticipating Harms of AI), a generative framework to assist AI practitioners and decision-makers in anticipating potential harms and unintended consequences of AI systems prior to development or deployment. Given an… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

  30. arXiv:2305.10292  [pdf, other

    cs.DS cs.AI

    Linear Query Approximation Algorithms for Non-monotone Submodular Maximization under Knapsack Constraint

    Authors: Canh V. Pham, Tan D. Tran, Dung T. K. Ha, My T. Thai

    Abstract: This work, for the first time, introduces two constant factor approximation algorithms with linear query complexity for non-monotone submodular maximization over a ground set of size $n$ subject to a knapsack constraint, $\mathsf{DLA}$ and $\mathsf{RLA}$. $\mathsf{DLA}$ is a deterministic algorithm that provides an approximation factor of $6+ε$ while $\mathsf{RLA}$ is a randomized algorithm with a… ▽ More

    Submitted 10 July, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

  31. arXiv:2305.00627  [pdf

    eess.IV cs.CV

    CNN-based fully automatic mitral valve extraction using CT images and existence probability maps

    Authors: Yukiteru Masuda, Ryo Ishikawa, Toru Tanaka, Gakuto Aoyama, Keitaro Kawashima, James V. Chapman, Masahiko Asami, Michael Huy Cuong Pham, Klaus Fuglsang Kofoed, Takuya Sakaguchi, Kiyohide Satoh

    Abstract: Accurate extraction of mitral valve shape from clinical tomographic images acquired in patients has proven useful for planning surgical and interventional mitral valve treatments. However, manual extraction of the mitral valve shape is laborious, and the existing automatic extraction methods have not been sufficiently accurate. In this paper, we propose a fully automated method of extracting mitra… ▽ More

    Submitted 18 May, 2023; v1 submitted 30 April, 2023; originally announced May 2023.

    Comments: 15 pages, 6 figure, 3 table. changed title, modified taipo

  32. arXiv:2304.01686  [pdf, other

    cs.CV cs.AI

    HyperCUT: Video Sequence from a Single Blurry Image using Unsupervised Ordering

    Authors: Bang-Dang Pham, Phong Tran, Anh Tran, Cuong Pham, Rang Nguyen, Minh Hoai

    Abstract: We consider the challenging task of training models for image-to-video deblurring, which aims to recover a sequence of sharp images corresponding to a given blurry image input. A critical issue disturbing the training of an image-to-video model is the ambiguity of the frame ordering since both the forward and backward sequences are plausible solutions. This paper proposes an effective self-supervi… ▽ More

    Submitted 5 April, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

    Comments: Accepted to CVPR 2023

  33. arXiv:2212.00981  [pdf, other

    cs.CV cs.AI

    QC-StyleGAN -- Quality Controllable Image Generation and Manipulation

    Authors: Dat Viet Thanh Nguyen, Phong Tran The, Tan M. Dinh, Cuong Pham, Anh Tuan Tran

    Abstract: The introduction of high-quality image generation models, particularly the StyleGAN family, provides a powerful tool to synthesize and manipulate images. However, existing models are built upon high-quality (HQ) data as desired outputs, making them unfit for in-the-wild low-quality (LQ) images, which are common inputs for manipulation. In this work, we bridge this gap by proposing a novel GAN stru… ▽ More

    Submitted 7 December, 2022; v1 submitted 2 December, 2022; originally announced December 2022.

    Comments: Accepted to NeurIPS 2022; The code is available at https://github.com/VinAIResearch/QC-StyleGAN

  34. arXiv:2211.02681  [pdf, ps, other

    cs.LG cs.DS

    Deep Distance Sensitivity Oracles

    Authors: Davin Jeong, Allison Gunby-Mann, Sarel Cohen, Maximilian Katzmann, Chau Pham, Arnav Bhakta, Tobias Friedrich, Sang Chin

    Abstract: One of the most fundamental graph problems is finding a shortest path from a source to a target node. While in its basic forms the problem has been studied extensively and efficient algorithms are known, it becomes significantly harder as soon as parts of the graph are susceptible to failure. Although one can recompute a shortest replacement path after every outage, this is rather inefficient both… ▽ More

    Submitted 18 October, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: text overlap with arXiv:2007.11495 by other authors

  35. arXiv:2210.16103  [pdf, other

    cs.CV

    Collaborative Multi-Teacher Knowledge Distillation for Learning Low Bit-width Deep Neural Networks

    Authors: Cuong Pham, Tuan Hoang, Thanh-Toan Do

    Abstract: Knowledge distillation which learns a lightweight student model by distilling knowledge from a cumbersome teacher model is an attractive approach for learning compact deep neural networks (DNNs). Recent works further improve student network performance by leveraging multiple teacher networks. However, most of the existing knowledge distillation-based multi-teacher methods use separately pretrained… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: Accepted to WACV 2023

  36. SoccerNet 2022 Challenges Results

    Authors: Silvio Giancola, Anthony Cioppa, Adrien Deliège, Floriane Magera, Vladimir Somers, Le Kang, Xin Zhou, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Abdulrahman Darwish, Adrien Maglo, Albert Clapés, Andreas Luyts, Andrei Boiarov, Artur Xarles, Astrid Orcesi, Avijit Shah, Baoyu Fan, Bharath Comandur, Chen Chen, Chen Zhang, Chen Zhao , et al. (69 additional authors not shown)

    Abstract: The SoccerNet 2022 challenges were the second annual video understanding challenges organized by the SoccerNet team. In 2022, the challenges were composed of 6 vision-based tasks: (1) action spotting, focusing on retrieving action timestamps in long untrimmed videos, (2) replay grounding, focusing on retrieving the live moment of an action shown in a replay, (3) pitch localization, focusing on det… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

    Comments: Accepted at ACM MMSports 2022

  37. arXiv:2209.14264  [pdf, other

    cs.LG

    A Multi-scale Graph Signature for Persistence Diagrams based on Return Probabilities of Random Walks

    Authors: Chau Pham, Trung Dang, Peter Chin

    Abstract: Persistence diagrams (PDs), often characterized as sets of death and birth of homology class, have been known for providing a topological representation of a graph structure, which is often useful in machine learning tasks. Prior works rely on a single graph signature to construct PDs. In this paper, we explore the use of a family of multi-scale graph signatures to enhance the robustness of topolo… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

  38. KaliCalib: A Framework for Basketball Court Registration

    Authors: Adrien Maglo, Astrid Orcesi, Quoc Cuong Pham

    Abstract: Tracking the players and the ball in team sports is key to analyse the performance or to enhance the game watching experience with augmented reality. When the only sources for this data are broadcast videos, sports-field registration systems are required to estimate the homography and re-project the ball or the players from the image space to the field space. This paper describes a new basketball… ▽ More

    Submitted 16 September, 2022; originally announced September 2022.

    Comments: Accepted at ACM MMSports 2022 (5th International ACM Workshop on Multimedia Content Analysis in Sports)

  39. arXiv:2207.10988  [pdf, other

    cs.CV

    Few-shot Object Counting and Detection

    Authors: Thanh Nguyen, Chau Pham, Khoi Nguyen, Minh Hoai

    Abstract: We tackle a new task of few-shot object counting and detection. Given a few exemplar bounding boxes of a target object class, we seek to count and detect all objects of the target class. This task shares the same supervision as the few-shot object counting but additionally outputs the object bounding boxes along with the total object count. To address this challenging problem, we introduce a novel… ▽ More

    Submitted 28 July, 2022; v1 submitted 22 July, 2022; originally announced July 2022.

    Comments: Accepted to ECCV 2022; The first two authors contribute equally

  40. arXiv:2110.15941  [pdf, other

    cs.LG cs.SD eess.AS

    Personalized breath based biometric authentication with wearable multimodality

    Authors: Manh-Ha Bui, Viet-Anh Tran, Cuong Pham

    Abstract: Breath with nose sound features has been shown as a potential biometric in personal identification and verification. In this paper, we show that information that comes from other modalities captured by motion sensors on the chest in addition to audio features could further improve the performance. Our work is composed of three main contributions: hardware creation, dataset publication, and propose… ▽ More

    Submitted 29 October, 2021; originally announced October 2021.

    Comments: 7 pages (2 columns), 5 tables, 7 figures, submitted to ACM Multimedia 2020

  41. arXiv:2109.08863   

    cs.DS cs.GT

    Streaming algorithms for Budgeted $k$-Submodular Maximization problem

    Authors: Canh V. Pham, Quang C. Vu, Dung K. T. Ha, Tai T. Nguyen

    Abstract: Stimulated by practical applications arising from viral marketing. This paper investigates a novel Budgeted $k$-Submodular Maximization problem defined as follows: Given a finite set $V$, a budget $B$ and a $k$-submodular function $f: (k+1)^V \mapsto \mathbb{R}_+$, the problem asks to find a solution $\s=(S_1, S_2, \ldots, S_k)$, each element $e \in V$ has a cost $c_i(e)$ to be put into $i$-th set… ▽ More

    Submitted 22 October, 2021; v1 submitted 18 September, 2021; originally announced September 2021.

    Comments: There are some results of the article that need to be corrected

  42. arXiv:2109.08860   

    cs.GT

    Groups Influence with Minimum Cost in Social Networks

    Authors: Phuong N. H. Pham, Canh V. Pham, Hieu V. Duong, Thanh T. Nguyen, My T. Thai

    Abstract: This paper studies a Group Influence with Minimum cost which aims to find a seed set with smallest cost that can influence all target groups, where each user is associated with a cost and a group is influenced if the total score of the influenced users belonging to the group is at least a certain threshold. As the group-influence function is neither submodular nor supermodular, theoretical bounds… ▽ More

    Submitted 14 December, 2022; v1 submitted 18 September, 2021; originally announced September 2021.

    Comments: The paper contains some errors

  43. arXiv:2109.07592  [pdf, other

    cs.CV

    UCP-Net: Unstructured Contour Points for Instance Segmentation

    Authors: Camille Dupont, Yanis Ouakrim, Quoc Cuong Pham

    Abstract: The goal of interactive segmentation is to assist users in producing segmentation masks as fast and as accurately as possible. Interactions have to be simple and intuitive and the number of interactions required to produce a satisfactory segmentation mask should be as low as possible. In this paper, we propose a novel approach to interactive segmentation based on unconstrained contour clicks for i… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

  44. arXiv:2109.02917  [pdf, other

    cs.CV

    Fine-grained Hand Gesture Recognition in Multi-viewpoint Hand Hygiene

    Authors: Huy Q. Vo, Tuong Do, Vi C. Pham, Duy Nguyen, An T. Duong, Quang D. Tran

    Abstract: This paper contributes a new high-quality dataset for hand gesture recognition in hand hygiene systems, named "MFH". Generally, current datasets are not focused on: (i) fine-grained actions; and (ii) data mismatch between different viewpoints, which are available under realistic settings. To address the aforementioned issues, the MFH dataset is proposed to contain a total of 731147 samples obtaine… ▽ More

    Submitted 7 September, 2021; originally announced September 2021.

    Comments: 6 pages, accepted for oral in IEEE SMC 2021

  45. arXiv:2108.05796  [pdf

    stat.AP cs.LG

    Goal scoring in Premier League with Poisson regression

    Authors: Cuong Pham, Tung Le

    Abstract: Premier League is known as one of the most competitive football league in the world, hence there are many goals are scored here every match. Which are the factors that affect to the number of goal scored in each match? We use Poisson regression to find out the relation between many factors as shots on target, corners, red cards, to the goals home team can score in their match.

    Submitted 10 July, 2021; originally announced August 2021.

    Comments: 13 pages, in Vietnamese language, 14 figures

  46. arXiv:2107.11020  [pdf, other

    cs.CL cs.CY

    Emotion analysis and detection during COVID-19

    Authors: Tiberiu Sosea, Chau Pham, Alexander Tekle, Cornelia Caragea, Junyi Jessy Li

    Abstract: Crises such as natural disasters, global pandemics, and social unrest continuously threaten our world and emotionally affect millions of people worldwide in distinct ways. Understanding emotions that people express during large-scale crises helps inform policy makers and first responders about the emotional states of the population as well as provide emotional support to those who need such suppor… ▽ More

    Submitted 20 July, 2022; v1 submitted 23 July, 2021; originally announced July 2021.

    Comments: LREC 2022

  47. arXiv:2105.10014  [pdf, other

    cs.LG cs.AI cs.CV cs.RO

    Evaluating Robustness over High Level Driving Instruction for Autonomous Driving

    Authors: Florence Carton, David Filliat, Jaonary Rabarisoa, Quoc Cuong Pham

    Abstract: In recent years, we have witnessed increasingly high performance in the field of autonomous end-to-end driving. In particular, more and more research is being done on driving in urban environments, where the car has to follow high level commands to navigate. However, few evaluations are made on the ability of these agents to react in an unexpected situation. Specifically, no evaluations are conduc… ▽ More

    Submitted 20 May, 2021; originally announced May 2021.

    Comments: Accepted to IV21, 32nd IEEE Intelligent Vehicles Symposium

  48. arXiv:2101.02471  [pdf, other

    cs.CV

    PandaNet : Anchor-Based Single-Shot Multi-Person 3D Pose Estimation

    Authors: Abdallah Benzine, Florian Chabot, Bertrand Luvison, Quoc Cong Pham, Cahterine Achrd

    Abstract: Recently, several deep learning models have been proposed for 3D human pose estimation. Nevertheless, most of these approaches only focus on the single-person case or estimate 3D pose of a few people at high resolution. Furthermore, many applications such as autonomous driving or crowd analysis require pose estimation of a large number of people possibly at low-resolution. In this work, we present… ▽ More

    Submitted 7 January, 2021; originally announced January 2021.

  49. arXiv:2010.15021  [pdf, other

    cs.CV

    Road Damage Detection and Classification with Detectron2 and Faster R-CNN

    Authors: Vung Pham, Chau Pham, Tommy Dang

    Abstract: The road is vital for many aspects of life, and road maintenance is crucial for human safety. One of the critical tasks to allow timely repair of road damages is to quickly and efficiently detect and classify them. This work details the strategies and experiments evaluated for these tasks. Specifically, we evaluate Detectron2's implementation of Faster R-CNN using different base models and configu… ▽ More

    Submitted 28 October, 2020; originally announced October 2020.

    Comments: Under review for Global Road Damage Detection Challenge 2020, A Track in the IEEE Big Data 2020 Big Data Cup Challenge

  50. arXiv:2006.07827  [pdf, other

    cs.CV

    PCAAE: Principal Component Analysis Autoencoder for organising the latent space of generative networks

    Authors: Chi-Hieu Pham, Saïd Ladjal, Alasdair Newson

    Abstract: Autoencoders and generative models produce some of the most spectacular deep learning results to date. However, understanding and controlling the latent space of these models presents a considerable challenge. Drawing inspiration from principal component analysis and autoencoder, we propose the Principal Component Analysis Autoencoder (PCAAE). This is a novel autoencoder whose latent space verifie… ▽ More

    Submitted 14 June, 2020; originally announced June 2020.

    Comments: Preprint with Appendix