Skip to main content

Showing 1–50 of 69 results for author: Tang, A

  1. arXiv:2407.05643  [pdf, other

    cs.IT eess.SP

    Spatial Non-Stationary Dual-Wideband Channel Estimation for XL-MIMO Systems

    Authors: Anzheng Tang, Jun-Bo Wang, Yijin Pan, Tuo Wu, Chuanwen Chang, Yijian Chen, Hongkang Yu, Maged Elkashlan

    Abstract: In this paper, we investigate the channel estimation problem for extremely large-scale multi-input and multi-output (XL-MIMO) systems, considering the spherical wavefront effect, spatially non-stationary (SnS) property, and dual-wideband effects. To accurately characterize the XL-MIMO channel, we first derive a novel spatial-and-frequency-domain channel model for XL-MIMO systems and carefully exam… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: This paper has been submitted to IEEE journal for possible publication

  2. arXiv:2407.02666  [pdf, other

    cs.RO cs.AI

    Commonsense Reasoning for Legged Robot Adaptation with Vision-Language Models

    Authors: Annie S. Chen, Alec M. Lessing, Andy Tang, Govind Chada, Laura Smith, Sergey Levine, Chelsea Finn

    Abstract: Legged robots are physically capable of navigating a diverse variety of environments and overcoming a wide range of obstructions. For example, in a search and rescue mission, a legged robot could climb over debris, crawl through gaps, and navigate out of dead ends. However, the robot's controller needs to respond intelligently to such varied obstacles, and this requires handling unexpected and unu… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 27 pages

  3. arXiv:2406.09770  [pdf, other

    cs.LG cs.AI

    Towards Efficient Pareto Set Approximation via Mixture of Experts Based Model Fusion

    Authors: Anke Tang, Li Shen, Yong Luo, Shiwei Liu, Han Hu, Bo Du

    Abstract: Solving multi-objective optimization problems for large deep neural networks is a challenging task due to the complexity of the loss landscape and the expensive computational cost of training and evaluating models. Efficient Pareto front approximation of large models enables multi-objective optimization for various tasks such as multi-task learning and trade-off analysis. Existing algorithms for l… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: code is available at https://github.com/tanganke/pareto_set_learning

  4. arXiv:2406.03280  [pdf, other

    cs.LG cs.AI cs.CL

    FusionBench: A Comprehensive Benchmark of Deep Model Fusion

    Authors: Anke Tang, Li Shen, Yong Luo, Han Hu, Bo Du, Dacheng Tao

    Abstract: Deep model fusion is an emerging technique that unifies the predictions or parameters of several deep neural networks into a single model in a cost-effective and data-efficient manner. This enables the unified model to take advantage of the original models' strengths, potentially exceeding their performance. Although a variety of deep model fusion techniques have been introduced, their evaluations… ▽ More

    Submitted 14 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: Project homepage: https://github.com/tanganke/fusion_bench

  5. arXiv:2403.14525  [pdf, other

    eess.SY cs.NI

    Optimizing queues with deadlines under infrequent monitoring

    Authors: Faraz Farahvash, Ao Tang

    Abstract: In this paper, we aim to improve the percentage of packets meeting their deadline in discrete-time M/M/1 queues with infrequent monitoring. More specifically, we look into policies that only monitor the system (and subsequently take actions) after a packet arrival. We model the system as an MDP and provide the optimal policy for some special cases. Furthermore, we introduce a heuristic algorithm c… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 11 pages, 6 figures

  6. arXiv:2403.02633  [pdf, other

    cs.IT eess.SP

    Spatially Non-Stationary XL-MIMO Channel Estimation: A Three-Layer Generalized Approximate Message Passing Method

    Authors: Anzheng Tang, Jun-Bo Wang, Yijin Pan, Wence Zhang, Xiaodan Zhang, Yijian Chen, Hongkang Yu, Rodrigo C. de Lamare

    Abstract: In this paper, channel estimation problem for extremely large-scale multi-input multi-output (XL-MIMO) systems is investigated with the considerations of the spherical wavefront effect and the spatially non-stationary (SnS) property. Due to the diversities of SnS characteristics among different propagation paths, the concurrent channel estimation of multiple paths becomes intractable. To address t… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: This manuscript has been submitted to the IEEE journal for possible pubilcation

  7. arXiv:2402.19173  [pdf, other

    cs.SE cs.AI

    StarCoder 2 and The Stack v2: The Next Generation

    Authors: Anton Lozhkov, Raymond Li, Loubna Ben Allal, Federico Cassano, Joel Lamy-Poirier, Nouamane Tazi, Ao Tang, Dmytro Pykhtar, Jiawei Liu, Yuxiang Wei, Tianyang Liu, Max Tian, Denis Kocetkov, Arthur Zucker, Younes Belkada, Zijian Wang, Qian Liu, Dmitry Abulkhanov, Indraneil Paul, Zhuang Li, Wen-Ding Li, Megan Risdal, Jia Li, Jian Zhu, Terry Yue Zhuo , et al. (41 additional authors not shown)

    Abstract: The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder2. In partnership with Software Heritage (SWH), we build The Stack v2 on top of the digital commons of their source code archive. Alongside the SWH repositories spanning 619 programming languages, we carefully select other high-quality data… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  8. How People Prompt to Create Interactive VR Scenes

    Authors: Setareh Aghel Manesh, Tianyi Zhang, Yuki Onishi, Kotaro Hara, Scott Bateman, Jiannan Li, Anthony Tang

    Abstract: Generative AI tools can provide people with the ability to create virtual environments and scenes with natural language prompts. Yet, how people will formulate such prompts is unclear -- particularly when they inhabit the environment that they are designing. For instance, it is likely that a person might say, "Put a chair here", while pointing at a location. If such linguistic features are common… ▽ More

    Submitted 29 May, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: Accepted at ACM 2024 Designing Interactive Systems (DIS)

  9. arXiv:2402.04958  [pdf, other

    cs.CV

    Channel-Selective Normalization for Label-Shift Robust Test-Time Adaptation

    Authors: Pedro Vianna, Muawiz Chaudhary, Paria Mehrbod, An Tang, Guy Cloutier, Guy Wolf, Michael Eickenberg, Eugene Belilovsky

    Abstract: Deep neural networks have useful applications in many different tasks, however their performance can be severely affected by changes in the data distribution. For example, in the biomedical field, their performance can be affected by changes in the data (different machines, populations) between training and test datasets. To ensure robustness and generalization to real-world scenarios, test-time a… ▽ More

    Submitted 29 May, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: Accepted at the Conference on Lifelong Learning Agents (CoLLAs) 2024

  10. arXiv:2402.01380  [pdf, other

    cs.CV eess.IV

    Efficient Dynamic-NeRF Based Volumetric Video Coding with Rate Distortion Optimization

    Authors: Zhiyu Zhang, Guo Lu, Huanxiong Liang, Anni Tang, Qiang Hu, Li Song

    Abstract: Volumetric videos, benefiting from immersive 3D realism and interactivity, hold vast potential for various applications, while the tremendous data volume poses significant challenges for compression. Recently, NeRF has demonstrated remarkable potential in volumetric video compression thanks to its simple representation and powerful 3D modeling capabilities, where a notable work is ReRF. However, R… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  11. arXiv:2402.00433  [pdf, other

    cs.LG cs.CV

    Merging Multi-Task Models via Weight-Ensembling Mixture of Experts

    Authors: Anke Tang, Li Shen, Yong Luo, Nan Yin, Lefei Zhang, Dacheng Tao

    Abstract: Merging various task-specific Transformer-based models trained on different tasks into a single unified model can execute all the tasks concurrently. Previous methods, exemplified by task arithmetic, have been proven to be both effective and scalable. Existing methods have primarily focused on seeking a static optimal solution within the original model parameter space. A notable challenge is mitig… ▽ More

    Submitted 7 June, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

  12. arXiv:2312.06173  [pdf, other

    cs.LG

    Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion

    Authors: Anke Tang, Li Shen, Yong Luo, Liang Ding, Han Hu, Bo Du, Dacheng Tao

    Abstract: Merging models fine-tuned from a common, extensively pre-trained large model but specialized for different tasks has been demonstrated as a cheap and scalable strategy to construct a multi-task model that performs well across diverse tasks. Recent research, exemplified by task arithmetic, highlights that this multi-task model can be derived through arithmetic operations on task vectors. Neverthele… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  13. arXiv:2312.04586  [pdf, other

    cs.CR cs.LO

    Automated SELinux RBAC Policy Verification Using SMT

    Authors: Divyam Pahuja, Alvin Tang, Klim Tsoutsman

    Abstract: Security-Enhanced Linux (SELinux) is a Linux kernel module that allows for a role-based access control (RBAC) mechanism. It provides a fine-grained security framework enabling system administrators to define security policies at the system and application level. Whilst SELinux offers robust security features through a customisable, powerful RBAC model, its manual policy management is prone to erro… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: 10 pages (excluding appendices), 2 figures, 3 appendices

    ACM Class: F.4.1; D.4.6

  14. arXiv:2311.10305  [pdf, other

    eess.IV cs.CV

    Semi-supervised ViT knowledge distillation network with style transfer normalization for colorectal liver metastases survival prediction

    Authors: Mohamed El Amine Elforaici, Emmanuel Montagnon, Francisco Perdigon Romero, William Trung Le, Feryel Azzi, Dominique Trudel, Bich Nguyen, Simon Turcotte, An Tang, Samuel Kadoury

    Abstract: Colorectal liver metastases (CLM) significantly impact colon cancer patients, influencing survival based on systemic chemotherapy response. Traditional methods like tumor grading scores (e.g., tumor regression grade - TRG) for prognosis suffer from subjectivity, time constraints, and expertise demands. Current machine learning approaches often focus on radiological data, yet the relevance of histo… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: 16 pages, 7 figures and 7 tables. Submitted to Medical Journal Analysis (MedIA) journal

  15. arXiv:2311.09490  [pdf, other

    cs.IT eess.SP

    Joint Visibility Region and Channel Estimation for Extremely Large-scale MIMO Systems

    Authors: Anzheng Tang, Jun-bo Wang, Yijin Pan, Wence Zhang, Yijian Chen, Xiaodan Zhang, Hongkang Yu, Rodrigo C. de Lamare

    Abstract: In this work, we investigate the joint visibility region (VR) detection and channel estimation (CE) problem for extremely large-scale multiple-input-multiple-output (XL-MIMO) systems considering both the spherical wavefront effect and spatial non-stationary (SnS) property. Unlike existing SnS CE methods that rely on the statistical characteristics of channels in the spatial or delay domain, we pro… ▽ More

    Submitted 30 March, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: A major revision version has been submitted to IEEE journal

  16. arXiv:2311.01049  [pdf

    cs.CL cs.AI

    Multi-dimensional data refining strategy for effective fine-tuning LLMs

    Authors: Thanh Nguyen Ngoc, Quang Nhat Tran, Arthur Tang, Bao Nguyen, Thuy Nguyen, Thanh Pham

    Abstract: Data is a cornerstone for fine-tuning large language models, yet acquiring suitable data remains challenging. Challenges encompassed data scarcity, linguistic diversity, and domain-specific content. This paper presents lessons learned while crawling and refining data tailored for fine-tuning Vietnamese language models. Crafting such a dataset, while accounting for linguistic intricacies and striki… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  17. arXiv:2311.01048  [pdf

    cs.CY cs.AI

    AI-assisted Learning for Electronic Engineering Courses in High Education

    Authors: Thanh Nguyen Ngoc, Quang Nhat Tran, Arthur Tang, Bao Nguyen, Thuy Nguyen, Thanh Pham

    Abstract: This study evaluates the efficacy of ChatGPT as an AI teaching and learning support tool in an integrated circuit systems course at a higher education institution in an Asian country. Various question types were completed, and ChatGPT responses were assessed to gain valuable insights for further investigation. The objective is to assess ChatGPT's ability to provide insights, personalized support,… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  18. arXiv:2310.20227  [pdf, ps, other

    cs.IT

    Achieving Scalable Capacity in Wireless Mesh Networks

    Authors: Lei Lei, Aimin Tang, Xudong Wang

    Abstract: Wireless mesh networks play a critical role in enabling key networking scenarios in beyond-5G (B5G) and 6G networks, including integrated access and backhaul (IAB), multi-hop sidelinks, and V2X. However, it still poses a challenge to deliver scalable per-node throughput via mesh networking, which significantly limits the potential of large-scale deployment of wireless mesh networks. Existing resea… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: ~12pages, 4 figures, submitted to IEEE TIT, part of this work has been published in IEEE MASS 2022

  19. arXiv:2310.18646  [pdf

    cs.AI cs.LG

    Predicting Agricultural Commodities Prices with Machine Learning: A Review of Current Research

    Authors: Nhat-Quang Tran, Anna Felipe, Thanh Nguyen Ngoc, Tom Huynh, Quang Tran, Arthur Tang, Thuy Nguyen

    Abstract: Agricultural price prediction is crucial for farmers, policymakers, and other stakeholders in the agricultural sector. However, it is a challenging task due to the complex and dynamic nature of agricultural markets. Machine learning algorithms have the potential to revolutionize agricultural price prediction by improving accuracy, real-time prediction, customization, and integration. This paper re… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

  20. arXiv:2310.08184  [pdf, other

    cs.AI cs.LG

    Learn From Model Beyond Fine-Tuning: A Survey

    Authors: Hongling Zheng, Li Shen, Anke Tang, Yong Luo, Han Hu, Bo Du, Dacheng Tao

    Abstract: Foundation models (FM) have demonstrated remarkable performance across a wide range of tasks (especially in the fields of natural language processing and computer vision), primarily attributed to their ability to comprehend instructions and access extensive, high-quality data. This not only showcases their current effectiveness but also sets a promising trajectory towards the development of artifi… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: 20 pages, 9 figures

  21. arXiv:2310.04742  [pdf, other

    cs.LG

    Parameter Efficient Multi-task Model Fusion with Partial Linearization

    Authors: Anke Tang, Li Shen, Yong Luo, Yibing Zhan, Han Hu, Bo Du, Yixin Chen, Dacheng Tao

    Abstract: Large pre-trained models have enabled significant advances in machine learning and served as foundation components. Model fusion methods, such as task arithmetic, have been proven to be powerful and scalable to incorporate fine-tuned weights from different tasks into a multi-task model. However, efficiently fine-tuning large pre-trained models on multiple downstream tasks remains challenging, lead… ▽ More

    Submitted 11 March, 2024; v1 submitted 7 October, 2023; originally announced October 2023.

  22. arXiv:2309.14225  [pdf, other

    cs.RO

    HumanMimic: Learning Natural Locomotion and Transitions for Humanoid Robot via Wasserstein Adversarial Imitation

    Authors: Annan Tang, Takuma Hiraoka, Naoki Hiraoka, Fan Shi, Kento Kawaharazuka, Kunio Kojima, Kei Okada, Masayuki Inaba

    Abstract: Transferring human motion skills to humanoid robots remains a significant challenge. In this study, we introduce a Wasserstein adversarial imitation learning system, allowing humanoid robots to replicate natural whole-body locomotion patterns and execute seamless transitions by mimicking human motions. First, we present a unified primitive-skeleton motion retargeting to mitigate morphological diff… ▽ More

    Submitted 23 April, 2024; v1 submitted 25 September, 2023; originally announced September 2023.

  23. arXiv:2308.12983  [pdf, ps, other

    cs.LO cs.CR

    Implementation of Formal Semantics and the Potential of Non-Classical Logic Systems for the Enhancement of Access Control Models: A Literature Review

    Authors: Alvin Tang

    Abstract: This literature review discovers an implementation of formal logic systems in cyber security by enhancing access control models. We explore the characteristics of the existing access control theories, their limitations and how classical logic is used therein. We then delve into the possibility of utilising non-classical logic systems for improving the models. In particular, we explore how classica… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: 10 pages

    ACM Class: F.4.1; D.4.6

  24. arXiv:2307.05609  [pdf, other

    cs.NI

    Virtual Network Embedding without Explicit Virtual Network Specification

    Authors: Jiangnan Cheng, Yingjie Bi, Ao Tang

    Abstract: Network virtualization enables Internet service providers to run multiple heterogeneous and dedicated network architectures for different customers on a shared substrate. In existing works on virtual network embedding (VNE), each customer formulates a virtual network request (VNR) where a virtual network (VN) is required. Motivated by a concrete example where VN is not a proper VNR formulation to… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

  25. arXiv:2307.04945  [pdf, other

    cs.NI cs.PL

    What do LLMs need to Synthesize Correct Router Configurations?

    Authors: Rajdeep Mondal, Alan Tang, Ryan Beckett, Todd Millstein, George Varghese

    Abstract: We investigate whether Large Language Models (e.g., GPT-4) can synthesize correct router configurations with reduced manual effort. We find GPT-4 works very badly by itself, producing promising draft configurations but with egregious errors in topology, syntax, and semantics. Our strategy, that we call Verified Prompt Programming, is to combine GPT-4 with verifiers, and use localized feedback from… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

  26. arXiv:2305.13871  [pdf, other

    cs.LG

    Improving Heterogeneous Model Reuse by Density Estimation

    Authors: Anke Tang, Yong Luo, Han Hu, Fengxiang He, Kehua Su, Bo Du, Yixin Chen, Dacheng Tao

    Abstract: This paper studies multiparty learning, aiming to learn a model using the private data of different participants. Model reuse is a promising solution for multiparty learning, assuming that a local model has been trained for each party. Considering the potential sample selection bias among different parties, some heterogeneous model reuse approaches have been developed. However, although pre-traine… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: 9 pages, 5 figues. Accepted by IJCAI 2023

  27. arXiv:2305.01543  [pdf, other

    q-fin.GN cs.CY

    NFT Wash Trading Detection

    Authors: Derek Liu, Francesco Piccoli, Katie Chen, Adrina Tang, Victor Fang

    Abstract: Wash trading is a form of market manipulation where the same entity sells an asset to themselves to drive up market prices, launder money under the cover of a legitimate transaction, or claim a tax loss without losing ownership of an asset. Although the practice is illegal with traditional assets, lack of supervision in the non-fungible token market enables criminals to wash trade and scam unsuspe… ▽ More

    Submitted 7 February, 2023; originally announced May 2023.

  28. Stargazer: An Interactive Camera Robot for Capturing How-To Videos Based on Subtle Instructor Cues

    Authors: Jiannan Li, Mauricio Sousa, Karthik Mahadevan, Bryan Wang, Paula Akemi Aoyaui, Nicole Yu, Angela Yang, Ravin Balakrishnan, Anthony Tang, Tovi Grossman

    Abstract: Live and pre-recorded video tutorials are an effective means for teaching physical skills such as cooking or prototyping electronics. A dedicated cameraperson following an instructor's activities can improve production quality. However, instructors who do not have access to a cameraperson's help often have to work within the constraints of static cameras. We present Stargazer, a novel approach for… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: To appear in Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (CHI '23), April 23--28, 2023, Hamburg, Germany

  29. Memories are One-to-Many Mapping Alleviators in Talking Face Generation

    Authors: Anni Tang, Tianyu He, Xu Tan, Jun Ling, Li Song

    Abstract: Talking face generation aims at generating photo-realistic video portraits of a target person driven by input audio. Due to its nature of one-to-many mapping from the input audio to the output video (e.g., one speech content may have multiple feasible visual appearances), learning a deterministic mapping like previous works brings ambiguity during training, and thus causes inferior visual results.… ▽ More

    Submitted 5 March, 2024; v1 submitted 9 December, 2022; originally announced December 2022.

    Comments: Project page: see https://memoryface.github.io

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence (2024)

  30. arXiv:2211.02012  [pdf, other

    eess.SP cs.IT

    Optimal Compression for Minimizing Classification Error Probability: an Information-Theoretic Approach

    Authors: Jingchao Gao, Ao Tang, Weiyu Xu

    Abstract: We formulate the problem of performing optimal data compression under the constraints that compressed data can be used for accurate classification in machine learning. We show that this translates to a problem of minimizing the mutual information between data and its compressed version under the constraint on error probability of classification is small when using the compressed data for machine l… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

    Comments: This work was done in Summer 2021

  31. arXiv:2209.02866  [pdf, ps, other

    cs.CY

    Algorithmic Learning Foundations for Common Law

    Authors: Jason D. Hartline, Daniel W. Linna Jr., Liren Shan, Alex Tang

    Abstract: This paper looks at a common law legal system as a learning algorithm, models specific features of legal proceedings, and asks whether this system learns efficiently. A particular feature of our model is explicitly viewing various aspects of court proceedings as learning algorithms. This viewpoint enables directly pointing out that when the costs of going to court are not commensurate with the ben… ▽ More

    Submitted 8 September, 2022; v1 submitted 6 September, 2022; originally announced September 2022.

  32. arXiv:2208.02711  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Agnostic Learning of General ReLU Activation Using Gradient Descent

    Authors: Pranjal Awasthi, Alex Tang, Aravindan Vijayaraghavan

    Abstract: We provide a convergence analysis of gradient descent for the problem of agnostically learning a single ReLU function under Gaussian distributions. Unlike prior work that studies the setting of zero bias, we consider the more challenging scenario when the bias of the ReLU function is non-zero. Our main result establishes that starting from random initialization, in a polynomial number of iteration… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

    Comments: 28 oages

  33. arXiv:2206.08853  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge

    Authors: Linxi Fan, Guanzhi Wang, Yunfan Jiang, Ajay Mandlekar, Yuncong Yang, Haoyi Zhu, Andrew Tang, De-An Huang, Yuke Zhu, Anima Anandkumar

    Abstract: Autonomous agents have made great strides in specialist domains like Atari games and Go. However, they typically learn tabula rasa in isolated environments with limited and manually conceived objectives, thus failing to generalize across a wide spectrum of tasks and capabilities. Inspired by how humans continually learn and adapt in the open world, we advocate a trinity of ingredients for building… ▽ More

    Submitted 22 November, 2022; v1 submitted 17 June, 2022; originally announced June 2022.

    Comments: Outstanding Paper Award at NeurIPS 2022. Project website: https://minedojo.org

  34. LIGHTYEAR: Using Modularity to Scale BGP Control Plane Verification

    Authors: Alan Tang, Ryan Beckett, Steven Benaloh, Karthick Jayaraman, Tejas Patil, Todd Millstein, George Varghese

    Abstract: Current network control plane verification tools cannot scale to large networks, because of the complexity of jointly reasoning about the behaviors of all nodes in the network. In this paper we present a modular approach to control plane verification, whereby end-to-end network properties are verified via a set of purely local checks on individual nodes and edges. The approach targets the verifica… ▽ More

    Submitted 20 September, 2023; v1 submitted 20 April, 2022; originally announced April 2022.

    Comments: 12 pages (+ 2 pages references), 3 figures, Accepted at SIGCOMM '23

    Journal ref: In Proceedings of the ACM SIGCOMM 2023 Conference (ACM SIGCOMM '23). Association for Computing Machinery, New York, NY, USA, 94-107

  35. arXiv:2201.11917  [pdf, other

    cs.IT cs.LG

    Task-Aware Network Coding Over Butterfly Network

    Authors: Jiangnan Cheng, Sandeep Chinchali, Ao Tang

    Abstract: Network coding allows distributed information sources such as sensors to efficiently compress and transmit data to distributed receivers across a bandwidth-limited network. Classical network coding is largely task-agnostic -- the coding schemes mainly aim to faithfully reconstruct data at the receivers, regardless of what ultimate task the received data is used for. In this paper, we analyze a new… ▽ More

    Submitted 31 October, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

  36. arXiv:2112.02955  [pdf

    cs.CL cs.AI cs.LG q-bio.QM

    Does constituency analysis enhance domain-specific pre-trained BERT models for relation extraction?

    Authors: Anfu Tang, Louise Deléger, Robert Bossy, Pierre Zweigenbaum, Claire Nédellec

    Abstract: Recently many studies have been conducted on the topic of relation extraction. The DrugProt track at BioCreative VII provides a manually-annotated corpus for the purpose of the development and evaluation of relation extraction systems, in which interactions between chemicals and genes are studied. We describe the ensemble system that we used for our submission, which combines predictions of fine-t… ▽ More

    Submitted 25 November, 2021; originally announced December 2021.

    Journal ref: BioCreative VII Challenge Evaluation Workshop, Nov 2021, on-line, Spain

  37. arXiv:2112.02097  [pdf

    q-bio.OT cs.CL cs.LG q-bio.QM

    Global alignment for relation extraction in Microbiology

    Authors: Anfu Tang, Claire Nédellec, Pierre Zweigenbaum, Louise Deléger, Robert Bossy

    Abstract: We investigate a method to extract relations from texts based on global alignment and syntactic information. Combined with SVM, this method is shown to have a performance comparable or even better than LSTM on two RE tasks.

    Submitted 25 November, 2021; originally announced December 2021.

    Journal ref: Junior Conference on Data Science and Engineering, Feb 2021, Orsay, France

  38. arXiv:2110.02329  [pdf, other

    cs.CR cs.LG

    Task-aware Privacy Preservation for Multi-dimensional Data

    Authors: Jiangnan Cheng, Ao Tang, Sandeep Chinchali

    Abstract: Local differential privacy (LDP) can be adopted to anonymize richer user data attributes that will be input to sophisticated machine learning (ML) tasks. However, today's LDP approaches are largely task-agnostic and often lead to severe performance loss -- they simply inject noise to all data attributes according to a given privacy budget, regardless of what features are most relevant for the ulti… ▽ More

    Submitted 7 August, 2022; v1 submitted 5 October, 2021; originally announced October 2021.

    Comments: Accepted by 39th International Conference on Machine Learning (ICML 2022)

  39. arXiv:2109.14675  [pdf, other

    cs.LG

    Data Sharing and Compression for Cooperative Networked Control

    Authors: Jiangnan Cheng, Marco Pavone, Sachin Katti, Sandeep Chinchali, Ao Tang

    Abstract: Sharing forecasts of network timeseries data, such as cellular or electricity load patterns, can improve independent control applications ranging from traffic scheduling to power generation. Typically, forecasts are designed without knowledge of a downstream controller's task objective, and thus simply optimize for mean prediction error. However, such task-agnostic representations are often too la… ▽ More

    Submitted 5 October, 2021; v1 submitted 29 September, 2021; originally announced September 2021.

    Comments: Accepted by 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  40. arXiv:2107.10209  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Efficient Algorithms for Learning Depth-2 Neural Networks with General ReLU Activations

    Authors: Pranjal Awasthi, Alex Tang, Aravindan Vijayaraghavan

    Abstract: We present polynomial time and sample efficient algorithms for learning an unknown depth-2 feedforward neural network with general ReLU activations, under mild non-degeneracy assumptions. In particular, we consider learning an unknown network of the form $f(x) = {a}^{\mathsf{T}}σ({W}^\mathsf{T}x+b)$, where $x$ is drawn from the Gaussian distribution, and $σ(t) := \max(t,0)$ is the ReLU activation.… ▽ More

    Submitted 1 August, 2021; v1 submitted 21 July, 2021; originally announced July 2021.

    Comments: 45 pages (including appendix). This version fixes an error in the previous version of the paper

  41. arXiv:2107.01446  [pdf

    cs.SE

    Architecture Information Communication in Two OSS Projects: the Why, Who, When, and What

    Authors: Tingting Bi, Wei Ding, Peng Liang, Antony Tang

    Abstract: Architecture information is vital for Open Source Software (OSS) development, and mailing list is one of the widely used channels for developers to share and communicate architecture information. This work investigates the nature of architecture information communication (i.e., why, who, when, and what) by OSS developers via developer mailing lists. We employed a multiple case study approach to ex… ▽ More

    Submitted 3 July, 2021; originally announced July 2021.

    Comments: Preprint accepted for publication in Journal of Systems and Software, 2021

  42. Mining Architecture Tactics and Quality Attributes Knowledge in Stack Overflow

    Authors: Tingting Bi, Peng Liang, Antony Tang, Xin Xia

    Abstract: Context: Architecture Tactics (ATs) are architectural building blocks that provide general architectural solutions for addressing Quality Attributes (QAs) issues. Mining and analyzing QA-AT knowledge can help the software architecture community better understand architecture design. However, manually capturing and mining this knowledge is labor-intensive and difficult. Objective: Using Stack Overf… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

    Comments: Preprint accepted for publication in Journal of Systems and Software, 2021

  43. arXiv:2102.00581  [pdf

    cs.RO cs.HC

    "Grip-that-there": An Investigation of Explicit and Implicit Task Allocation Techniques for Human-Robot Collaboration

    Authors: Karthik Mahadevan, Maurício Sousa, Anthony Tang, Tovi Grossman

    Abstract: In ad-hoc human-robot collaboration (HRC), humans and robots work on a task without pre-planning the robot's actions prior to execution; instead, task allocation occurs in real-time. However, prior research has largely focused on task allocations that are pre-planned - there has not been a comprehensive exploration or evaluation of techniques where task allocation is adjusted in real-time. Inspire… ▽ More

    Submitted 2 February, 2021; v1 submitted 31 January, 2021; originally announced February 2021.

    Comments: To be published in Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems

    ACM Class: H.5.2

  44. arXiv:2012.08483  [pdf, other

    cs.LG

    Amazon SageMaker Autopilot: a white box AutoML solution at scale

    Authors: Piali Das, Valerio Perrone, Nikita Ivkin, Tanya Bansal, Zohar Karnin, Huibin Shen, Iaroslav Shcherbatyi, Yotam Elor, Wilton Wu, Aida Zolic, Thibaut Lienart, Alex Tang, Amr Ahmed, Jean Baptiste Faddoul, Rodolphe Jenatton, Fela Winkelmolen, Philip Gautier, Leo Dirac, Andre Perunicic, Miroslav Miladinovic, Giovanni Zappella, Cédric Archambeau, Matthias Seeger, Bhaskar Dutt, Laurence Rouesnel

    Abstract: AutoML systems provide a black-box solution to machine learning problems by selecting the right way of processing features, choosing an algorithm and tuning the hyperparameters of the entire pipeline. Although these systems perform well on many datasets, there is still a non-negligible number of datasets for which the one-shot solution produced by each particular system would provide sub-par perfo… ▽ More

    Submitted 16 December, 2020; v1 submitted 15 December, 2020; originally announced December 2020.

  45. Activity River: Visualizing Planned and Logged Personal Activities for Reflection

    Authors: Bon Adriel Aseniero, Charles Perin, Wesley Willett, Anthony Tang, Sheelagh Carpendale

    Abstract: We present Activity River, a personal visualization tool which enables individuals to plan, log, and reflect on their self-defined activities. We are interested in supporting this type of reflective practice as prior work has shown that reflection can help people plan and manage their time effectively. Hence, we designed Activity River based on five design goals (visualize historical and contextua… ▽ More

    Submitted 1 June, 2020; originally announced June 2020.

    Comments: 9 pages, 6 figures, AVI '20, September 28-October 2, 2020, Salerno, Italy 2020 Association for Computing Machinery

  46. arXiv:2001.03021  [pdf

    cs.HC

    TanGi: Tangible Proxies for Embodied Object Exploration and Manipulation in Virtual Reality

    Authors: Martin Feick, Scott Bateman, Anthony Tang, André Miede, Nicolai Marquardt

    Abstract: Exploring and manipulating complex virtual objects is challenging due to limitations of conventional controllers and free-hand interaction techniques. We present the TanGi toolkit which enables novices to rapidly build physical proxy objects using Composable Shape Primitives. TanGi also provides Manipulators allowing users to build objects including movable parts, making them suitable for rich obj… ▽ More

    Submitted 9 January, 2020; originally announced January 2020.

    Comments: 10 pages, 11 figures

    ACM Class: H.5.2

  47. arXiv:1909.06026  [pdf

    cond-mat.dis-nn cond-mat.mes-hall cs.ET

    Magnetic domain wall based synaptic and activation function generator for neuromorphic accelerators

    Authors: Saima A Siddiqui, Sumit Dutta, Astera Tang, Luqiao Liu, Caroline A Ross, Marc A Baldo

    Abstract: Magnetic domain walls are information tokens in both logic and memory devices, and hold particular interest in applications such as neuromorphic accelerators that combine logic in memory. Here, we show that devices based on the electrical manipulation of magnetic domain walls are capable of implementing linear, as well as programmable nonlinear, functions. Unlike other approaches, domain-wall-base… ▽ More

    Submitted 7 September, 2019; originally announced September 2019.

    Comments: 24 pages, 5 figures

  48. arXiv:1901.09483  [pdf, other

    cs.CV

    End-to-End Discriminative Deep Network for Liver Lesion Classification

    Authors: Francisco Perdigon Romero, Andre Diler, Gabriel Bisson-Gregoire, Simon Turcotte, Real Lapointe, Franck Vandenbroucke-Menu, An Tang, Samuel Kadoury

    Abstract: Colorectal liver metastasis is one of most aggressive liver malignancies. While the definition of lesion type based on CT images determines the diagnosis and therapeutic strategy, the discrimination between cancerous and non-cancerous lesions are critical and requires highly skilled expertise, experience and time. In the present work we introduce an end-to-end deep learning approach to assist in t… ▽ More

    Submitted 27 January, 2019; originally announced January 2019.

  49. The Liver Tumor Segmentation Benchmark (LiTS)

    Authors: Patrick Bilic, Patrick Christ, Hongwei Bran Li, Eugene Vorontsov, Avi Ben-Cohen, Georgios Kaissis, Adi Szeskin, Colin Jacobs, Gabriel Efrain Humpire Mamani, Gabriel Chartrand, Fabian Lohöfer, Julian Walter Holch, Wieland Sommer, Felix Hofmann, Alexandre Hostettler, Naama Lev-Cohain, Michal Drozdzal, Michal Marianne Amitai, Refael Vivantik, Jacob Sosna, Ivan Ezhov, Anjany Sekuboyina, Fernando Navarro, Florian Kofler, Johannes C. Paetzold , et al. (84 additional authors not shown)

    Abstract: In this work, we report the set-up and results of the Liver Tumor Segmentation Benchmark (LiTS), which was organized in conjunction with the IEEE International Symposium on Biomedical Imaging (ISBI) 2017 and the International Conferences on Medical Image Computing and Computer-Assisted Intervention (MICCAI) 2017 and 2018. The image dataset is diverse and contains primary and secondary tumors with… ▽ More

    Submitted 25 November, 2022; v1 submitted 13 January, 2019; originally announced January 2019.

    Comments: Patrick Bilic, Patrick Christ, Hongwei Bran Li, and Eugene Vorontsov made equal contributions to this work. Published in Medical Image Analysis

    Journal ref: Medical Image Analysis (2022) Pg. 102680

  50. Multi-Level Batch Normalization In Deep Networks For Invasive Ductal Carcinoma Cell Discrimination In Histopathology Images

    Authors: Francisco Perdigon Romero, An Tang, Samuel Kadoury

    Abstract: Breast cancer is the most diagnosed cancer and the most predominant cause of death in women worldwide. Imaging techniques such as the breast cancer pathology helps in the diagnosis and monitoring of the disease. However identification of malignant cells can be challenging given the high heterogeneity in tissue absorbotion from staining agents. In this work, we present a novel approach for Invasive… ▽ More

    Submitted 11 January, 2019; originally announced January 2019.

    Comments: 4 pages, 5 figures