-
UVIS: Unsupervised Video Instance Segmentation
Authors:
Shuaiyi Huang,
Saksham Suri,
Kamal Gupta,
Sai Saketh Rambhatla,
Ser-nam Lim,
Abhinav Shrivastava
Abstract:
Video instance segmentation requires classifying, segmenting, and tracking every object across video frames. Unlike existing approaches that rely on masks, boxes, or category labels, we propose UVIS, a novel Unsupervised Video Instance Segmentation (UVIS) framework that can perform video instance segmentation without any video annotations or dense label-based pretraining. Our key insight comes fro…
▽ More
Video instance segmentation requires classifying, segmenting, and tracking every object across video frames. Unlike existing approaches that rely on masks, boxes, or category labels, we propose UVIS, a novel Unsupervised Video Instance Segmentation (UVIS) framework that can perform video instance segmentation without any video annotations or dense label-based pretraining. Our key insight comes from leveraging the dense shape prior from the self-supervised vision foundation model DINO and the openset recognition ability from the image-caption supervised vision-language model CLIP. Our UVIS framework consists of three essential steps: frame-level pseudo-label generation, transformer-based VIS model training, and query-based tracking. To improve the quality of VIS predictions in the unsupervised setup, we introduce a dual-memory design. This design includes a semantic memory bank for generating accurate pseudo-labels and a tracking memory bank for maintaining temporal consistency in object tracks. We evaluate our approach on three standard VIS benchmarks, namely YoutubeVIS-2019, YoutubeVIS-2021, and Occluded VIS. Our UVIS achieves 21.1 AP on YoutubeVIS-2019 without any video annotations or dense pretraining, demonstrating the potential of our unsupervised VIS framework.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
Emergence of high-mass stars in complex fiber networks (EMERGE) II. The need for data combination in ALMA observations
Authors:
F. Bonanomi,
A. Hacar,
A. Socci,
D. Petry,
S. Suri
Abstract:
ALMA's high-resolution images allow to resolve the filamentary structure of the ISM down to few thousand au at kpc distances. We aim to systematically quantify the impact of the interferometric response and the effects of the short-spacing information during the characterization of the ISM structure using ALMA observations. We create a series of continuum ALMA synthetic observations to test the re…
▽ More
ALMA's high-resolution images allow to resolve the filamentary structure of the ISM down to few thousand au at kpc distances. We aim to systematically quantify the impact of the interferometric response and the effects of the short-spacing information during the characterization of the ISM structure using ALMA observations. We create a series of continuum ALMA synthetic observations to test the recovery of the observational properties of dense cores and filaments (i.e. intensity peak, radial profile, and width) at different scales. We compare the results obtained with and without different data combination techniques using different ALMA arrays and SD telescopes in simulated data and real observations. Our analysis illustrates the severity of interferometric filtering effects. ALMA-12m alone observations show significant scale-dependent flux losses systematically corrupting (>30%error) all the physical properties inferred in cores and filaments (i.e. column density, mass, and size) before the maximum recoverable scale of the interferometer. These effects are only partially mitigated by the addition of the ALMA ACA-7m array although degrading the telescope PSF. Our results demonstrate only the addition of the ALMA Total Power information allows to recover the true sky emission down to few times the ALMA beamsize with satisfactory accuracy (<10% error). Additional tests demonstrate the emission recovery at all scales is further improved if the 7mTP data are replaced by maps obtained by a larger SD telescope (e.g., IRAM-30m), even if the latter are noisier than expected. These observational biases particularly affect partially resolved targets, becoming critical especially for studies in nearby regions such as Taurus or Orion. Our results demonstrate the need for the use of data combination techniques to accurately characterize the complex physical structure of the ISM in the ALMA era.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
CasCalib: Cascaded Calibration for Motion Capture from Sparse Unsynchronized Cameras
Authors:
James Tang,
Shashwat Suri,
Daniel Ajisafe,
Bastian Wandt,
Helge Rhodin
Abstract:
It is now possible to estimate 3D human pose from monocular images with off-the-shelf 3D pose estimators. However, many practical applications require fine-grained absolute pose information for which multi-view cues and camera calibration are necessary. Such multi-view recordings are laborious because they require manual calibration, and are expensive when using dedicated hardware. Our goal is ful…
▽ More
It is now possible to estimate 3D human pose from monocular images with off-the-shelf 3D pose estimators. However, many practical applications require fine-grained absolute pose information for which multi-view cues and camera calibration are necessary. Such multi-view recordings are laborious because they require manual calibration, and are expensive when using dedicated hardware. Our goal is full automation, which includes temporal synchronization, as well as intrinsic and extrinsic camera calibration. This is done by using persons in the scene as the calibration objects. Existing methods either address only synchronization or calibration, assume one of the former as input, or have significant limitations. A common limitation is that they only consider single persons, which eases correspondence finding. We attain this generality by partitioning the high-dimensional time and calibration space into a cascade of subspaces and introduce tailored algorithms to optimize each efficiently and robustly. The outcome is an easy-to-use, flexible, and robust motion capture toolbox that we release to enable scientific applications, which we demonstrate on diverse multi-view benchmarks. Project website: https://github.com/jamestang1998/CasCalib.
△ Less
Submitted 10 May, 2024;
originally announced May 2024.
-
The Use of Generative Search Engines for Knowledge Work and Complex Tasks
Authors:
Siddharth Suri,
Scott Counts,
Leijie Wang,
Chacha Chen,
Mengting Wan,
Tara Safavi,
Jennifer Neville,
Chirag Shah,
Ryen W. White,
Reid Andersen,
Georg Buscher,
Sathish Manivannan,
Nagu Rangan,
Longqi Yang
Abstract:
Until recently, search engines were the predominant method for people to access online information. The recent emergence of large language models (LLMs) has given machines new capabilities such as the ability to generate new digital artifacts like text, images, code etc., resulting in a new tool, a generative search engine, which combines the capabilities of LLMs with a traditional search engine.…
▽ More
Until recently, search engines were the predominant method for people to access online information. The recent emergence of large language models (LLMs) has given machines new capabilities such as the ability to generate new digital artifacts like text, images, code etc., resulting in a new tool, a generative search engine, which combines the capabilities of LLMs with a traditional search engine. Through the empirical analysis of Bing Copilot (Bing Chat), one of the first publicly available generative search engines, we analyze the types and complexity of tasks that people use Bing Copilot for compared to Bing Search. Findings indicate that people use the generative search engine for more knowledge work tasks that are higher in cognitive complexity than were commonly done with a traditional search engine.
△ Less
Submitted 19 March, 2024;
originally announced April 2024.
-
LiFT: A Surprisingly Simple Lightweight Feature Transform for Dense ViT Descriptors
Authors:
Saksham Suri,
Matthew Walmer,
Kamal Gupta,
Abhinav Shrivastava
Abstract:
We present a simple self-supervised method to enhance the performance of ViT features for dense downstream tasks. Our Lightweight Feature Transform (LiFT) is a straightforward and compact postprocessing network that can be applied to enhance the features of any pre-trained ViT backbone. LiFT is fast and easy to train with a self-supervised objective, and it boosts the density of ViT features for m…
▽ More
We present a simple self-supervised method to enhance the performance of ViT features for dense downstream tasks. Our Lightweight Feature Transform (LiFT) is a straightforward and compact postprocessing network that can be applied to enhance the features of any pre-trained ViT backbone. LiFT is fast and easy to train with a self-supervised objective, and it boosts the density of ViT features for minimal extra inference cost. Furthermore, we demonstrate that LiFT can be applied with approaches that use additional task-specific downstream modules, as we integrate LiFT with ViTDet for COCO detection and segmentation. Despite the simplicity of LiFT, we find that it is not simply learning a more complex version of bilinear interpolation. Instead, our LiFT training protocol leads to several desirable emergent properties that benefit ViT features in dense downstream tasks. This includes greater scale invariance for features, and better object boundary maps. By simply training LiFT for a few epochs, we show improved performance on keypoint correspondence, detection, segmentation, and object discovery tasks. Overall, LiFT provides an easy way to unlock the benefits of denser feature arrays for a fraction of the computational cost. For more details, refer to our project page at https://www.cs.umd.edu/~sakshams/LiFT/.
△ Less
Submitted 21 March, 2024;
originally announced March 2024.
-
Interpretable User Satisfaction Estimation for Conversational Systems with Large Language Models
Authors:
Ying-Chun Lin,
Jennifer Neville,
Jack W. Stokes,
Longqi Yang,
Tara Safavi,
Mengting Wan,
Scott Counts,
Siddharth Suri,
Reid Andersen,
Xiaofeng Xu,
Deepak Gupta,
Sujay Kumar Jauhar,
Xia Song,
Georg Buscher,
Saurabh Tiwary,
Brent Hecht,
Jaime Teevan
Abstract:
Accurate and interpretable user satisfaction estimation (USE) is critical for understanding, evaluating, and continuously improving conversational systems. Users express their satisfaction or dissatisfaction with diverse conversational patterns in both general-purpose (ChatGPT and Bing Copilot) and task-oriented (customer service chatbot) conversational systems. Existing approaches based on featur…
▽ More
Accurate and interpretable user satisfaction estimation (USE) is critical for understanding, evaluating, and continuously improving conversational systems. Users express their satisfaction or dissatisfaction with diverse conversational patterns in both general-purpose (ChatGPT and Bing Copilot) and task-oriented (customer service chatbot) conversational systems. Existing approaches based on featurized ML models or text embeddings fall short in extracting generalizable patterns and are hard to interpret. In this work, we show that LLMs can extract interpretable signals of user satisfaction from their natural language utterances more effectively than embedding-based approaches. Moreover, an LLM can be tailored for USE via an iterative prompting framework using supervision from labeled examples. The resulting method, Supervised Prompting for User satisfaction Rubrics (SPUR), not only has higher accuracy but is more interpretable as it scores user satisfaction via learned rubrics with a detailed breakdown.
△ Less
Submitted 8 June, 2024; v1 submitted 18 March, 2024;
originally announced March 2024.
-
TnT-LLM: Text Mining at Scale with Large Language Models
Authors:
Mengting Wan,
Tara Safavi,
Sujay Kumar Jauhar,
Yujin Kim,
Scott Counts,
Jennifer Neville,
Siddharth Suri,
Chirag Shah,
Ryen W White,
Longqi Yang,
Reid Andersen,
Georg Buscher,
Dhruv Joshi,
Nagu Rangan
Abstract:
Transforming unstructured text into structured and meaningful forms, organized by useful category labels, is a fundamental step in text mining for downstream analysis and application. However, most existing methods for producing label taxonomies and building text-based label classifiers still rely heavily on domain expertise and manual curation, making the process expensive and time-consuming. Thi…
▽ More
Transforming unstructured text into structured and meaningful forms, organized by useful category labels, is a fundamental step in text mining for downstream analysis and application. However, most existing methods for producing label taxonomies and building text-based label classifiers still rely heavily on domain expertise and manual curation, making the process expensive and time-consuming. This is particularly challenging when the label space is under-specified and large-scale data annotations are unavailable. In this paper, we address these challenges with Large Language Models (LLMs), whose prompt-based interface facilitates the induction and use of large-scale pseudo labels. We propose TnT-LLM, a two-phase framework that employs LLMs to automate the process of end-to-end label generation and assignment with minimal human effort for any given use-case. In the first phase, we introduce a zero-shot, multi-stage reasoning approach which enables LLMs to produce and refine a label taxonomy iteratively. In the second phase, LLMs are used as data labelers that yield training samples so that lightweight supervised classifiers can be reliably built, deployed, and served at scale. We apply TnT-LLM to the analysis of user intent and conversational domain for Bing Copilot (formerly Bing Chat), an open-domain chat-based search engine. Extensive experiments using both human and automatic evaluation metrics demonstrate that TnT-LLM generates more accurate and relevant label taxonomies when compared against state-of-the-art baselines, and achieves a favorable balance between accuracy and efficiency for classification at scale. We also share our practical experiences and insights on the challenges and opportunities of using LLMs for large-scale text mining in real-world applications.
△ Less
Submitted 18 March, 2024;
originally announced March 2024.
-
The impact of generative artificial intelligence on socioeconomic inequalities and policy making
Authors:
Valerio Capraro,
Austin Lentsch,
Daron Acemoglu,
Selin Akgun,
Aisel Akhmedova,
Ennio Bilancini,
Jean-François Bonnefon,
Pablo Brañas-Garza,
Luigi Butera,
Karen M. Douglas,
Jim A. C. Everett,
Gerd Gigerenzer,
Christine Greenhow,
Daniel A. Hashimoto,
Julianne Holt-Lunstad,
Jolanda Jetten,
Simon Johnson,
Chiara Longoni,
Pete Lunn,
Simone Natale,
Iyad Rahwan,
Neil Selwyn,
Vivek Singh,
Siddharth Suri,
Jennifer Sutcliffe
, et al. (6 additional authors not shown)
Abstract:
Generative artificial intelligence has the potential to both exacerbate and ameliorate existing socioeconomic inequalities. In this article, we provide a state-of-the-art interdisciplinary overview of the potential impacts of generative AI on (mis)information and three information-intensive domains: work, education, and healthcare. Our goal is to highlight how generative AI could worsen existing i…
▽ More
Generative artificial intelligence has the potential to both exacerbate and ameliorate existing socioeconomic inequalities. In this article, we provide a state-of-the-art interdisciplinary overview of the potential impacts of generative AI on (mis)information and three information-intensive domains: work, education, and healthcare. Our goal is to highlight how generative AI could worsen existing inequalities while illuminating how AI may help mitigate pervasive social problems. In the information domain, generative AI can democratize content creation and access, but may dramatically expand the production and proliferation of misinformation. In the workplace, it can boost productivity and create new jobs, but the benefits will likely be distributed unevenly. In education, it offers personalized learning, but may widen the digital divide. In healthcare, it might improve diagnostics and accessibility, but could deepen pre-existing inequalities. In each section we cover a specific topic, evaluate existing research, identify critical gaps, and recommend research directions, including explicit trade-offs that complicate the derivation of a priori hypotheses. We conclude with a section highlighting the role of policymaking to maximize generative AI's potential to reduce inequalities while mitigating its harmful effects. We discuss strengths and weaknesses of existing policy frameworks in the European Union, the United States, and the United Kingdom, observing that each fails to fully confront the socioeconomic challenges we have identified. We propose several concrete policies that could promote shared prosperity through the advancement of generative AI. This article emphasizes the need for interdisciplinary collaborations to understand and address the complex challenges of generative AI.
△ Less
Submitted 6 May, 2024; v1 submitted 16 December, 2023;
originally announced January 2024.
-
GMF G214.5-1.8 as traced by CO: I -- cloud-scale CO freeze-out as a result of a low cosmic-ray ionisation rate
Authors:
S. D. Clarke,
V. A. Makeev,
Á. Sánchez-Monge,
G. M. Williams,
Y. -W. Tang,
S. Walch,
R. Higgins,
P. C. Nürnberger,
S. Suri
Abstract:
We present an analysis of the outer Galaxy giant molecular filament (GMF) G214.5-1.8 (G214.5) using IRAM 30m data of $^{12}$CO, $^{13}$CO and C$^{18}$O. We find that the $^{12}$CO (1-0) and (2-1) derived excitation temperatures are near identical and are very low, with a median of 8.2 K, showing that the gas is extremely cold across the whole cloud. Investigating the abundance of $^{13}$CO across…
▽ More
We present an analysis of the outer Galaxy giant molecular filament (GMF) G214.5-1.8 (G214.5) using IRAM 30m data of $^{12}$CO, $^{13}$CO and C$^{18}$O. We find that the $^{12}$CO (1-0) and (2-1) derived excitation temperatures are near identical and are very low, with a median of 8.2 K, showing that the gas is extremely cold across the whole cloud. Investigating the abundance of $^{13}$CO across G214.5, we find that there is a significantly lower abundance along the entire 13 pc spine of the filament, extending out to a radius of $\sim 0.8$ pc, corresponding to $A_v \gtrsim 2$ mag and $T_{dust} \lesssim 13.5$ K. Due to this, we attribute the decrease in abundance to CO freeze-out, making G214.5 the largest scale example of freeze-out yet. We construct an axisymmetric model of G214.5's $^{13}$CO volume density considering freeze-out and find that to reproduce the observed profile significant depletion is required beginning at low volume densities, $n\gtrsim2000$ cm$^{-3}$. Freeze-out at this low number density is possible only if the cosmic-ray ionisation rate is $\sim 1.9 \times 10^{-18}$ s$^{-1}$, an order of magnitude below the typical value. Using timescale arguments, we posit that such a low ionisation rate may lead to ambipolar diffusion being an important physical process along G214.5's entire spine. We suggest that if low cosmic-ray ionisation rates are more common in the outer Galaxy, and other quiescent regions, cloud-scale CO freeze-out occurring at low column and number densities may also be more prevalent, having consequences for CO observations and their interpretation.
△ Less
Submitted 10 January, 2024;
originally announced January 2024.
-
Gen2Det: Generate to Detect
Authors:
Saksham Suri,
Fanyi Xiao,
Animesh Sinha,
Sean Chang Culatana,
Raghuraman Krishnamoorthi,
Chenchen Zhu,
Abhinav Shrivastava
Abstract:
Recently diffusion models have shown improvement in synthetic image quality as well as better control in generation. We motivate and present Gen2Det, a simple modular pipeline to create synthetic training data for object detection for free by leveraging state-of-the-art grounded image generation methods. Unlike existing works which generate individual object instances, require identifying foregrou…
▽ More
Recently diffusion models have shown improvement in synthetic image quality as well as better control in generation. We motivate and present Gen2Det, a simple modular pipeline to create synthetic training data for object detection for free by leveraging state-of-the-art grounded image generation methods. Unlike existing works which generate individual object instances, require identifying foreground followed by pasting on other images, we simplify to directly generating scene-centric images. In addition to the synthetic data, Gen2Det also proposes a suite of techniques to best utilize the generated data, including image-level filtering, instance-level filtering, and better training recipe to account for imperfections in the generation. Using Gen2Det, we show healthy improvements on object detection and segmentation tasks under various settings and agnostic to detection methods. In the long-tailed detection setting on LVIS, Gen2Det improves the performance on rare categories by a large margin while also significantly improving the performance on other categories, e.g. we see an improvement of 2.13 Box AP and 1.84 Mask AP over just training on real data on LVIS with Mask R-CNN. In the low-data regime setting on COCO, Gen2Det consistently improves both Box and Mask AP by 2.27 and 1.85 points. In the most general detection setting, Gen2Det still demonstrates robust performance gains, e.g. it improves the Box and Mask AP on COCO by 0.45 and 0.32 points.
△ Less
Submitted 7 December, 2023;
originally announced December 2023.
-
Using Large Language Models to Generate, Validate, and Apply User Intent Taxonomies
Authors:
Chirag Shah,
Ryen W. White,
Reid Andersen,
Georg Buscher,
Scott Counts,
Sarkar Snigdha Sarathi Das,
Ali Montazer,
Sathish Manivannan,
Jennifer Neville,
Xiaochuan Ni,
Nagu Rangan,
Tara Safavi,
Siddharth Suri,
Mengting Wan,
Leijie Wang,
Longqi Yang
Abstract:
Log data can reveal valuable information about how users interact with Web search services, what they want, and how satisfied they are. However, analyzing user intents in log data is not easy, especially for emerging forms of Web search such as AI-driven chat. To understand user intents from log data, we need a way to label them with meaningful categories that capture their diversity and dynamics.…
▽ More
Log data can reveal valuable information about how users interact with Web search services, what they want, and how satisfied they are. However, analyzing user intents in log data is not easy, especially for emerging forms of Web search such as AI-driven chat. To understand user intents from log data, we need a way to label them with meaningful categories that capture their diversity and dynamics. Existing methods rely on manual or machine-learned labeling, which are either expensive or inflexible for large and dynamic datasets. We propose a novel solution using large language models (LLMs), which can generate rich and relevant concepts, descriptions, and examples for user intents. However, using LLMs to generate a user intent taxonomy and apply it for log analysis can be problematic for two main reasons: (1) such a taxonomy is not externally validated; and (2) there may be an undesirable feedback loop. To address this, we propose a new methodology with human experts and assessors to verify the quality of the LLM-generated taxonomy. We also present an end-to-end pipeline that uses an LLM with human-in-the-loop to produce, refine, and apply labels for user intent analysis in log data. We demonstrate its effectiveness by uncovering new insights into user intents from search and chat logs from the Microsoft Bing commercial search engine. The proposed work's novelty stems from the method for generating purpose-driven user intent taxonomies with strong validation. This method not only helps remove methodological and practical bottlenecks from intent-focused research, but also provides a new framework for generating, validating, and applying other kinds of taxonomies in a scalable and adaptable way with reasonable human effort.
△ Less
Submitted 9 May, 2024; v1 submitted 14 September, 2023;
originally announced September 2023.
-
Diff2Lip: Audio Conditioned Diffusion Models for Lip-Synchronization
Authors:
Soumik Mukhopadhyay,
Saksham Suri,
Ravi Teja Gadde,
Abhinav Shrivastava
Abstract:
The task of lip synchronization (lip-sync) seeks to match the lips of human faces with different audio. It has various applications in the film industry as well as for creating virtual avatars and for video conferencing. This is a challenging problem as one needs to simultaneously introduce detailed, realistic lip movements while preserving the identity, pose, emotions, and image quality. Many of…
▽ More
The task of lip synchronization (lip-sync) seeks to match the lips of human faces with different audio. It has various applications in the film industry as well as for creating virtual avatars and for video conferencing. This is a challenging problem as one needs to simultaneously introduce detailed, realistic lip movements while preserving the identity, pose, emotions, and image quality. Many of the previous methods trying to solve this problem suffer from image quality degradation due to a lack of complete contextual information. In this paper, we present Diff2Lip, an audio-conditioned diffusion-based model which is able to do lip synchronization in-the-wild while preserving these qualities. We train our model on Voxceleb2, a video dataset containing in-the-wild talking face videos. Extensive studies show that our method outperforms popular methods like Wav2Lip and PC-AVS in Fréchet inception distance (FID) metric and Mean Opinion Scores (MOS) of the users. We show results on both reconstruction (same audio-video inputs) as well as cross (different audio-video inputs) settings on Voxceleb2 and LRW datasets. Video results and code can be accessed from our project page ( https://soumik-kanad.github.io/diff2lip ).
△ Less
Submitted 18 August, 2023;
originally announced August 2023.
-
Fault Tolerance in Euclidean Committee Selection
Authors:
Chinmay Sonar,
Subhash Suri,
Jie Xue
Abstract:
In the committee selection problem, the goal is to choose a subset of size $k$ from a set of candidates $C$ that collectively gives the best representation to a set of voters. We consider this problem in Euclidean $d$-space where each voter/candidate is a point and voters' preferences are implicitly represented by Euclidean distances to candidates. We explore fault-tolerance in committee selection…
▽ More
In the committee selection problem, the goal is to choose a subset of size $k$ from a set of candidates $C$ that collectively gives the best representation to a set of voters. We consider this problem in Euclidean $d$-space where each voter/candidate is a point and voters' preferences are implicitly represented by Euclidean distances to candidates. We explore fault-tolerance in committee selection and study the following three variants: (1) given a committee and a set of $f$ failing candidates, find their optimal replacement; (2) compute the worst-case replacement score for a given committee under failure of $f$ candidates; and (3) design a committee with the best replacement score under worst-case failures. The score of a committee is determined using the well-known (min-max) Chamberlin-Courant rule: minimize the maximum distance between any voter and its closest candidate in the committee. Our main results include the following: (1) in one dimension, all three problems can be solved in polynomial time; (2) in dimension $d \geq 2$, all three problems are NP-hard; and (3) all three problems admit a constant-factor approximation in any fixed dimension, and the optimal committee problem has an FPT bicriterion approximation.
△ Less
Submitted 14 August, 2023;
originally announced August 2023.
-
Functional equivariance and modified vector fields
Authors:
Ari Stern,
Sanah Suri
Abstract:
This paper examines functional equivariance, recently introduced by McLachlan and Stern [Found. Comput. Math. (2022)], from the perspective of backward error analysis. We characterize the evolution of certain classes of observables (especially affine and quadratic) by structure-preserving numerical integrators in terms of their modified vector fields. Several results on invariant preservation and…
▽ More
This paper examines functional equivariance, recently introduced by McLachlan and Stern [Found. Comput. Math. (2022)], from the perspective of backward error analysis. We characterize the evolution of certain classes of observables (especially affine and quadratic) by structure-preserving numerical integrators in terms of their modified vector fields. Several results on invariant preservation and symplecticity of modified vector fields are thereby generalized to describe the numerical evolution of non-invariant observables.
△ Less
Submitted 15 November, 2023; v1 submitted 4 July, 2023;
originally announced July 2023.
-
Simple Embodied Language Learning as a Byproduct of Meta-Reinforcement Learning
Authors:
Evan Zheran Liu,
Sahaana Suri,
Tong Mu,
Allan Zhou,
Chelsea Finn
Abstract:
Whereas machine learning models typically learn language by directly training on language tasks (e.g., next-word prediction), language emerges in human children as a byproduct of solving non-language tasks (e.g., acquiring food). Motivated by this observation, we ask: can embodied reinforcement learning (RL) agents also indirectly learn language from non-language tasks? Learning to associate langu…
▽ More
Whereas machine learning models typically learn language by directly training on language tasks (e.g., next-word prediction), language emerges in human children as a byproduct of solving non-language tasks (e.g., acquiring food). Motivated by this observation, we ask: can embodied reinforcement learning (RL) agents also indirectly learn language from non-language tasks? Learning to associate language with its meaning requires a dynamic environment with varied language. Therefore, we investigate this question in a multi-task environment with language that varies across the different tasks. Specifically, we design an office navigation environment, where the agent's goal is to find a particular office, and office locations differ in different buildings (i.e., tasks). Each building includes a floor plan with a simple language description of the goal office's location, which can be visually read as an RGB image when visited. We find RL agents indeed are able to indirectly learn language. Agents trained with current meta-RL algorithms successfully generalize to reading floor plans with held-out layouts and language phrases, and quickly navigate to the correct office, despite receiving no direct language supervision.
△ Less
Submitted 14 June, 2023;
originally announced June 2023.
-
Kinematics and stability of high-mass protostellar disk candidates at sub-arcsecond resolution -- Insights from the IRAM NOEMA large program CORE
Authors:
Aida Ahmadi,
H. Beuther,
F. Bosco,
C. Gieser,
S. Suri,
J. C. Mottram,
R. Kuiper,
Th. Henning,
Á. Sánchez-Monge,
H. Linz,
R. E. Pudritz,
D. Semenov,
J. M. Winters,
T. Möller,
M. T. Beltrán,
T. Csengeri,
R. Galván-Madrid,
K. G. Johnston,
E. Keto,
P. D. Klaassen,
S. Leurini,
S. N. Longmore,
S. L. Lumsden,
L. T. Maud,
L. Moscadelli
, et al. (6 additional authors not shown)
Abstract:
The fragmentation mode of high-mass molecular clumps and the accretion processes that form the most massive stars ($M\gtrsim 8M_\odot$) are still not well understood. To this end, we have undertaken a large observational program (CORE) making use of interferometric observations from the Northern Extended Millimetre Array (NOEMA) for a sample of 20 luminous ($L>10^4L_\odot$) protostellar objects in…
▽ More
The fragmentation mode of high-mass molecular clumps and the accretion processes that form the most massive stars ($M\gtrsim 8M_\odot$) are still not well understood. To this end, we have undertaken a large observational program (CORE) making use of interferometric observations from the Northern Extended Millimetre Array (NOEMA) for a sample of 20 luminous ($L>10^4L_\odot$) protostellar objects in the 1.37 mm wavelength regime in both continuum and line emission, reaching $\sim$0.4" resolution (800 au at 2 kpc). Using the dense gas tracer CH$_3$CN, we find velocity gradients across 13 cores perpendicular to the directions of bipolar molecular outflows, making them excellent disk candidates. Specific angular momentum ($j$) radial profiles are on average $\sim10^{-3}$ km /s pc and follow $j \propto r^{1.7}$, consistent with a poorly resolved rotating and infalling envelope/disk model. Fitting the velocity profiles with a Keplerian model, we find protostellar masses in the range of $\sim 10-25$ $M_\odot$. Modelling the level population of CH$_3$CN lines, we present temperature maps and find median gas temperatures in the range $70-210$ K. We create Toomre $Q$ maps to study the stability of the disks and find almost all (11 of 13) disk candidates to be prone to fragmentation due to gravitational instabilities at the scales probed by our observations. In particular, disks with masses greater than $\sim10-20\%$ of the mass of their host (proto)stars are Toomre unstable, and more luminous protostellar objects tend to have disks that are more massive and hence more prone to fragmentation. Our finings show that most disks around high-mass protostars are prone to disk fragmentation early in their formation due to their high disk to stellar mass ratio. This impacts the accretion evolution of high-mass protostars which will have significant implications for the formation of the most massive stars.
△ Less
Submitted 3 May, 2023; v1 submitted 28 April, 2023;
originally announced May 2023.
-
OpenAssistant Conversations -- Democratizing Large Language Model Alignment
Authors:
Andreas Köpf,
Yannic Kilcher,
Dimitri von Rütte,
Sotiris Anagnostidis,
Zhi-Rui Tam,
Keith Stevens,
Abdullah Barhoum,
Nguyen Minh Duc,
Oliver Stanley,
Richárd Nagyfi,
Shahul ES,
Sameer Suri,
David Glushkov,
Arnav Dantuluri,
Andrew Maguire,
Christoph Schuhmann,
Huu Nguyen,
Alexander Mattick
Abstract:
Aligning large language models (LLMs) with human preferences has proven to drastically improve usability and has driven rapid adoption as demonstrated by ChatGPT. Alignment techniques such as supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF) greatly reduce the required skill and domain knowledge to effectively harness the capabilities of LLMs, increasing their acce…
▽ More
Aligning large language models (LLMs) with human preferences has proven to drastically improve usability and has driven rapid adoption as demonstrated by ChatGPT. Alignment techniques such as supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF) greatly reduce the required skill and domain knowledge to effectively harness the capabilities of LLMs, increasing their accessibility and utility across various domains. However, state-of-the-art alignment techniques like RLHF rely on high-quality human feedback data, which is expensive to create and often remains proprietary. In an effort to democratize research on large-scale alignment, we release OpenAssistant Conversations, a human-generated, human-annotated assistant-style conversation corpus consisting of 161,443 messages in 35 different languages, annotated with 461,292 quality ratings, resulting in over 10,000 complete and fully annotated conversation trees. The corpus is a product of a worldwide crowd-sourcing effort involving over 13,500 volunteers. Models trained on OpenAssistant Conversations show consistent improvements on standard benchmarks over respective base models. We release our code and data under a fully permissive licence.
△ Less
Submitted 31 October, 2023; v1 submitted 14 April, 2023;
originally announced April 2023.
-
Data Combination: Interferometry and Single-dish Imaging in Radio Astronomy
Authors:
Adele Plunkett,
Alvaro Hacar,
Lydia Moser-Fischer,
Dirk Petry,
Peter Teuben,
Nickolas Pingel,
Devaky Kunneriath,
Toshinobu Takagi,
Yusuke Miyamoto,
Emily Moravec,
Sumeyye Suri,
Kelley M. Hess,
Melissa Hoffman,
Brian Mason
Abstract:
Modern interferometers routinely provide radio-astronomical images down to subarcsecond resolution. However, interferometers filter out spatial scales larger than those sampled by the shortest baselines, which affects the measurement of both spatial and spectral features. Complementary single-dish data are vital for recovering the true flux distribution of spatially resolved astronomical sources w…
▽ More
Modern interferometers routinely provide radio-astronomical images down to subarcsecond resolution. However, interferometers filter out spatial scales larger than those sampled by the shortest baselines, which affects the measurement of both spatial and spectral features. Complementary single-dish data are vital for recovering the true flux distribution of spatially resolved astronomical sources with such extended emission. In this work, we provide an overview of the prominent available methods to combine single-dish and interferometric observations. We test each of these methods in the framework of the CASA data analysis software package on both synthetic continuum and observed spectral data sets. We develop a set of new assessment tools that are generally applicable to all radio-astronomical cases of data combination. Applying these new assessment diagnostics, we evaluate the methods' performance and demonstrate the significant improvement of the combined results in comparison to purely interferometric reductions. We provide combination and assessment scripts as add-on material. Our results highlight the advantage of using data combination to ensure high-quality science images of spatially resolved objects.
△ Less
Submitted 3 March, 2023;
originally announced March 2023.
-
Teaching Matters: Investigating the Role of Supervision in Vision Transformers
Authors:
Matthew Walmer,
Saksham Suri,
Kamal Gupta,
Abhinav Shrivastava
Abstract:
Vision Transformers (ViTs) have gained significant popularity in recent years and have proliferated into many applications. However, their behavior under different learning paradigms is not well explored. We compare ViTs trained through different methods of supervision, and show that they learn a diverse range of behaviors in terms of their attention, representations, and downstream performance. W…
▽ More
Vision Transformers (ViTs) have gained significant popularity in recent years and have proliferated into many applications. However, their behavior under different learning paradigms is not well explored. We compare ViTs trained through different methods of supervision, and show that they learn a diverse range of behaviors in terms of their attention, representations, and downstream performance. We also discover ViT behaviors that are consistent across supervision, including the emergence of Offset Local Attention Heads. These are self-attention heads that attend to a token adjacent to the current token with a fixed directional offset, a phenomenon that to the best of our knowledge has not been highlighted in any prior work. Our analysis shows that ViTs are highly flexible and learn to process local and global information in different orders depending on their training method. We find that contrastive self-supervised methods learn features that are competitive with explicitly supervised features, and they can even be superior for part-level tasks. We also find that the representations of reconstruction-based models show non-trivial similarity to contrastive self-supervised models. Project website (https://www.cs.umd.edu/~sakshams/vit_analysis) and code (https://www.github.com/mwalmer-umd/vit_analysis) are publicly available.
△ Less
Submitted 5 April, 2023; v1 submitted 7 December, 2022;
originally announced December 2022.
-
ALMA Fragmented Source Catalogue in Orion (FraSCO) I. Outflow interaction within an embedded cluster in OMC-2/FIR3, FIR4, and FIR5
Authors:
Asako Sato,
Satoko Takahashi,
Shun Ishii,
Paul T. P. Ho,
Masahiro N. Machida,
John Carpenter,
Luis A. Zapata,
Paula Stella Teixeira,
Sümeyye Suri
Abstract:
We present a high angular resolution ($\sim1"$) and wide-field ($2'.9 \times 1'.9$) image of the 1.3-mm continuum, CO($J$ = 2--1) line, and SiO($J$ = 5--4) line emissions toward an embedded protocluster, FIR3, FIR4, and FIR5, in the Orion Molecular Cloud 2 obtained from the Atacama Large Millimeter/submillimeter Array (ALMA). We identify 51 continuum sources, 36 of which are newly identified in th…
▽ More
We present a high angular resolution ($\sim1"$) and wide-field ($2'.9 \times 1'.9$) image of the 1.3-mm continuum, CO($J$ = 2--1) line, and SiO($J$ = 5--4) line emissions toward an embedded protocluster, FIR3, FIR4, and FIR5, in the Orion Molecular Cloud 2 obtained from the Atacama Large Millimeter/submillimeter Array (ALMA). We identify 51 continuum sources, 36 of which are newly identified in this study. Their dust masses, projected sizes, and $\mathrm{H_2}$ gas number densities are estimated to be $3.8 \times 10^{-5}$--$ 1.1 \times 10^{-2} \mathrm{M_{\odot}}$, 290--2000 au, and $6.4 \times 10^{6}$--$3.3 \times 10^{8}\,\mathrm{cm^{-3}}$, respectively. The results of a Jeans analysis show that $\sim80\,\%$ of the protostellar sources and $\sim15\,\%$ of the prestellar sources are gravitationally bound. We identify 12 molecular outflows traced in the CO($J$ = 2--1) emission, six of which are newly detected. We spatially resolve shocked gas structures traced by the SiO($J$ = 5--4) emission in this region for the first time. We identify shocked gas originating from outflows and other shocked regions. These results provide direct evidence of an interaction between a dust condensation, FIR4, and an energetic outflow driven by HOPS-370 located within FIR3. A comparison of the outflow dynamical timescales, fragmentation timescales, and protostellar ages shows that the previously proposed triggered star-formation scenario in FIR4 is not strongly supported. We also discuss the spatial distribution of filaments identified in our continuum image by comparing it with a previously identified hub-fiber system in the $\mathrm{N_2H^+}$ line.
△ Less
Submitted 23 November, 2022; v1 submitted 22 November, 2022;
originally announced November 2022.
-
The Cygnus Allscale Survey of Chemistry and Dynamical Environments: CASCADE: Overview and first results toward DR20 from the Max Planck IRAM Observatory program (MIOP)
Authors:
H. Beuther,
F. Wyrowski,
K. M. Menten,
J. M. Winters,
S. Suri,
W. -J. Kim,
L. Bouscasse,
C. Gieser,
M. Sawczuck,
I. B. Christensen,
I. M. Skretas
Abstract:
Context: While star formation on large molecular cloud scales and on small core and disk scales has been investigated intensely over the past decades, the connection of the large-scale interstellar material with the densest small-scale cores has been a largely neglected field.
Methods: Using NOEMA and the IRAM 30\,m telescope, we mapped large areas (640\,arcmin$^2$) of the archetypical star form…
▽ More
Context: While star formation on large molecular cloud scales and on small core and disk scales has been investigated intensely over the past decades, the connection of the large-scale interstellar material with the densest small-scale cores has been a largely neglected field.
Methods: Using NOEMA and the IRAM 30\,m telescope, we mapped large areas (640\,arcmin$^2$) of the archetypical star formation complex Cygnus X at 3.6\,mm wavelengths in line and continuum emission.
Results: The scope and outline of The Cygnus Allscale Survey of Chemistry and Dynamical Environments (CASCADE) is presented. We then focus on the first observed subregion in Cygnus X, namely the DR20 star formation site, which comprises sources in a range of evolutionary stages from cold pristine gas clumps to more evolved ultracompact H{\sc ii} regions. The data covering cloud to cores scales at a linear spatial resolution of $<5000$\,au reveal several kinematic cloud components that are likely part of several large-scale flows onto the central cores. The temperature structure of the region is investigated by means of the HCN/HNC intensity ratio and compared to dust-derived temperatures. We find that the deuterated DCO$^+$ emission is almost exclusively located toward regions at low temperatures below 20\,K. Investigating the slopes of spatial power spectra of dense gas tracer intensity distributions (HCO$^+$, H$^{13}$CO$^+$, and N$_2$H$^+$), we find comparatively flat slopes between $-2.9$ and $-2.6$, consistent with high Mach numbers and/or active star formation in DR20.
△ Less
Submitted 22 July, 2022;
originally announced July 2022.
-
Multiwinner Elections under Minimax Chamberlin-Courant Rule in Euclidean Space
Authors:
Chinmay Sonar,
Subhash Suri,
Jie Xue
Abstract:
We consider multiwinner elections in Euclidean space using the minimax Chamberlin-Courant rule. In this setting, voters and candidates are embedded in a $d$-dimensional Euclidean space, and the goal is to choose a committee of $k$ candidates so that the rank of any voter's most preferred candidate in the committee is minimized. (The problem is also equivalent to the ordinal version of the classica…
▽ More
We consider multiwinner elections in Euclidean space using the minimax Chamberlin-Courant rule. In this setting, voters and candidates are embedded in a $d$-dimensional Euclidean space, and the goal is to choose a committee of $k$ candidates so that the rank of any voter's most preferred candidate in the committee is minimized. (The problem is also equivalent to the ordinal version of the classical $k$-center problem.) We show that the problem is NP-hard in any dimension $d \geq 2$, and also provably hard to approximate. Our main results are three polynomial-time approximation schemes, each of which finds a committee with provably good minimax score. In all cases, we show that our approximation bounds are tight or close to tight. We mainly focus on the $1$-Borda rule but some of our results also hold for the more general $r$-Borda.
△ Less
Submitted 26 May, 2022;
originally announced May 2022.
-
Point Separation and Obstacle Removal by Finding and Hitting Odd Cycles
Authors:
Neeraj Kumar,
Daniel Lokshtanov,
Saket Saurabh,
Subhash Suri,
Jie Xue
Abstract:
Suppose we are given a pair of points $s, t$ and a set $S$ of $n$ geometric objects in the plane, called obstacles. We show that in polynomial time one can construct an auxiliary (multi-)graph $G$ with vertex set $S$ and every edge labeled from $\{0, 1\}$, such that a set $S_d \subseteq S$ of obstacles separates $s$ from $t$ if and only if $G[S_d]$ contains a cycle whose sum of labels is odd. Usin…
▽ More
Suppose we are given a pair of points $s, t$ and a set $S$ of $n$ geometric objects in the plane, called obstacles. We show that in polynomial time one can construct an auxiliary (multi-)graph $G$ with vertex set $S$ and every edge labeled from $\{0, 1\}$, such that a set $S_d \subseteq S$ of obstacles separates $s$ from $t$ if and only if $G[S_d]$ contains a cycle whose sum of labels is odd. Using this structural characterization of separating sets of obstacles we obtain the following algorithmic results.
In the Obstacle-Removal problem the task is to find a curve in the plane connecting s to t intersecting at most q obstacles. We give a $2.3146^qn^{O(1)}$ algorithm for Obstacle-Removal, significantly improving upon the previously best known $q^{O(q^3)} n^{O(1)}$ algorithm of Eiben and Lokshtanov (SoCG'20). We also obtain an alternative proof of a constant factor approximation algorithm for Obstacle-Removal, substantially simplifying the arguments of Kumar et al. (SODA'21).
In the Generalized Points-Separation problem, the input consists of the set S of obstacles, a point set A of k points and p pairs $(s_1, t_1),... (s_p, t_p)$ of points from A. The task is to find a minimum subset $S_r \subseteq S$ such that for every $i$, every curve from $s_i$ to $t_i$ intersects at least one obstacle in $S_r$. We obtain $2^{O(p)} n^{O(k)}$-time algorithm for Generalized Points-Separation problem. This resolves an open problem of Cabello and Giannopoulos (SoCG'13), who asked about the existence of such an algorithm for the special case where $(s_1, t_1), ... (s_p, t_p)$ contains all the pairs of points in A. Finally, we improve the running time of our algorithm to $f(p,k) n^{O(\sqrt{k})}$ when the obstacles are unit disks, where $f(p,k) = 2^O(p) k^{O(k)}$, and show that, assuming the Exponential Time Hypothesis (ETH), the running time dependence on $k$ of our algorithms is essentially optimal.
△ Less
Submitted 15 March, 2022;
originally announced March 2022.
-
FEEDBACK from the NGC7538 HII region
Authors:
H. Beuther,
N. Schneider,
R. Simon,
S. Suri,
V. Ossenkopf-Okada,
S. Kabanovic,
M. Roellig,
C. Guevara,
A. G. G. M. Tielens,
G. Sandell,
C. Buchbender,
O. Ricken,
R. Guesten
Abstract:
Context: How do expanding HII regions interact with their environmental cloud? This is one of the central questions driving the SOFIA legacy program FEEDBACK. Here, we present a case study toward the prototypical H{\sc ii} region NGC7538. Methods: With SOFIA we mapped an area of ~210'^2 around NGC7538 in the [CII] line at 1.9THz. Complementary observed atomic carbon [CI] and high-J CO(8-7) data as…
▽ More
Context: How do expanding HII regions interact with their environmental cloud? This is one of the central questions driving the SOFIA legacy program FEEDBACK. Here, we present a case study toward the prototypical H{\sc ii} region NGC7538. Methods: With SOFIA we mapped an area of ~210'^2 around NGC7538 in the [CII] line at 1.9THz. Complementary observed atomic carbon [CI] and high-J CO(8-7) data as well as archival NIR/FIR, cm continuum, CO(3-2) and HI data are folded into the analysis. Results: While the overall [CII] morphology follows the general ionized gas, the channel maps show multiple bubble-like structures with sizes on the order of ~80-100" (~1.0-1.28pc). While at least one of them may be an individual feedback bubble driven by the main exciting sources of the region, the other bubble-morphologies may also be due to the intrinsically porous structure of the HII region. An analysis of the expansion velocities around 10km s^{-1} indicates that thermal expansion is not sufficient but that wind-driving from the central O-stars is required. The most blue-shifted [CII] component has barely any molecular or atomic counterparts. At the interface to the molecular cloud, we find a typical photon-dominated region (PDR) with a bar-shape. Ionized, atomic and molecular carbon show a layered structure in this PDR. The carbon in the PDR is dominated by its ionized form with atomic and molecular masses of ~0.45+-0.1M_{\odot} and ~1.2+-0.1M_{\odot}, respectively, compared to the ionized carbon in the range of 3.6-9.7M_{\odot}. Conclusions: The NGC7538 HII region exhibits a diverse set of sub-structures that interact with each other as well as with the adjacent cloud. Compared to other recent [CII] observations of HII regions (e.g., Orion Veil, RCW120, RCW49), bubble-shape morphologies revealed in [CII] emission, indicative of expanding shells, are recurring structures of PDRs.
△ Less
Submitted 14 January, 2022;
originally announced January 2022.
-
SparseDet: Improving Sparsely Annotated Object Detection with Pseudo-positive Mining
Authors:
Saksham Suri,
Sai Saketh Rambhatla,
Rama Chellappa,
Abhinav Shrivastava
Abstract:
Training with sparse annotations is known to reduce the performance of object detectors. Previous methods have focused on proxies for missing ground truth annotations in the form of pseudo-labels for unlabeled boxes. We observe that existing methods suffer at higher levels of sparsity in the data due to noisy pseudo-labels. To prevent this, we propose an end-to-end system that learns to separate t…
▽ More
Training with sparse annotations is known to reduce the performance of object detectors. Previous methods have focused on proxies for missing ground truth annotations in the form of pseudo-labels for unlabeled boxes. We observe that existing methods suffer at higher levels of sparsity in the data due to noisy pseudo-labels. To prevent this, we propose an end-to-end system that learns to separate the proposals into labeled and unlabeled regions using Pseudo-positive mining. While the labeled regions are processed as usual, self-supervised learning is used to process the unlabeled regions thereby preventing the negative effects of noisy pseudo-labels. This novel approach has multiple advantages such as improved robustness to higher sparsity when compared to existing methods. We conduct exhaustive experiments on five splits on the PASCAL-VOC and COCO datasets achieving state-of-the-art performance. We also unify various splits used across literature for this task and present a standardized benchmark. On average, we improve by $2.6$, $3.9$ and $9.6$ mAP over previous state-of-the-art methods on three splits of increasing sparsity on COCO. Our project is publicly available at https://www.cs.umd.edu/~sakshams/SparseDet.
△ Less
Submitted 26 August, 2023; v1 submitted 12 January, 2022;
originally announced January 2022.
-
Dynamic Geometric Set Cover, Revisited
Authors:
Timothy M. Chan,
Qizheng He,
Subhash Suri,
Jie Xue
Abstract:
Geometric set cover is a classical problem in computational geometry, which has been extensively studied in the past. In the dynamic version of the problem, points and ranges may be inserted and deleted, and our goal is to efficiently maintain a set cover solution (satisfying certain quality requirement). In this paper, we give a plethora of new dynamic geometric set cover data structures in 1D an…
▽ More
Geometric set cover is a classical problem in computational geometry, which has been extensively studied in the past. In the dynamic version of the problem, points and ranges may be inserted and deleted, and our goal is to efficiently maintain a set cover solution (satisfying certain quality requirement). In this paper, we give a plethora of new dynamic geometric set cover data structures in 1D and 2D, which significantly improve and extend the previous results:
1. The first data structure for $(1+\varepsilon)$-approximate dynamic interval set cover with polylogarithmic amortized update time. Specifically, we achieve an update time of $O(\log^3 n/\varepsilon)$, improving the $O(n^δ/\varepsilon)$ bound of Agarwal et al. [SoCG'20], where $δ>0$ denotes an arbitrarily small constant.
2. A data structure for $O(1)$-approximate dynamic unit-square set cover with $2^{O(\sqrt{\log n})}$ amortized update time, substantially improving the $O(n^{1/2+δ})$ update time of Agarwal et al. [SoCG'20].
3. A data structure for $O(1)$-approximate dynamic square set cover with $O(n^{1/2+δ})$ randomized amortized update time, improving the $O(n^{2/3+δ})$ update time of Chan and He [SoCG'21].
4. A data structure for $O(1)$-approximate dynamic 2D halfplane set cover with $O(n^{17/23+δ})$ randomized amortized update time. The previous solution for halfplane set cover by Chan and He [SoCG'21] is slower and can only report the size of the approximate solution.
5. The first sublinear results for the \textit{weighted} version of dynamic geometric set cover. Specifically, we give a data structure for $(3+o(1))$-approximate dynamic weighted interval set cover with $2^{O(\sqrt{\log n \log\log n})}$ amortized update time and a data structure for $O(1)$-approximate dynamic weighted unit-square set cover with $O(n^δ)$ amortized update time.
△ Less
Submitted 1 November, 2021;
originally announced November 2021.
-
The "Maggie" filament: Physical properties of a giant atomic cloud
Authors:
J. Syed,
J. D. Soler,
H. Beuther,
Y. Wang,
S. Suri,
J. D. Henshaw,
M. Riener,
S. Bialy,
S. Rezaei Kh.,
J. M. Stil,
P. F. Goldsmith,
M. R. Rugel,
S. C. O. Glover,
R. S. Klessen,
J. Kerp,
J. S. Urquhart,
J. Ott,
N. Roy,
N. Schneider,
R. J. Smith,
S. N. Longmore,
H. Linz
Abstract:
The atomic phase of the interstellar medium plays a key role in the formation process of molecular clouds. Due to the line-of-sight confusion in the Galactic plane that is associated with its ubiquity, atomic hydrogen emission has been challenging to study. Employing the high-angular resolution data from the THOR survey, we identify one of the largest, coherent, mostly atomic HI filaments in the M…
▽ More
The atomic phase of the interstellar medium plays a key role in the formation process of molecular clouds. Due to the line-of-sight confusion in the Galactic plane that is associated with its ubiquity, atomic hydrogen emission has been challenging to study. Employing the high-angular resolution data from the THOR survey, we identify one of the largest, coherent, mostly atomic HI filaments in the Milky Way at the line-of-sight velocities around -54 km/s. The giant atomic filament "Maggie", with a total length of 1.2 kpc, is not detected in most other tracers, and does not show signs of active star formation. At a kinematic distance of 17 kpc, Maggie is situated below (by 500 pc) but parallel to the Galactic HI disk and is trailing the predicted location of the Outer Arm by 5-10 km/s in longitude-velocity space. The centroid velocity exhibits a smooth gradient of less than $\pm$3 km/s /10 pc and a coherent structure to within $\pm$6 km/s. The line widths of 10 km/s along the spine of the filament are dominated by non-thermal effects. After correcting for optical depth effects, the mass of Maggie's dense spine is estimated to be $7.2\times10^5\,M_{\odot}$. The mean number density of the filament is 4$\rm\,cm^{-3}$, which is best explained by the filament being a mix of cold and warm neutral gas. In contrast to molecular filaments, the turbulent Mach number and velocity structure function suggest that Maggie is driven by transonic to moderately supersonic velocities that are likely associated with the Galactic potential rather than being subject to the effects of self-gravity or stellar feedback. The column density PDF displays a log-normal shape around a mean of $N_{\rm HI} = 4.8\times 10^{20}\rm\,cm^{-2}$, thus reflecting the absence of dominating effects of gravitational contraction.
△ Less
Submitted 1 November, 2021;
originally announced November 2021.
-
Clustered star formation at early evolutionary stages. Physical and chemical analysis of the young star-forming regions ISOSS J22478+6357 and ISOSS J23053+5953
Authors:
C. Gieser,
H. Beuther,
D. Semenov,
S. Suri,
J. D. Soler,
H. Linz,
J. Syed,
Th. Henning,
S. Feng,
T. Möller,
A. Palau,
J. M. Winters,
M. T. Beltrán,
R. Kuiper,
L. Moscadelli,
P. Klaassen,
J. S. Urquhart,
T. Peters,
S. N. Longmore,
Á. Sánchez-Monge,
R. Galván-Madrid,
R. E. Pudritz,
K. G. Johnston
Abstract:
We aim to characterize the physical and chemical properties of fragmented cores during the earliest evolutionary stages in the very young star-forming regions ISOSS J22478+6357 and ISOSS J23053+5953. NOEMA 1.3 mm data are used in combination with archival mid- and far-infrared observations to construct and fit the SEDs of individual fragmented cores. The radial density profiles are inferred from t…
▽ More
We aim to characterize the physical and chemical properties of fragmented cores during the earliest evolutionary stages in the very young star-forming regions ISOSS J22478+6357 and ISOSS J23053+5953. NOEMA 1.3 mm data are used in combination with archival mid- and far-infrared observations to construct and fit the SEDs of individual fragmented cores. The radial density profiles are inferred from the 1.3 mm continuum visibility profiles and the radial temperature profiles are estimated from H2CO rotation temperature maps. Molecular column densities are derived with the line fitting tool XCLASS. The physical and chemical properties are combined by applying the physical-chemical model MUSCLE in order to constrain the chemical timescales of a few line-rich cores. The morphology and spatial correlations of the molecular emission are analyzed using the HOG method. The mid-infrared data show that both regions contain a cluster of young stellar objects. Bipolar molecular outflows are observed in the CO 2-1 transition toward the strong mm cores indicating protostellar activity. We find strong molecular emission of SO, SiO, H2CO, and CH3OH in locations which are not associated with the mm cores. These shocked knots can be either associated with the bipolar outflows or, in the case of ISOSS J23053+5953, with a colliding flow that creates a large shocked region between the mm cores. The mean chemical timescale of the cores is lower (20 000 yr) compared to that of the sources of the more evolved CORE sample (60 000 yr). With the HOG method, we find that the spatial emission of species tracing the extended emission and of shock-tracing molecules are well correlated within transitions of these groups.
△ Less
Submitted 14 October, 2021; v1 submitted 5 October, 2021;
originally announced October 2021.
-
Quantifying the Invisible Labor in Crowd Work
Authors:
Carlos Toxtli,
Siddharth Suri,
Saiph Savage
Abstract:
Crowdsourcing markets provide workers with a centralized place to find paid work. What may not be obvious at first glance is that, in addition to the work they do for pay, crowd workers also have to shoulder a variety of unpaid invisible labor in these markets, which ultimately reduces workers' hourly wages. Invisible labor includes finding good tasks, messaging requesters, or managing payments. H…
▽ More
Crowdsourcing markets provide workers with a centralized place to find paid work. What may not be obvious at first glance is that, in addition to the work they do for pay, crowd workers also have to shoulder a variety of unpaid invisible labor in these markets, which ultimately reduces workers' hourly wages. Invisible labor includes finding good tasks, messaging requesters, or managing payments. However, we currently know little about how much time crowd workers actually spend on invisible labor or how much it costs them economically. To ensure a fair and equitable future for crowd work, we need to be certain that workers are being paid fairly for all of the work they do. In this paper, we conduct a field study to quantify the invisible labor in crowd work. We build a plugin to record the amount of time that 100 workers on Amazon Mechanical Turk dedicate to invisible labor while completing 40,903 tasks. If we ignore the time workers spent on invisible labor, workers' median hourly wage was $3.76. But, we estimated that crowd workers in our study spent 33% of their time daily on invisible labor, dropping their median hourly wage to $2.83. We found that the invisible labor differentially impacts workers depending on their skill level and workers' demographics. The invisible labor category that took the most time and that was also the most common revolved around workers having to manage their payments. The second most time-consuming invisible labor category involved hyper-vigilance, where workers vigilantly watched over requesters' profiles for newly posted work or vigilantly searched for labor. We hope that through our paper, the invisible labor in crowdsourcing becomes more visible, and our results help to reveal the larger implications of the continuing invisibility of labor in crowdsourcing.
△ Less
Submitted 30 September, 2021;
originally announced October 2021.
-
Disk fragmentation in high-mass star formation. High-resolution observations towards AFGL 2591-VLA 3
Authors:
S. Suri,
H. Beuther,
C. Gieser,
A. Ahmadi,
Á. Sánchez-Monge,
J. M. Winters,
H. Linz,
Th. Henning,
M. T. Beltrán,
F. Bosco,
R. Cesaroni,
T. Csengeri,
S. Feng,
M. G. Hoare,
K. G. Johnston,
P. Klaasen,
R. Kuiper,
S. Leurini,
S. Longmore,
S. Lumsden,
L. Maud,
L. Moscadelli,
T. Möller,
A. Palau,
T. Peters
, et al. (7 additional authors not shown)
Abstract:
Increasing evidence suggests that, similar to their low-mass counterparts, high-mass stars form through a disk-mediated accretion process. At the same time, formation of high-mass stars still necessitates high accretion rates, and hence, high gas densities, which in turn can cause disks to become unstable against gravitational fragmentation. We study the kinematics and fragmentation of the disk ar…
▽ More
Increasing evidence suggests that, similar to their low-mass counterparts, high-mass stars form through a disk-mediated accretion process. At the same time, formation of high-mass stars still necessitates high accretion rates, and hence, high gas densities, which in turn can cause disks to become unstable against gravitational fragmentation. We study the kinematics and fragmentation of the disk around the high-mass star forming region AFGL 2591-VLA 3 which was hypothesized to be fragmenting based on the observations that show multiple outflow directions. We use a new set of high-resolution (0.19 arcsec) IRAM/NOEMA observations at 843 micron towards VLA 3 which allow us to resolve its disk, characterize the fragmentation, and study its kinematics. In addition to the 843 micron continuum emission, our spectral setup targets warm dense gas and outflow tracers such as HCN, HC$_3$N and SO$_2$, as well as vibrationally excited HCN lines. The high resolution continuum and line emission maps reveal multiple fragments with subsolar masses within the inner 1000 AU of VLA 3. Furthermore, the velocity field of the inner disk observed at 843 micron shows a similar behavior to that of the larger scale velocity field studied in the CORE project at 1.37 mm. We present the first observational evidence for disk fragmentation towards AFGL 2591-VLA 3, a source that was thought to be a single high-mass core. While the fragments themselves are low-mass, the rotation of the disk is dominated by the protostar with a mass of 10.3$\pm 1.8~M_{\odot}$. These data also show that NOEMA Band 4 can obtain the highest currently achievable spatial resolution at (sub-)mm wavelengths in observations of strong northern sources.
△ Less
Submitted 10 September, 2021;
originally announced September 2021.
-
Observation and calibration strategies for large-scale multi-beam velocity-resolved mapping of the [CII] emission in the Orion molecular cloud
Authors:
R. Higgins,
S. Kabanovic,
C. Pabst,
D. Teyssier,
J. R. Goicoechea,
O. Berne,
E. Chambers,
M. Wolfire,
S. Suri,
C. Buchbender,
Y. Okada,
M. Mertens,
A. Parikka,
R. Aladro,
H. Richter,
R. Güsten,
J. Stutzki,
A. G. G. M. Tielens
Abstract:
Context. The [CII] 158micron far-infrared fine-structure line is one of the dominant cooling lines of the star-forming interstellar medium (ISM). Hence [CII] emission originates in and thus can be used to trace a range of ISM processes. Velocity-resolved large-scale mapping of [CII] in star-forming regions provides a unique perspective of the kinematics of these regions and their interactions with…
▽ More
Context. The [CII] 158micron far-infrared fine-structure line is one of the dominant cooling lines of the star-forming interstellar medium (ISM). Hence [CII] emission originates in and thus can be used to trace a range of ISM processes. Velocity-resolved large-scale mapping of [CII] in star-forming regions provides a unique perspective of the kinematics of these regions and their interactions with the exciting source of radiation.
Aims. We explore the scientific applications of large-scale mapping of velocity-resolved [CII] observations. With the [CII] observations, we investigate the effect of stellar feedback on the ISM. We present the details of observation, calibration, and data reduction using a heterodyne array receiver mounted on an airborne observatory.
Results. A square-degree [CII] map with a spectral resolution of 0.3 km/s is presented. The scientific potential of this data is summarized with discussion of mechanical and radiative stellar feedback, filament tracing using [CII], [CII] opacity effects, [CII] and carbon recombination lines, and [CII] interaction with the large molecular cloud. The data quality and calibration is discussed in detail, and new techniques are presented to mitigate the effects of unavoidable instrument deficiencies (e.g. baseline stability) and thus to improve the data quality. A comparison with a smaller [CII] map taken with the Herschel/Heterodyne Instrument for the Far-Infrared (HIFI) spectrometer is presented.
△ Less
Submitted 1 July, 2021; v1 submitted 29 June, 2021;
originally announced June 2021.
-
Ember: No-Code Context Enrichment via Similarity-Based Keyless Joins
Authors:
Sahaana Suri,
Ihab F. Ilyas,
Christopher Ré,
Theodoros Rekatsinas
Abstract:
Structured data, or data that adheres to a pre-defined schema, can suffer from fragmented context: information describing a single entity can be scattered across multiple datasets or tables tailored for specific business needs, with no explicit linking keys (e.g., primary key-foreign key relationships or heuristic functions). Context enrichment, or rebuilding fragmented context, using keyless join…
▽ More
Structured data, or data that adheres to a pre-defined schema, can suffer from fragmented context: information describing a single entity can be scattered across multiple datasets or tables tailored for specific business needs, with no explicit linking keys (e.g., primary key-foreign key relationships or heuristic functions). Context enrichment, or rebuilding fragmented context, using keyless joins is an implicit or explicit step in machine learning (ML) pipelines over structured data sources. This process is tedious, domain-specific, and lacks support in now-prevalent no-code ML systems that let users create ML pipelines using just input data and high-level configuration files. In response, we propose Ember, a system that abstracts and automates keyless joins to generalize context enrichment. Our key insight is that Ember can enable a general keyless join operator by constructing an index populated with task-specific embeddings. Ember learns these embeddings by leveraging Transformer-based representation learning techniques. We describe our core architectural principles and operators when developing Ember, and empirically demonstrate that Ember allows users to develop no-code pipelines for five domains, including search, recommendation and question answering, and can exceed alternatives by up to 39% recall, with as little as a single line configuration change.
△ Less
Submitted 2 June, 2021;
originally announced June 2021.
-
Towards Discovery and Attribution of Open-world GAN Generated Images
Authors:
Sharath Girish,
Saksham Suri,
Saketh Rambhatla,
Abhinav Shrivastava
Abstract:
With the recent progress in Generative Adversarial Networks (GANs), it is imperative for media and visual forensics to develop detectors which can identify and attribute images to the model generating them. Existing works have shown to attribute images to their corresponding GAN sources with high accuracy. However, these works are limited to a closed set scenario, failing to generalize to GANs uns…
▽ More
With the recent progress in Generative Adversarial Networks (GANs), it is imperative for media and visual forensics to develop detectors which can identify and attribute images to the model generating them. Existing works have shown to attribute images to their corresponding GAN sources with high accuracy. However, these works are limited to a closed set scenario, failing to generalize to GANs unseen during train time and are therefore, not scalable with a steady influx of new GANs. We present an iterative algorithm for discovering images generated from previously unseen GANs by exploiting the fact that all GANs leave distinct fingerprints on their generated images. Our algorithm consists of multiple components including network training, out-of-distribution detection, clustering, merge and refine steps. Through extensive experiments, we show that our algorithm discovers unseen GANs with high accuracy and also generalizes to GANs trained on unseen real datasets. We additionally apply our algorithm to attribution and discovery of GANs in an online fashion as well as to the more standard task of real/fake detection. Our experiments demonstrate the effectiveness of our approach to discover new GANs and can be used in an open-world setup.
△ Less
Submitted 20 September, 2021; v1 submitted 10 May, 2021;
originally announced May 2021.
-
Learned Spatial Representations for Few-shot Talking-Head Synthesis
Authors:
Moustafa Meshry,
Saksham Suri,
Larry S. Davis,
Abhinav Shrivastava
Abstract:
We propose a novel approach for few-shot talking-head synthesis. While recent works in neural talking heads have produced promising results, they can still produce images that do not preserve the identity of the subject in source images. We posit this is a result of the entangled representation of each subject in a single latent code that models 3D shape information, identity cues, colors, lightin…
▽ More
We propose a novel approach for few-shot talking-head synthesis. While recent works in neural talking heads have produced promising results, they can still produce images that do not preserve the identity of the subject in source images. We posit this is a result of the entangled representation of each subject in a single latent code that models 3D shape information, identity cues, colors, lighting and even background details. In contrast, we propose to factorize the representation of a subject into its spatial and style components. Our method generates a target frame in two steps. First, it predicts a dense spatial layout for the target image. Second, an image generator utilizes the predicted layout for spatial denormalization and synthesizes the target frame. We experimentally show that this disentangled representation leads to a significant improvement over previous methods, both quantitatively and qualitatively.
△ Less
Submitted 29 April, 2021;
originally announced April 2021.
-
Fragmentation and kinematics in high-mass star formation: CORE-extension targeting two very young high-mass star-forming regions
Authors:
H. Beuther,
C. Gieser,
S. Suri,
H. Linz,
P. Klaassen,
D. Semenov,
J. M. Winters,
Th. Henning,
J. D. Soler,
J. S. Urquhart,
J. Syed,
S . Feng,
T. Moeller,
M. T. Beltran,
A. Sanchez-Monge,
S. N. Longmore,
T. Peters,
J. Ballesteros-Paredes,
P. Schilke,
L. Moscadelli,
A. Palau,
R. Cesaroni,
S. Lumsden,
R. Pudritz,
F. Wyrowski
, et al. (2 additional authors not shown)
Abstract:
Context: The formation of high-mass star-forming regions from their parental gas cloud and the subsequent fragmentation processes lie at the heart of star formation research. Aims: We aim to study the dynamical and fragmentation properties at very early evolutionary stages of high-mass star formation. Methods: Employing the NOrthern Extended Millimeter Array (NOEMA) and the IRAM 30m telescope, we…
▽ More
Context: The formation of high-mass star-forming regions from their parental gas cloud and the subsequent fragmentation processes lie at the heart of star formation research. Aims: We aim to study the dynamical and fragmentation properties at very early evolutionary stages of high-mass star formation. Methods: Employing the NOrthern Extended Millimeter Array (NOEMA) and the IRAM 30m telescope, we observed two young high-mass star-forming regions, ISOSS22478 and ISOSS23053, in the 1.3mm continuum and spectral line emission at a high angular resolution (~0.8''). Results: We resolved 29 cores that are mostly located along filament-like structures. Depending on the temperature assumption, these cores follow a mass-size relation of approximately M~r^2.0, corresponding to constant mean column densities. However, with different temperature assumptions, a steeper mass-size relation up to M~r^3.0, which would be more likely to correspond to constant mean volume densities, cannot be ruled out. The correlation of the core masses with their nearest neighbor separations is consistent with thermal Jeans fragmentation. We found hardly any core separations at the spatial resolution limit, indicating that the data resolve the large-scale fragmentation well. Although the kinematics of the two regions appear very different at first sight - multiple velocity components along filaments in ISOSS22478 versus a steep velocity gradient of more than 50km/s/pc in ISOSS23053 - the findings can be explained within the framework of a dynamical cloud collapse scenario. Conclusions: While our data are consistent with a dynamical cloud collapse scenario and subsequent thermal Jeans fragmentation, the importance of additional environmental properties, such as the magnetization of the gas or external shocks triggering converging gas flows, is nonetheless not as well constrained and would require future investigation.
△ Less
Submitted 6 April, 2021;
originally announced April 2021.
-
High-resolution CARMA Observation of Molecular Gas in the North America and Pelican Nebulae
Authors:
Shuo Kong,
Héctor G. Arce,
John M. Carpenter,
John Bally,
Volker Ossenkopf-Okada,
Álvaro Sánchez-Monge,
Anneila I. Sargent,
Sümeyye Suri,
Peregrine McGehee,
Dariusz C. Lis,
Ralf Klessen,
Steve Mairs,
Catherine Zucker,
Rowan J. Smith,
Fumitaka Nakamura,
Thushara G. S. Pillai,
Jens Kauffmann,
Shaobo Zhang
Abstract:
We present the first results from a CARMA high-resolution $^{12}$CO(1-0), $^{13}$CO(1-0), and C$^{18}$O(1-0) molecular line survey of the North America and Pelican (NAP) Nebulae. CARMA observations have been combined with single-dish data from the Purple Mountain 13.7m telescope to add short spacings and produce high-dynamic-range images. We find that the molecular gas is predominantly shaped by t…
▽ More
We present the first results from a CARMA high-resolution $^{12}$CO(1-0), $^{13}$CO(1-0), and C$^{18}$O(1-0) molecular line survey of the North America and Pelican (NAP) Nebulae. CARMA observations have been combined with single-dish data from the Purple Mountain 13.7m telescope to add short spacings and produce high-dynamic-range images. We find that the molecular gas is predominantly shaped by the W80 HII bubble that is driven by an O star. Several bright rims are probably remnant molecular clouds heated and stripped by the massive star. Matching these rims in molecular lines and optical images, we construct a model of the three-dimensional structure of the NAP complex. Two groups of molecular clumps/filaments are on the near side of the bubble, one being pushed toward us, whereas the other is moving toward the bubble. Another group is on the far side of the bubble and moving away. The young stellar objects in the Gulf region reside in three different clusters, each hosted by a cloud from one of the three molecular clump groups. Although all gas content in the NAP is impacted by feedback from the central O star, some regions show no signs of star formation, while other areas clearly exhibit star formation activity. Other molecular gas being carved by feedback includes the cometary structures in the Pelican Head region and the boomerang features at the boundary of the Gulf region. The results show that the NAP complex is an ideal place for the study of feedback effects on star formation.
△ Less
Submitted 7 March, 2021;
originally announced March 2021.
-
The physical and chemical structure of high-mass star-forming regions. Unraveling chemical complexity with the NOEMA large program "CORE"
Authors:
C. Gieser,
H. Beuther,
D. Semenov,
A. Ahmadi,
S. Suri,
T. Möller,
M. T. Beltran,
P. Klaassen,
Q. Zhang,
J. S. Urquhart,
Th. Henning,
S. Feng,
R. Galván-Madrid,
V. de Souza Magalhães,
L. Moscadelli,
S. Longmore,
S. Leurini,
R. Kuiper,
T. Peters,
K. M. Menten,
T. Csengeri,
G. Fuller,
F. Wyrowski,
S. Lumsden,
Á. Sánchez-Monge
, et al. (8 additional authors not shown)
Abstract:
We use sub-arcsecond resolution ($\sim$0.4$''$) observations with NOEMA at 1.37 mm to study the dust emission and molecular gas of 18 high-mass star-forming regions. We combine the derived physical and chemical properties of individual cores in these regions to estimate their ages. The temperature structure of these regions are determined by fitting H2CO and CH3CN line emission. The density profil…
▽ More
We use sub-arcsecond resolution ($\sim$0.4$''$) observations with NOEMA at 1.37 mm to study the dust emission and molecular gas of 18 high-mass star-forming regions. We combine the derived physical and chemical properties of individual cores in these regions to estimate their ages. The temperature structure of these regions are determined by fitting H2CO and CH3CN line emission. The density profiles are inferred from the 1.37 mm continuum visibilities. The column densities of 11 different species are determined by fitting the emission lines with XCLASS. Within the 18 observed regions, we identify 22 individual cores with associated 1.37 mm continuum emission and with a radially decreasing temperature profile. We find an average temperature power-law index of q = 0.4$\pm$0.1 and an average density power-law index of p = 2.0$\pm$0.2 on scales on the order of several 1 000 au. Comparing these results with values of p derived in the literature suggest that the density profiles remain unchanged from clump to core scales. The column densities relative to N(C18O) between pairs of dense gas tracers show tight correlations. We apply the physical-chemical model MUSCLE to the derived column densities of each core and find a mean chemical age of $\sim$60 000 yrs and an age spread of 20 000-100 000 yrs. With this paper we release all data products of the CORE project available at https://www.mpia.de/core. The CORE sample reveals well constrained density and temperature power-law distributions. Furthermore, we characterize a large variety in molecular richness that can be explained by an age spread confirmed by our physical-chemical modeling. The hot molecular cores show the most emission lines, but we also find evolved cores at an evolutionary stage, in which most molecules are destroyed and thus the spectra appear line-poor again.
△ Less
Submitted 23 February, 2021;
originally announced February 2021.
-
The Maximum Exposure Problem
Authors:
Neeraj Kumar,
Stavros Sintos,
Subhash Suri
Abstract:
Given a set of points $P$ and axis-aligned rectangles $\mathcal{R}$ in the plane, a point $p \in P$ is called \emph{exposed} if it lies outside all rectangles in $\mathcal{R}$. In the \emph{max-exposure problem}, given an integer parameter $k$, we want to delete $k$ rectangles from $\mathcal{R}$ so as to maximize the number of exposed points. We show that the problem is NP-hard and assuming plausi…
▽ More
Given a set of points $P$ and axis-aligned rectangles $\mathcal{R}$ in the plane, a point $p \in P$ is called \emph{exposed} if it lies outside all rectangles in $\mathcal{R}$. In the \emph{max-exposure problem}, given an integer parameter $k$, we want to delete $k$ rectangles from $\mathcal{R}$ so as to maximize the number of exposed points. We show that the problem is NP-hard and assuming plausible complexity conjectures is also hard to approximate even when rectangles in $\mathcal{R}$ are translates of two fixed rectangles. However, if $\mathcal{R}$ only consists of translates of a single rectangle, we present a polynomial-time approximation scheme. For range space defined by general rectangles, we present a simple $O(k)$ bicriteria approximation algorithm; that is by deleting $O(k^2)$ rectangles, we can expose at least $Ω(1/k)$ of the optimal number of points.
△ Less
Submitted 5 February, 2021;
originally announced February 2021.
-
The SEDIGISM survey: first data release and overview of the Galactic structure
Authors:
F. Schuller,
J. S. Urquhart,
T. Csengeri,
D. Colombo,
A. Duarte-Cabral,
M. Mattern,
A. Ginsburg,
A. R. Pettitt,
F. Wyrowski,
L. Anderson,
F. Azagra,
P. Barnes,
M. Beltran,
H. Beuther,
S. Billington,
L. Bronfman,
R. Cesaroni,
C. Dobbs,
D. Eden,
M. -Y. Lee,
S. -N. X. Medina,
K. M. Menten,
T. Moore,
F. M. Montenegro-Montes,
S. Ragan
, et al. (35 additional authors not shown)
Abstract:
The SEDIGISM (Structure, Excitation and Dynamics of the Inner Galactic Interstellar Medium) survey used the APEX telescope to map 84 deg^2 of the Galactic plane between l = -60 deg and l = +31 deg in several molecular transitions, including 13CO(2-1) and C18O(2-1), thus probing the moderately dense (~10^3 cm^-3) component of the interstellar medium. With an angular resolution of 30'' and a typical…
▽ More
The SEDIGISM (Structure, Excitation and Dynamics of the Inner Galactic Interstellar Medium) survey used the APEX telescope to map 84 deg^2 of the Galactic plane between l = -60 deg and l = +31 deg in several molecular transitions, including 13CO(2-1) and C18O(2-1), thus probing the moderately dense (~10^3 cm^-3) component of the interstellar medium. With an angular resolution of 30'' and a typical 1-sigma sensitivity of 0.8-1.0 K at 0.25 km/s velocity resolution, it gives access to a wide range of structures, from individual star-forming clumps to giant molecular clouds and complexes. The coverage includes a good fraction of the first and fourth Galactic quadrants, allowing us to constrain the large scale distribution of cold molecular gas in the inner Galaxy. In this paper we provide an updated overview of the full survey and the data reduction procedures used. We also assess the quality of these data and describe the data products that are being made publicly available as part of this first data release (DR1). We present integrated maps and position-velocity maps of the molecular gas and use these to investigate the correlation between the molecular gas and the large scale structural features of the Milky Way such as the spiral arms, Galactic bar and Galactic centre. We find that approximately 60 per cent of the molecular gas is associated with the spiral arms and these appear as strong intensity peaks in the derived Galactocentric distribution. We also find strong peaks in intensity at specific longitudes that correspond to the Galactic centre and well known star forming complexes, revealing that the 13CO emission is concentrated in a small number of complexes rather than evenly distributed along spiral arms.
△ Less
Submitted 2 December, 2020;
originally announced December 2020.
-
The SEDIGISM survey: Molecular clouds in the inner Galaxy
Authors:
A. Duarte-Cabral,
D. Colombo,
J. S. Urquhart,
A. Ginsburg,
D. Russeil,
F. Schuller,
L. D. Anderson,
P. J. Barnes,
M. T. Beltran,
H. Beuther,
S. Bontemps,
L. Bronfman,
T. Csengeri,
C. L. Dobbs,
D. Eden,
A. Giannetti,
J. Kauffmann,
M. Mattern,
S. -N. X. Medina,
K. M. Menten,
M. -Y. Lee,
A. R. Pettitt,
M. Riener,
A. J. Rigby,
A. Trafficante
, et al. (35 additional authors not shown)
Abstract:
We use the 13CO(2-1) emission from the SEDIGISM high-resolution spectral-line survey of the inner Galaxy, to extract the molecular cloud population with a large dynamic range in spatial scales, using the SCIMES algorithm. This work compiles a cloud catalogue with a total of 10663 molecular clouds, 10300 of which we were able to assign distances and compute physical properties. We study some of the…
▽ More
We use the 13CO(2-1) emission from the SEDIGISM high-resolution spectral-line survey of the inner Galaxy, to extract the molecular cloud population with a large dynamic range in spatial scales, using the SCIMES algorithm. This work compiles a cloud catalogue with a total of 10663 molecular clouds, 10300 of which we were able to assign distances and compute physical properties. We study some of the global properties of clouds using a science sample, consisting of 6664 well resolved sources and for which the distance estimates are reliable. In particular, we compare the scaling relations retrieved from SEDIGISM to those of other surveys, and we explore the properties of clouds with and without high-mass star formation. Our results suggest that there is no single global property of a cloud that determines its ability to form massive stars, although we find combined trends of increasing mass, size, surface density and velocity dispersion for the sub-sample of clouds with ongoing high-mass star formation. We then isolate the most extreme clouds in the SEDIGISM sample (i.e. clouds in the tails of the distributions) to look at their overall Galactic distribution, in search for hints of environmental effects. We find that, for most properties, the Galactic distribution of the most extreme clouds is only marginally different to that of the global cloud population. The Galactic distribution of the largest clouds, the turbulent clouds and the high-mass star-forming clouds are those that deviate most significantly from the global cloud population. We also find that the least dynamically active clouds (with low velocity dispersion or low virial parameter) are situated further afield, mostly in the least populated areas. However, we suspect that part of these trends may be affected by some observational biases, and thus require further follow up work in order to be confirmed.
△ Less
Submitted 2 December, 2020;
originally announced December 2020.
-
A Constant Factor Approximation for Navigating Through Connected Obstacles in the Plane
Authors:
Neeraj Kumar,
Daniel Lokshtanov,
Saket Saurabh,
Subhash Suri
Abstract:
Given two points s and t in the plane and a set of obstacles defined by closed curves, what is the minimum number of obstacles touched by a path connecting s and t? This is a fundamental and well-studied problem arising naturally in computational geometry, graph theory (under the names Min-Color Path and Minimum Label Path), wireless sensor networks (Barrier Resilience) and motion planning (Minimu…
▽ More
Given two points s and t in the plane and a set of obstacles defined by closed curves, what is the minimum number of obstacles touched by a path connecting s and t? This is a fundamental and well-studied problem arising naturally in computational geometry, graph theory (under the names Min-Color Path and Minimum Label Path), wireless sensor networks (Barrier Resilience) and motion planning (Minimum Constraint Removal). It remains NP-hard even for very simple-shaped obstacles such as unit-length line segments. In this paper we give the first constant factor approximation algorithm for this problem, resolving an open problem of [Chan and Kirkpatrick, TCS, 2014] and [Bandyapadhyay et al., CGTA, 2020]. We also obtain a constant factor approximation for the Minimum Color Prize Collecting Steiner Forest where the goal is to connect multiple request pairs (s1, t1), . . . ,(sk, tk) while minimizing the number of obstacles touched by any (si, ti) path plus a fixed cost of wi for each pair (si, ti) left disconnected. This generalizes the classic Steiner Forest and Prize-Collecting Steiner Forest problems on planar graphs, for which intricate PTASes are known. In contrast, no PTAS is possible for Min-Color Path even on planar graphs since the problem is known to be APXhard [Eiben and Kanj, TALG, 2020]. Additionally, we show that generalizations of the problem to disconnected obstacles
△ Less
Submitted 29 November, 2020;
originally announced November 2020.
-
The CARMA-NRO Orion Survey: Filament Formation via Collision-Induced Magnetic Reconnection -- The Stick in Orion A
Authors:
Shuo Kong,
Volker Ossenkopf-Okada,
Héctor G. Arce,
John Bally,
Álvaro Sánchez-Monge,
Peregrine McGehee,
Sümeyye Suri,
Ralf S. Klessen,
John M. Carpenter,
Dariusz C. Lis,
Fumitaka Nakamura,
Peter Schilke,
Rowan J. Smith,
Steve Mairs,
Alyssa Goodman,
María José Maureira
Abstract:
A unique filament is identified in the {\it Herschel} maps of the Orion A giant molecular cloud. The filament, which, we name the Stick, is ruler-straight and at an early evolutionary stage. Transverse position-velocity diagrams show two velocity components closing in on the Stick. The filament shows consecutive rings/forks in C$^{18}$O(1-0) channel maps, which is reminiscent of structures generat…
▽ More
A unique filament is identified in the {\it Herschel} maps of the Orion A giant molecular cloud. The filament, which, we name the Stick, is ruler-straight and at an early evolutionary stage. Transverse position-velocity diagrams show two velocity components closing in on the Stick. The filament shows consecutive rings/forks in C$^{18}$O(1-0) channel maps, which is reminiscent of structures generated by magnetic reconnection. We propose that the Stick formed via collision-induced magnetic reconnection (CMR). We use the magnetohydrodynamics (MHD) code Athena++ to simulate the collision between two diffuse molecular clumps, each carrying an anti-parallel magnetic field. The clump collision produces a narrow, straight, dense filament with a factor of $>$200 increase in density. The production of the dense gas is seven times faster than free-fall collapse. The dense filament shows ring/fork-like structures in radiative transfer maps. Cores in the filament are confined by surface magnetic pressure. CMR can be an important dense-gas-producing mechanism in the Galaxy and beyond.
△ Less
Submitted 31 October, 2020;
originally announced November 2020.
-
Leveraging Organizational Resources to Adapt Models to New Data Modalities
Authors:
Sahaana Suri,
Raghuveer Chanda,
Neslihan Bulut,
Pradyumna Narayana,
Yemao Zeng,
Peter Bailis,
Sugato Basu,
Girija Narlikar,
Christopher Re,
Abishek Sethi
Abstract:
As applications in large organizations evolve, the machine learning (ML) models that power them must adapt the same predictive tasks to newly arising data modalities (e.g., a new video content launch in a social media application requires existing text or image models to extend to video). To solve this problem, organizations typically create ML pipelines from scratch. However, this fails to utiliz…
▽ More
As applications in large organizations evolve, the machine learning (ML) models that power them must adapt the same predictive tasks to newly arising data modalities (e.g., a new video content launch in a social media application requires existing text or image models to extend to video). To solve this problem, organizations typically create ML pipelines from scratch. However, this fails to utilize the domain expertise and data they have cultivated from developing tasks for existing modalities. We demonstrate how organizational resources, in the form of aggregate statistics, knowledge bases, and existing services that operate over related tasks, enable teams to construct a common feature space that connects new and existing data modalities. This allows teams to apply methods for training data curation (e.g., weak supervision and label propagation) and model training (e.g., forms of multi-modal learning) across these different data modalities. We study how this use of organizational resources composes at production scale in over 5 classification tasks at Google, and demonstrate how it reduces the time needed to develop models for new modalities from months to weeks to days.
△ Less
Submitted 23 August, 2020;
originally announced August 2020.
-
How Work From Home Affects Collaboration: A Large-Scale Study of Information Workers in a Natural Experiment During COVID-19
Authors:
Longqi Yang,
Sonia Jaffe,
David Holtz,
Siddharth Suri,
Shilpi Sinha,
Jeffrey Weston,
Connor Joyce,
Neha Shah,
Kevin Sherman,
CJ Lee,
Brent Hecht,
Jaime Teevan
Abstract:
The COVID-19 pandemic has had a wide-ranging impact on information workers such as higher stress levels, increased workloads, new workstreams, and more caregiving responsibilities during lockdown. COVID-19 also caused the overwhelming majority of information workers to rapidly shift to working from home (WFH). The central question this work addresses is: can we isolate the effects of WFH on inform…
▽ More
The COVID-19 pandemic has had a wide-ranging impact on information workers such as higher stress levels, increased workloads, new workstreams, and more caregiving responsibilities during lockdown. COVID-19 also caused the overwhelming majority of information workers to rapidly shift to working from home (WFH). The central question this work addresses is: can we isolate the effects of WFH on information workers' collaboration activities from all other factors, especially the other effects of COVID-19? This is important because in the future, WFH will likely to be more common than it was prior to the pandemic.
We use difference-in-differences (DiD), a causal identification strategy commonly used in the social sciences, to control for unobserved confounding factors and estimate the causal effect of WFH. Our analysis relies on measuring the difference in changes between those who WFH prior to COVID-19 and those who did not. Our preliminary results suggest that on average, people spent more time on collaboration in April (Post WFH mandate) than in February (Pre WFH mandate), but this is primarily due to factors other than WFH, such as lockdowns during the pandemic. The change attributable to WFH specifically is in the opposite direction: less time on collaboration and more focus time. This reversal shows the importance of using causal inference: a simple analysis would have resulted in the wrong conclusion. We further find that the effect of WFH is moderated by individual remote collaboration experience prior to WFH. Meanwhile, the medium for collaboration has also shifted due to WFH: instant messages were used more, whereas scheduled meetings were used less. We discuss design implications -- how future WFH may affect focused work, collaborative work, and creative work.
△ Less
Submitted 30 July, 2020;
originally announced July 2020.
-
Pseudo Rehearsal using non photo-realistic images
Authors:
Bhasker Sri Harsha Suri,
Kalidas Yeturu
Abstract:
Deep Neural networks forget previously learnt tasks when they are faced with learning new tasks. This is called catastrophic forgetting. Rehearsing the neural network with the training data of the previous task can protect the network from catastrophic forgetting. Since rehearsing requires the storage of entire previous data, Pseudo rehearsal was proposed, where samples belonging to the previous d…
▽ More
Deep Neural networks forget previously learnt tasks when they are faced with learning new tasks. This is called catastrophic forgetting. Rehearsing the neural network with the training data of the previous task can protect the network from catastrophic forgetting. Since rehearsing requires the storage of entire previous data, Pseudo rehearsal was proposed, where samples belonging to the previous data are generated synthetically for rehearsal. In an image classification setting, while current techniques try to generate synthetic data that is photo-realistic, we demonstrated that Neural networks can be rehearsed on data that is not photo-realistic and still achieve good retention of the previous task. We also demonstrated that forgoing the constraint of having photo realism in the generated data can result in a significant reduction in the consumption of computational and memory resources for pseudo rehearsal.
△ Less
Submitted 28 April, 2020;
originally announced April 2020.
-
Molecular globules in the Veil bubble of Orion. IRAM 30m 12CO, 13CO, and C18O 2-1 expanded maps of Orion A
Authors:
J. R. Goicoechea,
C. H. M. Pabst,
S. Kabanovic,
M. G. Santa-Maria,
N. Marcelino,
A. G. G. M. Tielens,
A. Hacar,
O. Berne,
C. Buchbender,
S. Cuadrado,
R. Higgins,
C. Kramer,
J. Stutzki,
S. Suri,
D. Teyssier,
M. Wolfire
Abstract:
Strong winds and ultraviolet (UV) radiation from O-type stars disrupt and ionize their molecular core birthplaces, sweeping up material into parsec-size shells. Owing to dissociation by starlight, the thinnest shells are expected to host low molecular abundances and therefore little star formation. Here, we expand previous maps taken with the IRAM 30m telescope and present square-degree 12CO and 1…
▽ More
Strong winds and ultraviolet (UV) radiation from O-type stars disrupt and ionize their molecular core birthplaces, sweeping up material into parsec-size shells. Owing to dissociation by starlight, the thinnest shells are expected to host low molecular abundances and therefore little star formation. Here, we expand previous maps taken with the IRAM 30m telescope and present square-degree 12CO and 13CO (J=2-1) maps of the wind-driven "Veil bubble'' that surrounds the Trapezium cluster and its natal Orion molecular core (OMC). Although widespread and extended CO emission is largely absent from the Veil, we show that several CO "globules'' exist and are embedded in the [CII]158um-bright shell that confines the bubble. This includes the first detection of quiescent CO at negative LSR velocities in Orion. Given the harsh UV irradiation conditions in this translucent material, the detection of CO globules is surprising. These globules are small (R=7,100 AU), not massive (M=0.3M_Sun), and are moderately dense: n_ H=4x10^4 cm^-3 (median values). They are confined by the external pressure of the shell, P_ext/k~10^7 cm^-3 K, and are likely magnetically supported. They are either transient objects formed by instabilities or have detached from pre-existing molecular structures, sculpted by the passing shock associated with the expanding shell and by UV radiation from the Trapezium. Some represent the first stages in the formation of small pillars, others of isolated small globules. Although their masses do not suggest they will form stars, one globule matches the position of a known YSO. The lack of extended CO in the "Veil shell'' demonstrates that feedback from massive stars expels, agitates, and reprocesses most of the disrupted molecular cloud gas, thereby limiting the star-formation rate in the region. The presence of globules is a result of this feedback.
△ Less
Submitted 5 May, 2020; v1 submitted 27 April, 2020;
originally announced April 2020.
-
The CARMA-NRO Orion Survey: Protostellar Outflows, Energetics, and Filamentary Alignment
Authors:
Jesse R. Feddersen,
Héctor G. Arce,
Shuo Kong,
Sümeyye Suri,
Álvaro Sánchez-Monge,
Volker Ossenkopf-Okada,
Michael M. Dunham,
Fumitaka Nakamura,
Yoshito Shimajiri,
John Bally
Abstract:
We identify 45 protostellar outflows in CO maps of the Orion A giant molecular cloud from the CARMA-NRO Orion survey. Our sample includes 11 newly detected outflows. We measure the mass and energetics of the outflows, including material at low-velocities by correcting for cloud contributions. The total momentum and kinetic energy injection rates of outflows is comparable to the turbulent dissipati…
▽ More
We identify 45 protostellar outflows in CO maps of the Orion A giant molecular cloud from the CARMA-NRO Orion survey. Our sample includes 11 newly detected outflows. We measure the mass and energetics of the outflows, including material at low-velocities by correcting for cloud contributions. The total momentum and kinetic energy injection rates of outflows is comparable to the turbulent dissipation rate of the cloud. We also compare the outflow position angles to the orientation of C$^{18}$O filaments. We find that the full sample of outflows is consistent with being randomly oriented with respect to the filaments. A subsample of the most reliable measurements shows a moderately perpendicular outflow-filament alignment which may reflect accretion of mass across filaments and onto the protostellar cores.
△ Less
Submitted 7 April, 2020;
originally announced April 2020.
-
Dynamic geometric set cover and hitting set
Authors:
Pankaj K. Agarwal,
Hsien-Chih Chang,
Subhash Suri,
Allen Xiao,
Jie Xue
Abstract:
We investigate dynamic versions of geometric set cover and hitting set where points and ranges may be inserted or deleted, and we want to efficiently maintain an (approximately) optimal solution for the current problem instance. While their static versions have been extensively studied in the past, surprisingly little is known about dynamic geometric set cover and hitting set. For instance, even f…
▽ More
We investigate dynamic versions of geometric set cover and hitting set where points and ranges may be inserted or deleted, and we want to efficiently maintain an (approximately) optimal solution for the current problem instance. While their static versions have been extensively studied in the past, surprisingly little is known about dynamic geometric set cover and hitting set. For instance, even for the most basic case of one-dimensional interval set cover and hitting set, no nontrivial results were known. The main contribution of our paper are two frameworks that lead to efficient data structures for dynamically maintaining set covers and hitting sets in $\mathbb{R}^1$ and $\mathbb{R}^2$. The first framework uses bootstrapping and gives a $(1+\varepsilon)$-approximate data structure for dynamic interval set cover in $\mathbb{R}^1$ with $O(n^α/\varepsilon)$ amortized update time for any constant $α> 0$; in $\mathbb{R}^2$, this method gives $O(1)$-approximate data structures for unit-square (and quadrant) set cover and hitting set with $O(n^{1/2+α})$ amortized update time. The second framework uses local modification, and leads to a $(1+\varepsilon)$-approximate data structure for dynamic interval hitting set in $\mathbb{R}^1$ with $\widetilde{O}(1/\varepsilon)$ amortized update time; in $\mathbb{R}^2$, it gives $O(1)$-approximate data structures for unit-square (and quadrant) set cover and hitting set in the \textit{partially} dynamic settings with $\widetilde{O}(1)$ amortized update time.
△ Less
Submitted 29 February, 2020;
originally announced March 2020.
-
Chemical complexity in high-mass star formation: An observational and modeling case study of the AFGL 2591 VLA 3 hot core
Authors:
C. Gieser,
D. Semenov,
H. Beuther,
A. Ahmadi,
J. C. Mottram,
Th. Henning,
M. Beltran,
L. T. Maud,
F. Bosco,
S. Leurini,
T. Peters,
P. Klaassen,
R. Kuiper,
S. Feng,
J. S. Urquhart,
L. Moscadelli,
T. Csengeri,
S. Lumsden,
J. M. Winters,
S. Suri,
Q. Zhang,
R. Pudritz,
A. Palau,
K. M. Menten,
R. Galvan-Madrid
, et al. (8 additional authors not shown)
Abstract:
We present a detailed observational and modeling study of the hot core VLA 3 in the high-mass star-forming region AFGL 2591, which is a target region of the NOrthern Extended Millimeter Array (NOEMA) large program CORE. Using NOEMA observations at 1.37 mm with an angular resolution of ~0."42 (1 400 au at 3.33 kpc), we derived the physical and chemical structure of the source. We modeled the observ…
▽ More
We present a detailed observational and modeling study of the hot core VLA 3 in the high-mass star-forming region AFGL 2591, which is a target region of the NOrthern Extended Millimeter Array (NOEMA) large program CORE. Using NOEMA observations at 1.37 mm with an angular resolution of ~0."42 (1 400 au at 3.33 kpc), we derived the physical and chemical structure of the source. We modeled the observed molecular abundances with the chemical evolution code MUSCLE (MUlti Stage ChemicaL codE). Results. With the kinetic temperature tracers CH3CN and H2CO we observe a temperature distribution with a power-law index of q = 0.41+-0.08. Using the visibilities of the continuum emission we derive a density structure with a power-law index of p = 1.7+-0.1. The hot core spectra reveal high molecular abundances and a rich diversity in complex molecules. The majority of the molecules have an asymmetric spatial distribution around the forming protostar(s), which indicates a complex physical structure on scales < 1 400 au. Using MUSCLE, we are able to explain the observed molecular abundance of 10 out of 14 modeled species at an estimated hot core chemical age of ~21 100 years. In contrast to the observational analysis, our chemical modeling predicts a lower density power-law index of p < 1.4. Reasons for this discrepancy are discussed. Conclusions. Combining high spatial resolution observations with detailed chemical modeling allows us to derive a concise picture of the physical and chemical structure of the famous AFGL 2591 hot core. The next steps are to conduct a similar analysis for the whole CORE sample, and then use this analysis to constrain the chemical diversity in high-mass star formation to a much greater depth.
△ Less
Submitted 11 October, 2019;
originally announced October 2019.
-
What You See Is What You Get? The Impact of Representation Criteria on Human Bias in Hiring
Authors:
Andi Peng,
Besmira Nushi,
Emre Kiciman,
Kori Inkpen,
Siddharth Suri,
Ece Kamar
Abstract:
Although systematic biases in decision-making are widely documented, the ways in which they emerge from different sources is less understood. We present a controlled experimental platform to study gender bias in hiring by decoupling the effect of world distribution (the gender breakdown of candidates in a specific profession) from bias in human decision-making. We explore the effectiveness of \tex…
▽ More
Although systematic biases in decision-making are widely documented, the ways in which they emerge from different sources is less understood. We present a controlled experimental platform to study gender bias in hiring by decoupling the effect of world distribution (the gender breakdown of candidates in a specific profession) from bias in human decision-making. We explore the effectiveness of \textit{representation criteria}, fixed proportional display of candidates, as an intervention strategy for mitigation of gender bias by conducting experiments measuring human decision-makers' rankings for who they would recommend as potential hires. Experiments across professions with varying gender proportions show that balancing gender representation in candidate slates can correct biases for some professions where the world distribution is skewed, although doing so has no impact on other professions where human persistent preferences are at play. We show that the gender of the decision-maker, complexity of the decision-making task and over- and under-representation of genders in the candidate slate can all impact the final decision. By decoupling sources of bias, we can better isolate strategies for bias mitigation in human-in-the-loop systems.
△ Less
Submitted 8 September, 2019;
originally announced September 2019.