subscribe to arXiv mailings

doi 10.1021/acscentsci.1c00599

Twisting of 2D kagomé sheets in layered intermetallics

Authors: Mekhola Sinha, Hector K. Vivanco, Cheng Wan, Maxime A. Siegler, Veronica J. Stewart, Elizabeth A. Pogue, Lucas A. Pressley, Tanya Berry, Ziqian Wang, Isaac Johnson, Mingwei Chen, Thao T. Tran, W. Adam Phelan, Tyrel M. McQueen

Abstract: Chemical bonding in 2D layered materials and van der Waals solids is central to understanding and harnessing their unique electronic, magnetic, optical, thermal and superconducting properties. Here we report the discovery of spontaneous, bidirectional, bilayer twisting (twist angle ~ 4.5°) in the metallic kagomé MgCo6Ge6 at T = 100(2) K via X-ray diffraction measure-ments, enabled by the preparati… ▽ More Chemical bonding in 2D layered materials and van der Waals solids is central to understanding and harnessing their unique electronic, magnetic, optical, thermal and superconducting properties. Here we report the discovery of spontaneous, bidirectional, bilayer twisting (twist angle ~ 4.5°) in the metallic kagomé MgCo6Ge6 at T = 100(2) K via X-ray diffraction measure-ments, enabled by the preparation of single crystals by the Laser Bridgman method. Despite the appearance of static twisting on cooling from T ~ 300 K to 100 K, no evidence for a phase transition was found in physical properties measurements. Combined with the presence of an Einstein phonon mode contribution in the specific heat, this implies that the twisting exists at all temperatures but is thermally fluctuating at room temperature. Crystal Orbital Hamilton Population analysis demonstrates that the cooperative twisting between layers stabilizes the Co-kagomé network when coupled to strongly bonded and rigid (Ge2) dimers that connect adjacent layers. Further modelling of the displacive disorder in the crystal structure shows the presence of second, Mg-deficient, stacking sequence. This alternative stacking sequence also exhibits inter-layer twisting, but with a different pattern, consistent with the change in electron count due to removal of Mg. Magnetization, resistivity, and low-temperature specific heat measurements are all consistent with a Pauli paramagnetic, strongly correlated metal. Our results provide crucial insight into how chemical concepts lead to interesting electronic structures and behaviors in layered materials. △ Less

Submitted 6 August, 2021; v1 submitted 6 February, 2021; originally announced February 2021.

arXiv:2102.03489 [pdf, ps, other]

Communications using Sparse Signals

Authors: Madhusudan Kumar Sinha, Arun Pachai Kannu

Abstract: Inspired by compressive sensing principles, we propose novel error control coding techniques for communication systems. The information bits are encoded in the support and the non-zero entries of a sparse signal. By selecting a dictionary matrix with suitable dimensions, the codeword for transmission is obtained by multiplying the dictionary matrix with the sparse signal. Specifically, the codewor… ▽ More Inspired by compressive sensing principles, we propose novel error control coding techniques for communication systems. The information bits are encoded in the support and the non-zero entries of a sparse signal. By selecting a dictionary matrix with suitable dimensions, the codeword for transmission is obtained by multiplying the dictionary matrix with the sparse signal. Specifically, the codewords are obtained from the sparse linear combinations of the columns of the dictionary matrix. At the decoder, we employ variations of greedy sparse signal recovery algorithms. Using Gold code sequences and mutually unbiased bases from quantum information theory as dictionary matrices, we study the block error rate (BLER) performance of the proposed scheme in the AWGN channel. Our results show that the proposed scheme has a comparable and competitive performance with respect to the several widely used linear codes, for very small to moderate block lengths. In addition, our coding scheme extends straightforwardly to multi-user scenarios such as multiple access channel, broadcast channel, and interference channel. In these multi-user channels, if the users are grouped such that they have similar channel gains and noise levels, the overall BLER performance of our proposed scheme will coincide with an equivalent single-user scenario. △ Less

Submitted 5 February, 2021; originally announced February 2021.

Comments: 11 pages, 8 figures

arXiv:2012.13306 [pdf, ps, other]

Majorizing Measures for the Optimizer

Authors: Sander Borst, Daniel Dadush, Neil Olver, Makrand Sinha

Abstract: The theory of majorizing measures, extensively developed by Fernique, Talagrand and many others, provides one of the most general frameworks for controlling the behavior of stochastic processes. In particular, it can be applied to derive quantitative bounds on the expected suprema and the degree of continuity of sample paths for many processes. One of the crowning achievements of the theory is T… ▽ More The theory of majorizing measures, extensively developed by Fernique, Talagrand and many others, provides one of the most general frameworks for controlling the behavior of stochastic processes. In particular, it can be applied to derive quantitative bounds on the expected suprema and the degree of continuity of sample paths for many processes. One of the crowning achievements of the theory is Talagrand's tight alternative characterization of the suprema of Gaussian processes in terms of majorizing measures. The proof of this theorem was difficult, and thus considerable effort was put into the task of developing both shorter and easier to understand proofs. A major reason for this difficulty was considered to be theory of majorizing measures itself, which had the reputation of being opaque and mysterious. As a consequence, most recent treatments of the theory (including by Talagrand himself) have eschewed the use of majorizing measures in favor of a purely combinatorial approach (the generic chaining) where objects based on sequences of partitions provide roughly matching upper and lower bounds on the desired expected supremum. In this paper, we return to majorizing measures as a primary object of study, and give a viewpoint that we think is natural and clarifying from an optimization perspective. As our main contribution, we give an algorithmic proof of the majorizing measures theorem based on two parts: (1) We make the simple (but apparently new) observation that finding the best majorizing measure can be cast as a convex program. This also allows for efficiently computing the measure using off-the-shelf methods from convex optimization. (2) We obtain tree-based upper and lower bound certificates by rounding, in a series of steps, the primal and dual solutions to this convex program. [...] △ Less

Submitted 24 December, 2020; originally announced December 2020.

Comments: 37 pages. Extended Abstract to appear in ITCS 2021

MSC Class: 60G15; 68Q87 ACM Class: G.3

arXiv:2011.08276 [pdf, other]

doi 10.3847/1538-4357/abca9f

Void Galaxies Follow a Distinct Evolutionary Path in the Environmental COntext Catalog

Authors: Jonathan Florez, Andreas A. Berlind, Sheila J. Kannappan, David V. Stark, Kathleen D. Eckert, Victor F. Calderon, Amanda J. Moffett, Duncan Campbell, Manodeep Sinha

Abstract: We measure the environmental dependence, where environment is defined by the distance to the third nearest neighbor, of multiple galaxy properties inside the Environmental COntext (ECO) catalog. We focus primarily on void galaxies, which we define as the $10 \%$ of galaxies having the lowest local density. We compare the properties of void and non-void galaxies: baryonic mass, color, fractional st… ▽ More We measure the environmental dependence, where environment is defined by the distance to the third nearest neighbor, of multiple galaxy properties inside the Environmental COntext (ECO) catalog. We focus primarily on void galaxies, which we define as the $10 \%$ of galaxies having the lowest local density. We compare the properties of void and non-void galaxies: baryonic mass, color, fractional stellar mass growth rate (FSMGR), morphology, and gas-to-stellar-mass ratio (estimated from a combination of HI data and photometric gas fractions calibrated with the RESOLVE survey). Our void galaxies typically have lower baryonic masses than galaxies in denser environments, and they display the properties expected of a lower mass population: they have more late-types, are bluer, have higher FSMGR, and are more gas rich. We control for baryonic mass and investigate the extent to which void galaxies are different at fixed mass. Void galaxies are bluer, more gas-rich, and more star forming at fixed mass than non-void galaxies, which is a possible signature of galaxy assembly bias. Furthermore, we show that these trends persist even at fixed mass and morphology, and we find that voids host a distinct population of early-types that are bluer and more star-forming than the typical red and quenched early-types. In addition to these empirical observational results, we also present theoretical results from mock catalogs with built-in galaxy assembly bias. We show that a simple matching of galaxy properties to (sub)halo properties, such as mass and age, can recover the observed environmental trends in ECO galaxies. △ Less

Submitted 16 November, 2020; originally announced November 2020.

Comments: 18 pages, 11 figures. Accepted for publication in ApJ

arXiv:2011.06440 [pdf, other]

doi 10.1103/PhysRevD.102.123007

Dense matter equation of state of a massive neutron star with anti-kaon condensation

Authors: Vivek Baruah Thapa, Monika Sinha

Abstract: Recent measurements of neutron star mass from several candidates (PSR J$1614-2230$, PSR J$0348+0432$, MSP J$0740+6620$) set the lower bound on the maximum possible mass for this class of compact objects $\sim 2$ M$_\odot$. Existence of stars with high mass brings the possibility of existence of exotic matter (hyperons, meson condensates) at the core region of the objects. In this work, we investig… ▽ More Recent measurements of neutron star mass from several candidates (PSR J$1614-2230$, PSR J$0348+0432$, MSP J$0740+6620$) set the lower bound on the maximum possible mass for this class of compact objects $\sim 2$ M$_\odot$. Existence of stars with high mass brings the possibility of existence of exotic matter (hyperons, meson condensates) at the core region of the objects. In this work, we investigate the (anti)kaon ($K^-, \bar{K}^0$) condensation in $β-$equilibrated nuclear matter within the framework of covariant density functional theory. The functionals in the kaonic sector are constrained by the experimental studies on $K^-$ atomic, kaon-nucleon scattering data fits. We find that the equation of state softens with the inclusion of (anti)kaon condensates, which lowers the maximum mass of neutron star. In one of the density-independent coupling cases, the $K^-$ condensation is through a first-order phase transition type, which produces a $2$ M$_\odot$ neutron star. The first-order phase transition results in mixed phase region in the inner core of the stars. While $\bar{K}^0$ condensation appears via second-order phase transition for all the models we consider here. △ Less

Submitted 3 December, 2020; v1 submitted 12 November, 2020; originally announced November 2020.

Comments: 14 pages, 13 figures, corrected some typos, matches with the published version

Journal ref: Phys. Rev. D 102 (2020), 123007

arXiv:2010.10776 [pdf, ps, other]

Ambipolar diffusion velocity and magnetic field evolution in magnetar core: Generalised theoretical approach

Authors: Monika Sinha, Manoj Ghosh

Abstract: The magnetic field associated with neutron stars is generally believed to be threaded inside the star. In the presence of a magnetic field, the plasma present in the interior of the star goes through several processes that lead to magnetic field evolution. It is thought that magnetar activities are mainly due to field decay. The most important process of field decay inside the core of the star is… ▽ More The magnetic field associated with neutron stars is generally believed to be threaded inside the star. In the presence of a magnetic field, the plasma present in the interior of the star goes through several processes that lead to magnetic field evolution. It is thought that magnetar activities are mainly due to field decay. The most important process of field decay inside the core of the star is the ambipolar diffusion of the charged particles present in the interior plasma. The decay rate due to ambipolar diffusion is directly connected to the ambipolar velocity of the charged particles under the influence of the present magnetic field. The ambipolar velocity of the charged particles depends on the internal dynamics of the particles. We outline a general method to solve the particle dynamics in the presence of a magnetic field using a magnetohydrodynamic equation for ambipolar velocity. The equation is general and applies to all possible surrounding conditions \eg temperature, and matter states like normal or superfluid. △ Less

Submitted 29 September, 2023; v1 submitted 21 October, 2020; originally announced October 2020.

Comments: 15 pages, Calculation modified extensively, Appendix added

arXiv:2010.00981 [pdf, other]

doi 10.3390/particles3040043

Equation of state of strongly magnetized matter with hyperons and $Δ$-resonances

Authors: Vivek Baruah Thapa, Monika Sinha, Jia-Jie Li, Armen Sedrakian

Abstract: We construct a new equation of state for the baryonic matter under an intense magnetic field within the framework of covariant density functional theory. The composition of matter includes hyperons as well as $ Δ$-resonances. The extension of the nucleonic functional to the hypernuclear sector is constrained by the experimental data on $Λ$ and $Ξ$-hypernuclei. We find that the equation of state st… ▽ More We construct a new equation of state for the baryonic matter under an intense magnetic field within the framework of covariant density functional theory. The composition of matter includes hyperons as well as $ Δ$-resonances. The extension of the nucleonic functional to the hypernuclear sector is constrained by the experimental data on $Λ$ and $Ξ$-hypernuclei. We find that the equation of state stiffens with the inclusion of the magnetic field, which increases the maximum mass of neutron star compared to the non-magnetic case. In addition, the strangeness fraction in the matter is enhanced. Several observables, like the Dirac effective mass, particle abundances, etc show typical oscillatory behavior as a function of the magnetic field and/or density which is traced back to the occupation pattern of Landau levels. △ Less

Submitted 16 October, 2020; v1 submitted 2 October, 2020; originally announced October 2020.

Comments: 16 pages, 8 figures, Accepted in journal "Particles", Corrected some typos. Matches with published version

Journal ref: Particles 3(4), 660-675 (2020)

arXiv:2009.09649 [pdf, ps, other]

doi 10.1142/S0217732321501443

Ambipolar decay of magnetic field in magnetars and the observed magnetar activities

Authors: Badal Bhalla, Monika Sinha

Abstract: Magnetars are comparatively young neutron stars with ultra-strong surface magnetic field in the range $10^{14-16}$ G. The old neutron stars have surface magnetic field some what less $\sim 10^8$ G which clearly indicates the decay of field with time. One possible way of magnetic field decay is by ambipolar diffusion. We describe the general procedure to solve for the ambipolar velocity inside the… ▽ More Magnetars are comparatively young neutron stars with ultra-strong surface magnetic field in the range $10^{14-16}$ G. The old neutron stars have surface magnetic field some what less $\sim 10^8$ G which clearly indicates the decay of field with time. One possible way of magnetic field decay is by ambipolar diffusion. We describe the general procedure to solve for the ambipolar velocity inside the star core without any approximation. With a realistic model of neutron star we determine the ambipolar velocity configuration inside the neutron star core and hence find the ambipolar decay rate and time scale which is consistent with the magnetar observations. △ Less

Submitted 21 June, 2021; v1 submitted 21 September, 2020; originally announced September 2020.

Comments: 12 pages, 4 figures, Accepted for publication in MPLA

Journal ref: Vol. 36, No. 20, 2150144 (2021)

arXiv:2008.07003 [pdf, other]

$k$-Forrelation Optimally Separates Quantum and Classical Query Complexity

Authors: Nikhil Bansal, Makrand Sinha

Abstract: Aaronson and Ambainis (SICOMP `18) showed that any partial function on $N$ bits that can be computed with an advantage $δ$ over a random guess by making $q$ quantum queries, can also be computed classically with an advantage $δ/2$ by a randomized decision tree making ${O}_q(N^{1-\frac{1}{2q}}δ^{-2})$ queries. Moreover, they conjectured the $k$-Forrelation problem -- a partial function that can be… ▽ More Aaronson and Ambainis (SICOMP `18) showed that any partial function on $N$ bits that can be computed with an advantage $δ$ over a random guess by making $q$ quantum queries, can also be computed classically with an advantage $δ/2$ by a randomized decision tree making ${O}_q(N^{1-\frac{1}{2q}}δ^{-2})$ queries. Moreover, they conjectured the $k$-Forrelation problem -- a partial function that can be computed with $q = \lceil k/2 \rceil$ quantum queries -- to be a suitable candidate for exhibiting such an extremal separation. We prove their conjecture by showing a tight lower bound of $\widetildeΩ(N^{1-1/k})$ for the randomized query complexity of $k$-Forrelation, where the advantage $δ= 2^{-O(k)}$. By standard amplification arguments, this gives an explicit partial function that exhibits an $O_ε(1)$ vs $Ω(N^{1-ε})$ separation between bounded-error quantum and randomized query complexities, where $ε>0$ can be made arbitrarily small. Our proof also gives the same bound for the closely related but non-explicit $k$-Rorrelation function introduced by Tal (FOCS `20). Our techniques rely on classical Gaussian tools, in particular, Gaussian interpolation and Gaussian integration by parts, and in fact, give a more general statement. We show that to prove lower bounds for $k$-Forrelation against a family of functions, it suffices to bound the $\ell_1$-weight of the Fourier coefficients between levels $k$ and $(k-1)k$. We also prove new interpolation and integration by parts identities that might be of independent interest in the context of rounding high-dimensional Gaussian vectors. △ Less

Submitted 17 November, 2020; v1 submitted 16 August, 2020; originally announced August 2020.

Comments: 40 pages, 2 figures. Change from v1 to v2: Updated figures to fix an Adobe Acrobat specific issue. Change from v0 to v1: Improved the advantage $δ$ to $2^{-O(k)}$ strengthening the main conclusions. Added a reference to the independent work of Sherstov, Storozhenko and Wu (arxiv:2008.10223) who obtained a similar lower bound for the randomized query complexity of $k$-Rorrelation

arXiv:2008.05001 [pdf, other]

doi 10.1093/mnras/stab867

Machine Learning the Fates of Dark Matter Subhalos: A Fuzzy Crystal Ball

Authors: Abigail Petulante, Andreas A. Berlind, J. Kelly Holley-Bockelmann, Manodeep Sinha

Abstract: The evolution of a dark matter halo in a dark matter only simulation is governed purely byNewtonian gravity, making a clean testbed to determine what halo properties drive its fate.Using machine learning, we predict the survival, mass loss, final position, and merging time of subhalos within a cosmological N-body simulation, focusing on what instantaneous initial features of the halo, interaction,… ▽ More The evolution of a dark matter halo in a dark matter only simulation is governed purely byNewtonian gravity, making a clean testbed to determine what halo properties drive its fate.Using machine learning, we predict the survival, mass loss, final position, and merging time of subhalos within a cosmological N-body simulation, focusing on what instantaneous initial features of the halo, interaction, and environment matter most. Survival is well predicted, with our model achieving 96.5% accuracy using only 3 model inputs from the initial interaction.However, the mass loss, final location, and merging times are much more stochastic processes, with significant margins of error between the true and predicted quantities for much of our sample. The redshift, impact angle, relative velocity, and the masses of the host and subhalo are the only relevant initial inputs for determining subhalo evolution. In general, subhalos that enter their hosts at a mid-range of redshifts (typically z = 0.67-0.43) are the most challenging to make predictions for, across all of our final outcomes. Subhalo orbits that come in more perpendicular to the host are also easier to predict, except for in the case of predicting disruption, where the opposite appears to be true. We conclude that the detailed evolution of individual subhalos within N-body simulations is quite difficult to predict, pointing to a stochasticity in the merging process. We discuss implications for both simulations and observations △ Less

Submitted 11 August, 2020; originally announced August 2020.

Comments: 19 pages, 11 figures

arXiv:2007.14720 [pdf, other]

doi 10.1093/mnras/stab1755

The Uchuu Simulations: Data Release 1 and Dark Matter Halo Concentrations

Authors: Tomoaki Ishiyama, Francisco Prada, Anatoly A. Klypin, Manodeep Sinha, R. Benton Metcalf, Eric Jullo, Bruno Altieri, Sofía A. Cora, Darren Croton, Sylvain de la Torre, David E. Millán-Calero, Taira Oogi, José Ruedas, Cristian A. Vega-Martínez

Abstract: We introduce the Uchuu suite of large high-resolution cosmological $N$-body simulations. The largest simulation, named Uchuu, consists of 2.1 trillion ($12800^3$) dark matter particles in a box of side-length 2.0 Gpc/h, with particle mass $3.27 \times 10^{8}$ Msun/h. The highest resolution simulation, Shin-Uchuu, consists of 262 billion ($6400^3$) particles in a box of side-length 140 Mpc/h, with… ▽ More We introduce the Uchuu suite of large high-resolution cosmological $N$-body simulations. The largest simulation, named Uchuu, consists of 2.1 trillion ($12800^3$) dark matter particles in a box of side-length 2.0 Gpc/h, with particle mass $3.27 \times 10^{8}$ Msun/h. The highest resolution simulation, Shin-Uchuu, consists of 262 billion ($6400^3$) particles in a box of side-length 140 Mpc/h, with particle mass $8.97 \times 10^{5}$ Msun/h. Combining these simulations we can follow the evolution of dark matter halos and subhalos spanning those hosting dwarf galaxies to massive galaxy clusters across an unprecedented volume. In this first paper, we present basic statistics, dark matter power spectra, and the halo and subhalo mass functions, which demonstrate the wide dynamic range and superb statistics of the Uchuu suite. From an analysis of the evolution of the power spectra we conclude that our simulations remain accurate from the Baryon Acoustic Oscillation scale down to the very small. We also provide parameters of a mass-concentration model, which describes the evolution of halo concentration and reproduces our simulation data to within 5 per cent for halos with masses spanning nearly eight orders of magnitude at redshift 0<z<14. There is an upturn in the mass-concentration relation for the population of all halos and of relaxed halos at z>0.5, whereas no upturn is detected at z<0.5. We make publicly available various $N$-body products as part of Uchuu Data Release 1 on the Skies & Universes site. Future releases will include gravitational lensing maps and mock galaxy, X-ray cluster, and active galactic nuclei catalogues. △ Less

Submitted 13 July, 2021; v1 submitted 29 July, 2020; originally announced July 2020.

Comments: 22 pages, 14 figures, accepted by MNRAS. We release various $N$-body products as data release 1 on http://skiesanduniverses.org/ such as subsets of simulation particles, matter power spectra, halo/subhalo catalogues, and their merger trees

arXiv:2007.10622 [pdf, other]

Online Discrepancy Minimization for Stochastic Arrivals

Authors: Nikhil Bansal, Haotian Jiang, Raghu Meka, Sahil Singla, Makrand Sinha

Abstract: In the stochastic online vector balancing problem, vectors $v_1,v_2,\ldots,v_T$ chosen independently from an arbitrary distribution in $\mathbb{R}^n$ arrive one-by-one and must be immediately given a $\pm$ sign. The goal is to keep the norm of the discrepancy vector, i.e., the signed prefix-sum, as small as possible for a given target norm. We consider some of the most well-known problems in dis… ▽ More In the stochastic online vector balancing problem, vectors $v_1,v_2,\ldots,v_T$ chosen independently from an arbitrary distribution in $\mathbb{R}^n$ arrive one-by-one and must be immediately given a $\pm$ sign. The goal is to keep the norm of the discrepancy vector, i.e., the signed prefix-sum, as small as possible for a given target norm. We consider some of the most well-known problems in discrepancy theory in the above online stochastic setting, and give algorithms that match the known offline bounds up to $\mathsf{polylog}(nT)$ factors. This substantially generalizes and improves upon the previous results of Bansal, Jiang, Singla, and Sinha (STOC' 20). In particular, for the Komlós problem where $\|v_t\|_2\leq 1$ for each $t$, our algorithm achieves $\tilde{O}(1)$ discrepancy with high probability, improving upon the previous $\tilde{O}(n^{3/2})$ bound. For Tusnády's problem of minimizing the discrepancy of axis-aligned boxes, we obtain an $O(\log^{d+4} T)$ bound for arbitrary distribution over points. Previous techniques only worked for product distributions and gave a weaker $O(\log^{2d+1} T)$ bound. We also consider the Banaszczyk setting, where given a symmetric convex body $K$ with Gaussian measure at least $1/2$, our algorithm achieves $\tilde{O}(1)$ discrepancy with respect to the norm given by $K$ for input distributions with sub-exponential tails. Our key idea is to introduce a potential that also enforces constraints on how the discrepancy vector evolves, allowing us to maintain certain anti-concentration properties. For the Banaszczyk setting, we further enhance this potential by combining it with ideas from generic chaining. Finally, we also extend these results to the setting of online multi-color discrepancy. △ Less

Submitted 21 July, 2020; originally announced July 2020.

arXiv:2006.09582 [pdf, other]

doi 10.1017/S1743921319002813

Mentari: A pipeline to model the galaxy SED using semi analytic models

Authors: Dian Triani, Darren Croton, Manodeep Sinha

Abstract: We build a theoretical picture of how the light from galaxies evolves across cosmic time. In particular, we predict the evolution of the galaxy spectral energy distribution (SED) by carefully integrating the star formation and metal enrichment histories of semi-analytic model (SAM) galaxies and combining these with stellar population synthesis models which we call mentari. Our SAM combines prescri… ▽ More We build a theoretical picture of how the light from galaxies evolves across cosmic time. In particular, we predict the evolution of the galaxy spectral energy distribution (SED) by carefully integrating the star formation and metal enrichment histories of semi-analytic model (SAM) galaxies and combining these with stellar population synthesis models which we call mentari. Our SAM combines prescriptions to model the interplay between gas accretion, star formation, feedback process, and chemical enrichment in galaxy evolution. From this, the SED of any simulated galaxy at any point in its history can be constructed and compared with telescope data to reverse engineer the various physical processes that may have led to a particular set of observations. The synthetic SEDs of millions of simulated galaxies from mentari can cover wavelengths from the far UV to infrared, and thus can tell a near complete story of the history of galaxy evolution. \keywords{galaxies: evolution - galaxies: stellar content - galaxies.} △ Less

Submitted 16 June, 2020; originally announced June 2020.

Comments: 5 pages, 3 figures, Proceedings of IAU symposium #341: Challenges in Panchromatic Modelling with Next Generation Facilities

arXiv:2006.00591 [pdf, other]

Efficient Deployment of Conversational Natural Language Interfaces over Databases

Authors: Anthony Colas, Trung Bui, Franck Dernoncourt, Moumita Sinha, Doo Soon Kim

Abstract: Many users communicate with chatbots and AI assistants in order to help them with various tasks. A key component of the assistant is the ability to understand and answer a user's natural language questions for question-answering (QA). Because data can be usually stored in a structured manner, an essential step involves turning a natural language question into its corresponding query language. Howe… ▽ More Many users communicate with chatbots and AI assistants in order to help them with various tasks. A key component of the assistant is the ability to understand and answer a user's natural language questions for question-answering (QA). Because data can be usually stored in a structured manner, an essential step involves turning a natural language question into its corresponding query language. However, in order to train most natural language-to-query-language state-of-the-art models, a large amount of training data is needed first. In most domains, this data is not available and collecting such datasets for various domains can be tedious and time-consuming. In this work, we propose a novel method for accelerating the training dataset collection for developing the natural language-to-query-language machine learning models. Our system allows one to generate conversational multi-term data, where multiple turns define a dialogue session, enabling one to better utilize chatbot interfaces. We train two current state-of-the-art NL-to-QL models, on both an SQL and SPARQL-based datasets in order to showcase the adaptability and efficacy of our created data. △ Less

Submitted 4 June, 2020; v1 submitted 31 May, 2020; originally announced June 2020.

Comments: Accepted at ACL-NLI 2020

arXiv:2004.12388 [pdf, other]

The Impact of Presentation Style on Human-In-The-Loop Detection of Algorithmic Bias

Authors: Po-Ming Law, Sana Malik, Fan Du, Moumita Sinha

Abstract: While decision makers have begun to employ machine learning, machine learning models may make predictions that bias against certain demographic groups. Semi-automated bias detection tools often present reports of automatically-detected biases using a recommendation list or visual cues. However, there is a lack of guidance concerning which presentation style to use in what scenarios. We conducted a… ▽ More While decision makers have begun to employ machine learning, machine learning models may make predictions that bias against certain demographic groups. Semi-automated bias detection tools often present reports of automatically-detected biases using a recommendation list or visual cues. However, there is a lack of guidance concerning which presentation style to use in what scenarios. We conducted a small lab study with 16 participants to investigate how presentation style might affect user behaviors in reviewing bias reports. Participants used both a prototype with a recommendation list and a prototype with visual cues for bias detection. We found that participants often wanted to investigate the performance measures that were not automatically detected as biases. Yet, when using the prototype with a recommendation list, they tended to give less consideration to such measures. Grounded in the findings, we propose information load and comprehensiveness as two axes for characterizing bias detection tasks and illustrate how the two axes could be adopted to reason about when to use a recommendation list or visual cues. △ Less

Submitted 9 May, 2020; v1 submitted 26 April, 2020; originally announced April 2020.

Comments: Published at Graphics Interface 2020 (GI 2020)

arXiv:2004.09900 [pdf, other]

An RNN-Survival Model to Decide Email Send Times

Authors: Harvineet Singh, Moumita Sinha, Atanu R. Sinha, Sahil Garg, Neha Banerjee

Abstract: Email communications are ubiquitous. Firms control send times of emails and thereby the instants at which emails reach recipients (it is assumed email is received instantaneously from the send time). However, they do not control the duration it takes for recipients to open emails, labeled as time-to-open. Importantly, among emails that are opened, most occur within a short window from their send t… ▽ More Email communications are ubiquitous. Firms control send times of emails and thereby the instants at which emails reach recipients (it is assumed email is received instantaneously from the send time). However, they do not control the duration it takes for recipients to open emails, labeled as time-to-open. Importantly, among emails that are opened, most occur within a short window from their send times. We posit that emails are likely to be opened sooner when send times are convenient for recipients, while for other send times, emails can get ignored. Thus, to compute appropriate send times it is important to predict times-to-open accurately. We propose a recurrent neural network (RNN) in a survival model framework to predict times-to-open, for each recipient. Using that we compute appropriate send times. We experiment on a data set of emails sent to a million customers over five months. The sequence of emails received by a person from a sender is a result of interactions with past emails from the sender, and hence contain useful signal that inform our model. This sequential dependence affords our proposed RNN-Survival (RNN-S) approach to outperform survival analysis approaches in predicting times-to-open. We show that best times to send emails can be computed accurately from predicted times-to-open. This approach allows a firm to tune send times of emails, which is in its control, to favorably influence open rates and engagement. △ Less

Submitted 21 April, 2020; originally announced April 2020.

Comments: 11 pages, 3 figures, 2 tables

arXiv:2003.07680 [pdf]

Designing Tools for Semi-Automated Detection of Machine Learning Biases: An Interview Study

Authors: Po-Ming Law, Sana Malik, Fan Du, Moumita Sinha

Abstract: Machine learning models often make predictions that bias against certain subgroups of input data. When undetected, machine learning biases can constitute significant financial and ethical implications. Semi-automated tools that involve humans in the loop could facilitate bias detection. Yet, little is known about the considerations involved in their design. In this paper, we report on an interview… ▽ More Machine learning models often make predictions that bias against certain subgroups of input data. When undetected, machine learning biases can constitute significant financial and ethical implications. Semi-automated tools that involve humans in the loop could facilitate bias detection. Yet, little is known about the considerations involved in their design. In this paper, we report on an interview study with 11 machine learning practitioners for investigating the needs surrounding semi-automated bias detection tools. Based on the findings, we highlight four considerations in designing to guide system designers who aim to create future tools for bias detection. △ Less

Submitted 17 March, 2020; v1 submitted 12 March, 2020; originally announced March 2020.

Comments: Proceedings of the CHI 2020 Workshop on Detection and Design for Cognitive Biases in People and Computing Systems

arXiv:2002.07062 [pdf]

An optimal scheduling architecture for accelerating batch algorithms on Neural Network processor architectures

Authors: Phani Kumar Nyshadham, Mohit Sinha, Biswajit Mishra, H S Vijay

Abstract: In neural network topologies, algorithms are running on batches of data tensors. The batches of data are typically scheduled onto the computing cores which execute in parallel. For the algorithms running on batches of data, an optimal batch scheduling architecture is very much needed by suitably utilizing hardware resources - thereby resulting in significant reduction training and inference time.… ▽ More In neural network topologies, algorithms are running on batches of data tensors. The batches of data are typically scheduled onto the computing cores which execute in parallel. For the algorithms running on batches of data, an optimal batch scheduling architecture is very much needed by suitably utilizing hardware resources - thereby resulting in significant reduction training and inference time. In this paper, we propose to accelerate the batch algorithms for neural networks through a scheduling architecture enabling optimal compute power utilization. The proposed optimal scheduling architecture can be built into HW or can be implemented in SW alone which can be leveraged for accelerating batch algorithms. The results demonstrate that the proposed architecture speeds up the batch algorithms compared to the previous solutions. The proposed idea applies to any HPC architecture meant for neural networks. △ Less

Submitted 14 February, 2020; originally announced February 2020.

Comments: 9 pages, page 7 contains the proposed example

arXiv:2002.05343 [pdf, other]

doi 10.1093/mnras/staa446

The origin of dust in galaxies across cosmic time

Authors: Dian P. Triani, Manodeep Sinha, Darren J. Croton, Camilla Pacifici, Eli Dwek

Abstract: We study the dust evolution in galaxies by implementing a detailed dust prescription in the SAGE semi-analytical model for galaxy formation. The new model, called Dusty SAGE, follows the condensation of dust in the ejecta of type II supernovae and asymptotic giant branch (AGB) stars, grain growth in the dense molecular clouds, destruction by supernovae shocks, and the removal of dust from the ISM… ▽ More We study the dust evolution in galaxies by implementing a detailed dust prescription in the SAGE semi-analytical model for galaxy formation. The new model, called Dusty SAGE, follows the condensation of dust in the ejecta of type II supernovae and asymptotic giant branch (AGB) stars, grain growth in the dense molecular clouds, destruction by supernovae shocks, and the removal of dust from the ISM by star formation, reheating, inflows and outflows. Our model successfully reproduces the observed dust mass function at redshift z = 0 and the observed scaling relations for dust across a wide range of redshifts. We find that the dust mass content in the present Universe is mainly produced via grain growth in the interstellar medium (ISM). By contrast, in the early Universe, the primary production mechanism for dust is the condensation in stellar ejecta. The shift of the significant production channel for dust characterises the scaling relations of dust-to-gas (DTG) and dust-to-metal (DTM) ratios. In galaxies where the grain growth dominates, we find positive correlations for DTG and DTM ratios with both metallicity and stellar mass. On the other hand, in galaxies where dust is produced primarily via condensation, we find negative or no correlation for DTM and DTG ratios with either metallicity or stellar mass. In agreement with observation showing that the circumgalactic medium (CGM) contains more dust than the ISM, our model also shows the same trend for z < 4. Our semi-analytic model is publicly available at https: //github.com/dptriani/dusty-sage. △ Less

Submitted 12 February, 2020; originally announced February 2020.

Comments: 19 pages, 14 figures, accepted for publication in MNRAS

Journal ref: 2020MNRAS.tmp..415T

arXiv:2001.00235 [pdf]

doi 10.1103/PhysRevMaterials.3.125002

Introduction of spin centers in single crystals of Ba$_2$CaWO$_{6-δ}$

Authors: Mekhola Sinha, Tyler J. Pearson, Tim R. Reeder, Hector K. Vivanco, Danna E. Freedman, W. Adam Phelan, Tyrel M. McQueen

Abstract: Developing the field of quantum information science (QIS) hinges upon designing viable qubits, the smallest unit in quantum computing. One approach to creating qubits is introducing paramagnetic defects into semiconductors or insulators. This class of qubits has seen success in the form of nitrogen-vacancy centers in diamond, divacancy defects in SiC, and P doped into Si. These materials feature p… ▽ More Developing the field of quantum information science (QIS) hinges upon designing viable qubits, the smallest unit in quantum computing. One approach to creating qubits is introducing paramagnetic defects into semiconductors or insulators. This class of qubits has seen success in the form of nitrogen-vacancy centers in diamond, divacancy defects in SiC, and P doped into Si. These materials feature paramagnetic defects in a low nuclear spin environment to reduce the impact of nuclear spin on electronic spin coherence. In this work, we report single crystal growth of Ba$_2$CaWO$_{6-δ}$, and the coherence properties of controllably introduced W$^{5+}$ spin centers generated by oxygen vacancies. Ba$_2$CaWO$_{6-δ}$ ($δ$ = 0) is a B-site ordered double perovskite with a temperature-dependent octahedral tilting wherein oxygen vacancies generate W$^{5+}$ (d$^1$), $S = \frac{1}{2}, I$ = 0, centers. We characterized these defects by measuring the spin-lattice ($T_1$) and spin-spin relaxation ($T_2$) times from T = 5 to 150 K. At T = 5 K, $T_1$ = 310 ms and $T_2$ = 4 $μ$s, establishing the viability of these qubit candidates. With increasing temperature, $T_2$ remains constant up to T = 60 K and then decreases to $T_2$ $\approx$ 1 $μ$s at T = 90 K, and remains roughly constant until T = 150 K, demonstrating the remarkable stability of $T_2$ with increasing temperature. Together, these results demonstrate that controlled defect generation in double perovskite structures can generate viable paramagnetic point centers for quantum applications and expand the field of potential materials for QIS. △ Less

Submitted 1 January, 2020; originally announced January 2020.

Comments: 9 pages, 5 figures

Journal ref: Phys. Rev. Mater. 3, 125002 (2019)

arXiv:1912.03350 [pdf, other]

Online Vector Balancing and Geometric Discrepancy

Authors: Nikhil Bansal, Haotian Jiang, Sahil Singla, Makrand Sinha

Abstract: We consider an online vector balancing question where $T$ vectors, chosen from an arbitrary distribution over $[-1,1]^n$, arrive one-by-one and must be immediately given a $\pm$ sign. The goal is to keep the discrepancy small as possible. A concrete example is the online interval discrepancy problem where T points are sampled uniformly in [0,1], and the goal is to immediately color them $\pm$ such… ▽ More We consider an online vector balancing question where $T$ vectors, chosen from an arbitrary distribution over $[-1,1]^n$, arrive one-by-one and must be immediately given a $\pm$ sign. The goal is to keep the discrepancy small as possible. A concrete example is the online interval discrepancy problem where T points are sampled uniformly in [0,1], and the goal is to immediately color them $\pm$ such that every sub-interval remains nearly balanced. As random coloring incurs $Ω(T^{1/2})$ discrepancy, while the offline bounds are $Θ(\sqrt{n \log (T/n)})$ for vector balancing and $1$ for interval balancing, a natural question is whether one can (nearly) match the offline bounds in the online setting for these problems. One must utilize the stochasticity as in the worst-case scenario it is known that discrepancy is $Ω(T^{1/2})$ for any online algorithm. Bansal and Spencer recently show an $O(\sqrt{n}\log T)$ bound when each coordinate is independent. When there are dependencies among the coordinates, the problem becomes much more challenging, as evidenced by a recent work of Jiang, Kulkarni, and Singla that gives a non-trivial $O(T^{1/\log\log T})$ bound for online interval discrepancy. Although this beats random coloring, it is still far from the offline bound. In this work, we introduce a new framework for online vector balancing when the input distribution has dependencies across coordinates. This lets us obtain a $poly(n, \log T)$ bound for online vector balancing under arbitrary input distributions, and a $poly(\log T)$ bound for online interval discrepancy. Our framework is powerful enough to capture other well-studied geometric discrepancy problems; e.g., a $poly(\log^d (T))$ bound for the online $d$-dimensional Tusnády's problem. A key new technical ingredient is an {anti-concentration} inequality for sums of pairwise uncorrelated random variables. △ Less

Submitted 12 April, 2020; v1 submitted 6 December, 2019; originally announced December 2019.

Comments: Appears in STOC 2020

arXiv:1911.08275 [pdf, other]

doi 10.1007/978-981-13-7729-7_1

Corrfunc: Blazing fast correlation functions with AVX512F SIMD Intrinsics

Authors: Manodeep Sinha, Lehman H. Garrison

Abstract: Correlation functions are widely used in extra-galactic astrophysics to extract insights into how galaxies occupy dark matter halos and in cosmology to place stringent constraints on cosmological parameters. A correlation function fundamentally requires computing pair-wise separations between two sets of points and then computing a histogram of the separations. Corrfunc is an existing open-source,… ▽ More Correlation functions are widely used in extra-galactic astrophysics to extract insights into how galaxies occupy dark matter halos and in cosmology to place stringent constraints on cosmological parameters. A correlation function fundamentally requires computing pair-wise separations between two sets of points and then computing a histogram of the separations. Corrfunc is an existing open-source, high-performance software package for efficiently computing a multitude of correlation functions. In this paper, we will discuss the SIMD AVX512F kernels within Corrfunc, capable of processing 16 floats or 8 doubles at a time. The latest manually implemented Corrfunc AVX512F kernels show a speedup of up to $\sim 4\times$ relative to compiler-generated code for double-precision calculations. The AVX512F kernels show $\sim 1.6\times$ speedup relative to the AVX kernels and compare favorably to a theoretical maximum of $2\times$. In addition, by pruning pairs with too large of a minimum possible separation, we achieve a $\sim 5-10\%$ speedup across all the SIMD kernels. Such speedups highlight the importance of programming explicitly with SIMD vector intrinsics for complex calculations that can not be efficiently vectorized by compilers. Corrfunc is publicly available at https://github.com/manodeep/Corrfunc/. △ Less

Submitted 15 November, 2019; originally announced November 2019.

Comments: Paper II for the Corrfunc software package, paper I is on arXiv here: arXiv:1911.03545. Appeared in the refereed proceedings for the "Second Workshop on Software Challenges to Exascale Computing"

arXiv:1911.07688 [pdf, other]

doi 10.21105/joss.01864

emcee v3: A Python ensemble sampling toolkit for affine-invariant MCMC

Authors: Daniel Foreman-Mackey, Will M. Farr, Manodeep Sinha, Anne M. Archibald, David W. Hogg, Jeremy S. Sanders, Joe Zuntz, Peter K. G. Williams, Andrew R. J. Nelson, Miguel de Val-Borro, Tobias Erhardt, Ilya Pashchenko, Oriol Abril Pla

Abstract: emcee is a Python library implementing a class of affine-invariant ensemble samplers for Markov chain Monte Carlo (MCMC). This package has been widely applied to probabilistic modeling problems in astrophysics where it was originally published, with some applications in other fields. When it was first released in 2012, the interface implemented in emcee was fundamentally different from the MCMC li… ▽ More emcee is a Python library implementing a class of affine-invariant ensemble samplers for Markov chain Monte Carlo (MCMC). This package has been widely applied to probabilistic modeling problems in astrophysics where it was originally published, with some applications in other fields. When it was first released in 2012, the interface implemented in emcee was fundamentally different from the MCMC libraries that were popular at the time, such as PyMC, because it was specifically designed to work with "black box" models instead of structured graphical models. This has been a popular interface for applications in astrophysics because it is often non-trivial to implement realistic physics within the modeling frameworks required by other libraries. Since emcee's release, other libraries have been developed with similar interfaces, such as dynesty (Speagle 2019). The version 3.0 release of emcee is the first major release of the library in about 6 years and it includes a full re-write of the computational backend, several commonly requested features, and a set of new "move" implementations. △ Less

Submitted 18 November, 2019; originally announced November 2019.

Comments: Published in the Journal for Open Source Software

Journal ref: Journal of Open Source Software, 2 4(43), 1864 (2019)

arXiv:1911.03545 [pdf, other]

doi 10.1093/mnras/stz3157

Corrfunc --- A Suite of Blazing Fast Correlation Functions on the CPU

Authors: Manodeep Sinha, Lehman H. Garrison

Abstract: The two-point correlation function (2PCF) is the most widely used tool for quantifying the spatial distribution of galaxies. Since the distribution of galaxies is determined by galaxy formation physics as well as the underlying cosmology, fitting an observed correlation function yields valuable insights into both. The calculation for a 2PCF involves computing pair-wise separations and consequently… ▽ More The two-point correlation function (2PCF) is the most widely used tool for quantifying the spatial distribution of galaxies. Since the distribution of galaxies is determined by galaxy formation physics as well as the underlying cosmology, fitting an observed correlation function yields valuable insights into both. The calculation for a 2PCF involves computing pair-wise separations and consequently, the computing time scales quadratically with the number of galaxies. The next-generation galaxy surveys are slated to observe many millions of galaxies, and computing the 2PCF for such surveys would be prohibitively time-consuming. Additionally, modern modelling techniques require the 2PCF to be calculated thousands of times on simulated galaxy catalogues of {\em at least} equal size to the data and would be completely unfeasible for the next generation surveys. Thus, calculating the 2PCF forms a substantial bottleneck in improving our understanding of the fundamental physics of the universe, and we need high-performance software to compute the correlation function. In this paper, we present Corrfunc --- a suite of highly optimised, OpenMP parallel clustering codes. The improved performance of Corrfunc arises from both efficient algorithms as well as software design that suits the underlying hardware of modern CPUs. Corrfunc can compute a wide range of 2-D and 3-D correlation functions in either simulation (Cartesian) space or on-sky coordinates. Corrfunc runs efficiently in both single- and multi-threaded modes and can compute a typical 2-point projected correlation function ($w_p(r_p)$) for ~1 million galaxies within a few seconds on a single thread. Corrfunc is designed to be both user-friendly and fast and is publicly available at https://github.com/manodeep/Corrfunc. △ Less

Submitted 8 November, 2019; originally announced November 2019.

Comments: Accepted for publication to MNRAS

arXiv:1910.08376 [pdf]

The Growing Importance of a Tech Savvy Astronomy and Astrophysics Workforce

Authors: Dara Norman, Kelle Cruz, Vandana Desai, Britt Lundgren, Eric Bellm, Frossie Economou, Arfon Smith, Amanda Bauer, Brian Nord, Chad Schafer, Gautham Narayan, Ting Li, Erik Tollerud, Brigitta Sipocz, Heloise Stevance, Timothy Pickering, Manodeep Sinha, Joseph Harrington, Jeyhan Kartaltepe, Dany Vohl, Adrian Price-Whelan, Brian Cherinka, Chi-kwan Chan, Benjamin Weiner, Maryam Modjaz , et al. (4 additional authors not shown)

Abstract: Fundamental coding and software development skills are increasingly necessary for success in nearly every aspect of astronomical and astrophysical research as large surveys and high resolution simulations become the norm. However, professional training in these skills is inaccessible or impractical for many members of our community. Students and professionals alike have been expected to acquire th… ▽ More Fundamental coding and software development skills are increasingly necessary for success in nearly every aspect of astronomical and astrophysical research as large surveys and high resolution simulations become the norm. However, professional training in these skills is inaccessible or impractical for many members of our community. Students and professionals alike have been expected to acquire these skills on their own, apart from formal classroom curriculum or on-the-job training. Despite the recognized importance of these skills, there is little opportunity to develop them - even for interested researchers. To ensure a workforce capable of taking advantage of the computational resources and the large volumes of data coming in the next decade, we must identify and support ways to make software development training widely accessible to community members, regardless of affiliation or career level. To develop and sustain a technology capable astronomical and astrophysical workforce, we recommend that agencies make funding and other resources available in order to encourage, support and, in some cases, require progress on necessary training, infrastructure and policies. In this white paper, we focus on recommendations for how funding agencies can lead in the promotion of activities to support the astronomy and astrophysical workforce in the 2020s. △ Less

Submitted 17 October, 2019; originally announced October 2019.

Comments: Submitted as a ASTRO2020 Decadal Survey APC position paper. arXiv admin note: substantial text overlap with arXiv:1905.05116

arXiv:1909.12125 [pdf, ps, other]

doi 10.1016/j.nuclphysb.2019.114914

Appearance of branched motifs in the spectra of $BC_N$ type Polychronakos spin chains

Authors: Bireswar Basu-Mallick, Madhurima Sinha

Abstract: As is well known, energy levels appearing in the highly degenerate spectra of the $A_{N-1}$ type of Haldane-Shastry and Polychronakos spin chains can be classified through the motifs, which are characterized by some sequences of the binary digits like `0' and `1'. In a similar way, at present we classify all energy levels appearing in the spectra of the $BC_N$ type of Polychronakos spin chains wit… ▽ More As is well known, energy levels appearing in the highly degenerate spectra of the $A_{N-1}$ type of Haldane-Shastry and Polychronakos spin chains can be classified through the motifs, which are characterized by some sequences of the binary digits like `0' and `1'. In a similar way, at present we classify all energy levels appearing in the spectra of the $BC_N$ type of Polychronakos spin chains with Hamiltonians containing supersymmetric analogue of polarized spin reversal operators. To this end, we show that the $BC_N$ type of multivariate super Rogers-Szegö (SRS) polynomials, which at a certain limit reduce to the partition functions of the later type of Polychronakos spin chains, satisfy some recursion relation involving a $q$-deformation of the elementary supersymmetric polynomials. Subsequently, we use a Jacobi-Trudi like formula to define the corresponding $q$-deformed super Schur polynomials and derive a novel expression for the $BC_N$ type of multivariate SRS polynomials as suitable linear combinations of the $q$-deformed super Schur polynomials. Such an expression for SRS polynomials leads to a complete classification of all energy levels appearing in the spectra of the $BC_N$ type of Polychronakos spin chains through the `branched' motifs, which are characterized by some sequences of integers of the form $(δ_1, δ_2,..., δ_{N-1}|l)$, where $δ_i \in \{ 0,1 \}$ and $ l \in \{ 0,1,...,N \}$. Finally, we derive an extended boson-fermion duality relation among the restricted super Schur polynomials and show that the partition functions of the $BC_N$ type of Polychronakos spin chains also exhibit similar type of duality relation. △ Less

Submitted 26 September, 2019; originally announced September 2019.

Comments: 40 pages, 3 figures, dedicated to Artemio González-López on the occasion of his 60th birthday

arXiv:1908.06512 [pdf, other]

doi 10.1145/3159652.3159683

Modeling Time to Open of Emails with a Latent State for User Engagement Level

Authors: Moumita Sinha, Vishwa Vinay, Harvineet Singh

Abstract: Email messages have been an important mode of communication, not only for work, but also for social interactions and marketing. When messages have time sensitive information, it becomes relevant for the sender to know what is the expected time within which the email will be read by the recipient. In this paper we use a survival analysis framework to predict the time to open an email once it has be… ▽ More Email messages have been an important mode of communication, not only for work, but also for social interactions and marketing. When messages have time sensitive information, it becomes relevant for the sender to know what is the expected time within which the email will be read by the recipient. In this paper we use a survival analysis framework to predict the time to open an email once it has been received. We use the Cox Proportional Hazards (CoxPH) model that offers a way to combine various features that might affect the event of opening an email. As an extension, we also apply a mixture model (MM) approach to CoxPH that distinguishes between recipients, based on a latent state of how prone to opening the messages each individual is. We compare our approach with standard classification and regression models. While the classification model provides predictions on the likelihood of an email being opened, the regression model provides prediction of the real-valued time to open. The use of survival analysis based methods allows us to jointly model both the open event as well as the time-to-open. We experimented on a large real-world dataset of marketing emails sent in a 3-month time duration. The mixture model achieves the best accuracy on our data where a high proportion of email messages go unopened. △ Less

Submitted 18 August, 2019; originally announced August 2019.

Comments: 9 pages, 5 figures, WSDM'18, February 5-9, 2018, Marina Del Rey, CA, USA, https://dl.acm.org/citation.cfm?id=3159683

Journal ref: Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining (WSDM 2018). ACM, New York, NY, USA, 531-539

arXiv:1907.06981 [pdf]

Astro2020 APC White Paper: Elevating the Role of Software as a Product of the Research Enterprise

Authors: Arfon M. Smith, Dara Norman, Kelle Cruz, Vandana Desai, Eric Bellm, Britt Lundgren, Frossie Economou, Brian D. Nord, Chad Schafer, Gautham Narayan, Joseph Harrington, Erik Tollerud, Brigitta Sipőcz, Timothy Pickering, Molly S. Peeples, Bruce Berriman, Peter Teuben, David Rodriguez, Andre Gradvohl, Lior Shamir, Alice Allen, Joel R. Brownstein, Adam Ginsburg, Manodeep Sinha, Cameron Hummels , et al. (20 additional authors not shown)

Abstract: Software is a critical part of modern research, and yet there are insufficient mechanisms in the scholarly ecosystem to acknowledge, cite, and measure the impact of research software. The majority of academic fields rely on a one-dimensional credit model whereby academic articles (and their associated citations) are the dominant factor in the success of a researcher's career. In the petabyte era o… ▽ More Software is a critical part of modern research, and yet there are insufficient mechanisms in the scholarly ecosystem to acknowledge, cite, and measure the impact of research software. The majority of academic fields rely on a one-dimensional credit model whereby academic articles (and their associated citations) are the dominant factor in the success of a researcher's career. In the petabyte era of astronomical science, citing software and measuring its impact enables academia to retain and reward researchers that make significant software contributions. These highly skilled researchers must be retained to maximize the scientific return from petabyte-scale datasets. Evolving beyond the one-dimensional credit model requires overcoming several key challenges, including the current scholarly ecosystem and scientific culture issues. This white paper will present these challenges and suggest practical solutions for elevating the role of software as a product of the research enterprise. △ Less

Submitted 14 July, 2019; originally announced July 2019.

Comments: arXiv admin note: substantial text overlap with arXiv:1905.05116

arXiv:1907.04342 [pdf, other]

doi 10.1093/mnras/stz3139

The 21cm bispectrum during reionization: a tracer of the ionization topology

Authors: Anne Hutter, Catherine A. Watkinson, Jacob Seiler, Pratika Dayal, Manodeep Sinha, Darren J. Croton

Abstract: We compute the bispectra of the 21cm signal during the Epoch of Reionization for three different reionization scenarios that are based on a dark matter N-body simulation combined with a self-consistent, semi-numerical model of galaxy evolution and reionization. Our reionization scenarios differ in their trends of ionizing escape fractions ($f_\mathrm{esc}$) with the underlying galaxy properties an… ▽ More We compute the bispectra of the 21cm signal during the Epoch of Reionization for three different reionization scenarios that are based on a dark matter N-body simulation combined with a self-consistent, semi-numerical model of galaxy evolution and reionization. Our reionization scenarios differ in their trends of ionizing escape fractions ($f_\mathrm{esc}$) with the underlying galaxy properties and cover the physically plausible range, i.e. $f_\mathrm{esc}$ effectively decreasing, being constant, or increasing with halo mass. We find the 21cm bispectrum to be sensitive to the resulting ionization topologies that significantly differ in their size distribution of ionized and neutral regions throughout reionization. From squeezed to stretched triangles, the 21cm bispectra features a change of sign from negative to positive values, with ionized and neutral regions representing below-average and above-average concentrations contributing negatively and positively, respectively. The position of the change of sign provides a tracer of the size distribution of the ionized and neutral regions, and allows us to identify three major regimes that the 21cm bispectrum undergoes during reionization. In particular the regime during the early stages of reionization, where the 21cm bispectrum tracks the peak of the size distribution of the ionized regions, provides exciting prospects for pinning down reionization with the forthcoming Square Kilometre Array. △ Less

Submitted 18 November, 2019; v1 submitted 9 July, 2019; originally announced July 2019.

Comments: 16 pages, 7 figures. Accepted for publication in MNRAS

arXiv:1902.01611 [pdf, other]

doi 10.1093/mnras/stz1663

The Escape Fraction of Ionizing Photons During the Epoch of Reionization: observability with the Square Kilometre Array

Authors: Jacob Seiler, Anne Hutter, Manodeep Sinha, Darren Croton

Abstract: One of the most important parameters in characterizing the Epoch of Reionization, the escape fraction of ionizing photons, $f_\mathrm{esc}$, remains unconstrained both observationally and theoretically. With recent work highlighting the impact of galaxy-scale feedback on the instantaneous value of $f_\mathrm{esc}$, it is important to develop a model in which reionization is self-consistently coupl… ▽ More One of the most important parameters in characterizing the Epoch of Reionization, the escape fraction of ionizing photons, $f_\mathrm{esc}$, remains unconstrained both observationally and theoretically. With recent work highlighting the impact of galaxy-scale feedback on the instantaneous value of $f_\mathrm{esc}$, it is important to develop a model in which reionization is self-consistently coupled to galaxy evolution. In this work, we present such a model and explore how physically motivated functional forms of $f_\mathrm{esc}$ affect the evolution of ionized hydrogen within the intergalactic medium. Using the $21$cm power spectrum evolution, we investigate the likelihood of observationally distinguishing between a constant $f_\mathrm{esc}$ and other models that depend upon different forms of galaxy feedback. We find that changing the underlying connection between $f_\mathrm{esc}$ and galaxy feedback drastically alters the large-scale $21$cm power. The upcoming Square Kilometre Array Low Frequency instrument possesses the sensitivity to differentiate between our models at a fixed optical depth, requiring only $200$ hours of integration time focused on redshifts $z = 7.5-8.5$. Generalizing these results to account for a varying optical depth will require multiple $800$ hour observations spanning redshifts $z = 7-10$. This presents an exciting opportunity to observationally constrain one of the most elusive parameters during the Epoch of Reionization. △ Less

Submitted 15 July, 2019; v1 submitted 5 February, 2019; originally announced February 2019.

Comments: 14 pages, 8 figures, 2 tables

Journal ref: MNRAS 487 (2019) 5739-5752

arXiv:1901.08725 [pdf, other]

doi 10.3847/1538-4365/ab1f7d

Model dispersion with PRISM; an alternative to MCMC for rapid analysis of models

Authors: Ellert van der Velden, Alan R. Duffy, Darren Croton, Simon J. Mutch, Manodeep Sinha

Abstract: We have built PRISM, a "Probabilistic Regression Instrument for Simulating Models". PRISM uses the Bayes linear approach and history matching to construct an approximation ('emulator') of any given model, by combining limited model evaluations with advanced regression techniques, covariances and probability calculations. It is designed to easily facilitate and enhance existing Markov chain Monte C… ▽ More We have built PRISM, a "Probabilistic Regression Instrument for Simulating Models". PRISM uses the Bayes linear approach and history matching to construct an approximation ('emulator') of any given model, by combining limited model evaluations with advanced regression techniques, covariances and probability calculations. It is designed to easily facilitate and enhance existing Markov chain Monte Carlo (MCMC) methods by restricting plausible regions and exploring parameter space efficiently. However, PRISM can additionally be used as a standalone alternative to MCMC for model analysis, providing insight into the behavior of complex scientific models. With PRISM, the time spent on evaluating a model is minimized, providing developers with an advanced model analysis for a fraction of the time required by more traditional methods. This paper provides an overview of the different techniques and algorithms that are used within PRISM. We demonstrate the advantage of using the Bayes linear approach over a full Bayesian analysis when analyzing complex models. Our results show how much information can be captured by PRISM and how one can combine it with MCMC methods to significantly speed up calibration processes (>15 times faster). PRISM is an open-source Python package that is available under the BSD 3-Clause License (BSD-3) at https://github.com/1313e/PRISM and hosted at https://prism-tool.readthedocs.io. PRISM has also been reviewed by "The Journal of Open Source Software" (https://doi.org/10.21105/joss.01229). △ Less

Submitted 11 June, 2019; v1 submitted 24 January, 2019; originally announced January 2019.

Comments: 28 pages, 13 figures, 1 table. Updated to reflect some changes made to published version

Journal ref: ApJS 242 22 (2019)

arXiv:1812.02206 [pdf, other]

doi 10.1093/mnras/stz942

The Secondary Spin Bias of Dark Matter Haloes

Authors: James W. Johnson, Ariyeh H. Maller, Andreas A. Berlind, Manodeep Sinha, J. Kelly Holley-Bockelmann

Abstract: We investigate the role of angular momentum in the clustering of dark matter haloes. We make use of data from two high-resolution N-body simulations spanning over four orders of magnitude in halo mass, from $10^{9.8}$ to $10^{14}\ h^{-1}\ \text{M}_\odot$. We explore the hypothesis that mass accretion in filamentary environments alters the angular momentum of a halo, thereby driving a correlation b… ▽ More We investigate the role of angular momentum in the clustering of dark matter haloes. We make use of data from two high-resolution N-body simulations spanning over four orders of magnitude in halo mass, from $10^{9.8}$ to $10^{14}\ h^{-1}\ \text{M}_\odot$. We explore the hypothesis that mass accretion in filamentary environments alters the angular momentum of a halo, thereby driving a correlation between the spin parameter $λ$ and the strength of clustering. However, we do not find evidence that the distribution of matter on large scales is related to the spin of haloes. We find that a halo's spin is correlated with its age, concentration, sphericity, and mass accretion rate. Removing these correlations strongly affects the strength of secondary spin bias at low halo masses. We also find that high spin haloes are slightly more likely to be found near another halo of comparable mass. These haloes that are found near a comparable mass neighbour - a \textit{twin} - are strongly spatially biased. We demonstrate that this \textit{twin bias}, along with the relationship between spin and mass accretion rates, statistically accounts for halo spin secondary bias. △ Less

Submitted 5 December, 2018; originally announced December 2018.

Comments: 11 pages, 6 figures; submitted to MNRAS, comments welcome

arXiv:1811.10090 [pdf, ps, other]

Exponential Separation between Quantum Communication and Logarithm of Approximate Rank

Authors: Makrand Sinha, Ronald de Wolf

Abstract: Chattopadhyay, Mande and Sherif (ECCC 2018) recently exhibited a total Boolean function, the sink function, that has polynomial approximate rank and polynomial randomized communication complexity. This gives an exponential separation between randomized communication complexity and logarithm of the approximate rank, refuting the log-approximate-rank conjecture. We show that even the quantum communi… ▽ More Chattopadhyay, Mande and Sherif (ECCC 2018) recently exhibited a total Boolean function, the sink function, that has polynomial approximate rank and polynomial randomized communication complexity. This gives an exponential separation between randomized communication complexity and logarithm of the approximate rank, refuting the log-approximate-rank conjecture. We show that even the quantum communication complexity of the sink function is polynomial, thus also refuting the quantum log-approximate-rank conjecture. Our lower bound is based on the fooling distribution method introduced by Rao and Sinha (ECCC 2015) for the classical case and extended by Anshu, Touchette, Yao and Yu (STOC 2017) for the quantum case. We also give a new proof of the classical lower bound using the fooling distribution method. △ Less

Submitted 25 November, 2018; originally announced November 2018.

Comments: The same lower bound has been obtained independently and simultaneously by Anurag Anshu, Naresh Goud Boddu and Dave Touchette

arXiv:1809.05252 [pdf, other]

CIMTDetect: A Community Infused Matrix-Tensor Coupled Factorization Based Method for Fake News Detection

Authors: Shashank Gupta, Raghuveer Thirukovalluru, Manjira Sinha, Sandya Mannarswamy

Abstract: Detecting whether a news article is fake or genuine is a crucial task in today's digital world where it's easy to create and spread a misleading news article. This is especially true of news stories shared on social media since they don't undergo any stringent journalistic checking associated with main stream media. Given the inherent human tendency to share information with their social connectio… ▽ More Detecting whether a news article is fake or genuine is a crucial task in today's digital world where it's easy to create and spread a misleading news article. This is especially true of news stories shared on social media since they don't undergo any stringent journalistic checking associated with main stream media. Given the inherent human tendency to share information with their social connections at a mouse-click, fake news articles masquerading as real ones, tend to spread widely and virally. The presence of echo chambers (people sharing same beliefs) in social networks, only adds to this problem of wide-spread existence of fake news on social media. In this paper, we tackle the problem of fake news detection from social media by exploiting the very presence of echo chambers that exist within the social network of users to obtain an efficient and informative latent representation of the news article. By modeling the echo-chambers as closely-connected communities within the social network, we represent a news article as a 3-mode tensor of the structure - <News, User, Community> and propose a tensor factorization based method to encode the news article in a latent embedding space preserving the community structure. We also propose an extension of the above method, which jointly models the community and content information of the news article through a coupled matrix-tensor factorization framework. We empirically demonstrate the efficacy of our method for the task of Fake News Detection over two real-world datasets. Further, we validate the generalization of the resulting embeddings over two other auxiliary tasks, namely: \textbf{1)} News Cohort Analysis and \textbf{2)} Collaborative News Recommendation. Our proposed method outperforms appropriate baselines for both the tasks, establishing its generalization. △ Less

Submitted 14 September, 2018; originally announced September 2018.

Comments: Presented at ASONAM'18

arXiv:1809.04622 [pdf, other]

doi 10.1093/mnras/sty2111

The Three Hundred project: a large catalogue of theoretically modelled galaxy clusters for cosmological and astrophysical applications

Authors: Weiguang Cui, Alexander Knebe, Gustavo Yepes, Frazer Pearce, Chris Power, Romeel Dave, Alexander Arth, Stefano Borgani, Klaus Dolag, Pascal Elahi, Robert Mostoghiu, Giuseppe Murante, Elena Rasia, Doris Stoppacher, Jesus Vega-Ferrero, Yang Wang, Xiaohu Yang, Andrew Benson, Sofía A. Cora, Darren J. Croton, Manodeep Sinha, Adam R. H. Stevens, Cristian A. Vega-Martínez, Jake Arthur, Anna S. Baldi , et al. (12 additional authors not shown)

Abstract: We introduce the THE THREE HUNDRED project, an endeavour to model 324 large galaxy clusters with full-physics hydrodynamical re-simulations. Here we present the data set and study the differences to observations for fundamental galaxy cluster properties and scaling relations. We find that the modelled galaxy clusters are generally in reasonable agreement with observations with respect to baryonic… ▽ More We introduce the THE THREE HUNDRED project, an endeavour to model 324 large galaxy clusters with full-physics hydrodynamical re-simulations. Here we present the data set and study the differences to observations for fundamental galaxy cluster properties and scaling relations. We find that the modelled galaxy clusters are generally in reasonable agreement with observations with respect to baryonic fractions and gas scaling relations at redshift z = 0. However, there are still some (model-dependent) differences, such as central galaxies being too massive, and galaxy colours (g - r) being bluer (about 0.2 dex lower at the peak position) than in observations. The agreement in gas scaling relations down to 10^{13} h^{-1} M_{\odot} between the simulations indicates that particulars of the sub-grid modelling of the baryonic physics only has a weak influence on these relations. We also include - where appropriate - a comparison to three semi-analytical galaxy formation models as applied to the same underlying dark-matter-only simulation. All simulations and derived data products are publicly available. △ Less

Submitted 12 September, 2018; originally announced September 2018.

Comments: 20 pages, 8 figures, 7 tables. MNRAS published version

Journal ref: Cui, W., Knebe, A., Yepes, G., et al.\ 2018, \mnras, 480, 2898

arXiv:1806.11284 [pdf, other]

doi 10.1093/mnrasl/sly122

The Indirect Influence of Quasars on Reionization

Authors: Jacob Seiler, Anne Hutter, Manodeep Sinha, Darren Croton

Abstract: The exact role of quasars during the Epoch of Reionization remains uncertain. With consensus leaning towards quasars producing a negligible amount of ionizing photons, we pose an alternate question: Can quasars indirectly contribute to reionization by allowing ionizing photons from stars to escape more easily? Using the Semi-Analytic Galaxy Evolution model to evolve a galaxy population through cos… ▽ More The exact role of quasars during the Epoch of Reionization remains uncertain. With consensus leaning towards quasars producing a negligible amount of ionizing photons, we pose an alternate question: Can quasars indirectly contribute to reionization by allowing ionizing photons from stars to escape more easily? Using the Semi-Analytic Galaxy Evolution model to evolve a galaxy population through cosmic time, we construct an idealized scenario in which the escape fraction of stellar ionizing photons ($f_\mathrm{esc}$) is boosted following quasar wind events, potentially for several dynamical times. We find that under this scenario, the mean value of $f_\mathrm{esc}$ as a function of galaxy stellar mass peaks for intermediate mass galaxies. This mass dependence will have consequences for the 21cm power spectrum, enhancing power at small scales and suppressing it at large scales. This hints that whilst quasars may not directly contribute to the ionizing photon budget, they could influence reionization indirectly by altering the topology of ionized regions. △ Less

Submitted 29 June, 2018; originally announced June 2018.

Comments: 5 pages, 3 figures

arXiv:1806.07402 [pdf, other]

doi 10.1093/mnras/sty2650

Connecting and dissecting galaxies' angular momenta and neutral gas in a hierarchical universe: cue DARK SAGE

Authors: Adam R. H. Stevens, Claudia del P. Lagos, Danail Obreschkow, Manodeep Sinha

Abstract: We explore the connection between the atomic gas fraction, f_atm, and `global disc stability' parameter, q, of galaxies within a fully cosmological context by examining galaxies in the Dark Sage semi-analytic model. The q parameter is determined by the ratio of disc specific angular momentum to mass. Dark Sage is well suited to our study, as it includes the numerical evolution of one-dimensional d… ▽ More We explore the connection between the atomic gas fraction, f_atm, and `global disc stability' parameter, q, of galaxies within a fully cosmological context by examining galaxies in the Dark Sage semi-analytic model. The q parameter is determined by the ratio of disc specific angular momentum to mass. Dark Sage is well suited to our study, as it includes the numerical evolution of one-dimensional disc structure, making both j_disc and q predicted quantities. We show that Dark Sage produces a clear correlation between gas fraction and j_disc at fixed disc mass, in line with recent results from observations and hydrodynamic simulations. This translates to a tight q--f_atm sequence for star-forming central galaxies, which closely tracks the analytic prediction of Obreschkow et al. The scatter in this sequence is driven by the probability distribution function of mass as a function of j (PDF of j) within discs, specifically where it peaks. We find that halo mass is primarily responsible for the peak location of the PDF of j, at least for low values of q. Two main mechanisms of equal significance are then identified for disconnecting f_atm from q. Mergers in the model can trigger quasar winds, with the potential to blow out most of the gas disc, while leaving the stellar disc relatively unharmed. Ram-pressure stripping of satellite galaxies has a similar effect, where f_atm can drop drastically with only a minimal effect to q. We highlight challenges associated with following these predictions up with observations. △ Less

Submitted 17 August, 2018; v1 submitted 19 June, 2018; originally announced June 2018.

Comments: 13 pages, 8 figures (excluding references and appendices). Submitted to MNRAS. Revisions after referee's report

arXiv:1803.06348 [pdf, other]

doi 10.1093/mnras/stz558

Likelihood Non-Gaussianity in Large-Scale Structure Analyses

Authors: ChangHoon Hahn, Florian Beutler, Manodeep Sinha, Andreas Berlind, Shirley Ho, David W. Hogg

Abstract: Standard present day large-scale structure (LSS) analyses make a major assumption in their Bayesian parameter inference --- that the likelihood has a Gaussian form. For summary statistics currently used in LSS, this assumption, even if the underlying density field is Gaussian, cannot be correct in detail. We investigate the impact of this assumption on two recent LSS analyses: the Beutler et al. (… ▽ More Standard present day large-scale structure (LSS) analyses make a major assumption in their Bayesian parameter inference --- that the likelihood has a Gaussian form. For summary statistics currently used in LSS, this assumption, even if the underlying density field is Gaussian, cannot be correct in detail. We investigate the impact of this assumption on two recent LSS analyses: the Beutler et al. (2017) power spectrum multipole ($P_\ell$) analysis and the Sinha et al. (2017) group multiplicity function ($ζ$) analysis. Using non-parametric divergence estimators on mock catalogs originally constructed for covariance matrix estimation, we identify significant non-Gaussianity in both the $P_\ell$ and $ζ$ likelihoods. We then use Gaussian mixture density estimation and Independent Component Analysis on the same mocks to construct likelihood estimates that approximate the true likelihood better than the Gaussian $pseudo$-likelihood. Using these likelihood estimates, we accurately estimate the true posterior probability distribution of the Beutler et al. (2017) and Sinha et al. (2017) parameters. Likelihood non-Gaussianity shifts the $fσ_8$ constraint by $-0.44σ$, but otherwise, does not significantly impact the overall parameter constraints of Beutler et al. (2017). For the $ζ$ analysis, using the pseudo-likelihood significantly underestimates the uncertainties and biases the constraints of Sinha et al. (2017) halo occupation parameters. For $\log M_1$ and $α$, the posteriors are shifted by $+0.43σ$ and $-0.51σ$ and broadened by $42\%$ and $66\%$, respectively. The divergence and likelihood estimation methods we present provide a straightforward framework for quantifying the impact of likelihood non-Gaussianity and deriving more accurate parameter constraints. △ Less

Submitted 16 March, 2018; originally announced March 2018.

Comments: 33 pages, 7 figures

arXiv:1712.02797 [pdf, other]

doi 10.1093/mnras/sty2000

Small- and Large-Scale Galactic Conformity in SDSS DR7

Authors: Victor F. Calderon, Andreas A. Berlind, Manodeep Sinha

Abstract: Galactic conformity is the phenomenon whereby galaxy properties exhibit excess correlations across distance than that expected if these properties only depended on halo mass. We perform a comprehensive study of conformity at low redshift using a galaxy group catalogue from the SDSS DR7 spectroscopic sample. We study correlations both between central galaxies and their satellites (1-halo), and betw… ▽ More Galactic conformity is the phenomenon whereby galaxy properties exhibit excess correlations across distance than that expected if these properties only depended on halo mass. We perform a comprehensive study of conformity at low redshift using a galaxy group catalogue from the SDSS DR7 spectroscopic sample. We study correlations both between central galaxies and their satellites (1-halo), and between central galaxies in separate haloes (2-halo). We use the quenched fractions and the marked correlation function (MCF), to probe for conformity in three galaxy properties, $(g-r)$ colour, specific star formation rate (sSFR), and morphology. We assess the statistical significance of conformity signals with a suite of mock galaxy catalogues that have no built-in conformity, but contain the same group-finding and mass assignment errors as the real data. In the case of 1-halo conformity, quenched fractions show strong signals at all group masses. However, these signals are equally strong in mock catalogues, indicating that the conformity signal is spurious and likely entirely caused by group-finding systematics, calling into question previous claims of 1-halo conformity detection. The MCF reveals a significant detection of radial segregation within massive groups, but no evidence of conformity. In the case of 2-halo conformity, quenched fractions show no significant evidence of conformity in colour or sSFR once compared with mock catalogues, but a clear signal using morphology. In contrast, the MCF reveals a small, yet highly significant signal for all three properties in low mass groups and scales of $0.8-4\ h^{-1}\textrm{Mpc}$, possibly representing the first robust detection of 2-halo conformity. △ Less

Submitted 15 August, 2018; v1 submitted 7 December, 2017; originally announced December 2017.

Comments: 16 pages, 5 figures, accepted by MNRAS

arXiv:1711.10145 [pdf, other]

Lower Bounds for Approximating the Matching Polytope

Authors: Makrand Sinha

Abstract: We prove that any extended formulation that approximates the matching polytope on $n$-vertex graphs up to a factor of $(1+\varepsilon)$ for any $\frac2n \le \varepsilon \le 1$ must have at least $\binom{n}{α/{\varepsilon}}$ defining inequalities where $0<α<1$ is an absolute constant. This is tight as exhibited by the $(1+\varepsilon)$ approximating linear program obtained by dropping the odd set c… ▽ More We prove that any extended formulation that approximates the matching polytope on $n$-vertex graphs up to a factor of $(1+\varepsilon)$ for any $\frac2n \le \varepsilon \le 1$ must have at least $\binom{n}{α/{\varepsilon}}$ defining inequalities where $0<α<1$ is an absolute constant. This is tight as exhibited by the $(1+\varepsilon)$ approximating linear program obtained by dropping the odd set constraints of size larger than $({1+\varepsilon})/{\varepsilon}$ from the description of the matching polytope. Previously, a tight lower bound of $2^{Ω(n)}$ was only known for $\varepsilon = O\left(\frac{1}{n}\right)$ [Rothvoss, STOC '14; Braun and Pokutta, IEEE Trans. Information Theory '15] whereas for $\frac2n \le \varepsilon \le 1$, the best lower bound was $2^{Ω\left({1}/{\varepsilon}\right)}$ [Rothvoss, STOC '14]. The key new ingredient in our proof is a close connection to the non-negative rank of a lopsided version of the unique disjointness matrix. △ Less

Submitted 28 November, 2017; originally announced November 2017.

Comments: To appear in proceedings of SODA '18

arXiv:1711.07567 [pdf, other]

Edge Estimation with Independent Set Oracles

Authors: Paul Beame, Sariel Har-Peled, Sivaramakrishnan Natarajan Ramamoorthy, Cyrus Rashtchian, Makrand Sinha

Abstract: We study the task of estimating the number of edges in a graph with access to only an independent set oracle. Independent set queries draw motivation from group testing and have applications to the complexity of decision versus counting problems. We give two algorithms to estimate the number of edges in an $n$-vertex graph, using (i) $\mathrm{polylog}(n)$ bipartite independent set queries, or (ii)… ▽ More We study the task of estimating the number of edges in a graph with access to only an independent set oracle. Independent set queries draw motivation from group testing and have applications to the complexity of decision versus counting problems. We give two algorithms to estimate the number of edges in an $n$-vertex graph, using (i) $\mathrm{polylog}(n)$ bipartite independent set queries, or (ii) ${n}^{2/3} \cdot\mathrm{polylog}(n)$ independent set queries. △ Less

Submitted 11 March, 2020; v1 submitted 20 November, 2017; originally announced November 2017.

Comments: A preliminary version appeared in the proceedings of ITCS 2018

ACM Class: F.1.1; F.2

arXiv:1710.08150 [pdf, other]

doi 10.1093/mnras/stx2662

MultiDark-Galaxies: data release and first results

Authors: Alexander Knebe, Doris Stoppacher, Francisco Prada, Christoph Behrens, Andrew Benson, Sofia A. Cora, Darren J. Croton, Nelson D. Padilla, Andrés N. Ruiz, Manodeep Sinha, Adam R. H. Stevens, Cristian A. Vega-Martínez, Peter Behroozi, Violeta Gonzalez-Perez, Stefan Gottlöber, Anatoly A. Klypin, Gustavo Yepes, Harry Enke, Noam I. Libeskind, Kristin Riebe, Matthias Steinmetz

Abstract: We present the public release of the MultiDark-Galaxies: three distinct galaxy catalogues derived from one of the Planck cosmology MultiDark simulations (i.e. MDPL2, with a volume of (1 Gpc/$h$)$^{3}$ and mass resolution of $1.5 \times 10^{9} M_{\odot}/h$) by applying the semi-analytic models GALACTICUS, SAG, and SAGE to it. We compare the three models and their conformity with observational data… ▽ More We present the public release of the MultiDark-Galaxies: three distinct galaxy catalogues derived from one of the Planck cosmology MultiDark simulations (i.e. MDPL2, with a volume of (1 Gpc/$h$)$^{3}$ and mass resolution of $1.5 \times 10^{9} M_{\odot}/h$) by applying the semi-analytic models GALACTICUS, SAG, and SAGE to it. We compare the three models and their conformity with observational data for a selection of fundamental properties of galaxies like stellar mass function, star formation rate, cold gas fractions, and metallicities - noting that they sometimes perform differently reflecting model designs and calibrations. We have further selected galaxy subsamples of the catalogues by number densities in stellar mass, cold gas mass, and star formation rate in order to study the clustering statistics of galaxies. We show that despite different treatment of orphan galaxies, i.e. galaxies that lost their dark-matter host halo due to the finite mass resolution of the N-body simulation or tidal stripping, the clustering signal is comparable, and reproduces the observations in all three models - in particular when selecting samples based upon stellar mass. Our catalogues provide a powerful tool to study galaxy formation within a volume comparable to those probed by on-going and future photometric and redshift surveys. All model data consisting of a range of galaxy properties - including broad-band SDSS magnitudes - are publicly available. △ Less

Submitted 23 October, 2017; originally announced October 2017.

Comments: 29 pages, 16 figures, 8 tables, accepted for publication in MNRAS. All data incl. the complete galaxy catalogues for all models are publicly available from the CosmoSim database (http://www.cosmosim.org); a selected set of galaxy properties is available via the Skies & Universes website (http://www.skiesanduniverses.org)

arXiv:1708.08451 [pdf, other]

doi 10.1093/mnras/sty109

Spatial Clustering of Dark Matter Halos: Secondary Bias, Neighbor Bias, and the Influence of Massive Neighbors on Halo Properties

Authors: Andrés N. Salcedo, Ariyeh H. Maller, Andreas A. Berlind, Manodeep Sinha, Cameron K. McBride, Peter S. Behroozi, Risa H. Wechsler, David H. Weinberg

Abstract: We explore the phenomenon commonly known as halo assembly bias, whereby dark matter halos of the same mass are found to be more or less clustered when a second halo property is considered, for halos in the mass range $3.7 \times 10^{11} \; h^{-1} \mathrm{M_{\odot}} - 5.0 \times 10^{13} \; h^{-1} \mathrm{M_{\odot}}$. Using the Large Suite of Dark Matter Simulations (LasDamas) we consider nine commo… ▽ More We explore the phenomenon commonly known as halo assembly bias, whereby dark matter halos of the same mass are found to be more or less clustered when a second halo property is considered, for halos in the mass range $3.7 \times 10^{11} \; h^{-1} \mathrm{M_{\odot}} - 5.0 \times 10^{13} \; h^{-1} \mathrm{M_{\odot}}$. Using the Large Suite of Dark Matter Simulations (LasDamas) we consider nine commonly used halo properties and find that a clustering bias exists if halos are binned by mass or by any other halo property. This secondary bias implies that no single halo property encompasses all the spatial clustering information of the halo population. The mean values of some halo properties depend on their halo's distance to a more massive neighbor. Halo samples selected by having high values of one of these properties therefore inherit a neighbor bias such that they are much more likely to be close to a much more massive neighbor. This neighbor bias largely accounts for the secondary bias seen in halos binned by mass and split by concentration or age. However, halos binned by other mass-like properties still show a secondary bias even when the neighbor bias is removed. The secondary bias of halos selected by their spin behaves differently than that for other halo properties, suggesting that the origin of the spin bias is different than of other secondary biases. △ Less

Submitted 9 January, 2018; v1 submitted 28 August, 2017; originally announced August 2017.

Comments: 14 pages, LaTeX; minor revisions, and added references; results unchanged

arXiv:1708.04892 [pdf, other]

doi 10.1093/mnras/sty967

Towards Accurate Modelling of Galaxy Clustering on Small Scales: Testing the Standard $Λ\mathrm{CDM}$ + Halo Model

Authors: Manodeep Sinha, Andreas A. Berlind, Cameron K. McBride, Roman Scoccimarro, Jennifer A. Piscionere, Benjamin D. Wibking

Abstract: Interpreting the small-scale clustering of galaxies with halo models can elucidate the connection between galaxies and dark matter halos. Unfortunately, the modelling is typically not sufficiently accurate for ruling out models statistically. It is thus difficult to use the information encoded in small scales to test cosmological models or probe subtle features of the galaxy-halo connection. In th… ▽ More Interpreting the small-scale clustering of galaxies with halo models can elucidate the connection between galaxies and dark matter halos. Unfortunately, the modelling is typically not sufficiently accurate for ruling out models statistically. It is thus difficult to use the information encoded in small scales to test cosmological models or probe subtle features of the galaxy-halo connection. In this paper, we attempt to push halo modelling into the "accurate" regime with a fully numerical mock-based methodology and careful treatment of statistical and systematic errors. With our forward-modelling approach, we can incorporate clustering statistics beyond the traditional two-point statistics. We use this modelling methodology to test the standard $Λ\mathrm{CDM}$ + halo model against the clustering of SDSS DR7 galaxies. Specifically, we use the projected correlation function, group multiplicity function and galaxy number density as constraints. We find that while the model fits each statistic separately, it struggles to fit them simultaneously. Adding group statistics leads to a more stringent test of the model and significantly tighter constraints on model parameters. We explore the impact of varying the adopted halo definition and cosmological model and find that changing the cosmology makes a significant difference. The most successful model we tried (Planck cosmology with Mvir halos) matches the clustering of low luminosity galaxies, but exhibits a 2.3$σ$ tension with the clustering of luminous galaxies, thus providing evidence that the "standard" halo model needs to be extended. This work opens the door to adding interesting freedom to the halo model and including additional clustering statistics as constraints. △ Less

Submitted 6 August, 2018; v1 submitted 16 August, 2017; originally announced August 2017.

Comments: Replaced to match the published version

Journal ref: http://adsabs.harvard.edu/abs/2018MNRAS.478.1042S

arXiv:1701.00895 [pdf, ps, other]

doi 10.1088/1742-6596/861/1/012025

From microphysics to dynamics of magnetars

Authors: Armen Sedrakian, Xu-Guang Huang, Monika Sinha, John W. Clark

Abstract: MeV-scale magnetic fields in the interiors of magnetars suppress the pairing of neutrons and protons in the $S$-wave state. In the case of a neutron condensate the suppression is the consequence of the Pauli-paramagnetism of the neutron gas, i.e., the alignment of the neutron spins along the magnetic field. The proton $S$-wave pairing is suppressed because of the Landau diamagnetic currents of pro… ▽ More MeV-scale magnetic fields in the interiors of magnetars suppress the pairing of neutrons and protons in the $S$-wave state. In the case of a neutron condensate the suppression is the consequence of the Pauli-paramagnetism of the neutron gas, i.e., the alignment of the neutron spins along the magnetic field. The proton $S$-wave pairing is suppressed because of the Landau diamagnetic currents of protons induced by the field. The Ginzburg-Landau and BCS theories of the critical magnetic fields for unpairing are reviewed. The macrophysical implications of the suppression (unpairing) of the condensates are discussed for the rotational crust-core coupling in magnetars and the neutrino-dominated cooling era of their thermal evolution. △ Less

Submitted 3 January, 2017; originally announced January 2017.

Comments: 10 pages, 4 figures, Proceedings of "Compact Stars in the QCD phase diagram V", 23-27 May 2016 GSSI and LNGS, L'Aquila, Italy

Journal ref: Journal of Physics: Conf. Series 861 (2017) 012025

arXiv:1610.03159 [pdf, ps, other]

The Astropy Problem

Authors: Demitri Muna, Michael Alexander, Alice Allen, Richard Ashley, Daniel Asmus, Ruyman Azzollini, Michele Bannister, Rachael Beaton, Andrew Benson, G. Bruce Berriman, Maciej Bilicki, Peter Boyce, Joanna Bridge, Jan Cami, Eryn Cangi, Xian Chen, Nicholas Christiny, Christopher Clark, Michelle Collins, Johan Comparat, Neil Cook, Darren Croton, Isak Delberth Davids, Éric Depagne, John Donor , et al. (129 additional authors not shown)

Abstract: The Astropy Project (http://astropy.org) is, in its own words, "a community effort to develop a single core package for Astronomy in Python and foster interoperability between Python astronomy packages." For five years this project has been managed, written, and operated as a grassroots, self-organized, almost entirely volunteer effort while the software is used by the majority of the astronomical… ▽ More The Astropy Project (http://astropy.org) is, in its own words, "a community effort to develop a single core package for Astronomy in Python and foster interoperability between Python astronomy packages." For five years this project has been managed, written, and operated as a grassroots, self-organized, almost entirely volunteer effort while the software is used by the majority of the astronomical community. Despite this, the project has always been and remains to this day effectively unfunded. Further, contributors receive little or no formal recognition for creating and supporting what is now critical software. This paper explores the problem in detail, outlines possible solutions to correct this, and presents a few suggestions on how to address the sustainability of general purpose astronomical software. △ Less

Submitted 10 October, 2016; originally announced October 2016.

arXiv:1606.05138 [pdf, ps, other]

doi 10.1093/mnras/stw1492

Method for Determining AGN Accretion Phase in Field Galaxies

Authors: Miroslav Micic, Nemanja Martinovic, Manodeep Sinha

Abstract: Recent observations of AGN activity in massive galaxies (log Mstar / Msun > 10.4) show that: 1) at z < 1, AGN-hosting galaxies do not show enhanced merger signatures compared to normal galaxies, 2) also at z < 1, most AGNs are hosted by quiescent galaxies; and 3) at z > 1, percentage of AGNs in star forming galaxies increases and becomes comparable to AGN percentage in quiescent galaxies at z ~ 2.… ▽ More Recent observations of AGN activity in massive galaxies (log Mstar / Msun > 10.4) show that: 1) at z < 1, AGN-hosting galaxies do not show enhanced merger signatures compared to normal galaxies, 2) also at z < 1, most AGNs are hosted by quiescent galaxies; and 3) at z > 1, percentage of AGNs in star forming galaxies increases and becomes comparable to AGN percentage in quiescent galaxies at z ~ 2. How can major mergers explain AGN activity in massive quiescent galaxies which have no merger features and no star formation to indicate recent galaxy merger? By matching merger events in a cosmological N-body simulation to the observed AGN incidence probability in the COSMOS survey, we show that major merger triggered AGN activity is consistent with the observations. By distinguishing between "peak" AGNs (recently merger triggered and hosted by star forming galaxies) and "faded" AGNs (merger triggered a long time ago and now residing in quiescent galaxies), we show that the AGN occupation fraction in star forming and quiescent galaxies simply follows the evolution of the galaxy merger rate. Since the galaxy merger rate drops dramatically at z < 1, the only AGNs left to be observed are the ones triggered by old mergers and are now in the declining phase of their nuclear activity, hosted by quiescent galaxies. As we go toward higher redshifts the galaxy merger rate increases and the percentages of "peak" AGNs and "faded" AGNs become comparable. △ Less

Submitted 16 June, 2016; originally announced June 2016.

Comments: accepted by MNRAS

arXiv:1606.04106 [pdf, other]

doi 10.3847/1538-3881/aa859f

Forward Modeling of Large-Scale Structure: An open-source approach with Halotools

Authors: Andrew Hearin, Duncan Campbell, Erik Tollerud, Peter Behroozi, Benedikt Diemer, Nathan J. Goldbaum, Elise Jennings, Alexie Leauthaud, Yao-Yuan Mao, Surhud More, John Parejko, Manodeep Sinha, Brigitta Sipocz, Andrew Zentner

Abstract: We present the first stable release of Halotools (v0.2), a community-driven Python package designed to build and test models of the galaxy-halo connection. Halotools provides a modular platform for creating mock universes of galaxies starting from a catalog of dark matter halos obtained from a cosmological simulation. The package supports many of the common forms used to describe galaxy-halo model… ▽ More We present the first stable release of Halotools (v0.2), a community-driven Python package designed to build and test models of the galaxy-halo connection. Halotools provides a modular platform for creating mock universes of galaxies starting from a catalog of dark matter halos obtained from a cosmological simulation. The package supports many of the common forms used to describe galaxy-halo models: the halo occupation distribution (HOD), the conditional luminosity function (CLF), abundance matching, and alternatives to these models that include effects such as environmental quenching or variable galaxy assembly bias. Satellite galaxies can be modeled to live in subhalos, or to follow custom number density profiles within their halos, including spatial and/or velocity bias with respect to the dark matter profile. The package has an optimized toolkit to make mock observations on a synthetic galaxy population, including galaxy clustering, galaxy-galaxy lensing, galaxy group identification, RSD multipoles, void statistics, pairwise velocities and others, allowing direct comparison to observations. Halotools is object-oriented, enabling complex models to be built from a set of simple, interchangeable components, including those of your own creation. Halotools has an automated testing suite and is exhaustively documented on http://halotools.readthedocs.io, which includes quickstart guides, source code notes and a large collection of tutorials. The documentation is effectively an online textbook on how to build and study empirical models of galaxy formation with Python. △ Less

Submitted 22 September, 2017; v1 submitted 13 June, 2016; originally announced June 2016.

Comments: Revisions match version accepted for publication in AAS

arXiv:1510.04416 [pdf, ps, other]

doi 10.1103/PhysRevC.93.044601

Study of $^{26}$Mg through 1p pick up reaction $^{27}$Al(d,$^{3}$He)

Authors: Vishal Srivastava, C. Bhattacharya, T. K. Rana, S. Manna, S. Kundu, S. Bhattacharya, K. Banerjee, P. Roy, R. Pandey, G. Mukherjee, T. K. Ghosh, J. K. Meena, T. Roy, A. Chaudhuri, M. Sinha, A. K. Saha, Md. A. Asgar, A. Dey, Subinit Roy, Md. M. Shaikh

Abstract: The even-even nucleus $^{26}$Mg has been studied through the reaction $^{27}$Al(d,$^{3}$He) at 25 MeV beam energy. The spectroscopic factors have been extracted upto 7.50 MeV excitation energy using local, zero range distorted wave Born approximation. The comparison of the spectroscopic factors have been done with previously reported values using the same reaction probe. The extracted spectroscopi… ▽ More The even-even nucleus $^{26}$Mg has been studied through the reaction $^{27}$Al(d,$^{3}$He) at 25 MeV beam energy. The spectroscopic factors have been extracted upto 7.50 MeV excitation energy using local, zero range distorted wave Born approximation. The comparison of the spectroscopic factors have been done with previously reported values using the same reaction probe. The extracted spectroscopic factors for different excited states were found to be in good agreement with the previously reported values for the same. The present results were also compared with the predictions from shell model as well as rotational model. The analog states of $^{26}$Al and $^{26}$Mg were found to be in good agreement. △ Less

Submitted 15 October, 2015; originally announced October 2015.

Journal ref: Phys. Rev. C 93, 044601 (2016)

arXiv:1509.00482 [pdf, ps, other]

doi 10.1093/mnras/stw1080

Connecting massive galaxies to dark matter halos in BOSS - I. Is galaxy color a stochastic process in high-mass halos?

Authors: Shun Saito, Alexie Leauthaud, Andrew P. Hearin, Kevin Bundy, Andrew R. Zentner, Peter S. Behroozi, Beth A. Reid, Manodeep Sinha, Jean Coupon, Jeremy L. Tinker, Martin White, Donald P. Schneider

Abstract: We use subhalo abundance matching (SHAM) to model the stellar mass function (SMF) and clustering of the Baryon Oscillation Spectroscopic Survey (BOSS) "CMASS" sample at $z\sim0.5$. We introduce a novel method which accounts for the stellar mass incompleteness of CMASS as a function of redshift, and produce CMASS mock catalogs which include selection effects, reproduce the overall SMF, the projecte… ▽ More We use subhalo abundance matching (SHAM) to model the stellar mass function (SMF) and clustering of the Baryon Oscillation Spectroscopic Survey (BOSS) "CMASS" sample at $z\sim0.5$. We introduce a novel method which accounts for the stellar mass incompleteness of CMASS as a function of redshift, and produce CMASS mock catalogs which include selection effects, reproduce the overall SMF, the projected two-point correlation function $w_{\rm p}$, the CMASS $dn/dz$, and are made publicly available. We study the effects of assembly bias above collapse mass in the context of "age matching" and show that these effects are markedly different compared to the ones explored by Hearin et al. (2013) at lower stellar masses. We construct two models, one in which galaxy color is stochastic ("AbM" model) as well as a model which contains assembly bias effects ("AgM" model). By confronting the redshift dependent clustering of CMASS with the predictions from our model, we argue that that galaxy colors are not a stochastic process in high-mass halos. Our results suggest that the colors of galaxies in high-mass halos are determined by other halo properties besides halo peak velocity and that assembly bias effects play an important role in determining the clustering properties of this sample. △ Less

Submitted 24 May, 2016; v1 submitted 1 September, 2015; originally announced September 2015.

Comments: 22 pages. Appendix. B added. Matches the version accepted by MNRAS. Mock galaxy catalog and HOD table are available at http://www.massivegalaxies.com

Report number: IPMU15-0129

Showing 51–100 of 145 results for author: Sinha, M