subscribe to arXiv mailings

doi 10.1145/3659604

From Classification to Clinical Insights: Towards Analyzing and Reasoning About Mobile and Behavioral Health Data With Large Language Models

Authors: Zachary Englhardt, Chengqian Ma, Margaret E. Morris, Xuhai "Orson" Xu, Chun-Cheng Chang, Lianhui Qin, Daniel McDuff, Xin Liu, Shwetak Patel, Vikram Iyer

Abstract: Passively collected behavioral health data from ubiquitous sensors holds significant promise to provide mental health professionals insights from patient's daily lives; however, developing analysis tools to use this data in clinical practice requires addressing challenges of generalization across devices and weak or ambiguous correlations between the measured signals and an individual's mental hea… ▽ More Passively collected behavioral health data from ubiquitous sensors holds significant promise to provide mental health professionals insights from patient's daily lives; however, developing analysis tools to use this data in clinical practice requires addressing challenges of generalization across devices and weak or ambiguous correlations between the measured signals and an individual's mental health. To address these challenges, we take a novel approach that leverages large language models (LLMs) to synthesize clinically useful insights from multi-sensor data. We develop chain of thought prompting methods that use LLMs to generate reasoning about how trends in data such as step count and sleep relate to conditions like depression and anxiety. We first demonstrate binary depression classification with LLMs achieving accuracies of 61.1% which exceed the state of the art. While it is not robust for clinical use, this leads us to our key finding: even more impactful and valued than classification is a new human-AI collaboration approach in which clinician experts interactively query these tools and combine their domain expertise and context about the patient with AI generated reasoning to support clinical decision-making. We find models like GPT-4 correctly reference numerical data 75% of the time, and clinician participants express strong interest in using this approach to interpret self-tracking data. △ Less

Submitted 25 November, 2023; v1 submitted 21 November, 2023; originally announced November 2023.

arXiv:2303.12830 [pdf, other]

doi 10.3847/1538-4365/acda30

A Quick Look at the 3GHz Radio Sky. II. Hunting for DRAGNs in the VLA Sky Survey

Authors: Yjan A. Gordon, Lawrence Rudnick, Heinz Andernach, Leah K. Morabito, Christopher P. O'Dea, Kaylan-Marie Achong, Stefi A. Baum, Caryelis Bayona-Figueroa, Eric J. Hooper, Beatriz Mingo, Melissa E. Morris, Adrian N. Vantyghem

Abstract: Active Galactic Nuclei (AGN) can often be identified in radio images as two lobes, sometimes connected to a core by a radio jet. This multi-component morphology unfortunately creates difficulties for source-finders, leading to components that are a) separate parts of a wider whole, and b) offset from the multiwavelength cross identification of the host galaxy. In this work we define an algorithm,… ▽ More Active Galactic Nuclei (AGN) can often be identified in radio images as two lobes, sometimes connected to a core by a radio jet. This multi-component morphology unfortunately creates difficulties for source-finders, leading to components that are a) separate parts of a wider whole, and b) offset from the multiwavelength cross identification of the host galaxy. In this work we define an algorithm, \textsc{DRAGNhunter}, for identifying Double Radio Sources associated with Active Galactic Nuclei (DRAGNs) from component catalog data in the first epoch \textit{Quick Look} images of the high resolution ($\approx 3''$ beam size) Very Large Array Sky Survey (VLASS). We use \textsc{DRAGNhunter} to construct a catalog of $>17,000$ DRAGNs in VLASS for which contamination from spurious sources is estimated at $\approx 11\,\%$. A `high-fidelity' sample consisting of $90\,\%$ of our catalog is identified for which contamination is $<3\,\%$. Host galaxies are found for $\approx 13,000$ DRAGNs as well as for an additional $234,000$ single-component radio sources. Using these data we explore the properties of our DRAGNs, finding them to be typically consistent with Fanaroff-Riley class II sources and allowing us to report the discovery of $31$ new giant radio galaxies identified using VLASS. △ Less

Submitted 22 May, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

Comments: Accepted for publication in ApJS. 33 pages, 25 figures, 6 tables. Minor textual changes with respect to previous version, One table added, two example tables removed. Catalog data will be available via https://cirada.ca/, https://vizier.cds.unistra.fr/viz-bin/VizieR and in the online version of the ApJS article

arXiv:2211.02733 [pdf, other]

GLOBEM Dataset: Multi-Year Datasets for Longitudinal Human Behavior Modeling Generalization

Authors: Xuhai Xu, Han Zhang, Yasaman Sefidgar, Yiyi Ren, Xin Liu, Woosuk Seo, Jennifer Brown, Kevin Kuehn, Mike Merrill, Paula Nurius, Shwetak Patel, Tim Althoff, Margaret E. Morris, Eve Riskin, Jennifer Mankoff, Anind K. Dey

Abstract: Recent research has demonstrated the capability of behavior signals captured by smartphones and wearables for longitudinal behavior modeling. However, there is a lack of a comprehensive public dataset that serves as an open testbed for fair comparison among algorithms. Moreover, prior studies mainly evaluate algorithms using data from a single population within a short period, without measuring th… ▽ More Recent research has demonstrated the capability of behavior signals captured by smartphones and wearables for longitudinal behavior modeling. However, there is a lack of a comprehensive public dataset that serves as an open testbed for fair comparison among algorithms. Moreover, prior studies mainly evaluate algorithms using data from a single population within a short period, without measuring the cross-dataset generalizability of these algorithms. We present the first multi-year passive sensing datasets, containing over 700 user-years and 497 unique users' data collected from mobile and wearable sensors, together with a wide range of well-being metrics. Our datasets can support multiple cross-dataset evaluations of behavior modeling algorithms' generalizability across different users and years. As a starting point, we provide the benchmark results of 18 algorithms on the task of depression detection. Our results indicate that both prior depression detection algorithms and domain generalization techniques show potential but need further research to achieve adequate cross-dataset generalizability. We envision our multi-year datasets can support the ML community in developing generalizable longitudinal behavior modeling algorithms. △ Less

Submitted 4 March, 2023; v1 submitted 4 November, 2022; originally announced November 2022.

Comments: Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track

MSC Class: 68T09 ACM Class: I.2.1; E.m

arXiv:2204.08510 [pdf, other]

doi 10.3847/1538-3881/ac66db

How does environment affect the morphology of radio AGN?

Authors: Melissa Elizabeth Morris, Eric Wilcots, Eric Hooper, Sebastian Heinz

Abstract: Galaxies hosting Active Galactic Nuclei (AGN) with bent radio jets are used as tracers of dense environments, such as galaxy groups and clusters. The assumption behind using these jets is that they are bent under ram pressure from a dense, gaseous medium through which the host galaxy moves. However, there are many AGN in groups and clusters with jets that are not bent, which leads us to ask: why a… ▽ More Galaxies hosting Active Galactic Nuclei (AGN) with bent radio jets are used as tracers of dense environments, such as galaxy groups and clusters. The assumption behind using these jets is that they are bent under ram pressure from a dense, gaseous medium through which the host galaxy moves. However, there are many AGN in groups and clusters with jets that are not bent, which leads us to ask: why are some AGN jets affected so much by their environment while others are seemingly not? We present the results of an environmental study on a sample of 185 AGN with bent jets and 191 AGN with unbent jets in which we characterize their environments by searching for neighboring galaxies using a Friends-of-Friends algorithm. We find that AGN with bent jets are indeed more likely to reside in groups and clusters, while unbent AGN are more likely to exist in singles or pairs. When considering only AGN in groups of 3 or more galaxies, we find that bent AGN are more likely to exist in halos with more galaxies than unbent AGN. We also find that unbent AGN are more likely than bent AGN to be the brightest group galaxy. Additionally, groups hosting AGN with bent jets have a higher density of galaxies than groups hosting unbent AGN. Curiously, there is a population of AGN with bent jets that are in seemingly less dense regions of space, indicating they may be embedded in a cosmic web filament. Overall, our results indicate that bent doubles are more likely to exist in in larger, denser, and less relaxed environments than unbent doubles, potentially linking a galaxy's radio morphology to its environment. △ Less

Submitted 18 April, 2022; originally announced April 2022.

Comments: 18 pages, 15 figures, accepted for publication by AJ

arXiv:2111.13266 [pdf, other]

Examining Needs and Opportunities for Supporting Students Who Experience Discrimination

Authors: Yasaman S. Sefidgar, Paula S. Nurius, Amanda Baughan, Lisa A. Elkin, Anind K. Dey, Eve Riskin, Jennifer Mankoff, Margaret E. Morris

Abstract: Perceived discrimination is common and consequential. Yet, little support is available to ease handling of these experiences. Addressing this gap, we report on a need-finding study to guide us in identifying relevant technologies and their requirements. Specifically, we examined unfolding experiences of perceived discrimination among college students and found factors to address in providing meani… ▽ More Perceived discrimination is common and consequential. Yet, little support is available to ease handling of these experiences. Addressing this gap, we report on a need-finding study to guide us in identifying relevant technologies and their requirements. Specifically, we examined unfolding experiences of perceived discrimination among college students and found factors to address in providing meaningful support. We used semi-structured retrospective interviews with 14 students to understand their perceptions, emotions, and coping in response to discriminatory behaviors within the prior ten-week period. These 14 students were among 90 who provided experience sampling reports of unfair treatment over the same ten-week period. We found that discrimination is more distressing if students face related academic and social struggles or when the incident triggers beliefs of inefficacy. We additionally identified patterns of effective coping. By grounding the findings in an extended stress processing framework, we offer a principled approach to intervention design, which we illustrate through incident-specific and proactive intervention paradigms. △ Less

Submitted 25 November, 2021; originally announced November 2021.

ACM Class: J.4

arXiv:1706.04336 [pdf, other]

doi 10.2478/ijcss-2018-0002

Predictive modelling of training loads and injury in Australian football

Authors: David L. Carey, Kok-Leong Ong, Rod Whiteley, Kay M. Crossley, Justin Crow, Meg E. Morris

Abstract: To investigate whether training load monitoring data could be used to predict injuries in elite Australian football players, data were collected from elite athletes over 3 seasons at an Australian football club. Loads were quantified using GPS devices, accelerometers and player perceived exertion ratings. Absolute and relative training load metrics were calculated for each player each day (rolling… ▽ More To investigate whether training load monitoring data could be used to predict injuries in elite Australian football players, data were collected from elite athletes over 3 seasons at an Australian football club. Loads were quantified using GPS devices, accelerometers and player perceived exertion ratings. Absolute and relative training load metrics were calculated for each player each day (rolling average, exponentially weighted moving average, acute:chronic workload ratio, monotony and strain). Injury prediction models (regularised logistic regression, generalised estimating equations, random forests and support vector machines) were built for non-contact, non-contact time-loss and hamstring specific injuries using the first two seasons of data. Injury predictions were generated for the third season and evaluated using the area under the receiver operator characteristic (AUC). Predictive performance was only marginally better than chance for models of non-contact and non-contact time-loss injuries (AUC$<$0.65). The best performing model was a multivariate logistic regression for hamstring injuries (best AUC=0.76). Learning curves suggested logistic regression was underfitting the load-injury relationship and that using a more complex model or increasing the amount of model building data may lead to future improvements. Injury prediction models built using training load data from a single club showed poor ability to predict injuries when tested on previously unseen data, suggesting they are limited as a daily decision tool for practitioners. Focusing the modelling approach on specific injury types and increasing the amount of training data may lead to the development of improved predictive models for injury prevention. △ Less

Submitted 14 June, 2017; originally announced June 2017.

Comments: 15 pages, 5 figures

Showing 1–6 of 6 results for author: Morris, M E