-
From Classification to Clinical Insights: Towards Analyzing and Reasoning About Mobile and Behavioral Health Data With Large Language Models
Authors:
Zachary Englhardt,
Chengqian Ma,
Margaret E. Morris,
Xuhai "Orson" Xu,
Chun-Cheng Chang,
Lianhui Qin,
Daniel McDuff,
Xin Liu,
Shwetak Patel,
Vikram Iyer
Abstract:
Passively collected behavioral health data from ubiquitous sensors holds significant promise to provide mental health professionals insights from patient's daily lives; however, developing analysis tools to use this data in clinical practice requires addressing challenges of generalization across devices and weak or ambiguous correlations between the measured signals and an individual's mental hea…
▽ More
Passively collected behavioral health data from ubiquitous sensors holds significant promise to provide mental health professionals insights from patient's daily lives; however, developing analysis tools to use this data in clinical practice requires addressing challenges of generalization across devices and weak or ambiguous correlations between the measured signals and an individual's mental health. To address these challenges, we take a novel approach that leverages large language models (LLMs) to synthesize clinically useful insights from multi-sensor data. We develop chain of thought prompting methods that use LLMs to generate reasoning about how trends in data such as step count and sleep relate to conditions like depression and anxiety. We first demonstrate binary depression classification with LLMs achieving accuracies of 61.1% which exceed the state of the art. While it is not robust for clinical use, this leads us to our key finding: even more impactful and valued than classification is a new human-AI collaboration approach in which clinician experts interactively query these tools and combine their domain expertise and context about the patient with AI generated reasoning to support clinical decision-making. We find models like GPT-4 correctly reference numerical data 75% of the time, and clinician participants express strong interest in using this approach to interpret self-tracking data.
△ Less
Submitted 25 November, 2023; v1 submitted 21 November, 2023;
originally announced November 2023.
-
A Quick Look at the 3GHz Radio Sky. II. Hunting for DRAGNs in the VLA Sky Survey
Authors:
Yjan A. Gordon,
Lawrence Rudnick,
Heinz Andernach,
Leah K. Morabito,
Christopher P. O'Dea,
Kaylan-Marie Achong,
Stefi A. Baum,
Caryelis Bayona-Figueroa,
Eric J. Hooper,
Beatriz Mingo,
Melissa E. Morris,
Adrian N. Vantyghem
Abstract:
Active Galactic Nuclei (AGN) can often be identified in radio images as two lobes, sometimes connected to a core by a radio jet. This multi-component morphology unfortunately creates difficulties for source-finders, leading to components that are a) separate parts of a wider whole, and b) offset from the multiwavelength cross identification of the host galaxy. In this work we define an algorithm,…
▽ More
Active Galactic Nuclei (AGN) can often be identified in radio images as two lobes, sometimes connected to a core by a radio jet. This multi-component morphology unfortunately creates difficulties for source-finders, leading to components that are a) separate parts of a wider whole, and b) offset from the multiwavelength cross identification of the host galaxy. In this work we define an algorithm, \textsc{DRAGNhunter}, for identifying Double Radio Sources associated with Active Galactic Nuclei (DRAGNs) from component catalog data in the first epoch \textit{Quick Look} images of the high resolution ($\approx 3''$ beam size) Very Large Array Sky Survey (VLASS). We use \textsc{DRAGNhunter} to construct a catalog of $>17,000$ DRAGNs in VLASS for which contamination from spurious sources is estimated at $\approx 11\,\%$. A `high-fidelity' sample consisting of $90\,\%$ of our catalog is identified for which contamination is $<3\,\%$. Host galaxies are found for $\approx 13,000$ DRAGNs as well as for an additional $234,000$ single-component radio sources. Using these data we explore the properties of our DRAGNs, finding them to be typically consistent with Fanaroff-Riley class II sources and allowing us to report the discovery of $31$ new giant radio galaxies identified using VLASS.
△ Less
Submitted 22 May, 2023; v1 submitted 22 March, 2023;
originally announced March 2023.
-
GLOBEM Dataset: Multi-Year Datasets for Longitudinal Human Behavior Modeling Generalization
Authors:
Xuhai Xu,
Han Zhang,
Yasaman Sefidgar,
Yiyi Ren,
Xin Liu,
Woosuk Seo,
Jennifer Brown,
Kevin Kuehn,
Mike Merrill,
Paula Nurius,
Shwetak Patel,
Tim Althoff,
Margaret E. Morris,
Eve Riskin,
Jennifer Mankoff,
Anind K. Dey
Abstract:
Recent research has demonstrated the capability of behavior signals captured by smartphones and wearables for longitudinal behavior modeling. However, there is a lack of a comprehensive public dataset that serves as an open testbed for fair comparison among algorithms. Moreover, prior studies mainly evaluate algorithms using data from a single population within a short period, without measuring th…
▽ More
Recent research has demonstrated the capability of behavior signals captured by smartphones and wearables for longitudinal behavior modeling. However, there is a lack of a comprehensive public dataset that serves as an open testbed for fair comparison among algorithms. Moreover, prior studies mainly evaluate algorithms using data from a single population within a short period, without measuring the cross-dataset generalizability of these algorithms. We present the first multi-year passive sensing datasets, containing over 700 user-years and 497 unique users' data collected from mobile and wearable sensors, together with a wide range of well-being metrics. Our datasets can support multiple cross-dataset evaluations of behavior modeling algorithms' generalizability across different users and years. As a starting point, we provide the benchmark results of 18 algorithms on the task of depression detection. Our results indicate that both prior depression detection algorithms and domain generalization techniques show potential but need further research to achieve adequate cross-dataset generalizability. We envision our multi-year datasets can support the ML community in developing generalizable longitudinal behavior modeling algorithms.
△ Less
Submitted 4 March, 2023; v1 submitted 4 November, 2022;
originally announced November 2022.
-
How does environment affect the morphology of radio AGN?
Authors:
Melissa Elizabeth Morris,
Eric Wilcots,
Eric Hooper,
Sebastian Heinz
Abstract:
Galaxies hosting Active Galactic Nuclei (AGN) with bent radio jets are used as tracers of dense environments, such as galaxy groups and clusters. The assumption behind using these jets is that they are bent under ram pressure from a dense, gaseous medium through which the host galaxy moves. However, there are many AGN in groups and clusters with jets that are not bent, which leads us to ask: why a…
▽ More
Galaxies hosting Active Galactic Nuclei (AGN) with bent radio jets are used as tracers of dense environments, such as galaxy groups and clusters. The assumption behind using these jets is that they are bent under ram pressure from a dense, gaseous medium through which the host galaxy moves. However, there are many AGN in groups and clusters with jets that are not bent, which leads us to ask: why are some AGN jets affected so much by their environment while others are seemingly not? We present the results of an environmental study on a sample of 185 AGN with bent jets and 191 AGN with unbent jets in which we characterize their environments by searching for neighboring galaxies using a Friends-of-Friends algorithm. We find that AGN with bent jets are indeed more likely to reside in groups and clusters, while unbent AGN are more likely to exist in singles or pairs. When considering only AGN in groups of 3 or more galaxies, we find that bent AGN are more likely to exist in halos with more galaxies than unbent AGN. We also find that unbent AGN are more likely than bent AGN to be the brightest group galaxy. Additionally, groups hosting AGN with bent jets have a higher density of galaxies than groups hosting unbent AGN. Curiously, there is a population of AGN with bent jets that are in seemingly less dense regions of space, indicating they may be embedded in a cosmic web filament. Overall, our results indicate that bent doubles are more likely to exist in in larger, denser, and less relaxed environments than unbent doubles, potentially linking a galaxy's radio morphology to its environment.
△ Less
Submitted 18 April, 2022;
originally announced April 2022.
-
Examining Needs and Opportunities for Supporting Students Who Experience Discrimination
Authors:
Yasaman S. Sefidgar,
Paula S. Nurius,
Amanda Baughan,
Lisa A. Elkin,
Anind K. Dey,
Eve Riskin,
Jennifer Mankoff,
Margaret E. Morris
Abstract:
Perceived discrimination is common and consequential. Yet, little support is available to ease handling of these experiences. Addressing this gap, we report on a need-finding study to guide us in identifying relevant technologies and their requirements. Specifically, we examined unfolding experiences of perceived discrimination among college students and found factors to address in providing meani…
▽ More
Perceived discrimination is common and consequential. Yet, little support is available to ease handling of these experiences. Addressing this gap, we report on a need-finding study to guide us in identifying relevant technologies and their requirements. Specifically, we examined unfolding experiences of perceived discrimination among college students and found factors to address in providing meaningful support. We used semi-structured retrospective interviews with 14 students to understand their perceptions, emotions, and coping in response to discriminatory behaviors within the prior ten-week period. These 14 students were among 90 who provided experience sampling reports of unfair treatment over the same ten-week period. We found that discrimination is more distressing if students face related academic and social struggles or when the incident triggers beliefs of inefficacy. We additionally identified patterns of effective coping. By grounding the findings in an extended stress processing framework, we offer a principled approach to intervention design, which we illustrate through incident-specific and proactive intervention paradigms.
△ Less
Submitted 25 November, 2021;
originally announced November 2021.
-
Predictive modelling of training loads and injury in Australian football
Authors:
David L. Carey,
Kok-Leong Ong,
Rod Whiteley,
Kay M. Crossley,
Justin Crow,
Meg E. Morris
Abstract:
To investigate whether training load monitoring data could be used to predict injuries in elite Australian football players, data were collected from elite athletes over 3 seasons at an Australian football club. Loads were quantified using GPS devices, accelerometers and player perceived exertion ratings. Absolute and relative training load metrics were calculated for each player each day (rolling…
▽ More
To investigate whether training load monitoring data could be used to predict injuries in elite Australian football players, data were collected from elite athletes over 3 seasons at an Australian football club. Loads were quantified using GPS devices, accelerometers and player perceived exertion ratings. Absolute and relative training load metrics were calculated for each player each day (rolling average, exponentially weighted moving average, acute:chronic workload ratio, monotony and strain). Injury prediction models (regularised logistic regression, generalised estimating equations, random forests and support vector machines) were built for non-contact, non-contact time-loss and hamstring specific injuries using the first two seasons of data. Injury predictions were generated for the third season and evaluated using the area under the receiver operator characteristic (AUC). Predictive performance was only marginally better than chance for models of non-contact and non-contact time-loss injuries (AUC$<$0.65). The best performing model was a multivariate logistic regression for hamstring injuries (best AUC=0.76). Learning curves suggested logistic regression was underfitting the load-injury relationship and that using a more complex model or increasing the amount of model building data may lead to future improvements. Injury prediction models built using training load data from a single club showed poor ability to predict injuries when tested on previously unseen data, suggesting they are limited as a daily decision tool for practitioners. Focusing the modelling approach on specific injury types and increasing the amount of training data may lead to the development of improved predictive models for injury prevention.
△ Less
Submitted 14 June, 2017;
originally announced June 2017.