Author ORCID Identifier
https://orcid.org/0000-0002-7919-7138
Date of Award
2026
Document Type
Thesis (Master's)
Department or Program
Computer Science
First Advisor
Andrew Thomas Campbell
Abstract
Multimodal health sensing offers rich behavioral signals for assessing mental health, yet translating these numerical time-series measurements into natural language remains challenging. Current LLMs cannot natively ingest long-duration sensor streams, and paired sensor–text datasets are scarce. To address these challenges, we introduce LENS, a framework that aligns multimodal sensing data with language models to generate clinically grounded mental-health narratives. LENS first constructs a large-scale dataset by transforming Ecological Momentary Assessment (EMA) responses related to depression and anxiety symptoms into natural-language descriptions, yielding over 100,000 sensor–text QA pairs from 258 participants. To enable native time-series integration, we train a patch-level encoder that projects raw sensor signals directly into an LLM’s representation space. Our results show that LENS outperforms strong baselines on standard NLP metrics and task-specific measures of symptom-severity accuracy. A user study with 13 mental-health professionals further indicates that LENS-produced narratives are comprehensive and clinically meaningful. Ultimately, our approach advances LLMs as interfaces for health sensing, providing a scalable path toward models that can reason over raw behavioral signals and support downstream clinical decision-making.
Original Citation
@misc{xu2026lensllmenablednarrativesynthesis, title={LENS: LLM-Enabled Narrative Synthesis for Mental Health by Aligning Multimodal Sensing with Language Models}, author={Wenxuan Xu and Arvind Pillai and Subigya Nepal and Amanda C Collins and Daniel M Mackin and Michael V Heinz and Tess Z Griffin and Nicholas C Jacobson and Andrew Campbell}, year={2026}, eprint={2512.23025}, archivePrefix={arXiv}, primaryClass={cs.CL}, url={https://arxiv.org/abs/2512.23025}, }
Recommended Citation
Xu, Wenxuan, "LENS: LLM-ENABLED NARRATIVE SYNTHESIS FOR MENTAL HEALTH BY ALIGNING MULTIMODAL SENSING WITH LANGUAGE MODELS" (2026). Dartmouth College Master’s Theses. 280.
https://digitalcommons.dartmouth.edu/masters_theses/280
