Transforming Wearable Data into Personal Health Insights using Large Language Model Agents (PHIA)

🔥 Please remember to ⭐ this repo if you find it useful and cite our work if you end up using it in your work! 🔥

🔥 If you have any questions or concerns, please create an issue 📝! 🔥

The official repository for the paper "Transforming Wearable Data into Personal Health Insights using Large Language Model Agents" and its corresponding Personal Health Insights Agent (PHIA).

🔧 Setup

Python 3.11 and higher, and conda, are required. Run bash setup.sh to fully setup the phia conda environment. The entire setup process should be automatic from end-to-end, though has been tested on a limited number of machines. Please report any issues as you encounter them.

Once setup is complete, you can activate the environment using conda activate phia in your terminal for subsequent usage via the terminal. Most typical usage will involve invoking the conda environment, either via terminal or VSCode, as a kernel to utilize for various notebooks in the repo. If you open a notebook in VSCode, you should be able to select the phia environment as the kernel in the top-right corner.

💻 Usage

Notable parts of our repo are as follows:

figs contains all code necessary to reproduce figures from the paper.
data contains model outputs and human annotations.
Objective Query - PHIA.xlsx contains 4000 objective queries. For example: "What was the distance of my longest run in the past 21 days?"
Open-Ended Query - PHIA.xlsx contains 172 open-ended queries. For example: "How do I reduce stress?"
real_wearable_users contains a set of deidentified real wearable users. All subjects are used in evaluation. The deidentification process includes generation of a random user ID, conversion of dates into a day of the week and ordinal date based on chronological order, conversion of times into HH:MM format without the date, and conversion of ages into age buckets (e.g., [30-34]). Note that the columns and format of this data may differ from what the agent may expect - you may have to modify data_utils.py and prompt_templates.py accordingly.
synthetic_wearable_users contains a set of synthetic wearable users. Subject 465, 333, 171, and 41 are used in evaluation.
few_shots contains all of our few-shot examples that are utilized by PHIA.
phia_agent.py contains the core agent logic for PHIA.
prompt_templates.py contains key prompt templates (e.g., agent preamble) utilized by PHIA.
phia_demo.ipynb contains code to try out PHIA. API keys must be provided as noted in the notebook.

Beyond referencing various artifacts, the primary runnable notebooks of interest in this repo are in the figs folder (for reproducing figures using source data) and in phia_demo.ipynb (for trying out PHIA). When trying out PHIA, take note of particular notebook cells and their purpose, especially what data (e.g., synthetic user summary dataframe, exercise dataframe) is being loaded and whether or not you want to change what data is being loaded.

Note: you can obtain a Google / Gemini API key from here with certain rate limits. Similarly, tavily offers a free usage tier and corresponding API key for researchers here.

📜 Citation

If you find our paper or this code release useful for your research, please cite our work.

@article{merrill2024transforming,
  title={Transforming wearable data into health insights using large language model agents},
  author={Merrill, Mike A and Paruchuri, Akshay and Rezaei, Naghmeh and Kovacs, Geza and Perez, Javier and Liu, Yun and Schenck, Erik and Hammerquist, Nova and Sunshine, Jake and Tailor, Shyam and others},
  journal={arXiv preprint arXiv:2406.06464},
  year={2024}
}

License

Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transforming Wearable Data into Personal Health Insights using Large Language Model Agents (PHIA)

🔧 Setup

💻 Usage

📜 Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
data		data
few_shots		few_shots
figs		figs
real_wearable_users		real_wearable_users
synthetic_wearable_users		synthetic_wearable_users
.gitignore		.gitignore
LICENSE		LICENSE
Objective Query - PHIA.xlsx		Objective Query - PHIA.xlsx
Open-Ended Query - PHIA.xlsx		Open-Ended Query - PHIA.xlsx
README.md		README.md
colab_utils.py		colab_utils.py
data_utils.py		data_utils.py
phia_agent.py		phia_agent.py
phia_demo.ipynb		phia_demo.ipynb
prompt_templates.py		prompt_templates.py
requirements.txt		requirements.txt
setup.sh		setup.sh
teaser.gif		teaser.gif

Folders and files

Latest commit

History

Repository files navigation

Transforming Wearable Data into Personal Health Insights using Large Language Model Agents (PHIA)

🔧 Setup

💻 Usage

📜 Citation

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages