[NeurIPS 2024] Data-Driven Discovery of Dynamical Systems in Pharmacology using Large Language Models

This repo holds the code for Data-Driven Discovery of Dynamical Systems in Pharmacology using Large Language Models.

Introduction

The discovery of dynamical systems is crucial across a range of fields, including pharmacology, epidemiology, and physical sciences. Accurate and interpretable modeling of these systems is essential for understanding complex temporal processes, optimizing interventions, and minimizing adverse effects. In pharmacology, for example, precise modeling of drug dynamics is vital to maximize therapeutic efficacy while minimizing patient harm, as in chemotherapy. However, current models, often developed by human experts, are limited by high cost, lack of scalability, and restriction to existing human knowledge. In this paper, we present the Data-Driven Discovery (D3) framework, a novel approach leveraging Large Language Models (LLMs) to iteratively discover and refine interpretable models of dynamical systems, demonstrated here with pharmacological applications. Unlike traditional methods, D3 enables the LLM to propose, acquire, and integrate new features, validate, and compare dynamical systems models, uncovering new insights into pharmacokinetics. Experiments on a pharmacokinetic Warfarin dataset reveal that D3 identifies a new plausible model that is well-fitting, highlighting its potential for precision dosing in clinical applications.

Setup

To get started:

Clone this repo

git clone https://github.com/samholt/DataDrivenDiscovery && cd ./DataDrivenDiscovery

Follow the installation instructions in setup/install.sh to install the required packages.

./setup/install.sh

Replicating the main results

In the main terminal, perform the following steps:

Modify the configuration files in folder config. The main config file that specifies baselines, datasets and other run parameters is in config/config.yaml
Run python run.py to run all baselines on all datasets. This will generate a log file in the logs folder.
Once a run has completed, process the log file generated output into the logs folder, with the script process_result_file.py. Note, you will need to edit the process_result_file.py to read this generated log file, i.e., specify the path variable of where it is. This will generate the main tables as presented in the paper.

Cite

If you use our work in your research, please cite:

@inproceedings{
    holt2024datadriven,
    title={Data-Driven Discovery of Dynamical Systems in Pharmacology using Large Language Models},
    author={Samuel Holt and Zhaozhi Qian and Tennison Liu and Jim Weatherall and Mihaela van der Schaar},
    booktitle={The Thirty-eighth Annual Conference on Neural Information Processing Systems},
    year={2024},
    url={https://openreview.net/forum?id=KIrZmlTA92}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
config		config
figures		figures
libs		libs
setup		setup
utils		utils
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
agents.py		agents.py
envs.py		envs.py
executor.py		executor.py
llm_utils.py		llm_utils.py
long_vars.py		long_vars.py
process_result_file.py		process_result_file.py
pylintrc		pylintrc
pytype.cfg		pytype.cfg
rate_limiter.py		rate_limiter.py
requirements.txt		requirements.txt
ruff.toml		ruff.toml
run.py		run.py
simulate.py		simulate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

[NeurIPS 2024] Data-Driven Discovery of Dynamical Systems in Pharmacology using Large Language Models

Introduction

Setup

Replicating the main results

Cite

About

Releases

Packages

Languages

License

samholt/DataDrivenDiscovery

Folders and files

Latest commit

History

Repository files navigation

[NeurIPS 2024] Data-Driven Discovery of Dynamical Systems in Pharmacology using Large Language Models

Introduction

Setup

Replicating the main results

Cite

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages