MixRx

Authors: Risha Surana, Cameron Saidock, Hugo Chacon Affiliation: University of Southern California

Abstract

MixRx uses Large Language Models (LLMs) to classify drug combination interactions as Additive, Synergistic, or Antagonistic, given a multi-drug patient history. We evaluate the performance of GPT-2 and Mistral-7B-Instruct models, along with fine-tuned variations of each. Our experiments show that the fine-tuned Mistral-7B-Instruct model achieves an average accuracy of ~81.5%, including robustness to “messy” clinical input formats. These results demonstrate the feasibility of applying LLMs to biological interaction prediction and suggest a pathway toward real-time clinical decision support, particularly in emergency medicine settings where interaction complexity and time constraints are significant factors.

Overview

Most clinical drug interaction checkers evaluate only pairwise drug relationships and depend on exact drug name spelling. Real patients, however, often take 3–10 or more drugs simultaneously, and clinical notes frequently contain abbreviations, misspellings, and non-standard naming.

MixRx addresses this by:

Constructing multi-drug combinations where all pairwise synergy metrics are known.
Converting those combinations into structured LLM prompts with relevant synergy context.
Using an LLM to infer a combination-level classification.
Validating predictions through a rule-based synergy scoring algorithm.

Repository Structure

mixrx/
│
├── Models/                         # Model weights, checkpoints, and fine-tuned outputs
│   └── mistral/                    # Fine-tuned Mistral-7B-Instruct model artifacts
│
├── evaluate/                       # Evaluation scripts and metric calculations
│
├── preprocessing/                  # Data preprocessing and prompt generation
│   └── messy/                      # Input perturbation logic (spelling swaps, truncations)
│
├── .gitignore
├── README.md
│
├── embeddings.ipynb                # Embedding and representation analysis
│
├── final.csv                       # Full processed synergy dataset
├── final_reduced.csv               # Cleaned dataset for prompting
├── final_reduced_messy.csv         # Perturbed dataset for robustness testing
│
├── generate_messy.py               # Script to create spelling/formatting perturbation

Installation

Requires Python 3.9+.

pip install -r requirements.txt

If using a hosted LLM API (OpenAI, Anthropic, etc.):

export OPENAI_API_KEY=your_api_key

Workflow

1. Preprocess the Synergy Data

python preprocessing/generate_validation_data.py

2. Create LLM Prompt Data

python preprocessing/generate_model_data.py

3. Generate Messy Input Variant (optional)

python generate_messy.py

4. Run Evaluation / Model Inference

Open:

synergy.ipynb

Citation

If using MixRx in academic work, please cite:

Surana, R., Saidock, C., & Chacon, H. (2024).
MixRx
University of Southern California.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MixRx

Abstract

Overview

Repository Structure

Installation

Workflow

1. Preprocess the Synergy Data

2. Create LLM Prompt Data

3. Generate Messy Input Variant (optional)

4. Run Evaluation / Model Inference

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
Models		Models
evaluate		evaluate
preprocessing		preprocessing
.gitignore		.gitignore
README.md		README.md
embeddings.ipynb		embeddings.ipynb
final.csv		final.csv
final_reduced.csv		final_reduced.csv
final_reduced_messy.csv		final_reduced_messy.csv
generate_messy.py		generate_messy.py

rishasurana/MixRx

Folders and files

Latest commit

History

Repository files navigation

MixRx

Abstract

Overview

Repository Structure

Installation

Workflow

1. Preprocess the Synergy Data

2. Create LLM Prompt Data

3. Generate Messy Input Variant (optional)

4. Run Evaluation / Model Inference

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages