FHIR Serialization Strategies Impact LLM Medication Reconciliation Performance

A new study compares how different FHIR data serialization formats affect LLM performance in medication reconciliation tasks. The findings highlight significant differences in accuracy across serialization methods.

Researchers have conducted the first systematic comparison of how different FHIR data serialization strategies impact the performance of large language models (LLMs) in medication reconciliation tasks. The study, published on arXiv, evaluated four serialization methods: Raw JSON, Markdown Table, Clinical Narrative, and Chronological Timeline across five open-weight models, including Phi-3.5-mini, Mistral-7B, BioMistral-7B, and Llama-3.

The study found that the choice of serialization strategy significantly affects the accuracy and reliability of LLMs in medication reconciliation, a critical process in clinical handoffs. For instance, the Chronological Timeline format showed the highest accuracy, reducing errors by up to 20% compared to Raw JSON. This underscores the importance of optimizing data presentation for LLMs in healthcare applications.

The findings suggest that future research should focus on developing standardized serialization methods tailored for LLMs in clinical settings. Healthcare providers and developers should consider these insights when implementing LLMs for medication reconciliation to ensure patient safety and accuracy. The study also opens questions about how other data formats and models might perform in similar tasks.