Patient Engagement

Med Datasource Comparison

Patient Engagement 1.1
Objective
Validate LLM-generated outputs against trusted sources of medical truth ensuring that any AI-enhanced decision-making in healthcare rests on reliable foundations.
Real-World Impact
By providing a transparent benchmarking system for AI-generated medication content, MedCompare promotes responsible AI adoption in healthcare. It empowers clinicians to verify outputs against established, trusted sources—improving safety, building trust, and ultimately driving better patient outcomes.

Benchmarking AI models for trusted medication data in healthcare​


MedCompare is an AI-driven benchmarking system designed to evaluate the accuracy and completeness of medication information provided by various large language models (LLMs), including Gemini, DeepSeek, LLaMA 4. A fourth LLM, ChatGPT 4o, was used to independently evaluate and score medication data found in the other three. What sets this project apart is its emphasis on validating LLM-generated outputs against trusted sources of medical truth ensuring that any AI-enhanced decision-making in healthcare rests on reliable foundations.

Key Features
  • Semantic Similarity Scoring: Utilizes advanced algorithms to assess the alignment between LLM outputs and official drug information.
  • Fuzzy Matching: Detects minor discrepancies in drug names and codes to ensure data integrity.
  • Batch Processing: Evaluates multiple medications simultaneously for greater scalability.
  • User-Friendly Interface: Intuitive design for clinicians and researchers to explore results easily.
Get in touch

Want In?

Members receive access to the full Leap of Faith ecosystem: AI tools, implementation support, and specialized engineering resources. Let's talk about how we can accelerate your AI strategy.