ARTFEED — Contemporary Art Intelligence

M3 System Enables Natural Language Queries of MIMIC-IV Clinical Database

other · 2026-05-22

A team of researchers has introduced M3, a system designed for natural language queries of the Medical Information Mart for Intensive Care (MIMIC-IV), which is among the largest open-source electronic health record databases globally. Historically, accessing MIMIC-IV necessitated expertise in SQL and clinical knowledge. M3 utilizes the Model Context Protocol to facilitate queries in plain English. It can retrieve MIMIC-IV from PhysioNet with a single command, either by launching a local SQLite instance or connecting to hosted BigQuery. In evaluations against the EHRSQL 2024 benchmark, Claude Sonnet 4 achieved an accuracy of 94%, while the open-weights gpt-oss-20B reached 93% on a set of one hundred answerable questions. The system's goal is to enhance secure access to clinical data for researchers.

Key facts

  • M3 enables natural language querying of MIMIC-IV data.
  • MIMIC-IV is one of the world's largest open-source electronic health record databases.
  • M3 uses the Model Context Protocol.
  • M3 retrieves data from PhysioNet.
  • M3 can launch a local SQLite instance or connect to hosted BigQuery.
  • Evaluated on EHRSQL 2024 benchmark.
  • Claude Sonnet 4 achieved 94% accuracy.
  • gpt-oss-20B achieved 93% accuracy.

Entities

Institutions

  • PhysioNet
  • MIMIC-IV
  • EHRSQL

Sources