FHIR Implementation Guides (IGs) are comprehensive specifications that explain how to use FHIR for specific healthcare needs — from national patient profiles to specialized clinical workflows. They define which resources to use, what codes to include, and how systems should behave so that data can move smoothly between organizations.
Today, developing an IG is still a highly technical task. It requires creating CodeSystems
, ValueSets
, StructureDefinitions
, and Extensions
, then publishing them as a cohesive, validated specification. The process is complex and often slows down projects.
This article looks at a new approach: using AI agents to generate and test FHIR Implementation Guides directly from plain language. The goal is to make guide development faster, easier, and more accessible — not only for technical specialists, but also for clinicians, researchers, and policy makers who understand the healthcare context but don’t want to write JSON code.
Anyone who's worked with FHIR knows the struggle. Creating canonical resources — the building blocks of healthcare interoperability — is complex. JSON files are verbose, the constraints are intricate, and the learning curve is steep enough to make experienced developers question their career choices.
Because of this complexity, the community created tools to simplify the process. One of them is FHIR Shorthand (FSH) a more compact and readable language compared to raw JSON. Still it requires developers to learn yet another syntax and understand the nuances of FHIR's underlying architecture.
The Traditional IG Publishing Pipeline
Even with FSH, building an IG is still a complex and error-prone process. The typical workflow looks like this:
This multi-step pipeline requires deep knowledge of multiple tools: FSH syntax, SUSHI compilation, IG Publisher configuration, and FHIR validation rules. The result is a process that creates barriers that exclude healthcare domain experts from direct participation.
So, what alternatives are there to this conventional, tool-heavy approach? Recent advances in AI may offer a way to rethink how IGs are developed and tested.
What if we could eliminate this translation layer? Imagine healthcare experts describing their requirements in plain English and having AI generate production-ready FHIR resources automatically.
Recent advances in Large Language Models (LLMs) make it possible. Modern AI can:
The key insight is that AI can bridge the gap between human intent and technical implementation — exactly what we need in FHIR Implementation Guide development. This means reducing the technical overhead and letting both developers and healthcare experts contribute directly.
Generating FHIR resources is only half the battle. The real magic happens when AI agents can immediately test their creations against a live FHIR server, creating a rapid feedback loop that enables continuous refinement.
Aidbox FHIR server makes this possible by providing:
This creates a powerful development cycle:
The result is sub-minute feedback, faster development, while ensuring clinical accuracy and technical compliance.
To see how this works in practice, let’s look at an example project.
Initial AI Prompt:
Create an MD file for a CodeSystem that includes biological sex codes: 1 for Male, 2 for Female, 3 for Intersex, 93 for 'Not reported' when Civil Registry updates occur, and 99 for Unknown cases.
The AI generates a structured markdown file like src/CS/CSSexoBiologico.MD, and you iterate on it until it looks right. Once you're satisfied with the markdown specification, you simply ask the AI to "create a FHIR resource for the CSSexoBiologico code system" and it converts it to FHIR JSON resource in the target/ folder: target/CSSexoBiologico.json.
Here’s where validation comes in. The AI can test the generated resources right away using Claude Code's commands and .http files.
Using Claude Code, you can simply run:
/test-cs CSSexoBiologico
This triggers the custom Claude command defined in .claude/commands/test-cs.md, which:
src/CS/CSSexoBiologico.MD
target/CSSexoBiologico.json
docker-compose up -d
)Generated Test File (test/CS/test-CSSexoBiologico.http):
@fhirServer = http://localhost:8080/fhir
@auth = Basic basic:secret
### Create CSSexoBiologico CodeSystem
PUT {{fhirServer}}/CodeSystem/CSSexoBiologico
Authorization: {{auth}}
Content-Type: application/json
< ../../target/CSSexoBiologico.json
### Test $lookup operation for code "1" (Male)
GET {{fhirServer}}/CodeSystem/$lookup?system=https://interoperability.testcompany.cl/CodeSystem/CSSexoBiologico&code=1
Authorization: {{auth}}
### Test $lookup operation for code "2" (Female)
GET {{fhirServer}}/CodeSystem/$lookup?system=https://interoperability.testcompany.cl/CodeSystem/CSSexoBiologico&code=2
Authorization: {{auth}}
... Other tests ...
### Test $lookup operation for invalid code "999"
GET {{fhirServer}}/CodeSystem/$lookup?system=https://interoperability.testcompany.cl/CodeSystem/CSSexoBiologico&code=999
Authorization: {{auth}}
### Verify CodeSystem was created
GET {{fhirServer}}/CodeSystem/CSSexoBiologico
Authorization: {{auth}}
### Delete CSSexoBiologico CodeSystem
DELETE {{fhirServer}}/CodeSystem/CSSexoBiologico
Authorization: {{auth}}
This setup allows developers and domain experts to quickly validate resources, see how they behave on a live server, and refine them without leaving the workflow.
The same AI-powered approach can be applied to all FHIR resource types needed for a complete Implementation Guide:
Each resource type follows the same pattern: describe requirements in natural language → AI generates markdown → convert to FHIR JSON → instant validation in Aidbox.
This makes it possible to scale from a single CodeSystem
to a complete, production-ready Implementation Guide.
Once all resources are validated, the final step is publishing the complete guide. This usually involves:
Together, these steps produce a full set of documentation and resources, ready for use and distribution.
At this point, the entire cycle comes together:
AI-supported IG development is still an emerging practice. While it can already reduce repetitive work and shorten feedback loops, there are areas that require careful attention:
Looking ahead, we see a future where healthcare experts and developers work side by side with AI agents: experts describe the intent, AI produces the first draft, and both refine the result together. This would make Implementation Guide development more inclusive, faster, and closer to real-world needs.
All the source code is available in the agentic-coding-ig-development repository. You can explore the workflows, run the commands, and test how AI-driven IG development works step by step.
We believe this approach is only the beginning. By combining domain expertise, modern AI, and tools like Aidbox, Implementation Guides can be developed faster, with less overhead, and with broader participation from the healthcare community.
Try it yourself with Aidbox: spin up a FHIR server, run the validation cycle, and see how AI can generate and test resources in minutes. Get started today and share your experience — your feedback will help shape the future of AI-powered FHIR development.
Get in touch with us today!