Bridging Health Literacy Gaps in Spine Care: Using ChatGPT-4o to Improve Patient-Education Materials
J Bone Joint Surg Am. 2025 Jun 19. doi: 10.2106/JBJS.24.01484. Online ahead of print.
ABSTRACT
BACKGROUND: Patient-education materials (PEMs) are essential to improve health literacy, engagement, and treatment adherence, yet many exceed the recommended readability levels. Therefore, individuals with limited health literacy are at a disadvantage. This study evaluated the readability of spine-related PEMs from the American Academy of Orthopaedic Surgeons (AAOS), the North American Spine Society (NASS), and the American Association of Neurological Surgeons (AANS), and examined the potential of artificial intelligence (AI) in optimizing PEMs for improved patient comprehension.
METHODS: A total of 146 spine-related PEMs from the AAOS, NASS, and AANS websites were analyzed. Readability was assessed using the Flesch-Kincaid Grade Level (FKGL) and Simple Measure of Gobbledygook (SMOG) Index scores, as well as other metrics, including language complexity and use of the passive voice. ChatGPT-4o was used to revise the PEMs to a sixth-grade reading level, and post-revision readability was assessed. Test-retest reliability was evaluated, and paired t tests were used to compare the readability scores of the original and AI-modified PEMs.
RESULTS: The original PEMs had a mean FKGL of 10.2 ± 2.6, which significantly exceeded both the recommended sixth-grade reading level and the average U.S. eighth-grade reading level (p < 0.05). ChatGPT-4o generated articles with a significantly reduced mean FKGL of 6.6 ± 1.3 (p < 0.05). ChatGPT-4o also improved other readability metrics, including the SMOG Index score, language complexity, and use of the passive voice, while maintaining accuracy and adequate detail. Excellent test-retest reliability was observed across all of the metrics (intraclass correlation coefficient [ICC] range, 0.91 to 0.98).
CONCLUSIONS: Spine-related PEMs from the AAOS, the NASS, and the AANS remain excessively complex, despite minor improvements to readability over the years. ChatGPT-4o demonstrated the potential to enhance PEM readability while maintaining content quality. Future efforts should integrate AI tools with visual aids and user-friendly platforms to create inclusive and comprehensible PEMs to address diverse patient needs and improve health-care delivery.
PMID:40536932 | DOI:10.2106/JBJS.24.01484