Semantic Understanding And Contextual Explanation of Medical Documents
DOI:
https://doi.org/10.33425/3066-1226.1080Keywords:
Medical Document Processing, Optical Character Recognition (ocr), Natural Language Processing (NLP), Paddle OCR, GPT API, Health Informatics Patient Centered Healthcare, Diagnostic Report SummarizationAbstract
This study introduces an AI-driven system designed to efficiently extract and summarize medical documents using Optical Character Recognition (OCR) and Natural Language Processing (NLP). By leveraging Tesseract OCR, the system accurately retrieves textual data from diagnostic reports. The extracted content is then processed through a GPT-based API to generate brief, patient friendly summaries in 2-3 lines. By simplifying complex medical terms, the system enhances patient understanding, supports informed decision-making, and improves communication between healthcare professionals and patients. This method seeks to bridge the gap between technical medical reports and patient awareness, ultimately contributing to better healthcare outcomes. The implementation is conducted on Google Colab, ensuring cloud-based execution for improved scalability and accessibility.