GLM-OCR实战案例：医院检验报告OCR→结构化字段+异常值自动标红提醒

你踩到我法袍了

289人浏览 · 2026-02-13 00:16:17

你踩到我法袍了 · 2026-02-13 00:16:17 发布

GLM-OCR实战案例：医院检验报告OCR→结构化字段+异常值自动标红提醒

1. 引言：当AI遇到检验报告

想象一下，你是一家医院的检验科医生，每天要处理上百份检验报告。这些报告格式五花八门，有的来自不同厂商的设备，有的是手写填写的表格。你需要从这些报告中提取关键指标——白细胞计数、血红蛋白、血糖值等，然后逐一核对是否在正常范围内，对异常值进行标记。

这个过程有多耗时？一份报告至少需要3-5分钟，一天下来就是好几个小时。更麻烦的是，人工核对难免会有疏漏，万一漏掉一个关键异常值，后果可能很严重。

现在，有了GLM-OCR，这一切都可以自动化。今天我就带你看看，如何用这个强大的多模态OCR模型，把医院检验报告从图片变成结构化数据，还能自动标出异常值，让医生一眼就能看到需要关注的地方。

2. GLM-OCR：不只是识别文字

2.1 它到底是什么？

GLM-OCR不是一个简单的文字识别工具。你可以把它理解为一个"能看懂文档的AI助手"。它基于GLM-V编码器-解码器架构，专门为理解复杂文档而设计。

传统的OCR只能告诉你"图片上有什么字"，但GLM-OCR能理解"这些字是什么意思，它们之间有什么关系"。比如，它不仅能识别出"白细胞计数：8.5×10^9/L"这行文字，还能理解：

"白细胞计数"是一个检验项目名称
"8.5"是具体的数值
"×10^9/L"是单位
这个数值需要和正常范围对比

2.2 为什么选择它来做检验报告？

我选择GLM-OCR来处理检验报告，主要是看中了它的几个核心能力：

多模态理解能力 检验报告不只是文字，还有表格、单位符号、上下标（比如10^9）、参考范围标注等。GLM-OCR的CogViT视觉编码器能同时处理文字和视觉信息，准确识别这些复杂元素。

表格识别专长 大多数检验报告都是表格形式。GLM-OCR专门优化了表格识别，能准确理解表格结构，把数据按行列提取出来，而不是把整个表格当成一段文字。

上下文理解 它能理解"参考范围"和"检验结果"之间的关系。当看到"正常范围：3.5-9.5"和"结果：12.8"时，它能判断出这个结果偏高。

稳定可靠 引入了多令牌预测损失函数和稳定的全任务强化学习机制，训练更高效，识别更准确，泛化能力更强。简单说就是，不容易出错，对各种格式的报告都能处理。

3. 实战准备：快速搭建环境

3.1 一键启动服务

如果你已经在CSDN星图镜像广场找到了GLM-OCR的镜像，部署就特别简单。启动服务只需要两行命令：

# 进入项目目录
cd /root/GLM-OCR

# 启动服务
./start_vllm.sh

第一次启动需要加载模型，大概等1-2分钟。看到服务运行起来后，在浏览器打开 http://你的服务器IP:7860，就能看到GLM-OCR的Web界面了。

3.2 环境检查

确保你的环境配置正确：

Python版本：3.10.19
PyTorch版本：2.9.1
模型路径：/root/ai-models/ZhipuAI/GLM-OCR/

如果缺少依赖，可以这样安装：

/opt/miniconda3/envs/py310/bin/pip install \
    git+https://github.com/huggingface/transformers.git \
    gradio

3.3 准备测试数据

找几份检验报告的图片作为测试数据。可以是：

手机拍摄的纸质报告照片
扫描的PDF转成的图片
医院系统导出的报告截图

保存为PNG或JPG格式，放在一个方便的目录里，比如 /root/test_reports/。

4. 基础功能演示：从图片到文字

4.1 文本识别初体验

我们先从最简单的开始——让GLM-OCR识别检验报告上的文字。

在Web界面中：

点击"上传图片"，选择一份检验报告
在Prompt输入框里写：Text Recognition:
点击"开始识别"

几秒钟后，你就会看到识别结果。比如，一份血常规报告可能被识别成这样的文字：

患者姓名：张三
性别：男
年龄：45岁
送检科室：内科
检验项目        结果        单位        参考范围
白细胞计数      8.5        10^9/L      3.5-9.5
血红蛋白       135         g/L        130-175
血小板计数     210         10^9/L     125-350

注意观察：GLM-OCR不仅识别了文字，还基本保持了表格的格式。项目名称、结果、单位、参考范围被分成了不同的列。

4.2 表格识别模式

对于格式规整的检验报告，使用表格识别模式效果更好：

上传同一份报告图片
Prompt写：Table Recognition:
点击识别

这次的结果会更加结构化：

{
  "table_data": [
    ["检验项目", "结果", "单位", "参考范围"],
    ["白细胞计数", "8.5", "10^9/L", "3.5-9.5"],
    ["血红蛋白", "135", "g/L", "130-175"],
    ["血小板计数", "210", "10^9/L", "125-350"]
  ]
}

看到了吗？表格识别模式直接把数据转换成了结构化的JSON格式，后续处理起来方便多了。

5. 核心实战：检验报告结构化处理

5.1 设计数据结构

我们要把检验报告转换成什么样的结构化数据？我设计了一个简单的Python类来表示：

class LabTestItem:
    """单个检验项目"""
    def __init__(self, name, value, unit, reference_range):
        self.name = name  # 项目名称，如"白细胞计数"
        self.value = float(value)  # 检验结果值
        self.unit = unit  # 单位，如"10^9/L"
        self.reference_range = reference_range  # 参考范围，如"3.5-9.5"
        self.is_abnormal = False  # 是否异常
        self.abnormal_type = None  # 异常类型：'high'偏高, 'low'偏低
        
    def check_abnormal(self):
        """检查结果是否异常"""
        if '-' in self.reference_range:
            # 处理范围值，如"3.5-9.5"
            low, high = map(float, self.reference_range.split('-'))
            if self.value < low:
                self.is_abnormal = True
                self.abnormal_type = 'low'
            elif self.value > high:
                self.is_abnormal = True
                self.abnormal_type = 'high'
        # 还可以处理其他格式的参考范围，比如">100"、"<5"等
        
class LabReport:
    """完整的检验报告"""
    def __init__(self):
        self.patient_name = ""
        self.patient_age = ""
        self.patient_gender = ""
        self.test_date = ""
        self.items = []  # 多个检验项目
        
    def add_item(self, item):
        self.items.append(item)
        
    def check_all_abnormal(self):
        """检查所有项目是否异常"""
        for item in self.items:
            item.check_abnormal()
            
    def get_abnormal_items(self):
        """获取所有异常项目"""
        return [item for item in self.items if item.is_abnormal]

5.2 解析GLM-OCR的输出

GLM-OCR识别出来的文字需要进一步解析，提取出我们需要的信息。下面是一个解析函数：

import re
import json

def parse_lab_report_from_ocr(ocr_text):
    """从OCR识别结果解析检验报告"""
    
    report = LabReport()
    
    # 先提取患者基本信息（简单正则匹配）
    name_match = re.search(r'患者姓名[：:]\s*(\S+)', ocr_text)
    if name_match:
        report.patient_name = name_match.group(1)
    
    age_match = re.search(r'年龄[：:]\s*(\d+)', ocr_text)
    if age_match:
        report.patient_age = age_match.group(1)
    
    # 提取检验项目数据
    # 假设数据以表格形式存在，每行有4列
    lines = ocr_text.split('\n')
    
    for line in lines:
        # 跳过空行和表头
        if not line.strip() or '检验项目' in line or '参考范围' in line:
            continue
            
        # 尝试按空格或制表符分割
        parts = re.split(r'\s{2,}|\t', line.strip())
        if len(parts) >= 4:
            # 假设格式：项目名称 结果 单位 参考范围
            name = parts[0]
            value = parts[1]
            unit = parts[2] if len(parts) > 2 else ""
            ref_range = parts[3] if len(parts) > 3 else ""
            
            # 清理数据
            value = value.replace('×10^9', '')  # 处理科学计数法
            value = value.replace('*', '')
            
            try:
                item = LabTestItem(name, value, unit, ref_range)
                report.add_item(item)
            except ValueError:
                # 如果值转换失败，跳过这个项目
                continue
    
    return report

5.3 完整的处理流程

现在我们把所有步骤串起来，形成一个完整的处理流程：

from gradio_client import Client

def process_lab_report(image_path):
    """处理单张检验报告图片的完整流程"""
    
    # 1. 连接GLM-OCR服务
    client = Client("http://localhost:7860")
    
    # 2. 调用OCR识别（使用表格识别模式）
    print(f"正在识别报告: {image_path}")
    result = client.predict(
        image_path=image_path,
        prompt="Table Recognition:",
        api_name="/predict"
    )
    
    # 3. 解析识别结果
    print("解析识别结果...")
    report = parse_lab_report_from_ocr(result)
    
    # 4. 检查异常值
    print("检查异常值...")
    report.check_all_abnormal()
    
    # 5. 输出结果
    abnormal_items = report.get_abnormal_items()
    if abnormal_items:
        print(f"\n发现{len(abnormal_items)}个异常项目:")
        for item in abnormal_items:
            print(f"  {item.name}: {item.value} {item.unit} ({item.reference_range}) - {item.abnormal_type}")
    else:
        print("\n所有项目均在正常范围内")
    
    return report

# 使用示例
if __name__ == "__main__":
    report = process_lab_report("/path/to/your/lab_report.png")

运行这个脚本，你就能看到一份检验报告被自动识别、解析，并标记出异常项目。

6. 高级功能：异常值自动标红

6.1 生成带标记的HTML报告

仅仅在控制台输出异常项目还不够直观。我们可以生成一个HTML报告，用红色高亮显示异常值：

def generate_html_report(report, output_path="lab_report.html"):
    """生成带颜色标记的HTML报告"""
    
    html_content = """
    <!DOCTYPE html>
    <html>
    <head>
        <meta charset="UTF-8">
        <title>检验报告分析</title>
        <style>
            body { font-family: Arial, sans-serif; margin: 40px; }
            .header { background: #f5f5f5; padding: 20px; border-radius: 5px; }
            .patient-info { margin-bottom: 20px; }
            .test-table { width: 100%; border-collapse: collapse; margin-top: 20px; }
            .test-table th, .test-table td { 
                border: 1px solid #ddd; 
                padding: 12px; 
                text-align: left; 
            }
            .test-table th { background: #4CAF50; color: white; }
            .abnormal-high { background: #ffcccc; color: #cc0000; font-weight: bold; }
            .abnormal-low { background: #ccffff; color: #0066cc; font-weight: bold; }
            .summary { 
                margin-top: 30px; 
                padding: 15px; 
                background: #fff3cd; 
                border-left: 4px solid #ffc107;
            }
        </style>
    </head>
    <body>
        <div class="header">
            <h1>检验报告智能分析</h1>
            <div class="patient-info">
                <p><strong>患者姓名:</strong> {patient_name}</p>
                <p><strong>年龄:</strong> {patient_age}岁</p>
                <p><strong>分析时间:</strong> {analysis_time}</p>
            </div>
        </div>
        
        <h2>检验项目详情</h2>
        <table class="test-table">
            <tr>
                <th>检验项目</th>
                <th>检验结果</th>
                <th>单位</th>
                <th>参考范围</th>
                <th>状态</th>
            </tr>
            {table_rows}
        </table>
        
        <div class="summary">
            <h3>分析总结</h3>
            <p>共分析 {total_items} 个检验项目，其中异常项目 {abnormal_count} 个。</p>
            {abnormal_list}
        </div>
    </body>
    </html>
    """
    
    # 生成表格行
    table_rows = ""
    abnormal_count = 0
    
    for item in report.items:
        # 根据异常类型设置CSS类
        if item.is_abnormal:
            abnormal_count += 1
            if item.abnormal_type == 'high':
                row_class = "abnormal-high"
                status = "偏高 ↑"
            else:
                row_class = "abnormal-low"
                status = "偏低 ↓"
        else:
            row_class = ""
            status = "正常"
        
        table_rows += f"""
        <tr class="{row_class}">
            <td>{item.name}</td>
            <td>{item.value}</td>
            <td>{item.unit}</td>
            <td>{item.reference_range}</td>
            <td>{status}</td>
        </tr>
        """
    
    # 生成异常项目列表
    abnormal_list = ""
    abnormal_items = report.get_abnormal_items()
    if abnormal_items:
        abnormal_list = "<ul>"
        for item in abnormal_items:
            arrow = "↑" if item.abnormal_type == 'high' else "↓"
            abnormal_list += f"<li>{item.name}: {item.value}{item.unit} {arrow} (参考范围: {item.reference_range})</li>"
        abnormal_list += "</ul>"
    
    # 填充模板
    from datetime import datetime
    html_content = html_content.format(
        patient_name=report.patient_name or "未识别",
        patient_age=report.patient_age or "未识别",
        analysis_time=datetime.now().strftime("%Y-%m-%d %H:%M:%S"),
        table_rows=table_rows,
        total_items=len(report.items),
        abnormal_count=abnormal_count,
        abnormal_list=abnormal_list
    )
    
    # 保存HTML文件
    with open(output_path, 'w', encoding='utf-8') as f:
        f.write(html_content)
    
    print(f"HTML报告已生成: {output_path}")
    return output_path

# 在完整流程中添加HTML生成
def process_lab_report_with_html(image_path):
    """完整流程，包含HTML报告生成"""
    report = process_lab_report(image_path)
    html_path = generate_html_report(report, "analysis_report.html")
    
    # 在控制台输出HTML文件路径，方便用户直接打开
    print(f"\n打开以下文件查看带颜色标记的报告:")
    print(f"file://{os.path.abspath(html_path)}")
    
    return report, html_path

6.2 批量处理多份报告

在医院实际场景中，往往需要批量处理大量报告。我们可以轻松扩展这个功能：

import os
from concurrent.futures import ThreadPoolExecutor

def batch_process_reports(reports_dir, output_dir="reports_output"):
    """批量处理目录下的所有检验报告图片"""
    
    # 创建输出目录
    os.makedirs(output_dir, exist_ok=True)
    
    # 获取所有图片文件
    image_extensions = ['.png', '.jpg', '.jpeg', '.webp']
    report_files = []
    
    for file in os.listdir(reports_dir):
        if any(file.lower().endswith(ext) for ext in image_extensions):
            report_files.append(os.path.join(reports_dir, file))
    
    print(f"找到 {len(report_files)} 份检验报告需要处理")
    
    # 使用线程池并行处理（根据服务器性能调整线程数）
    results = []
    with ThreadPoolExecutor(max_workers=4) as executor:
        # 提交所有任务
        future_to_file = {
            executor.submit(process_single_report, file, output_dir): file 
            for file in report_files
        }
        
        # 收集结果
        for future in future_to_file:
            try:
                result = future.result()
                results.append(result)
                print(f"✓ 已完成: {os.path.basename(future_to_file[future])}")
            except Exception as e:
                print(f"✗ 处理失败: {os.path.basename(future_to_file[future])}, 错误: {e}")
    
    print(f"\n批量处理完成！共处理 {len(results)}/{len(report_files)} 份报告")
    return results

def process_single_report(image_path, output_dir):
    """处理单份报告并保存结果"""
    try:
        report = process_lab_report(image_path)
        
        # 生成HTML报告
        base_name = os.path.splitext(os.path.basename(image_path))[0]
        html_path = os.path.join(output_dir, f"{base_name}_analysis.html")
        generate_html_report(report, html_path)
        
        # 同时保存结构化数据为JSON
        json_path = os.path.join(output_dir, f"{base_name}_data.json")
        save_report_as_json(report, json_path)
        
        return {
            'image': image_path,
            'html': html_path,
            'json': json_path,
            'abnormal_count': len(report.get_abnormal_items())
        }
    except Exception as e:
        raise Exception(f"处理 {image_path} 失败: {str(e)}")

def save_report_as_json(report, json_path):
    """将报告保存为JSON格式"""
    report_dict = {
        'patient_info': {
            'name': report.patient_name,
            'age': report.patient_age,
            'gender': report.patient_gender
        },
        'test_items': [],
        'summary': {
            'total_items': len(report.items),
            'abnormal_count': len(report.get_abnormal_items())
        }
    }
    
    for item in report.items:
        report_dict['test_items'].append({
            'name': item.name,
            'value': item.value,
            'unit': item.unit,
            'reference_range': item.reference_range,
            'is_abnormal': item.is_abnormal,
            'abnormal_type': item.abnormal_type
        })
    
    import json
    with open(json_path, 'w', encoding='utf-8') as f:
        json.dump(report_dict, f, ensure_ascii=False, indent=2)

7. 实际应用与优化建议

7.1 在医院场景中的实际应用

通过上面的代码，我们已经构建了一个完整的检验报告智能处理系统。在实际医院环境中，它可以这样应用：

检验科医生助手

自动识别纸质报告或扫描件
快速提取关键指标
自动标记异常值，减少漏检
生成标准化电子记录

临床医生工作流

在电子病历系统中集成
新报告自动分析，异常结果优先显示
历史数据对比，跟踪指标变化趋势

医院管理

批量处理历史报告，建立电子档案
统计分析科室常见异常指标
质量控制和审计追踪

7.2 性能优化建议

处理速度优化

使用异步处理：对于批量任务，使用异步IO避免等待
缓存模型：GLM-OCR模型加载后常驻内存，避免重复加载
图片预处理：上传前压缩图片，减少传输和处理时间

from PIL import Image
import io

def preprocess_image(image_path, max_size=1024):
    """预处理图片，调整大小并压缩"""
    img = Image.open(image_path)
    
    # 调整大小，保持长边不超过max_size
    if max(img.size) > max_size:
        ratio = max_size / max(img.size)
        new_size = tuple(int(dim * ratio) for dim in img.size)
        img = img.resize(new_size, Image.Resampling.LANCZOS)
    
    # 保存为优化后的JPEG
    buffer = io.BytesIO()
    img.save(buffer, format='JPEG', quality=85, optimize=True)
    buffer.seek(0)
    
    return buffer

准确率提升

模板匹配：针对特定医院或设备的报告格式，创建模板
后处理规则：添加领域知识规则，纠正常见识别错误
人工复核接口：关键指标提供人工复核功能

系统稳定性

错误重试机制：网络波动或服务暂时不可用时自动重试
服务健康检查：定期检查GLM-OCR服务状态
日志记录：详细记录处理过程和错误信息，便于排查

7.3 扩展功能思路

趋势分析 不仅分析单次报告，还可以连接历史数据，分析指标变化趋势：

def analyze_trends(patient_id, test_name, days=30):
    """分析某个检验项目在指定时间段内的趋势"""
    # 从数据库获取历史数据
    historical_data = get_historical_results(patient_id, test_name, days)
    
    if len(historical_data) < 2:
        return "数据不足进行趋势分析"
    
    # 简单趋势判断
    values = [d['value'] for d in historical_data]
    dates = [d['date'] for d in historical_data]
    
    # 计算变化趋势
    from scipy import stats
    slope, intercept, r_value, p_value, std_err = stats.linregress(
        range(len(values)), values
    )
    
    if slope > 0.1:
        trend = "上升趋势"
    elif slope < -0.1:
        trend = "下降趋势"
    else:
        trend = "基本稳定"
    
    return {
        'trend': trend,
        'slope': slope,
        'latest_value': values[-1],
        'data_points': len(values)
    }

智能提醒 根据异常严重程度和临床重要性，设置分级提醒：

class AlertSystem:
    """智能提醒系统"""
    
    CRITICAL_TESTS = ['白细胞计数', '血红蛋白', '血小板计数', '血糖']
    
    def generate_alert(self, report):
        alerts = []
        
        for item in report.get_abnormal_items():
            severity = self.calculate_severity(item)
            
            alert = {
                'test_name': item.name,
                'value': item.value,
                'unit': item.unit,
                'reference_range': item.reference_range,
                'severity': severity,
                'message': self.get_alert_message(item, severity)
            }
            
            alerts.append(alert)
        
        return alerts
    
    def calculate_severity(self, item):
        """计算异常严重程度"""
        # 根据偏离正常范围的程度分级
        low, high = map(float, item.reference_range.split('-'))
        range_mid = (low + high) / 2
        
        if item.abnormal_type == 'high':
            deviation = (item.value - high) / (high - range_mid)
        else:
            deviation = (low - item.value) / (range_mid - low)
        
        if deviation > 1.0:
            return 'critical'  # 严重异常
        elif deviation > 0.5:
            return 'warning'   # 中度异常
        else:
            return 'notice'    # 轻度异常
    
    def get_alert_message(self, item, severity):
        """生成提醒消息"""
        messages = {
            'critical': f"【紧急】{item.name}严重异常！请立即处理。",
            'warning': f"【注意】{item.name}异常，需要关注。",
            'notice': f"【提示】{item.name}轻度异常，建议复查。"
        }
        return messages.get(severity, "检测到异常值")