Abstract
王安然,吴思竹,刘盛宇,修晓蕾,周佳茵,胡拯涌,段一凡.面向FAIR数据共享的医学通用数据模型比较研究[J].Chinese journal of Epidemiology,2023,44(5):828-836
面向FAIR数据共享的医学通用数据模型比较研究
Comparative study of medical common data models for FAIR data sharing
Received:October 25, 2022  
DOI:10.3760/cma.j.cn112338-20221025-00908
KeyWord: 健康医疗大数据  观察性研究  通用数据模型  语义互操作  数据标准化
English Key Word: Healthcare big data  Observational study  Common data model  Semantic interoperability  Data standardization
FundProject:中国医学科学院医学与健康科技创新工程(2021-I2M-1-057);国家重点研发计划(2021YFC2701301)
Author NameAffiliationE-mail
Wang Anran Department of Medical Data Sharing, Institute of Medical Information, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing 100020, China  
Wu Sizhu Department of Medical Data Sharing, Institute of Medical Information, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing 100020, China wu.sizhu@imicams.ac.cn 
Liu Shengyu Department of Medical Data Sharing, Institute of Medical Information, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing 100020, China  
Xiu Xiaolei Department of Medical Data Sharing, Institute of Medical Information, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing 100020, China  
Zhou Jiayin Department of Medical Data Sharing, Institute of Medical Information, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing 100020, China  
Hu Zhengyong Department of Medical Data Sharing, Institute of Medical Information, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing 100020, China  
Duan Yifan Department of Medical Data Sharing, Institute of Medical Information, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing 100020, China  
Hits: 3265
Download times: 1400
Abstract:
      通用数据模型(CDM)是促进多源异构健康医疗大数据标准化整合、增强数据语义理解一致性、推动多方协同分析的重要工具,经CDM标准化后的数据集合可为开展大规模人群队列等观察性研究提供有力支撑。本文深入比较分析了三项国际典型医学CDM的数据存储结构、术语映射模式和辅助工具研发情况,系统梳理各模型的优势、局限,总结了我国在CDM应用过程中所面临的挑战与机遇。期望通过探索国外在健康医疗大数据开放共享过程中的先进技术理念与实践模式,为推动我国健康医疗数据资源FAIR化建设,即数据可发现(findable)、可访问(accessible)、可互操作(interoperable)和可重用(reusable),解决当前数据资源质量不佳、语义化程度低、无法实现打通共享和重复利用等实际问题提供借鉴。
English Abstract:
      The common data model (CDM) is an important tool to facilitate the standardized integration of multi-source heterogeneous healthcare big data, enhance the consistency of data semantic understanding, and promote multi-party collaborative analysis. The data collections standardized by CDM can provide powerful support for observational studies, such as large-scale population cohort study. This paper provides an in-depth comparative analysis of the data storage structure, term mapping pattern, and auxiliary tools development of the three international typical CDMs, then analyzes the advantages and limitations of each CDM and summarizes the challenges and opportunities faced in the CDM application in China. It is expected that exploring the advanced technical concepts and practical patterns of foreign countries in data management and sharing will provide references for promoting FAIR (findable, accessible, interoperable, reusable) construction of healthcare big data in China and solving the current practical problems, such as the poor quality of data resources, the low degree of semantization, and the inabilities of data sharing and reuse.
View Fulltext   Html FullText     View/Add Comment  Download reader
Close