The Data Characterization data model was designed to facilitate rapid querying of data characterization output from the data quality and characterization activities of distributed research networks. The data model is based on the output of the Mini-Sentinel data quality review process for the Mini-Sentinel Distributed Database and similar data quality review outputs against the PCORnet Common Data Model, but it is generalizable and extensible for use with other data quality review output.

The Data Characterization data model is directly queryable in the PopMedNet System by the /wiki/spaces/DOC/pages/8880343.

The Data Characterization data model includes thirteen tables containing information regarding the distribution and missingness of variables among the data held by each data partner:

TableDescription
AgeDistribution of patients' ages stratified by data partner
Diagnoses

Counts of diagnoses codes stratified by code, code type, and data partner

HeightDistribution of patients' heights stratified by data partner
HispanicEthnicity distribution stratified by data partner
MetadataCompleteness and availability of data stratified by table, minimum and maximum date, and data partner
NDCS

Presence of National Drug Codes stratified by data partner

PDXDistribution of discharge diagnosis types stratified by encounter type and data partner
ProceduresCounts of procedure codes stratified by code, code type, and data partner
RaceRace distribution stratified by data partner
RxAmtDistribution of pharmacy dispensing amount supply stratified by data partner
RxSupDistribution of pharmacy dispensing days supply stratified by data partner
SexDistribution of patients' sex stratified by data partner
WeightDistribution of patients' weights stratified by data partner