Skip to main content
MenuSearch & Directory

Data Resources

The AHEAD Institute warehouses several large databases and assists in identifying, obtaining and maintaining data to meet your research needs.

Many databases are de-identified and using them has been deemed non-human subjects research by the SLU Institution Review Board. Other databases require Data Use Agreements and special training. Below is a brief description of the primary data resources available within the institute.

Midwestern Healthcare System Electronic Health Records

The SLU/SSM Virtual Data Warehouse (VDW) captures academic and non- academic ambulatory and inpatient clinical encounters from more than 5 million patients (birth to age > 90 years) starting in 1/1/2008 to present. The VDW is updated monthly. SLU/SSM is a member site of the Health Care Systems Research Network (HCSRN), and the VDW was created per HCSRN specifications. 

This resource captures electronic health record (EHR) information from rural and urban settings in the St. Louis, Missouri, metropolitan area, mid-Missouri, southern Illinois, Oklahoma City, Oklahoma and surrounding areas, and southern Wisconsin. The VDW variables include ICD-9 and ICD-10 diagnostic codes; Current Procedural Terminology (CPT), ICD-9-PCS, and ICD-10-PCS procedure codes; prescription orders; laboratory orders and results; vital signs; provider and clinic type; and demographics.

VDW usage requires a Data Use Agreement and a Data Request.

If you have questions about this data resource, please email joanne.salas@health.slu.edu.

Healthcare Cost and Utilization Project 

The Healthcare Cost and Utilization Project (HCUP) includes the largest collection of longitudinal hospital care data in the United States. 

The AHEAD Institute warehouses the following HCUP data resources:

  • Nationwide Inpatient Sample (1998-2017)
    • The National (Nationwide) Inpatient Sample (NIS) is the largest publicly available all payer inpatient care database in the United States, containing data on more than seven million hospital stays. Its large sample size is ideal for developing national and regional estimates and enables analyses of rare conditions, uncommon treatments and special populations.
  • Kids’ Inpatient Database (1997-2016)
    • The Kids' Inpatient Database (KID) is the largest publicly available all-payer pediatric inpatient care database in the United States, containing data from two to three million hospital stays. Its large sample size is ideal for developing national and regional estimates and enables analyses of rare conditions, such as congenital anomalies, as well as uncommon treatments, such as organ transplantation. The KID has been produced every three years.
  • Nationwide Emergency Department Sample (2003-2017)
    • The Nationwide Emergency Department Sample (NEDS) produces national estimates about emergency department visits across the country. The NEDS describes emergency department visits, regardless of whether they result in admission. One of the most distinctive features of the NEDS is its large sample size, which allows for analysis across hospital types and the study of relatively uncommon disorders and procedures.
  • Nationwide Readmissions Database (2016-2019)
    • The Nationwide Readmissions Database (NRD) is a unique and powerful database designed to support various types of analyses of national readmission rates for all patients regardless of the expected payer for the hospital stay. The NRD includes discharges for patients with and without repeat hospital visits in a year and those who have died in the hospital. Repeat stays may or may not be related. This database addresses a large gap in health care data: the lack of nationally representative information on hospital readmissions for all ages. 

HCUP usage requires a Data Use Agreement and a Data Request.

If you have questions about this data resource, please email paula.buchanan@health.slu.edu

TriNetX

TriNetX is a global research network with a federated architecture. This database provides access to more than 200 million, de-identified patient lives (EHR), including demographics, diagnoses, procedures, medication orders, lab results and vital signs.

TriNetX Data Snapshot: 
  • Free of charge to unfunded, contributing health care organizations
  • Access to more than 200 million, de-identified patient lives (EHR)
  • EHR (demographics, diagnoses, procedures, medication orders, lab results and vital signs)
  • Access to linked third-party claims and mortality data (subset of about 11.6 million)
  • Limited access: Notes/reports (NLP), cancer registry, tumor registry

TriNetX usage requires a Data Request approval by TriNetX.

If you have questions about this data resource, please email joanne.salas@health.slu.edu.

All of Us 

The All of Us Research Program from the National Institutes of Health (NIH) aims to improve health care outcomes by building one of the most diverse health databases. This data resource offers a Research Hub, which stores health data from diverse participants across the United States, with a focus on underrepresented minorities in biomedical research (POC, LGBTQIA). Registered researchers can use the Researcher Workbench to access different data tiers and analysis tools. Anyone at SLU can sign up online to access All of Us data snapshots.

All of Us Data Snapshot: 

  • Free of charge
  • More than 507,000 participants recruited from partner sites (EHR) 
  • Genomics (blood, saliva and/or urine)
  • Surveys (identity, background, health, lifestyle, medical history, health care access, COVID-19 experiences, etc.) 
  • Physical measurements (height, weight, hip/waist circumference, blood pressure and heart rate)
  • Digital health (wearable trackable devices, e.g. Fitbit) 

All of Us Training Video  

If you have questions about this data resource, please email ruizhi.huang@health.slu.edu

Public Data Sources

The AHEAD Institute can work with publicly available databases accessible via organizational websites including the Centers for Disease Control and the US Department of Health and Human Services. Some examples include CDC Wonder Data and Inter-university Consortium for Political and Social Research.

Other Data Sources

The AHEAD team can work with investigator supplied data, Epic pulls, and retrospective chart reviews. The AHEAD team will work alongside investigators to select the ideal data source for all research projects.