Gaining access to EMR Data

Johns Hopkins Medicine uses the EPIC EMR at all locations except All Children's Hospital (ACH). The Epic EMR backs up nightly into a Clarity relational database comprising of over 20k tables. Those tables are brought into the PMAP data commons and projected to prospective research databases. As such, the data will have a 24 hour latency.

Getting access to the Asthma Dataset

Most of the tutorials in this section are based on a de-identified dataset projected from Epic on patients with asthma based upon an IRB from Dr. Christopher Chute. The dataset encompasses 60k patients with over 110M data elements encompassing encounters, medications, labs, procedures, symptoms, and vital measurements. The data only includes categorical fields (hence no narrative notes). To compile relevant Jupyter notebook tutorials you will need to be granted access to the PMAP database with this data.

You will need to attest here that you will:

Connecting to the PMAP Database

To access the PMAP Database you will need to use your JHEDID and password which you can hide by using the getpass library. PMAP maintains a list of approved users for each data projection based upon IRB protocols.

Database connection

The data sits on a database server that is designed to query large data sets quickly. This script connects to the PMAP database server and allows access to the CAMP_PMCoe_Projection which is the de-identified asthma dataset.