You will work through two healthcare data-analysis exercises using whatever AI tool you prefer (ChatGPT, Claude, Microsoft Copilot, etc.). Both exercises follow the same arc: download a dataset, clean it, ask an AI good analytical questions, and verify the answers.
The Joint Outpatient Experience Survey (JOES) combines and standardizes outpatient satisfaction surveys across Army, Navy, and Air Force medical facilities. It focuses on beneficiary experience with care received at Military Treatment Facilities (MTFs). The Defense Health Agency uses JOES data quarterly to generate "best of the best" reports ranking top-performing clinics, providers, and support staff across the Military Health System.
Key aspects evaluated:
This exercise uses fictionalized data modeled after authentic JOES datasets to provide realistic training while maintaining data privacy.
Obtain the one-year JOES dataset above. Save it somewhere accessible on your computer.
Healthcare data often contains inconsistencies, duplicate entries, missing values, and formatting errors that undermine analysis accuracy. Use your AI tool to identify and fix these issues. Keep a record of what changes you make.
Once cleaning is complete and you have verified the corrections, upload the cleaned dataset to an AI platform capable of data analysis — ChatGPT, Claude, or Microsoft Copilot with data analysis capabilities.
Structured, context-rich prompts significantly improve AI accuracy. Try prompts like:
"Compare patient satisfaction scores for Aspen Pass Army Medical Center against the average of comparable Army facilities. Identify areas where Aspen Pass performs above or below benchmark."
"Analyze satisfaction trends by department or clinic type. Which specialties have the highest and lowest patient satisfaction ratings?"
"Identify the top three factors contributing to patient dissatisfaction at Aspen Pass based on survey responses."
"Calculate the percentage of patients rating overall satisfaction as excellent, good, fair, or poor for Aspen Pass versus comparator facilities."
Effective prompts specify the analytical goal, the desired output format, and the relevant context.
Don't trust AI output blindly. Validation techniques:
Never rely solely on AI output for clinical or operational decisions without human verification.
| Variable | Type | Label | JOES Question / Meaning |
|---|---|---|---|
SEX | Categorical | Patient Sex | Patient sex/gender identifier |
MEPRS4 | Categorical | MEPRS Clinic Code | Medical Expense and Performance Reporting System (MEPRS) clinic/service code |
CLINIC_NAME | Text | Clinic Name | Name of the clinic or outpatient service |
BENCAT | Categorical | Beneficiary Category | Military beneficiary category of patient |
PATAGE | Numeric | Patient Age | Patient age in years |
QGENERAL_1 | Likert (1–5) | General Satisfaction #1 | "Overall, how satisfied were you with your visit?" |
QGENERAL_2 | Likert (1–5) | General Satisfaction #2 | "Overall, how satisfied were you with the clinic/facility?" |
QVISITOEv1 | Open-ended | Visit Comment | "Please provide comments about your visit/provider experience." |
QFAC_OEv1 | Open-ended | Facility Comment | "Please provide comments about the facility or clinic experience." |
QSEEPROE | Open-ended | Access to Provider Comment | "Please provide comments about your ability to see your provider when needed." |
facility_name | Text | MTF Name | Name of military treatment facility |
return_date | Date | Survey Return Date | Date survey was submitted or returned |
| Code | Meaning | Code | Meaning |
|---|---|---|---|
AD | Active Duty | RETFAM | Retiree Family Member |
ADFAM | Active Duty Family Member | DR | Dependent Retiree |
DA | Dependent Adult | MCR | Medicare Eligible Retiree |
DC | Dependent Child | MCRFAM | Medicare Eligible Retiree Family Member |
RET | Retiree | RES | Reservist |
NG | National Guard | CIV | Civilian |
OTHER | Other Beneficiary Category |
Using AI tools, analyze the DMLSS transaction dataset to determine each customer's consumption patterns and recommend a revised delivery schedule.
What is the maximum delivery interval for each facility based on their ordering patterns?
Assuming you reduce the delivery schedule to account for the average / min / max service interval for each account, does this solve your problem? Can you achieve your proposed delivery schedule given:
If your proposed schedule doesn't solve the problem, how would you refine your analysis to ensure that the new delivery schedule is achievable?
Data source: DMLSS transaction records, WHMC Lackland AFB, 2010–2011 (fictionalized for training).