The support
dataset is a random sample of 1000 patients from Phases I & II of SUPPORT (Study to Understand Prognoses Preferences Outcomes and Risks of Treatment). This dataset is very good for learning how to fit highly nonlinear predictor effects, imputing missing data, and for modeling 4 types of response variables:
See the following reference for more information about the SUPPORT study (funded by the Robert Wood Johnson Foundation): Knaus WA, Harrell FE, Lynn J et al. (1995): The SUPPORT prognostic model: Objective estimates of survival for seriously ill hospitalized adults. Annals of Internal Medicine 122:191-203.
You may want to use the following normal values that have been found to be satisfactory in imputing missing baseline physiologic data:
Baseline Variable | Normal Fill-in Value |
---|---|
Serum albumin | 3.5 |
PaO2/FiO2 ratio (pafi ) |
333.3 |
Bilirubin | 1.01 |
Creatinine | 1.01 |
BUN | 6.51 |
White blood count | 9 (thousands) |
Urine output | 2502 |
The ordinal functional disability variable sfdm2
has the following levels (in order):
Level | Meaning |
---|---|
1 | Patient lived 2 months to be able to get 2 month interview, and from this interview there were no signs of moderate to severe functional disability |
2 | Patient was unable to do 4 or more activities of daily living at month 2 after study entry. If the patient was not interviewed but the patient’s surrogate was, the cutoff for disability was ADL ³ 5 |
3 | Sickness Impact Profile total score at 2 months ³ 30 |
4 | Patient intubated or in coma at 2 months |
5 | Patient died before 2 months after study entry |
There are 159 patients surviving 2 months for whom there were nopatient or surrogate interviews. These patients have missing sfdm2
.
support2
DatasetData for all 9105 SUPPORT patients are available. These files contains all variables in the above support
plus the following (see variable labels for more documentation). As before, most of the patient assessments are made on the patient’s third study day.
Variable | Meaning |
---|---|
sps | SUPPORT day 3 physiology score |
aps | APACHE III day 3 physiology score |
surv2m | SUPPORT model 2-month survival estimate |
surv6m | SUPPORT model 6-month survival estimate |
hday | Day in hospital at which patient entered study |
diabetes | Diabetes as a comorbidity |
dementia | Dementia as a comorbidity |
ca | Patient has cancer |
prg2m | Physician’s 2-month survival estimate for pt. |
prg6m | Physician’s 6-month survival estimate for pt. |
dnr | Patient had DNR order |
dnrday | Day of DNR order (<0 if before study) |
To develop models without using findings from previous models, be sure not to use aps, sps, surv2m, surv6m
as predictors. You also will probably not want to use prg2m, prg6m, dnr, dnrday
.