CAST, 2024-11-18

Background Question

  • Do you wonder why we have so many special cases in statistics that require seemingly different methods?
    • time to first event
    • recurrent events
    • recurrent events with an absorbing state
    • cure models (two absorbing states)
    • competing risks

Background, continued

  • More special cases
    • Wilcoxon, Kruskal-Wallis, logrank tests
    • zero inflation adaptations of Poisson and negative binomial models
    • longitudinal data analysis

Semiparametric Regression

Univariate Proportional Odds Model

  • \(\Pr(Y \geq y | X) = \mathrm{expit}(\alpha_{y} + X\beta)\)
  • \(Y\)-transformation invariant, does not use \(Y\) spacings
  • Handles arbitrarily heavy ties or continuous \(Y\), bi-modality, …
  • Direct competitor of the linear model

Longitudinal Models

Full Likelihood Extensions

  • Random intercepts
    • massive lack of fit for correlation structure
      • implies there is interest in individual pt-level outcomes vs. group level (treatment level)
      • absorbing states destroy the correlation pattern
      • typically assumes that > 6 observations per patient do not increase power

Full Likelihood Extensions, continued

  • Random intercepts and slopes
    • more flexible correlation structure but still may not fit
      • too many parameters to estimate
    • can’t have absorbing states
  • Markov models
    • most flexible, fastest, easiest to program
      • trivial to implement with ML (until you want state occupancy probabilities)

Full Likelihood Extensions, continued

  • Markov models apply to all \(Y\)
    • binary
    • unordered categorical
    • ordinal categorical
    • ordinal continuous
    • ordinal mixed continuous and categorical
    • continuous
    • left, right, and interval censored
    • require unconditioning on previous \(Y\) to get marginal distributions

First-Order Discrete Time Markov Proportional Odds Model

  • Current state depends only on covariates, previous state
  • Let measurement times be \(t_{1}, t_{2}, \dots, t_{m}\), and the measurement for a patient at time \(t\) be denoted \(Y(t)\) \[\Pr(Y(t_{i}) \geq y | X, Y(t_{i-1})) =\] \[\mathrm{expit}(\alpha_{y} + X\beta + g(Y(t_{i-1}), t_{i}))\]

Unifying Approach

Markov PO Model: A Unified Approach

  • Time to terminating event
    • transition probability = discrete hazard rate
    • OR \(\approx\) HR when time intervals small
    • easily handles time-dependent covariates, left-truncation
  • Recurrent binary events
  • Recurrent binary events + a terminal event

Unified Approach, continued

  • Competing risks
    • death explicitly handled as a bad outcome
    • easier to interpret than competing risk models
  • Serial current status data
    • events of different severities
    • no need to judge whether an early heart attack is worse than a late death
  • Missing data and interval-censored \(Y\)

Unified Approach, continued

  • Standard longitudinal continuous \(Y\)
  • Longitudinal continuous or ordinal \(Y\) interrupted by clinical events
  • Easily handles multiple absorbing states
  • Serial correlation: condition on previous outcome
  • Random intercepts (compound symmetry correlation): condition on average of all previous outcomes

Examples of Longitudinal Ordinal Outcomes

  • 0=alive 1=dead
    • censored at 3w: 000
    • death at 2w: 01
  • 0=at home 1=hospitalized 2=MI 3=dead
    • hospitalized at 3w, rehosp at 7w, MI at 8w & stays in hosp, f/u ends at 10w: 0010001211

Examples, continued

  • 0-6 QOL excellent–poor, 7=MI 8=stroke 9=dead
    • QOL varies, not assessed in 3w but pt event free, stroke at 8w, death 9w: 12[0-6]334589
    • MI status unknown at 7w: 12[0-6]334[5,7]89
  • Can make first 200 levels be a continuous response variable and the remaining values represent clinical event overrides

From Transition Probabilities to State Occupancy Probabilities

Unconditioning on Previous States

  • For equal time spacing:
    \(\Pr(Y(t)=y | X) =\)
    \(\sum_{j=1}^{k}\Pr(Y(t)=y | X, Y(t-1) = j) \times\)
    \(\Pr(Y(t-1) = j | X)\)
  • Use this recursively
  • Yields a semiparametric unconditional (except for \(X\)) distribution of \(Y\) at each \(t\) (SOPs)
  • soprobMarkovOrd* functions in the R Hmisc package make this easy for frequentist and Bayesian models

Estimands

  • Transition odds ratios (original parameters)
  • Prior state and covariate-specific transition probabilities
  • State occupancy probabilities (SOPs; marginalize over time)
  • Covariate-specific SOPs
    • \(\Pr(\)stroke in week 4 or death in or before week 4\()\); \(\Pr(\)stroke and alive\()\)

Estimands, continued

  • Time in state \(Y=y\) (like RMST)
  • Time in states \(Y \geq y\) (e.g., mean time unwell)
  • Differences in mean time in state between treatments
  • Continuous \(Y\) example: mean time with SBP < 130mmHg
    • Nice way to handle treatment \(\times\) time interaction
    • No categorization of SBP

Popular OLM Readout

Another OLM Example: ORBITA-2

FA Simader et al (2024): Symptoms as a predictor of the placebo-controlled efficacy of PCI in stable coronary artery disease. JACC 84: 13-24.

  • Daily angina frequencies
  • Y = ordinal scale
    • Angina frequencies magnified by # units of anti-anginal meds required to control angina
    • Clinical event overrides at top of scale
    • Graphic design by Matthew Shun-Shin, Imperial College London

ORBITA-2, continued


OLM, continued

Extension of OLM: Partial Proportional Odds Markov Model

  • Allows effect of Tx to vary over outcome categories
  • Example: Tx may affect mortality differently than how it affects function/Sx
  • Bayesian prior can specify limits on the amount of borrowing of Tx effects across outcomes
  • Example Bayesian posterior prob.: probability that Tx affects death by \(> 1.5\times\) effect on function
  • See fharrell.com/post/yborrow

Replacement for Surrogate Endpoints

  • Surrogate endpoint: an endpoint affected by Tx if and only if the Tx affects the gold standard (GS) endpoint
  • Surrogates are used to mirror and give an early preview to the GS
  • Very difficult to establish surrogacy, and it is unreasonable to assume Pr(surrogacy) is 0 or 1
  • If you think of this as a multiple imputation problem (using surrogate to impute the GS), \(N\) for an RCT based on a surrogate may be greater than \(N\) needed for GS!

Surrogate Endpoints, continued

Instead:

  • Include the (lower powered) GS as part of an ordinal outcome
  • Construct an ordinal outcome built on consensus about severities of outcomes
  • Decide on how much information you want to borrow across endpoints
  • I.e., how does a posterior distribution for Tx effect on nonfatal endpoint help build the posterior for the effect on GS

More Information

  • hbiostat.org/ordinal
  • hbiostat.org/endpoint
  • fharrell.com/talk/cmstat