Thread by @raj_mehta, 1/ Tweetorial at use of Race/Ethnicity in prediction models, specifically looking at [...]

1/ Tweetorial at use of Race/Ethnicity in prediction models, specifically looking at eGFR based on CKD-EPI formula.

eGFR = estimated GFR http://ckdepi.org/

Home Page - CKD-EPI

Many kidney diseases progress slowly, and trials to evaluate treatments for kidney disease require extensive durations of follow-up to determine if the treatment has been effective, increasing...

http://ckdepi.org/

2/ First some background.

GFR can be directly measured, traditionally by measuring iothalamate clearance. This is process is time consuming, costly, and rather inconvenient. So for greater ease, we attempt to estimate GFR (eGFR) based on serum measurements of Creatinine.

3/ Prediction Models:
The way this works is by getting lots of values of Creatinine, comparing them to know values of GFR, and then putting together a prediction model to create a formula that lets us closely approximate the GFR values.

4/ The most important aspect here is the prediction model has to be calibrated & validated. Its not good if the eGFR is wrong! Calibrating the model requires adjusting based on other factors that can improve the prognostic value of the equation. https://jamanetwork.com/journals/jama/article-abstract/2656816

Users’ Guides to Discrimination and Calibration of Clinical Prediction Models

This Users’ Guide to the Medical Literature discusses discrimination and calibration, 2 primary ways to measure and compare the accuracy of clinical risk prediction models.

https://jamanetwork.com/journals/jama/article-abstract/2656816

5/ The Creatinine EPI-CKD model, based on available data-sets at that time, found that Age, Sex, and Race adjustments offered the best "improvement" predictions. Since creatinine is influenced by activity level, diet, and (presumably) genetics, this was seen as plausible model.

6/ Not all predictor variables are causal.

A confusing point!

For example, a mortality model might use hospitalization to improve prediction of death. But the hospital itself is not causing death; it is merely a surrogate for disease severity (which is harder to measure).

7/ Many people think of age, sex, and race as causal predictors of eGFR. Here is one example of a causal model, where race is a surrogate for (unmeasured) genetics:

8/ However, their is a problem with this approach; race/ethnicity is considered a complex social construct that is more than just genetics or skin pigmentation.

https://www.annualreviews.org/doi/abs/10.1146/annurev-polisci-032015-010015

9/ Here is one example of a eGFR causal model using social constructs. Factors such at inequality, Socio-Economic-Status, cultural culinary habits, etc can influence diet/activity & serum Cr values.

10/ In such a model, inadequate consideration for confounders while controlling for race is going to bias eGFR predictions.

For the 10 studies used to calibrate the CKD-EPI eGFR model, demographic info is available for some of the major data sets.

Ref:
https://rajking.github.io/ckdepi/

11/ The majority of AA patients came from the AASK trial. The demographic data of the AASK study had lower income, education, & more smokers compared to available demographic data in the other studies.

12/ Their two large problems for the Cr based prediction model.

First, the different cohorts of data & study sites can have lots of variations. To avoid accidentally over-weighting one study, its important to do a cluster analaysis, which the CKD-EPI study does not do.

13/ Second, the patients in the AASK trial had SES differences (jobs, environment, diet, income, etc) that would confound/bias eGFR predictions.

(Note, even if we could adjust for SES, we still cannot account for all confounders related to race)

https://jamanetwork.com/journals/jama/fullarticle/2703957

14/ Interestingly, we can also estimate GFR values using Cystatin C (its more expensive), which is not influenced by diet/activity.

The prediction models using cystatin-c did not find improved prediction when adjusting for race, as they do for age/sex. http://ckdepi.org/equations/estimating-equations/cystatin-c-based-equations/

Cystatin C Based Equations - CKD-EPI

CKD-EPI Cystatin C was developed in 2012 to estimate GFR from serum cystatin C, age, and sex. The equation has similar accuracy to the CKD-EPI Creatinine 2009 Equation. The advantage of the CKD-EPI...

http://ckdepi.org/equations/estimating-equations/cystatin-c-based-equations/

15/ Should we continue to use race/ethnicity in the Creatinine CKD-EPI prediction model?

After reviewing the data, its hard to have confidence in it.

Additionally, their are concerns that inaccurate eGFR can cause harm if it influences entering transplant wait lists, etc.

16/ For common use of Creatinine eGFR like dosing adjustment, etc, using age/sex alone is probably still sufficient.

For more important decisions, perhaps we need to do more decision analysis to decide when Cystatin C or direct (iothalimate) measures offer greater utility.

17/ A more difficult question: should we continue to use race/ethnicity for biomedical prediction models in general?

This is a more nuanced topic, to which I dont think their is one answer.

18/ Not all variables in prediction models are causal. Models specifically looking at predicting social determinants of health could use race/ethnicity if done with appreciation of the complex components involved.

19/ The ASCVD risk prediction, for example, considers race/ethnicity. Is race/ethnicity is "causal", or a surrogate for social determinants of health? Should we remove race? Would removing race cause harm if it worsens prediction? Should we let patients decide?

20/ If we think race/ethnicity is important for prediction, we should do our best to define how and why.

And if we want to collect data, we should do so with a more thoughtful, multi-dimensional understanding of identify. https://jamanetwork.com/journals/jama/article-abstract/2703954

Race, Ancestry, and Medical Research

The discussion of race and medicine in the United States is challenging and emotionally charged. Substantial disparities in health outcomes, based on race, ethnicity, and socioeconomic status,...

https://jamanetwork.com/journals/jama/article-abstract/2703954

21/ We should be thoughtful about any aspect of a person's identity, and the utility of using such information to inform decision making.

/End

Latest Threads Unrolled: