Living risk prediction algorithm (QCOVID) for risk of hospital admission and mortality from coronavirus 19 in adults: national derivation and validation cohort studyAbstract Objective To derive and validate a risk prediction algorithm to estimate hospital admission and mortality outcomes from coronavirus disease 2019 (covid-19) in adults. Design Population based cohort study. Setting and participants QResearch database, comprising 1205 general practices in England with linkage to covid-19 test results, Hospital Episode Statistics, and death registry data. 6.08 million adults aged 19-100 years were included in the derivation dataset and 2.17 million in the validation dataset. The derivation and first validation cohort period was 24 January 2020 to 30 April 2020. The second temporal validation cohort covered the period 1 May 2020 to 30 June 2020. Main outcome measures The primary outcome was time to death from covid-19, defined as death due to confirmed or suspected covid-19 as per the death certification or death occurring in a person with confirmed severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection in the period 24 January to 30 April 2020. The secondary outcome was time to hospital admission with confirmed SARS-CoV-2 infection. Models were fitted in the derivation cohort to derive risk equations using a range of predictor variables. Performance, including measures of discrimination and calibration, was evaluated in each validation time period. Results 4384 deaths from covid-19 occurred in the derivation cohort during follow-up and 1722 in the first validation cohort period and 621 in the second validation cohort period. The final risk algorithms included age, ethnicity, deprivation, body mass index, and a range of comorbidities. The algorithm had good calibration in the first validation cohort. For deaths from covid-19 in men, it explained 73.1% (95% confidence interval 71.9% to 74.3%) of the variation in time to death (R 2 ); the D statistic was 3.37 (95% confidence interval 3.27 to 3.47), and Harrell’s C was 0.928 (0.919 to 0.938). Similar results were obtained for women, for both outcomes, and in both time periods. In the top 5% of patients with the highest predicted risks of death, the sensitivity for identifying deaths within 97 days was 75.7%. People in the top 20% of predicted risk of death accounted for 94% of all deaths from covid-19. Conclusion The QCOVID population based risk algorithm performed well, showing very high levels of discrimination for deaths and hospital admissions due to covid-19. The absolute risks presented, however, will change over time in line with the prevailing SARS-C0V-2 infection rate and the extent of social distancing measures in place, so they should be interpreted with caution. The model can be recalibrated for different time periods, however, and has the potential to be dynamically updated as the pandemic evolves.
The current status of risk-stratified breast screeningAsh Kieran Clift, David Dodwell, Simon Lord et al.|British Journal of Cancer|2021 Apart from high-risk scenarios such as the presence of highly penetrant genetic mutations, breast screening typically comprises mammography or tomosynthesis strategies defined by age. However, age-based screening ignores the range of breast cancer risks that individual women may possess and is antithetical to the ambitions of personalised early detection. Whilst screening mammography reduces breast cancer mortality, this is at the risk of potentially significant harms including overdiagnosis with overtreatment, and psychological morbidity associated with false positives. In risk-stratified screening, individualised risk assessment may inform screening intensity/interval, starting age, imaging modality used, or even decisions not to screen. However, clear evidence for its benefits and harms needs to be established. In this scoping review, the authors summarise the established and emerging evidence regarding several critical dependencies for successful risk-stratified breast screening: risk prediction model performance, epidemiological studies, retrospective clinical evaluations, health economic evaluations and qualitative research on feasibility and acceptability. Family history, breast density or reproductive factors are not on their own suitable for precisely estimating risk and risk prediction models increasingly incorporate combinations of demographic, clinical, genetic and imaging-related parameters. Clinical evaluations of risk-stratified screening are currently limited. Epidemiological evidence is sparse, and randomised trials only began in recent years.
Association Between Race and COVID-19 Outcomes Among 2.6 Million Children in EnglandImportance: Although children mainly experience mild COVID-19 disease, hospitalization rates are increasing, with limited understanding of underlying factors. There is an established association between race and severe COVID-19 outcomes in adults in England; however, whether a similar association exists in children is unclear. Objective: To investigate the association between race and childhood COVID-19 testing and hospital outcomes. Design, Setting, Participants: In this cohort study, children (0-18 years of age) from participating family practices in England were identified in the QResearch database between January 24 and November 30, 2020. The QResearch database has individually linked patients with national SARS-CoV-2 testing, hospital admission, and mortality data. Exposures: The main characteristic of interest is self-reported race. Other exposures were age, sex, deprivation level, geographic region, household size, and comorbidities (asthma; diabetes; and cardiac, neurologic, and hematologic conditions). Main Outcomes and Measures: The primary outcome was hospital admission with confirmed COVID-19. Secondary outcomes were SARS-CoV-2-positive test result and any hospital attendance with confirmed COVID-19 and intensive care admission. Results: Of 2 576 353 children (mean [SD] age, 9.23 [5.24] years; 48.8% female), 410 726 (15.9%) were tested for SARS-CoV-2 and 26 322 (6.4%) tested positive. A total of 1853 children (0.07%) with confirmed COVID-19 attended hospital, 343 (0.01%) were admitted to the hospital, and 73 (0.002%) required intensive care. Testing varied across race. White children had the highest proportion of SARS-CoV-2 tests (223 701/1 311 041 [17.1%]), whereas Asian children (33 213/243 545 [13.6%]), Black children (7727/93 620 [8.3%]), and children of mixed or other races (18 971/147 529 [12.9%]) had lower proportions. Compared with White children, Asian children were more likely to have COVID-19 hospital admissions (adjusted odds ratio [OR], 1.62; 95% CI, 1.12-2.36), whereas Black children (adjusted OR, 1.44; 95% CI, 0.90-2.31) and children of mixed or other races (adjusted OR, 1.40; 95% CI, 0.93-2.10) had comparable hospital admissions. Asian children were more likely to be admitted to intensive care (adjusted OR, 2.11; 95% CI, 1.07-4.14), and Black children (adjusted OR, 2.31; 95% CI, 1.08-4.94) and children of mixed or other races (adjusted OR, 2.14; 95% CI, 1.25-3.65) had longer hospital admissions (≥36 hours). Conclusions and Relevance: In this large population-based study exploring the association between race and childhood COVID-19 testing and hospital outcomes, several race-specific disparities were observed in severe COVID-19 outcomes. However, ascertainment bias and residual confounding in this cohort study should be considered before drawing any further conclusions. Overall, findings of this study have important public health implications internationally.
Development and internal-external validation of statistical and machine learning models for breast cancer prognostication: cohort studyOBJECTIVE: To develop a clinically useful model that estimates the 10 year risk of breast cancer related mortality in women (self-reported female sex) with breast cancer of any stage, comparing results from regression and machine learning approaches. DESIGN: Population based cohort study. SETTING: QResearch primary care database in England, with individual level linkage to the national cancer registry, Hospital Episodes Statistics, and national mortality registers. PARTICIPANTS: 141 765 women aged 20 years and older with a diagnosis of invasive breast cancer between 1 January 2000 and 31 December 2020. MAIN OUTCOME MEASURES: Four model building strategies comprising two regression (Cox proportional hazards and competing risks regression) and two machine learning (XGBoost and an artificial neural network) approaches. Internal-external cross validation was used for model evaluation. Random effects meta-analysis that pooled estimates of discrimination and calibration metrics, calibration plots, and decision curve analysis were used to assess model performance, transportability, and clinical utility. RESULTS: During a median 4.16 years (interquartile range 1.76-8.26) of follow-up, 21 688 breast cancer related deaths and 11 454 deaths from other causes occurred. Restricting to 10 years maximum follow-up from breast cancer diagnosis, 20 367 breast cancer related deaths occurred during a total of 688 564.81 person years. The crude breast cancer mortality rate was 295.79 per 10 000 person years (95% confidence interval 291.75 to 299.88). Predictors varied for each regression model, but both Cox and competing risks models included age at diagnosis, body mass index, smoking status, route to diagnosis, hormone receptor status, cancer stage, and grade of breast cancer. The Cox model's random effects meta-analysis pooled estimate for Harrell's C index was the highest of any model at 0.858 (95% confidence interval 0.853 to 0.864, and 95% prediction interval 0.843 to 0.873). It appeared acceptably calibrated on calibration plots. The competing risks regression model had good discrimination: pooled Harrell's C index 0.849 (0.839 to 0.859, and 0.821 to 0.876, and evidence of systematic miscalibration on summary metrics was lacking. The machine learning models had acceptable discrimination overall (Harrell's C index: XGBoost 0.821 (0.813 to 0.828, and 0.805 to 0.837); neural network 0.847 (0.835 to 0.858, and 0.816 to 0.878)), but had more complex patterns of miscalibration and more variable regional and stage specific performance. Decision curve analysis suggested that the Cox and competing risks regression models tested may have higher clinical utility than the two machine learning approaches. CONCLUSION: In women with breast cancer of any stage, using the predictors available in this dataset, regression based methods had better and more consistent performance compared with machine learning approaches and may be worthy of further evaluation for potential clinical use, such as for stratified follow-up.
Ethnic disparities in COVID-19 outcomes: a multinational cohort study of 20 million individuals from England and CanadaBACKGROUND: Heterogeneous studies have demonstrated ethnic inequalities in the risk of SARS-CoV-2 infection and adverse COVID-19 outcomes. This study evaluates the association between ethnicity and COVID-19 outcomes in two large population-based cohorts from England and Canada and investigates potential explanatory factors for ethnic patterning of severe outcomes. METHODS: We identified adults aged 18 to 99 years in the QResearch primary care (England) and Ontario (Canada) healthcare administrative population-based datasets (start of follow-up: 24th and 25th Jan 2020 in England and Canada, respectively; end of follow-up: 31st Oct and 30th Sept 2020, respectively). We harmonised the definitions and the design of two cohorts to investigate associations between ethnicity and COVID-19-related death, hospitalisation, and intensive care (ICU) admission, adjusted for confounders, and combined the estimates obtained from survival analyses. We calculated the 'percentage of excess risk mediated' by these risk factors in the QResearch cohort. RESULTS: There were 9.83 million adults in the QResearch cohort (11,597 deaths; 21,917 hospitalisations; 2932 ICU admissions) and 10.27 million adults in the Ontario cohort (951 deaths; 5132 hospitalisations; 1191 ICU admissions). Compared to the general population, pooled random-effects estimates showed that South Asian ethnicity was associated with an increased risk of COVID-19 death (hazard ratio: 1.63, 95% CI: 1.09-2.44), hospitalisation (1.53; 1.32-1.76), and ICU admission (1.67; 1.23-2.28). Associations with ethnic groups were consistent across levels of deprivation. In QResearch, sociodemographic, lifestyle, and clinical factors accounted for 42.9% (South Asian) and 39.4% (Black) of the excess risk of COVID-19 death. CONCLUSION: International population-level analyses demonstrate clear ethnic inequalities in COVID-19 risks. Policymakers should be cognisant of the increased risks in some ethnic populations and design equitable health policy as the pandemic continues.