
Nuffield Orthopaedic Centre
Hospital / health systemOxford, United Kingdom
Research output, citation impact, and the most-cited recent papers from Nuffield Orthopaedic Centre (United Kingdom). Aggregated across the NobleBlocks index of 300M+ scholarly works.
Top-cited papers from Nuffield Orthopaedic Centre
Assessment of risk of bias is regarded as an essential component of a systematic review on the effects of an intervention. The most commonly used tool for randomised trials is the Cochrane risk-of-bias tool. We updated the tool to respond to developments in understanding how bias arises in randomised trials, and to address user feedback on and limitations of the original tool.
Non-randomised studies of the effects of interventions are critical to many areas of healthcare evaluation, but their results may be biased. It is therefore important to understand and appraise their strengths and weaknesses. We developed ROBINS-I (“Risk Of Bias In Non-randomised Studies - of Interventions”), a new tool for evaluating risk of bias in estimates of the comparative effectiveness (harm or benefit) of interventions from studies that did not use randomisation to allocate units (individuals or clusters of individuals) to comparison groups. The tool will be particularly useful to those undertaking systematic reviews that include non-randomised studies.
Complex interventions are widely used in the health service, in public health practice, and in areas of social policy that have important health consequences, such as education, transport, and housing. They present various problems for evaluators, in addition to the practical and methodological difficulties that any successful evaluation must overcome. In 2000, the Medical Research Council (MRC) published a framework 1 to help researchers and research funders to recognise and adopt appropriate methods. The framework has been highly influential, and the accompanying BMJ paper is widely cited. 2 However, much valuable experience has since accumulated of both conventional and more innovative methods. This has now been incorporated in comprehensively revised and updated guidance recently released by the MRC (www.mrc.ac.uk/ complexinterventionsguidance). In this article we summarise the issues that prompted the revision and the key messages of the new guidance.
Without a complete published description of interventions, clinicians and patients cannot reliably implement interventions that are shown to be useful, and other researchers cannot replicate or build on research findings. The quality of description of interventions in publications, however, is remarkably poor. To improve the completeness of reporting, and ultimately the replicability, of interventions, an international group of experts and stakeholders developed the Template for Intervention Description and Replication (TIDieR) checklist and guide. The process involved a literature review for relevant checklists and research, a Delphi survey of an international panel of experts to guide item selection, and a face to face panel meeting. The resultant 12 item TIDieR checklist (brief name, why, what (materials), what (procedure), who provided, how, where, when and how much, tailoring, modifications, how well (planned), how well (actual)) is an extension of the CONSORT 2010 statement (item 5) and the SPIRIT 2013 statement (item 11). While the emphasis of the checklist is on trials, the guidance is intended to apply across all evaluative study designs. This paper presents the TIDieR checklist and guide, with an explanation and elaboration for each item, and examples of good reporting. The TIDieR checklist and guide should improve the reporting of interventions and make it easier for authors to structure accounts of their interventions, reviewers and editors to assess the descriptions, and readers to use the information.
STUDY DESIGN: The Oswestry Disability Index (ODI) has become one of the principal condition-specific outcome measures used in the management of spinal disorders. This review is based on publications using the ODI identified from the authors' personal databases, the Science Citation Index, and hand searches of Spine and current textbooks of spinal disorders. OBJECTIVES: To review the versions of this instrument, document methods by which it has been validated, collate data from scores found in normal and back pain populations, provide curves for power calculations in studies using the ODI, and maintain the ODI as a gold standard outcome measure. SUMMARY OF BACKGROUND DATA: It has now been 20 years since its original publication. More than 200 citations exist in the Science Citation Index. The authors have a large correspondence file relating to the ODI, that is cited in most of the large textbooks related to spinal disorders. METHODS: All the published versions of the questionnaire were identified. A systematic review of this literature was made. The various reports of validation were collated and related to a version. RESULTS: Four versions of the ODI are available in English and nine in other languages. Some published versions contain misprints, and many omit the scoring system. At least 114 studies contain usable data. These data provide both validation and standards for other users and indicate the power of the instrument for detecting change in sample populations. CONCLUSIONS: The ODI remains a valid and vigorous measure and has been a worthwhile outcome measure. The process of using the ODI is reviewed and should be the subject of further research. The receiver operating characteristics should be explored in a population with higher self-report disabilities. The behavior of the instrument is incompletely understood, particularly in sensitivity to real change.
Prediction models are developed to aid health care providers in estimating the probability or risk that a specific disease or condition is present (diagnostic models) or that a specific event will occur in the future (prognostic models), to inform their decision making. However, the overwhelming evidence shows that the quality of reporting of prediction model studies is poor. Only with full and clear reporting of information on all aspects of a prediction model can risk of bias and potential usefulness of prediction models be adequately assessed. The Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) Initiative developed a set of recommendations for the reporting of studies developing, validating, or updating a prediction model, whether for diagnostic or prognostic purposes. This article describes how the TRIPOD Statement was developed. An extensive list of items based on a review of the literature was created, which was reduced after a Web based survey and revised during a three day meeting in June 2011 with methodologists, health care professionals, and journal editors. The list was refined during several meetings of the steering group and in e-mail discussions with the wider group of TRIPOD contributors. The resulting TRIPOD Statement is a checklist of 22 items, deemed essential for transparent reporting of a prediction model study. The TRIPOD Statement aims to improve the transparency of the reporting of a prediction model study regardless of the study methods used. The TRIPOD Statement is best used in conjunction with the TRIPOD explanation and elaboration document. To aid the editorial process and readers of prediction model studies, it is recommended that authors include a completed checklist in their submission (also available at www.tripod-statement.org). To encourage dissemination of the TRIPOD Statement, this article is freely accessible on the <i>Annals of Internal Medicine</i> Web site (www.annals.org) and will be also published in <i>BJOG</i>, <i>British Journal of Cancer</i>, <i>British Journal of Surgery</i>, <i>BMC Medicine</i>, <i>The BMJ</i>, <i>Circulation</i>, <i>Diabetic Medicine</i>, <i>European Journal of Clinical Investigation</i>, <i>European Urology</i>, and <i>Journal of Clinical Epidemiology</i>. The authors jointly hold the copyright of this article. An accompanying explanation and elaboration article is freely available only on www.annals.org; <i>Annals of Internal Medicine</i> holds copyright for that article.
BACKGROUND: Approximately 30% of people over 65 years of age living in the community fall each year. This is an update of a Cochrane review first published in 2009. OBJECTIVES: To assess the effects of interventions designed to reduce the incidence of falls in older people living in the community. SEARCH METHODS: We searched the Cochrane Bone, Joint and Muscle Trauma Group Specialised Register (February 2012), CENTRAL (The Cochrane Library 2012, Issue 3), MEDLINE (1946 to March 2012), EMBASE (1947 to March 2012), CINAHL (1982 to February 2012), and online trial registers. SELECTION CRITERIA: Randomised trials of interventions to reduce falls in community-dwelling older people. DATA COLLECTION AND ANALYSIS: Two review authors independently assessed risk of bias and extracted data. We used a rate ratio (RaR) and 95% confidence interval (CI) to compare the rate of falls (e.g. falls per person year) between intervention and control groups. For risk of falling, we used a risk ratio (RR) and 95% CI based on the number of people falling (fallers) in each group. We pooled data where appropriate. MAIN RESULTS: We included 159 trials with 79,193 participants. Most trials compared a fall prevention intervention with no intervention or an intervention not expected to reduce falls. The most common interventions tested were exercise as a single intervention (59 trials) and multifactorial programmes (40 trials). Sixty-two per cent (99/159) of trials were at low risk of bias for sequence generation, 60% for attrition bias for falls (66/110), 73% for attrition bias for fallers (96/131), and only 38% (60/159) for allocation concealment.Multiple-component group exercise significantly reduced rate of falls (RaR 0.71, 95% CI 0.63 to 0.82; 16 trials; 3622 participants) and risk of falling (RR 0.85, 95% CI 0.76 to 0.96; 22 trials; 5333 participants), as did multiple-component home-based exercise (RaR 0.68, 95% CI 0.58 to 0.80; seven trials; 951 participants and RR 0.78, 95% CI 0.64 to 0.94; six trials; 714 participants). For Tai Chi, the reduction in rate of falls bordered on statistical significance (RaR 0.72, 95% CI 0.52 to 1.00; five trials; 1563 participants) but Tai Chi did significantly reduce risk of falling (RR 0.71, 95% CI 0.57 to 0.87; six trials; 1625 participants).Multifactorial interventions, which include individual risk assessment, reduced rate of falls (RaR 0.76, 95% CI 0.67 to 0.86; 19 trials; 9503 participants), but not risk of falling (RR 0.93, 95% CI 0.86 to 1.02; 34 trials; 13,617 participants).Overall, vitamin D did not reduce rate of falls (RaR 1.00, 95% CI 0.90 to 1.11; seven trials; 9324 participants) or risk of falling (RR 0.96, 95% CI 0.89 to 1.03; 13 trials; 26,747 participants), but may do so in people with lower vitamin D levels before treatment.Home safety assessment and modification interventions were effective in reducing rate of falls (RR 0.81, 95% CI 0.68 to 0.97; six trials; 4208 participants) and risk of falling (RR 0.88, 95% CI 0.80 to 0.96; seven trials; 4051 participants). These interventions were more effective in people at higher risk of falling, including those with severe visual impairment. Home safety interventions appear to be more effective when delivered by an occupational therapist.An intervention to treat vision problems (616 participants) resulted in a significant increase in the rate of falls (RaR 1.57, 95% CI 1.19 to 2.06) and risk of falling (RR 1.54, 95% CI 1.24 to 1.91). When regular wearers of multifocal glasses (597 participants) were given single lens glasses, all falls and outside falls were significantly reduced in the subgroup that regularly took part in outside activities. Conversely, there was a significant increase in outside falls in intervention group participants who took part in little outside activity.Pacemakers reduced rate of falls in people with carotid sinus hypersensitivity (RaR 0.73, 95% CI 0.57 to 0.93; three trials; 349 participants) but not risk of falling. First eye cataract surgery in women reduced rate of falls (RaR 0.66, 95% CI 0.45 to 0.95; one trial; 306 participants), but second eye cataract surgery did not.Gradual withdrawal of psychotropic medication reduced rate of falls (RaR 0.34, 95% CI 0.16 to 0.73; one trial; 93 participants), but not risk of falling. A prescribing modification programme for primary care physicians significantly reduced risk of falling (RR 0.61, 95% CI 0.41 to 0.91; one trial; 659 participants).An anti-slip shoe device reduced rate of falls in icy conditions (RaR 0.42, 95% CI 0.22 to 0.78; one trial; 109 participants). One trial (305 participants) comparing multifaceted podiatry including foot and ankle exercises with standard podiatry in people with disabling foot pain significantly reduced the rate of falls (RaR 0.64, 95% CI 0.45 to 0.91) but not the risk of falling.There is no evidence of effect for cognitive behavioural interventions on rate of falls (RaR 1.00, 95% CI 0.37 to 2.72; one trial; 120 participants) or risk of falling (RR 1.11, 95% CI 0.80 to 1.54; two trials; 350 participants).Trials testing interventions to increase knowledge/educate about fall prevention alone did not significantly reduce the rate of falls (RaR 0.33, 95% CI 0.09 to 1.20; one trial; 45 participants) or risk of falling (RR 0.88, 95% CI 0.75 to 1.03; four trials; 2555 participants).No conclusions can be drawn from the 47 trials reporting fall-related fractures.Thirteen trials provided a comprehensive economic evaluation. Three of these indicated cost savings for their interventions during the trial period: home-based exercise in over 80-year-olds, home safety assessment and modification in those with a previous fall, and one multifactorial programme targeting eight specific risk factors. AUTHORS' CONCLUSIONS: Group and home-based exercise programmes, and home safety interventions reduce rate of falls and risk of falling.Multifactorial assessment and intervention programmes reduce rate of falls but not risk of falling; Tai Chi reduces risk of falling.Overall, vitamin D supplementation does not appear to reduce falls but may be effective in people who have lower vitamin D levels before treatment.
BACKGROUND: The study of microbial communities has been revolutionised in recent years by the widespread adoption of culture independent analytical techniques such as 16S rRNA gene sequencing and metagenomics. One potential confounder of these sequence-based approaches is the presence of contamination in DNA extraction kits and other laboratory reagents. RESULTS: In this study we demonstrate that contaminating DNA is ubiquitous in commonly used DNA extraction kits and other laboratory reagents, varies greatly in composition between different kits and kit batches, and that this contamination critically impacts results obtained from samples containing a low microbial biomass. Contamination impacts both PCR-based 16S rRNA gene surveys and shotgun metagenomics. We provide an extensive list of potential contaminating genera, and guidelines on how to mitigate the effects of contamination. CONCLUSIONS: These results suggest that caution should be advised when applying sequence-based techniques to the study of microbiota present in low biomass environments. Concurrent sequencing of negative control samples is strongly advised.
The Consolidated Standards of Reporting Trials (CON-SORT) statement (www.consort-statement.org) is a guideline designed to improve the transparency and quality of the reporting of randomised trials.
Prediction models are developed to aid health-care providers in estimating the probability or risk that a specific disease or condition is present (diagnostic models) or that a specific event will occur in the future (prognostic models), to inform their decision making. However, the overwhelming evidence shows that the quality of reporting of prediction model studies is poor. Only with full and clear reporting of information on all aspects of a prediction model can risk of bias and potential usefulness of prediction models be adequately assessed. The Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) Initiative developed a set of recommendations for the reporting of studies developing, validating, or updating a prediction model, whether for diagnostic or prognostic purposes. This article describes how the TRIPOD Statement was developed. An extensive list of items based on a review of the literature was created, which was reduced after a Web-based survey and revised during a 3-day meeting in June 2011 with methodologists, health-care professionals, and journal editors. The list was refined during several meetings of the steering group and in e-mail discussions with the wider group of TRIPOD contributors. The resulting TRIPOD Statement is a checklist of 22 items, deemed essential for transparent reporting of a prediction model study. The TRIPOD Statement aims to improve the transparency of the reporting of a prediction model study regardless of the study methods used. The TRIPOD Statement is best used in conjunction with the TRIPOD explanation and elaboration document. To aid the editorial process and readers of prediction model studies, it is recommended that authors include a completed checklist in their submission (also available at www.tripod-statement.org).
Incomplete reporting has been identified as a major source of avoidable waste in biomedical research. Essential information is often not provided in study reports, impeding the identification, critical appraisal, and replication of studies. To improve the quality of reporting of diagnostic accuracy studies, the Standards for Reporting Diagnostic Accuracy (STARD) statement was developed. Here we present STARD 2015, an updated list of 30 essential items that should be included in every report of a diagnostic accuracy study. This update incorporates recent evidence about sources of bias and variability in diagnostic accuracy and is intended to facilitate the use of STARD. As such, STARD 2015 may help to improve completeness and transparency in reporting of diagnostic accuracy studies.
OBJECTIVE: To review and appraise the validity and usefulness of published and preprint reports of prediction models for diagnosing coronavirus disease 2019 (covid-19) in patients with suspected infection, for prognosis of patients with covid-19, and for detecting people in the general population at increased risk of covid-19 infection or being admitted to hospital with the disease. DESIGN: Living systematic review and critical appraisal by the COVID-PRECISE (Precise Risk Estimation to optimise covid-19 Care for Infected or Suspected patients in diverse sEttings) group. DATA SOURCES: PubMed and Embase through Ovid, up to 1 July 2020, supplemented with arXiv, medRxiv, and bioRxiv up to 5 May 2020. STUDY SELECTION: Studies that developed or validated a multivariable covid-19 related prediction model. DATA EXTRACTION: At least two authors independently extracted data using the CHARMS (critical appraisal and data extraction for systematic reviews of prediction modelling studies) checklist; risk of bias was assessed using PROBAST (prediction model risk of bias assessment tool). RESULTS: 37 421 titles were screened, and 169 studies describing 232 prediction models were included. The review identified seven models for identifying people at risk in the general population; 118 diagnostic models for detecting covid-19 (75 were based on medical imaging, 10 to diagnose disease severity); and 107 prognostic models for predicting mortality risk, progression to severe disease, intensive care unit admission, ventilation, intubation, or length of hospital stay. The most frequent types of predictors included in the covid-19 prediction models are vital signs, age, comorbidities, and image features. Flu-like symptoms are frequently predictive in diagnostic models, while sex, C reactive protein, and lymphocyte counts are frequent prognostic factors. Reported C index estimates from the strongest form of validation available per model ranged from 0.71 to 0.99 in prediction models for the general population, from 0.65 to more than 0.99 in diagnostic models, and from 0.54 to 0.99 in prognostic models. All models were rated at high or unclear risk of bias, mostly because of non-representative selection of control patients, exclusion of patients who had not experienced the event of interest by the end of the study, high risk of model overfitting, and unclear reporting. Many models did not include a description of the target population (n=27, 12%) or care setting (n=75, 32%), and only 11 (5%) were externally validated by a calibration plot. The Jehi diagnostic model and the 4C mortality score were identified as promising models. CONCLUSION: Prediction models for covid-19 are quickly entering the academic literature to support medical decision making at a time when they are urgently needed. This review indicates that almost all pubished prediction models are poorly reported, and at high risk of bias such that their reported predictive performance is probably optimistic. However, we have identified two (one diagnostic and one prognostic) promising models that should soon be validated in multiple cohorts, preferably through collaborative efforts and data sharing to also allow an investigation of the stability and heterogeneity in their performance across populations and settings. Details on all reviewed models are publicly available at https://www.covprecise.org/. Methodological guidance as provided in this paper should be followed because unreliable predictions could cause more harm than benefit in guiding clinical decisions. Finally, prediction model authors should adhere to the TRIPOD (transparent reporting of a multivariable prediction model for individual prognosis or diagnosis) reporting guideline. SYSTEMATIC REVIEW REGISTRATION: Protocol https://osf.io/ehc47/, registration https://osf.io/wy245. READERS' NOTE: This article is a living systematic review that will be updated to reflect emerging evidence. Updates may occur for up to two years from the date of original publication. This version is update 3 of the original article published on 7 April 2020 (BMJ 2020;369:m1328). Previous updates can be found as data supplements (https://www.bmj.com/content/369/bmj.m1328/related#datasupp). When citing this paper please consider adding the update number and date of access for clarity.
Misinterpretation and abuse of statistical tests, confidence intervals, and statistical power have been decried for decades, yet remain rampant. A key problem is that there are no interpretations of these concepts that are at once simple, intuitive, correct, and foolproof. Instead, correct use and interpretation of these statistics requires an attention to detail which seems to tax the patience of working scientists. This high cognitive demand has led to an epidemic of shortcut definitions and interpretations that are simply wrong, sometimes disastrously so-and yet these misinterpretations dominate much of the scientific literature. In light of this problem, we provide definitions and a discussion of basic statistics that are more general and critical than typically found in traditional introductory expositions. Our goal is to provide a resource for instructors, researchers, and consumers of statistics whose knowledge of statistical theory and technique may be limited but who wish to avoid and spot misinterpretations. We emphasize how violation of often unstated analysis protocols (such as selecting analyses for presentation based on the P values they produce) can lead to small P values even if the declared test hypothesis is correct, and can lead to large P values even if that hypothesis is incorrect. We then provide an explanatory list of 25 misinterpretations of P values, confidence intervals, and power. We conclude with guidelines for improving statistical interpretation and reporting.
The Consolidated Standards of Reporting Trials (CONSORT) statement is a guideline designed to improve the transparency and quality of the reporting of randomised controlled trials (RCTs). In this article we present an extension to that statement for randomised pilot and feasibility trials conducted in advance of a future definitive RCT. The checklist applies to any randomised study in which a future definitive RCT, or part of it, is conducted on a smaller scale, regardless of its design (eg, cluster, factorial, crossover) or the terms used by authors to describe the study (eg, pilot, feasibility, trial, study). The extension does not directly apply to internal pilot studies built into the design of a main trial, non-randomised pilot and feasibility studies, or phase II studies, but these studies all have some similarities to randomised pilot and feasibility studies and so many of the principles might also apply. The development of the extension was motivated by the growing number of studies described as feasibility or pilot studies and by research that has identified weaknesses in their reporting and conduct. We followed recommended good practice to develop the extension, including carrying out a Delphi survey, holding a consensus meeting and research team meetings, and piloting the checklist. The aims and objectives of pilot and feasibility randomised studies differ from those of other randomised trials. Consequently, although much of the information to be reported in these trials is similar to those in randomised controlled trials (RCTs) assessing effectiveness and efficacy, there are some key differences in the type of information and in the appropriate interpretation of standard CONSORT reporting items. We have retained some of the original CONSORT statement items, but most have been adapted, some removed, and new items added. The new items cover how participants were identified and consent obtained; if applicable, the prespecified criteria used to judge whether or how to proceed with a future definitive RCT; if relevant, other important unintended consequences; implications for progression from pilot to future definitive RCT, including any proposed amendments; and ethical approval or approval by a research review committee confirmed with a reference number. This article includes the 26 item checklist, a separate checklist for the abstract, a template for a CONSORT flowchart for these studies, and an explanation of the changes made and supporting examples. We believe that routine use of this proposed extension to the CONSORT statement will result in improvements in the reporting of pilot trials. Editor’s note: In order to encourage its wide dissemination this article is freely accessible on the BMJ and Pilot and Feasibility Studies journal websites.
Prediction models are developed to aid health care providers in estimating the probability or risk that a specific disease or condition is present (diagnostic models) or that a specific event will occur in the future (prognostic models), to inform their decision making. However, the overwhelming evidence shows that the quality of reporting of prediction model studies is poor. Only with full and clear reporting of information on all aspects of a prediction model can risk of bias and potential usefulness of prediction models be adequately assessed. The Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) Initiative developed a set of recommendations for the reporting of studies developing, validating, or updating a prediction model, whether for diagnostic or prognostic purposes. This article describes how the TRIPOD Statement was developed. An extensive list of items based on a review of the literature was created, which was reduced after a Web-based survey and revised during a 3-day meeting in June 2011 with methodologists, health care professionals, and journal editors. The list was refined during several meetings of the steering group and in e-mail discussions with the wider group of TRIPOD contributors. The resulting TRIPOD Statement is a checklist of 22 items, deemed essential for transparent reporting of a prediction model study. The TRIPOD Statement aims to improve the transparency of the reporting of a prediction model study regardless of the study methods used. The TRIPOD Statement is best used in conjunction with the TRIPOD explanation and elaboration document. To aid the editorial process and readers of prediction model studies, it is recommended that authors include a completed checklist in their submission (also available at www.tripod-statement.org).
BACKGROUND: Literature searches underlie the foundations of systematic reviews and related review types. Yet, the literature searching component of systematic reviews and related review types is often poorly reported. Guidance for literature search reporting has been diverse, and, in many cases, does not offer enough detail to authors who need more specific information about reporting search methods and information sources in a clear, reproducible way. This document presents the PRISMA-S (Preferred Reporting Items for Systematic reviews and Meta-Analyses literature search extension) checklist, and explanation and elaboration. METHODS: The checklist was developed using a 3-stage Delphi survey process, followed by a consensus conference and public review process. RESULTS: The final checklist includes 16 reporting items, each of which is detailed with exemplar reporting and rationale. CONCLUSIONS: The intent of PRISMA-S is to complement the PRISMA Statement and its extensions by providing a checklist that could be used by interdisciplinary authors, editors, and peer reviewers to verify that each component of a search is completely reported and therefore reproducible.
Diagnostic accuracy studies are, like other clinical studies, at risk of bias due to shortcomings in design and conduct, and the results of a diagnostic accuracy study may not apply to other patient groups and settings. Readers of study reports need to be informed about study design and conduct, in sufficient detail to judge the trustworthiness and applicability of the study findings. The STARD statement (Standards for Reporting of Diagnostic Accuracy Studies) was developed to improve the completeness and transparency of reports of diagnostic accuracy studies. STARD contains a list of essential items that can be used as a checklist, by authors, reviewers and other readers, to ensure that a report of a diagnostic accuracy study contains the necessary information. STARD was recently updated. All updated STARD materials, including the checklist, are available at http://www.equator-network.org/reporting-guidelines/stard Here, we present the STARD 2015 explanation and elaboration document. Through commented examples of appropriate reporting, we clarify the rationale for each of the 30 items on the STARD 2015 checklist, and describe what is expected from authors in developing sufficiently informative study reports.
Clinical prediction models aim to predict outcomes in individuals, to inform diagnosis or prognosis in healthcare. Hundreds of prediction models are published in the medical literature each year, yet many are developed using a dataset that is too small for the total number of participants or outcome events. This leads to inaccurate predictions and consequently incorrect healthcare decisions for some individuals. In this article, the authors provide guidance on how to calculate the sample size required to develop a clinical prediction model.
We have developed a 12-item questionnaire for patients having a total knee replacement (TKR). We made a prospective study of 117 patients before operation and at follow-up six months later, asking them to complete the new questionnaire and the form SF36. Some also filled in the Stanford Health Assessment Questionnaire (HAQ). An orthopaedic surgeon completed the American Knee Society (AKS) clinical score. The single score derived from the new questionnaire had high internal consistency, and its reproducibility, examined by test-retest reliability, was found to be satisfactory. Its validity was established by obtaining significant correlations in the expected direction with the AKS scores and the relevant parts of the SF36 and HAQ. Sensitivity to change was assessed by analysing the differences between the preoperative scores and those at follow-up. We also compared change in scores with the patients' retrospective judgement of change in their condition. The effect size for the new questionnaire compared favourably with those for the relevant parts of the SF36. The change scores for the new knee questionnaire were significantly greater (p < 0.0001) for patients who reported the most improvement in their condition. The new questionnaire provides a measure of outcome for TKR that is short, practical, reliable, valid and sensitive to clinically important changes over time.
A recent literature has constructed top income shares time series over the long run for more than twenty countries using income tax statistics. Top incomes represent a small share of the population but a very significant share of total income and total taxes paid. Hence, aggregate economic growth per capita and Gini inequality indexes are sensitive to excluding or including top incomes. We discuss the estimation methods and issues that arise when constructing top income share series, including income definition and comparability over time and across countries, tax avoidance, and tax evasion. We provide a summary of the key empirical findings. Most countries experience a dramatic drop in top income shares in the first part of the twentieth century in general due to shocks to top capital incomes during the wars and depression shocks. Top income shares do not recover in the immediate postwar decades. However, over the last thirty years, top income shares have increased substantially in English speaking countries and in India and China but not in continental European countries or Japan. This increase is due in part to an unprecedented surge in top wage incomes. As a result, wage income comprises a larger fraction of top incomes than in the past. Finally, we discuss the theoretical and empirical models that have been proposed to account for the facts and the main questions that remain open. (JEL D31, D63, H26, N30)