The reproducibility of psychiatric evaluations of work disability: two reliability and agreement studies

Expert psychiatrists conducting work disability evaluations often disagree on work capacity (WC) when assessing the same patient. More structured and standardised evaluations focusing on function could improve agreement. The RELY studies aimed to establish the inter-rater reproducibility (reliabilit...

Full description

Saved in:

Bibliographic Details
Published in	BMC psychiatry Vol. 19; no. 1; pp. 205 - 15
Main Authors	Kunz, Regina, von Allmen, David Y, Marelli, Renato, Hoffmann-Richter, Ulrike, Jeger, Joerg, Mager, Ralph, Colomb, Etienne, Schaad, Heinz J, Bachmann, Monica, Vogel, Nicole, Busse, Jason W, Eichhorn, Martin, Bänziger, Oskar, Zumbrunn, Thomas, de Boer, Wout E L, Fischer, Katrin
Format	Journal Article
Language	English
Published	England BioMed Central Ltd 03.07.2019 BioMed Central BMC
Subjects	Agreements Analysis Clinical trials Disability Disability evaluation Employment Evidence-based medicine Experts Female Humans Interviews Male Medical research Mental disorders Mental Disorders - diagnosis Mentally ill persons Middle Aged Observer Variation Patients Psychiatrists Psychiatry Psychiatry - methods Ratings & rankings Reproducibility Reproducibility of Results Retirement benefits Return to work Social security Work capacity Work Capacity Evaluation Switzerland Social security Work capacity evaluation Reproducibility of results Observer variation Evidence-based medicine Disability evaluation Return to work
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Expert psychiatrists conducting work disability evaluations often disagree on work capacity (WC) when assessing the same patient. More structured and standardised evaluations focusing on function could improve agreement. The RELY studies aimed to establish the inter-rater reproducibility (reliability and agreement) of 'functional evaluations' in patients with mental disorders applying for disability benefits and to compare the effect of limited versus intensive expert training on reproducibility. We performed two multi-centre reproducibility studies on standardised functional WC evaluation (RELY 1 and 2). Trained psychiatrists interviewed 30 and 40 patients respectively and determined WC using the Instrument for Functional Assessment in Psychiatry (IFAP). Three psychiatrists per patient estimated WC from videotaped evaluations. We analysed reliability (intraclass correlation coefficients [ICC]) and agreement ('standard error of measurement' [SEM] and proportions of comparisons within prespecified limits) between expert evaluations of WC. Our primary outcome was WC in alternative work (WC ), 100-0%. Secondary outcomes were WC in last job (WC ), 100-0%; patients' perceived fairness of the evaluation, 10-0, higher is better; usefulness to psychiatrists. Inter-rater reliability for WC was fair in RELY 1 (ICC 0.43; 95%CI 0.22-0.60) and RELY 2 (ICC 0.44; 0.25-0.59). Agreement was low in both studies, the 'standard error of measurement' for WC was 24.6 percentage points (20.9-28.4) and 19.4 (16.9-22.0) respectively. Using a 'maximum acceptable difference' of 25 percentage points WC between two experts, 61.6% of comparisons in RELY 1, and 73.6% of comparisons in RELY 2 fell within these limits. Post-hoc secondary analysis for RELY 2 versus RELY 1 showed a significant change in SEM (- 5.2 percentage points WC [95%CI - 9.7 to - 0.6]), and in the proportions on the differences ≤ 25 percentage points WC between two experts (p = 0.008). Patients perceived the functional evaluation as fair (RELY 1: mean 8.0; RELY 2: 9.4), psychiatrists as useful. Evidence from non-randomised studies suggests that intensive training in functional evaluation may increase agreement on WC between experts, but fell short to reach stakeholders' expectations. It did not alter reliability. Isolated efforts in training psychiatrists may not suffice to reach the expected level of agreement. A societal discussion about achievable goals and readiness to consider procedural changes in WC evaluations may deserve considerations.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 ObjectType-Undefined-3
ISSN:	1471-244X 1471-244X
DOI:	10.1186/s12888-019-2171-y