- Research article
- Open Access
- Open Peer Review
Rating general practitioner consultation performance in cancer care: does the specialty of assessors matter? A simulated patient study
BMC Family Practice volume 15, Article number: 152 (2014)
Patients treated for prostate cancer may present to general practitioners (GPs) for treatment follow up, but may be reticent to have their consultations recorded. Therefore the use of simulated patients allows practitioner consultations to be rated. The aim of this study was to determine whether the speciality of the assessor has an impact on how GP consultation performance is rated.
Six pairs of scenarios were developed for professional actors in two series of consultations by GPs. The scenarios included: chronic radiation proctitis, Prostate Specific Antigen (PSA) ‘bounce’, recurrence of cancer, urethral stricture, erectile dysfunction and depression or anxiety. Participating GPs were furnished with the patient’s past medical history, current medication, prostate cancer details and treatment, details of physical examinations. Consultations were video recorded and assessed for quality by two sets of assessors- a team of two GPs and two Radiation Oncologists deploying the Leicester Assessment Package (LAP). LAP scores by the GPs and Radiation Oncologists were compared.
Eight GPs participated. In Series 1 the range of LAP scores by GP assessors was 61%-80%, and 67%-86% for Radiation Oncologist assessors. The range for GP LAP scores in Series 2 was 51%- 82%, and 56%-89% for Radiation Oncologist assessors. Within GP assessor correlations for LAP scores were 0.31 and 0.87 in Series 1 and 2 respectively. Within Radiation Oncologist assessor correlations were 0.50 and 0.72 in Series 1 and 2 respectively. Radiation Oncologist and GP assessor scores were significantly different for 4 doctors and for some scenarios. Anticipatory care was the only domain where GPs scored participants higher than Radiation Oncologist assessors.
The assessment of GP consultation performance is not consistent across assessors from different disciplines even when they deploy the same assessment tool.
It is important that the general practitioners (GPs) who are consulted by patients treated for prostate cancer are able to consult skilfully. Patients with prostate cancer often experience stigma by virtue of the psychosocial impact of the disease . Many such patients have unmet needs which may not be explicitly presented and yet expect to be supported by their GP [2–4]. Relative to some conditions (e.g. diabetes) patients who have completed treatment for cancer present infrequently in general practice . Consultations with ‘real’ patients are challenging to record and analyse. It is difficult to reliably report GP performance at individual consultations for appropriate diagnosis and management of cancer related issues given the myriad of possible confounding factors, including the need to recruit ill or distressed patients . Also comparison is only possible when the patient presents consistently to all GP participants. Therefore, the deployment of actor-patient simulations (henceforth called simulated patients) in which confounding variables can be controlled is a possible solution to explore GP cancer care . The use of ‘live’ simulations in which it is possible to observe the interaction between ‘patient’ and doctor adds to the validity of the data. Research deploying simulated patients have been reported in the literature [8–10]. A number of limitations exist with the use of simulations, including the validity of the assessment scores . This paper will focuses on the assessment of GP performance in such consultations. In previous research assessors from different specialities have used different assessment measures and have noted significant differences in GP assessment and management of patients. This is difficult to interpret . A particular criticism of this approach is that doctor consultation performance is being measured on different benchmarks, by assessors with different backgrounds. Comparisons may be neither reliable nor valid. This is especially important when reviewing the care of people who have been treated for cancer. In the present study GP consultation performance is rated on the same measure, but by assessors from different specialties. The aim of this study was to determine whether the speciality of the assessor affects how GP consultation performance is rated.
This study was approved by the Human Research Ethics Committee at Curtin University (RD-12-10).
Simulated consultations were video-recorded and assessed by two GP assessors and two Radiation Oncologist assessors.
Six pairs of scenarios were developed for two series of consultations by the research team using previous literature and with reference to two radiation oncologists, two GPs, a cancer nurse coordinator and a urologist. The scenarios included: chronic radiation proctitis, PSA bounce, recurrence of cancer, urethral stricture, erectile dysfunction and depression or anxiety.
Series 1 scenarios
Chronic radiation proctitis.
Prostate Specific Antigen (PSA) bounce after radiation therapy.
Recurrence of prostate cancer with bone metastases.
Urethral stricture after radiotherapy. Late urinary toxicity. Not infection.
Erectile dysfunction after radiation therapy.
Depression after prostate cancer treatment.
Series 2 scenarios
Late radiation bowel toxicity: Radiation proctitis OR Radiation proctopathy.
PSA elevation post radiotherapy.
Recurrence of prostate cancer with spinal bone metastases.
Urinary symptoms post-Brachytherapy. No infection.
Erectile Dysfunction (ED), post-treatment for prostate cancer.
Anxiety after prostate cancer treatment.
These scenarios were selected as those that might be presented to a GP for advice [12, 13]. They were performed by professional actors. Information available to the GPs at the time of the consultation included the patient’s past medical history, current medication, prostate cancer details and treatment, details of physical examinations.
Participants were recruited by convenience sampling through personal contact with the research team.
Series 1: Six GPs participated in Series 1 (males = 4, females = 2). Participants were offered brief feedback on the correct diagnosis and management of each case.
Series 2: Five GPs participated in Series 2 (males = 2, females = 3) including 3 GPs who participated in Series 1.
GPs were invited to consult with the simulated patients as though the person had previously visited the practice for ongoing medical problems. The GPs were aware that the ‘patient’ presenting to their clinic was an actor. A medical record with the relevant past medical history was prepared for each patient and was made available to the GP. Physical examination cards, describing examination findings were presented to the GP by the simulated patient when requested by the doctor. No ‘patient’ was actually examined. The practitioners were allowed up to 15 minutes and the consultation was video recorded. The scenarios were presented to the GP participants as a series of consecutive cases. Scenarios were presented in a random order to GPs in each series.
Quality of consultation
The Leicester Assessment Package (LAP) is an established measure of consultation competence for general practice consultations . Five of the seven LAP categories of consultation competence (interviewing and history taking, problem solving, patient management, anticipatory care and behaviour/relationship with patients) were assessed in this study.
Assessment by GPs and Radiation Oncologists
The recordings were independently rated by two GPs, who had previously assessed consultations using LAP scoring [9, 15]. The GP assessors then compared scores and agreed on a consensus score which represented the quality of the consultations. Two Radiation Oncologists were also trained by a GP familiar with LAP scoring. The Radiation Oncologists assessed the consultations independently and then compared their LAP scores to derive a consensus score. Similar scoring was completed for both series of consultations by all the assessors. Each participating GP was given a total score for each scenario, one from the GP LAP assessors and one from the Radiation Oncology LAP assessors.
Power calculations for correlated data using simulations, to avoid an underlying assumption of normality, indicated that a sample of 5 doctors with each doctor completing 6 scenarios would be sufficient to detect an intraclass correlation coefficient (ICC) of 0.75 in LAP scores between doctors at 90% power and 5% level of significance, assuming null ICC is 0.10. Sample size calculations were performed using the ‘sampicc’ command in Stata®.
The mean LAP scores from both the GP and Radiation Oncologist assessors for each GP participant were calculated. The differences in mean scores between individual doctors were then estimated in a standard unadjusted linear regression model. In order to determine whether the mean LAP scores varied by each of the simulated patient scenarios, Generalised Estimating Equations (GEEs) were used to fit a linear model using simulated patient scenario as the single independent variable (Table 1). The mean differences between GP and Radiation Oncologist assessors relating to GP performance in each clinical domain (e.g. “interviewing/history taking”, etc.) were evaluated by using the multilevel mixed effect models, with patients nested in GPs and GPs nested in assessors (Table 2). All analyses were performed using Stata (Intercooled 9.2, StataCorp, College Station, TX, USA).
Comparison between scores from GP and Radiation Oncology assessors using LAP scoring
LAP scores by the GPs and Radiation Oncologist assessors were collated and compared. The overall range for GP LAP scores in Series 1 was 61%-80% and the overall range for Radiation Oncologist LAP scores in Series 1 was 67%-86%. The overall range for GP LAP scores in Series 2 was 51%-82% and the overall range for Radiation Oncologist LAP scores in Series 2 was 56%-89%. These LAP scores are consistent with scores achieved in our previous work [9, 15].
LAP scores for each series were then compared using linear regression models to determine whether GP and Radiation Oncologist assessors similarly scored participating GPs (Table 1). The mean difference in scores (β coefficient) relative to doctor number 1 and associated p-value are also shown as estimated from each of the four standard unadjusted linear regression models. Radiation Oncologist and GP assessor scores were significantly different on 4 occasions. Radiation Oncologist assessors mean scores per consultation were higher than the GP assessors mean scores.
Within GP assessor correlations (Intraclass Correlation Coefficients) for LAP scores were 0.31 and 0.87 in Series 1 and 2 respectively. Within Radiation Oncologist assessor correlations were 0.50 and 0.72 in Series 1 and 2 respectively.
LAP scoring by domains
Because we detected a difference in the scoring between GP and Radiation Oncologist assessment we explored whether the scores varied depending on the different domains within the LAP. Table 2 provides a comparison of GP and Radiation Oncologist assessor scores by LAP domain. Radiation Oncologist assessors LAP scores were significantly higher for most domains in both series, apart from problem solving in series one and anticipatory care in both series where the scores were not significantly different.
In a previous study with simulated patients the type of assessment for consultation performance as well as the background of the assessors was reported to be relevant . In this study Radiation Oncologists assessors rated GP consultation better than GP assessors on almost every domain of the same tool. Relatively underperforming GPs were scored significantly better by specialists than by GPs.
The use of simulated patients has long been established to assess practitioner performance . A recent review concludes that
“..correlations with written examinations were modest, adding empirical support to the notion that standardized patients assess aspects of competence not addressed directly by the traditional written measures.” 
However, the issue of the expertise and background of the assessor of simulated patient consultations receives limited attention in reviews around assessment of practitioners . For example, in medical training courses practitioners are mentored and assessed by a variety of practitioners and yet may choose a career in general practice . This assessment could be improved by ensuring that the consultation skills are assessed by a person practicing in the target specialty . Anticipatory care was the only domain of LAP where GPs appeared to score higher than radiation oncologists, this may reflect the value of ensuring continuity of care in general practice .
The strength of this study was the use of simulated patients to research assessment of consultation skills in the management of defined issues presented by a specific group of interest. There are three key limitations; firstly the technique offers a proxy measure for clinical competence and the validity of this method may be diminished by any undetected differences in simulated patient performance. However, experts agree that simulated patient studies are valid in assessing consultation skills . Also the patients were not physically examined and the value of skilful examination in facilitating disclosure of symptoms may be significant. Secondly, the training of the assessors may have had a bearing on the scores assigned. It is also possible that the two specialist practitioners were atypically generous in their assessment. It has long been recognised that assessors in the same specialty also vary . Finally, the practitioners in this study all performed well with relatively high LAP scores compared to performance rated in other simulated patient studies [8, 9].
This is the second study using simulated patients in which we report that the management of patients with problems related to or associated with prostate cancer treatment is challenging . The assessment of GP consultation performance in this context is not consistent across assessors from different disciplines even when they deploy the same assessment tool. Future research might explore the question of whether LAP scores based on consultations where management is inappropriate also vary. In other words, do specialists and GPs differ in their assessment and recognition of errors in practice?
Prostate Specific Antigen
Leicester Assessment Package.
Fergus KD, Gray RE, Fitch MI: Sexual dysfunction and the preservation of manhood: experiences of men with prostate cancer. J Health Psychol. 2002, 7: 303-316. 10.1177/1359105302007003223.
Ream E, Quennell A, Fincham L, Faithfull S, Khoo V, Wilson-Barnett J, Richardson A: Supportive care needs of men living with prostate cancer in England: a survey. Br J Cancer. 2008, 98: 1903-1909. 10.1038/sj.bjc.6604406.
Armes J, Crowe M, Colbourne L, Morgan H, Murrells T, Oakley C, Palmer N, Ream E, Young A, Richardson A: Patients’ supportive care needs beyond the end of cancer treatment: a prospective, longitudinal survey. J Clin Oncol. 2009, 27: 6172-6179. 10.1200/JCO.2009.22.5151.
Bulsara C, Ward AM, Joske D: Patient perceptions of the GP role in cancer management. Aust Fam Physician. 2005, 34: 299-
Britt H, Harrison C, Miller G, Knox S: Prevalence and patterns of multimorbidity in Australia. Med J Aust. 2008, 189: 72-77.
Steinhauser KE, Clipp EC, Hays JC, Olsen M, Arnold R, Christakis NA, Lindquist JH, Tulsky JA: Identifying, recruiting, and retaining seriously-ill patients and their caregivers in longitudinal research. Palliat Med. 2006, 20: 745-754. 10.1177/0269216306073112.
Jiwa M, O’Shea C, McKinley R, Mitchell G, Sibbritt D, Burridge L, Smith M, Chan She Ping Delfos W, Halkett G: Developing and evaluating interventions for primary care – a focus on consultations in general practice. Med J Aust. 2009, 2: 5-
Jiwa M, McKinley R, O’Shea C, Arnet H, Spilsbury K, Smith M: Investigating the impact of extraneous distractions on consultations in general practice: lessons learned. BMC Med Res Methodol. 2009, 9: 1471-2288.
Jiwa M, O’Shea C, Merriman G, Halkett G, Spilsbury K: Psychosexual problems in general practice: measuring consultation competence using two different measures. Qual Prim Care. 2010, 18: 243-250.
Jiwa M, McKinley RK, Spilsbury K, Arnet H, Smith M: Deploying a clinical innovation in the context of actor-patient consultations in general practice: a prelude to a formal clinical trial. BMC Med Res Methodol. 2009, 9: 1471-2288.
McKinley RK, Fraser RC, Baker R: Model for directly assessing and improving clinical competence and performance in revalidation of clinicians. BMJ. 2001, 322: 712-715. 10.1136/bmj.322.7288.712.
Schultheiss TE, Lee WR, Hunt MA, Hanlon AL, Peter RS, Hanks GE: Late GI and GU complications in the treatment of prostate cancer. Int J Radiat Oncol Biol Phys. 1997, 37: 3-11. 10.1016/S0360-3016(96)00468-3.
Heins M, Korevaar J, Rijken P, Schellevis F: For which health problems do cancer survivors visit their general practitioner?. Eur J Cancer. 2013, 49: 211-218. 10.1016/j.ejca.2012.07.011.
Fraser R, McKinley R, Mulholland H: Consultation competence in general practice: establishing the face validity of prioritized criteria in the Leicester assessment package. Br J Gen Pract. 1994, 44: 109-113.
Halkett G, Jiwa M, O’Shea C, Smith M, Leong E, Jackson M, Meng X, Spry N: Management of cases that might benefit from radiotherapy: a standardised patient study in primary care. Eur J Cancer Care (Engl). 2012, 21: 259-265. 10.1111/j.1365-2354.2011.01314.x.
Kinnersley P, Pill R: Potential of using simulated patients to study the performance of general practitioners. Br J Gen Pract. 1993, 43: 297-300.
Norcini JJ, McKinley DW: Assessment methods in medical education. Teach Teach Educ. 2007, 23: 239-250. 10.1016/j.tate.2006.12.021.
Duffy FD, Gordon GH, Whelan G, Cole-Kelly K, Frankel R: Assessing competence in communication and interpersonal skills: the Kalamazoo II report. Acad Med. 2004, 79: 495-507. 10.1097/00001888-200406000-00002.
Deveugele M, Derese A, Maesschalck SD, Willems S, Driel MV, Maeseneer JD: Teaching communication skills to medical students, a challenge in the curriculum?. Patient Educ Couns. 2005, 58: 265-270. 10.1016/j.pec.2005.06.004.
Freeman GK, Olesen F, Hjortdahl P: Continuity of care: an essential element of modern general practice?. Fam Pract. 2003, 20: 623-627. 10.1093/fampra/cmg601.
Jiwa M, Halkett G, Meng X, Pillai V, Berg M, Shaw T: Supporting patients treated for prostate cancer: a video vignette study with an email-based educational program in general practice. J Med Internet Res. 2014, 16: e63-10.2196/jmir.3003.
The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2296/15/152/prepub
This study was funded through the Priority-driven Collaborative Cancer Research Scheme by the Australian Government with the support of Cancer Australia. We would like to thank Dr. Carolyn O’Shea, Dr. Marthe Smith, Dr. Eugene Leong and Dr. Hui Chin.
The authors declare that they have no competing interests.
MJ and GH conceived and designed the study, contributed to its coordination, interpreted the data and drafted and revised the manuscript. MB participated in data acquisition and collation, contributed to the manuscript draft and its revisions. XM analysed the data. All authors read and approved the final manuscript.