Access the full text.
Sign up today, get DeepDyve free for 14 days.
A. Anastasi (1981)
Coaching, test sophistication, and developed abilities.American Psychologist, 36
C Roberts, S Sarangi, L Soutgate, R Wakeford, V Wass (2000)
Oral examinations—equal opportunities, ethnicity, and fairness in the MRCGPBritish Medical Journal, 320
C. Kreiter, G. Bergus (2007)
Case Specificity: Empirical Phenomenon or Measurement Artifact?Teaching and Learning in Medicine, 19
B. Clauser, R. Nungester (2001)
Classification Accuracy for Tests That Allow RetakesAcademic Medicine, 76
Donald Powers (1986)
Relations of test item characteristics to test preparation/test practice effects: A quantitative summary.Psychological Bulletin, 100
D. Swanson, G. Norman, R. Linn (1995)
Performance-Based Assessment: Lessons From the Health ProfessionsEducational Researcher, 24
John Hausknecht, Jane Halpert, Nicole Paolo, M. Gerrard (2006)
Retesting in Selection: A Meta-Analysis of Practice Effects for Tests of Cognitive Ability
L. Cronbach, G. Gleser (1958)
Psychological tests and personnel decisionsJournal of the American Statistical Association, 53
AF Champlain, KA Swygert, DB Swanson, JR Boulet (2006)
Assessing the underlying structure of the USMLE Step 2 test of clinical skills using confirmatory factor analysisAcademic Medicine, 81
Kimberly Swygert, Kevin Balog, A. Jobe (2010)
The Impact of Repeat Information on Examinee Performance for a Large-Scale Standardized-Patient ExaminationAcademic Medicine, 85
John Hausknecht, Jane Halpert, Nicole Paolo, Meghan Gerrard (2007)
Retesting in selection: a meta-analysis of coaching and practice effects for tests of cognitive ability.The Journal of applied psychology, 92 2
R. Tamblyn, M. Abrahamowicz, D. Dauphinée, E. Wenghofer, A. Jacques, D. Klass, S. Smee, D. Blackmore, N. Winslade, N. Girard, R. Berger, Ilona Bartman, D. Buckeridge, J. Hanley (2007)
Physician scores on a national clinical skills examination as predictors of complaints to medical regulatory authorities.JAMA, 298 9
Brian Hodges, Jeff Turnbull, R. Cohen, A. Bienenstock, Geoffrey Norman (1996)
Evaluating communication skills in the objective structured clinical examination format: reliability and generalizabilityMedical Education, 30
Allison Geving, S. Webb, B. Davis (2005)
Opportunities for Repeat Testing: Practice Doesn't Always Make Perfect
Aneez Esmail, C. May (2000)
Commentary: oral exams--get them right or don't bother.BMJ : British Medical Journal, 320
AR Jensen (1980)
Bias in mental testing
J. Boulet, D. Mckinley, G. Whelan, R. Hambleton (2003)
The Effect of Task Exposure on Repeat Candidate Scores in a High-Stakes Standardized Patient AssessmentTeaching and Learning in Medicine, 15
F. Lord (1971)
A Theoretical Study of Two-Stage Testing.Psychometrika, 36
Polina Harik, B. Clauser, Irina Grabovsky, M. Margolis, G. Dillon, J. Boulet (2006)
Relationships among Subcomponents of the USMLE Step 2 Clinical Skills Examination, The Step 1, and the Step 2 Clinical Knowledge ExaminationsAcademic Medicine, 81
M. Raymond, Nilufer Kahraman, Kimberly Swygert, Kevin Balog (2011)
Evaluating Construct Equivalence and Criterion-Related Validity for Repeat Examinees on a Standardized Patient ExaminationAcademic Medicine, 86
A. Champlain, Kimberly Swygert, D. Swanson, J. Boulet (2006)
Assessing the Underlying Structure of the United States Medical Licensing Examination Step 2 Test of Clinical Skills Using Confirmatory Factor AnalysisAcademic Medicine, 81
D. Butzin, L. Finberg, R. Brownlee, R. Guerin (1982)
A study of the reliability of the grading process used in the American Board of Pediatrics oral examination.Journal of medical education, 57 12
M. Raymond, Ulana Luciw-Dubas (2010)
The Second Time Around: Accounting for Retest Effects on Oral ExaminationsEvaluation & the Health Professions, 33
P. Rowland-Morin, Kenneth Burchard, Jane Garb, Nicholas Coe (1991)
Influence of effective communication by surgery students on their oral examination scoresAcademic Medicine, 66
Polina Harik, B. Clauser, Irina Grabovsky, R. Nungester, Dave Swanson, R. Nandakumar (2009)
An Examination of Rater Drift within a Generalizability Theory Framework.Journal of Educational Measurement, 46
Nigel O'Brian, S. O'brian, A. Packman, M. Onslow (2003)
Generalizability Theory IJournal of Speech Language and Hearing Research
Kenneth Burchard, P. Rowland-Morin, Nicholas Coe, Jane Garb (1995)
A surgery oral examination: interrater agreement and the influence of rater characteristicsAcademic Medicine, 70
F. Lievens, C. Reeve, Eric Heggestad (2007)
An examination of psychometric bias due to retesting on cognitive ability tests in selection settings.The Journal of applied psychology, 92 6
M. Raymond, Sandra Neustel, D. Anderson (2009)
Same-Form Retest Effects on Credentialing Examinations.Educational Measurement: Issues and Practice, 28
J. Millman (1989)
If at First You Don't Succeed Setting Passing Scores When More Than One Attempt Is PermittedEducational Researcher, 18
E. Miller (1980)
HeadacheJournal of Neurology, Neurosurgery & Psychiatry, 43
Deidra Schleicher, Chad Iddekinge, F. Morgeson, M. Campion (2010)
If at first you don't succeed, try, try again: understanding race, age, and gender differences in retesting score improvement.The Journal of applied psychology, 95 4
Examinees who initially fail and later repeat an SP-based clinical skills exam typically exhibit large score gains on their second attempt, suggesting the possibility that examinees were not well measured on one of those attempts. This study evaluates score precision for examinees who repeated an SP-based clinical skills test administered as part of the US Medical Licensing Examination sequence. Generalizability theory was used as the basis for computing conditional standard errors of measurement (SEM) for individual examinees. Conditional SEMs were computed for approximately 60,000 single-take examinees and 5,000 repeat examinees who completed the Step 2 Clinical Skills Examination® between 2007 and 2009. The study focused exclusively on ratings of communication and interpersonal skills. Conditional SEMs for single-take and repeat examinees were nearly indistinguishable across most of the score scale. US graduates and IMGs were measured with equal levels of precision at all score levels, as were examinees with differing levels of skill speaking English. There was no evidence that examinees with the largest score changes were measured poorly on either their first or second attempt. The large score increases for repeat examinees on this SP-based exam probably cannot be attributed to unexpectedly large errors of measurement.
Advances in Health Sciences Education – Springer Journals
Published: Oct 1, 2011
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.