期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

A Note on Decisionmaking Processes for Multiple-Choice Test Items

Rand R. Wilcox Karen Thompson Wilcox Jacob Chung 《Journal of Educational Measurement》1988,25(3):247-250

相似文献

2.

A Statistical Test for Detecting Answer Copying on Multiple-Choice Tests

Wim J. van der Linden Leonardo Sotaridona 《Journal of Educational Measurement》2004,41(4):361-377

A statistical test for the detection of answer copying on multiple-choice tests is presented. The test is based on the idea that the answers of examinees to test items may be the result of three possible processes: (1) knowing, (2) guessing, and (3) copying, but that examinees who do not have access to the answers of other examinees can arrive at their answers only through the first two processes. This assumption leads to a distribution for the number of matched incorrect alternatives between the examinee suspected of copying and the examinee believed to be the source that belongs to a family of "shifted binomials." Power functions for the tests for several sets of parameter values are analyzed. An extension of the test to include matched numbers of correct alternatives would lead to improper statistical hypotheses. 相似文献

3.

Book Review of Developing and Validating Multiple-Choice Test Items

《Structural equation modeling》2013,20(2):313-315

相似文献

4.

The Foundations of Multiple-Choice Items

李迎旭吴克勤周磊《时代教育》2012,(21):129

The foundations of multiple-choice items are examined from three aspects in this paper: psychology linguistics and pedagogy,structural linguistics and audiolingualism.With this careful examination,some nature and characteristics is expected to gain,which paves a smooth way for the actual evaluation of the multiple-choice technique in testing. 相似文献

5.

Equivalence of Free-Response and Multiple-Choice Items

Randy Elliot Bennett Donald A. Rock Minhwei Wang 《Journal of Educational Measurement》1991,28(1):77-92

This study examined the relationship of multiple-choice and free-response items contained on the College Board's Advanced Placement Computer Science (APCS) examination. Confirmatory factor analysis was used to test the fit of a two-factor model where each item format marked its own factor. Results showed a single-factor solution to provide the most parsimonious fit in each of two random-half samples. This finding might be accounted for by several mechanisms, including overlap in the specific processes assessed by the multiple-choice and free-response items and the limited opportunity for skill differentiation afforded by the year-long APCS course. 相似文献

6.

True-False, Alternate-Choice, and Multiple-Choice Items

Steven M. Downing 《Educational Measurement》1992,11(3):27-30

What does research tell us about true-false tests? Is there a place for this format in standardized tests? 相似文献

7.

Why Students Answer TIMSS Science Test Items the Way They Do

Harlow Ann Jones Alister 《Research in Science Education》2004,34(2):221-238

The purpose of this study was to explore how Year 8 students answered Third International Mathematics and Science Study (TIMSS) questions and whether the test questions represented the scientific understanding of these students. One hundred and seventy-seven students were tested using written test questions taken from the science test used in the Third International Mathematics and Science Study. The degree to which a sample of 38 children represented their understanding of the topics in a written test compared to the level of understanding that could be elicited by an interview is presented in this paper. In exploring student responses in the interview situation this study hoped to gain some insight into the science knowledge that students held and whether or not the test items had been able to elicit this knowledge successfully. We question the usefulness and quality of data from large-scale summative assessments on their own to represent student scientific understanding and conclude that large scale written test items, such as TIMSS, on their own are not a valid way of exploring students' understanding of scientific concepts. Considerable caution is therefore needed in exploiting the outcomes of international achievement testing when considering educational policy changes or using TIMSS data on their own to represent student understanding. 相似文献

8.

Models of Decisionmaking Processes for Multiple-Choice Test Items: An Analysis of Spatial Ability

Rand R. Wilcox Karen Thompson Wilcox 《Journal of Educational Measurement》1988,25(2):125-136

Latent class models of decisionmaking processes related to multiple-choice test items are extremely important and useful in mental test theory. However, building realistic models or studying the robustness of existing models is very difficult. One problem is that there are a limited number of empirical studies that address this issue. The purpose of this paper is to describe and illustrate how latent class models, in conjunction with the answer-until-correct format, can be used to examine the strategies used by examinees for a specific type of task. In particular, suppose an examinee responds to a multiple-choice test item designed to measure spatial ability, and the examinee gets the item wrong. This paper empirically investigates various latent class models of the strategies that might be used to arrive at an incorrect response. The simplest model is a random guessing model, but the results reported here strongly suggest that this model is unsatisfactory. Models for the second attempt of an item, under an answer-until-correct scoring procedure, are proposed and found to give a good fit to data in most situations. Some results on strategies used to arrive at the first choice are also discussed 相似文献

9.

如何解答普通物理选择题

黄秋平《唐山师范学院学报》2000,(2)

准确掌握物理学的基本理论和基本方法是正确解答选择题的关键,对定理、定律的熟练应用是迅速解题的保证。结合实例介绍了解答单选题的一些技巧:直接法、间接法、筛选法、作图法等。相似文献

10.

Answer Changing on Objective Tests

《The Journal of educational research》2012,105(6):313-315

Abstract

In an attempt to identify some of the causes of answer changing behavior, the effects of four tests and item specific variables were evaluated. Three samples of New Zealand school children of different ages were administered tests of study skills. The number of answer changes per item was compared with the position of each item in a group of items, the position of each item in the test, the discrimination index and the difficulty index of each item. It is shown that answer changes were more likely to be made on items occurring early in a group of items and toward the end of a test. There was also a tendency for difficult items and items with poor discriminations to be changed more frequently. Some implications of answer changing in the design of tests are discussed. 相似文献

11.

Partial Identification of Answer Reviewing Effects in Multiple-Choice Exams

Yongnam Kim 《Journal of Educational Measurement》2020,57(4):511-526

Does reviewing previous answers during multiple-choice exams help examinees increase their final score? This article formalizes the question using a rigorous causal framework, the potential outcomes framework. Viewing examinees’ reviewing status as a treatment and their final score as an outcome, the article first explains the challenges of identifying the causal effect of answer reviewing in regular exam-taking settings. In addition to the incapability of randomizing the treatment selection (reviewing status) and the lack of other information to make this selection process ignorable, the treatment variable itself is not fully known to researchers. Looking at examinees’ answer sheet data, it is unclear whether an examinee who did not change his or her answer on a specific item reviewed it but retained the initial answer (treatment condition) or chose not to review it (control condition). Despite such challenges, however, the article develops partial identification strategies and shows that the sign of the answer reviewing effect can be reasonably inferred. By analyzing a statewide math assessment data set, the article finds that reviewing initial answers is generally beneficial for examinees. 相似文献

12.

Gender Differences in Performance on Multiple-Choice and Constructed Response Mathematics Items

《教育实用测度》2013,26(1):29-51

相似文献

13.

Guess Where: The Position of Correct Answers in Multiple-Choice Test Items as a Psychometric Variable

Yigal Attali Maya Bar-Hillel 《Journal of Educational Measurement》2003,40(2):109-128

In this article, the authors show that test makers and test takers have a strong and systematic tendency for hiding correct answers—or, respectively, for seeking them—in middle positions. In single, isolated questions, both prefer middle positions to extreme ones in a ratio of up to 3 or 4 to 1. Because test makers routinely, deliberately, and excessively balance the answer key of operational tests, middle bias almost, though not quite, disappears in those keys. Examinees taking real tests also produce answer sequences that are more balanced than their single question tendencies but less balanced than the correct key. In a typical four-choice test, about 55% of erroneous answers are in the two central positions. The authors show that this bias is large enough to have real psychometric consequences, as questions with middle correct answers are easier and less discriminating than questions with extreme correct answers, a fact of which some implications are explored. 相似文献

14.

A Comparison of Multiple-Choice and Constructed Figural Response Items

Michael E. Martinez 《Journal of Educational Measurement》1991,28(2):131-145

In contrast to multiple-choice test questions, figural response items call for constructed responses and rely upon figural material, such as illustrations and graphs, as the response medium. Figural response questions in various science domains were created and administered to a sample of 4th-, 8th-, and 12th-grade students. Item and test statistics from parallel sets of figural response and multiple-choice questions were compared. Figural response items were generally more difficult, especially for questions that were difficult (p < .5) in their constructed-response forms. Figural response questions were also slightly more discriminating and reliable than their multiple-choice counterparts, but they had higher omit rates. This article addresses the relevance of guessing to figural response items and the diagnostic value of the item type. Plans for future research on figural response items are discussed. 相似文献

15.

An Investigation of Explanation Multiple-Choice Items in Science Assessment

Ou Lydia Liu Hee-Sun Lee Marcia C. Linn 《Educational Assessment》2013,18(3):164-184

Both multiple-choice and constructed-response items have known advantages and disadvantages in measuring scientific inquiry. In this article we explore the function of explanation multiple-choice (EMC) items and examine how EMC items differ from traditional multiple-choice and constructed-response items in measuring scientific reasoning. A group of 794 middle school students was randomly assigned to answer either constructed-response or EMC items following regular multiple-choice items. By applying a Rasch partial-credit analysis, we found that there is a consistent alignment between the EMC and multiple-choice items. Also, the EMC items are easier than the constructed-response items but are harder than most of the multiple-choice items. We discuss the potential value of the EMC items as a learning and diagnostic tool. 相似文献

16.

通过选择题答案甄别雷同答卷的尝试

《中国考试》2013,(7)

本文通过对考生答卷中选择性试题的最大可能错同数的研究,建立甄别雷同答卷模型,对考生选择题作答情况进行分析,以判别是否存在雷同答卷。研究过程中以实际考试数据进行验证,证明该方法是可行,有效的。相似文献

17.

Using the Distractor Categories of Multiple-Choice Items to Improve IRT Linking

Jee-Seon Kim 《Journal of Educational Measurement》2006,43(3):193-213

Simulation and real data studies are used to investigate the value of modeling multiple-choice distractors on item response theory linking. Using the characteristic curve linking procedure for Bock's (1972) nominal response model presented by Kim and Hanson (2002) , all-category linking (i.e., a linking based on all category characteristic curves of the linking items) is compared against correct-only (CO) linking (i.e., linking based on the correct category characteristic curves only) using a common-item nonequivalent groups design. The CO linking is shown to represent an approximation to what occurs when using a traditional correct/incorrect item response model for linking. Results suggest that the number of linking items needed to achieve an equivalent level of linking precision declines substantially when incorporating the distractor categories. 相似文献

18.

用PowerPoint2002制作选择题

曾屹《物理教学探讨》2007,25(21):54

用PowerPoint2002制作交互式选择题,不仅制作简单,初学者也很容易掌握,而且不比用Authorware或flash制作的效果差.以下就以一道平抛运动的选择题为例,介绍用PowerPoint2002制作交互式选择题的过程.如图1所示. 相似文献

19.

Overcorrection for Guessing in Multiple-Choice Test Scoring

《The Journal of educational research》2012,105(6):245-252

相似文献

20.

Are Tests Comprising Both Multiple-Choice and Free-Response Items Necessarily Less Unidimensional Than Multiple-Choice Tests?An Analysis of Two Tests

David Thissen Howard Wainer Xiang-Bo Wang 《Journal of Educational Measurement》1994,31(2):113-123

We consider the relationship between the multiple-choice and free-response sections on the Computer Science and Chemistry tests of the College Board's Advanced Placement program. Restricted factor analysis shows that the free-response sections measure the same underlying proficiency as the multiple-choice sections for the most part. However, there is also a significant, if relatively small, amount of local dependence among the free-response items that produces a small degree of multidimensionauty for each test 相似文献