期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

宋吉祥李付鹏杜海燕《考试研究》2021,(1):51-57

RCMLM模型是基于Rasch测量理论的通用拓展模型。利用RCMLM模型对一份普通高中数学试卷进行不同性别的DIF分析。结果表明:该模型可对具有二分计分和多分计分的试题同时进行DIF分析,避免了以往分别对两种计分方式试题进行DIF分析的弊端,保持了试卷的完整性,使DIF分析结果更加有效。相似文献

2.

Is academic tracking related to gains in learning competence? Using propensity score matching and differential item change functioning analysis for better understanding of tracking implications

《Learning and Instruction》2020

This study analyzes gains in cognitive components of learning competence with respect to cohorts based on ability tracking in a Czech longitudinal study. Propensity score matching is used to form parallelized samples of academic and non-academic track students and to eliminate the effect of selective school intake. We applied regression models on the total scores to test for the overall track effect. Furthermore, we analyze scores and gains on the subscores and check for differential item functioning in Grade 6 and in change to Grade 9. While after 3 years, no significant difference between the two tracks was apparent in the total learning competence score, we did, however, find significant differences in some subscores and in the functioning of some items. We argue that item-level analysis is important for deeper understanding of the tracking implications and may provide the basis for more precise evidence-based decisions regarding the tracking policy. 相似文献

3.

Application of Think Aloud Protocols for Examining and Confirming Sources of Differential Item Functioning Identified by Expert Reviews

Kadriye Ercikan Rubab Arim Danielle Law Jose Domene France Gagnon Serge Lacroix 《Educational Measurement》2010,29(2):24-35

This paper demonstrates and discusses the use of think aloud protocols (TAPs) as an approach for examining and confirming sources of differential item functioning (DIF). The TAPs are used to investigate to what extent surface characteristics of the items that are identified by expert reviews as sources of DIF are supported by empirical evidence from examinee thinking processes in the English and French versions of a Canadian national assessment. In this research, the TAPs confirmed sources of DIF identified by expert reviews for 10 out of 20 DIF items. The moderate agreement between TAPs and expert reviews indicates that evidence from expert reviews cannot be considered sufficient in deciding whether DIF items are biased and such judgments need to include evidence from examinee thinking processes. 相似文献

4.

语言测试中的性别偏差与公平性

李静《科教文汇》2009,(13):239-239,241

要保证语言测试的效度和公平性,必须采取有效方法侦测及消除测试中存在的偏差。本文从性别偏差的角度对测试偏差作一简要介绍,探讨了性别偏差和公平性的概念,并介绍了用来侦测性别偏差的方法。相似文献

5.

General and Specific Factors in Reading Comprehension

A. Sterl Artley 《Journal of Experimental Education》2013,81(3):181-186

One explanation for why many students routinely choose to do their homework with a radio or TV operating is that this distraction facilitates studying. To investigate the validity of this explanation, the authors carried out two experiments with sixth-grade students who worked on two difficulty levels of individualized mathematics assignments (N = 12) and reading assignments (N = 9) under three distraction conditions: quiet; self-selected, self-regulated radio; and self-selected, self-regulated TV. The data yielded no evidence of sizable distractor effects on students' time spent studying, computational accuracy, reading comprehension, or reading rate. These results generally are consistent with the theoretical conceptualization of attention as flexible and adaptable. 相似文献

6.

Examining type I error and power for detection of differential item and testlet functioning

Young-Sun Lee Allan Cohen Maritsa Toro 《Asia Pacific Education Review》2009,10(3):365-375

In this study, the effectiveness of detection of differential item functioning (DIF) and testlet DIF using SIBTEST and Poly-SIBTEST were examined in tests composed of testlets. An example using data from a reading comprehension test showed that results from SIBTEST and Poly-SIBTEST were not completely consistent in the detection of DIF and testlet DIF. Results from a simulation study indicated that SIBTEST appeared to maintain type I error control for most conditions, except in some instances in which the magnitude of simulated DIF tended to increase. This same pattern was present for the Poly-SIBTEST results, although Poly-SIBTEST demonstrated markedly less control of type I errors. Type I error control with Poly-SIBTEST was lower for those conditions for which the ability was unmatched to test difficulty. The power results for SIBTEST were not adversely affected, when the size and percent of simulated DIF increased. Although Poly-SIBTEST failed to control type I errors in over 85% of the conditions simulated, in those conditions for which type I error control was maintained, Poly-SIBTEST demonstrated higher power than SIBTEST. 相似文献

7.

DIF分析实际应用中的常见问题及其研究新进展 总被引：1，自引：0，他引：1

李凌艳张勋《考试研究》2010,(2):73-82

多等级计分题、小样本、匹配变量不纯以及DIF检验后的原因分析是DIF检验面临的常见问题,对多等级计分题目进行DSF分析,小样本情况下DIF检测的平滑方法,匹配变量不纯情况下采用MIMIC法,以及运用Logistic模型进行DIF检验后的原因分析是DIF研究中的一些新进展。对这些进展的分析使我们相信,多种检验方法的配合使用、运用DIF研究进行多维IRT框架下的潜在变量探究等,都有可能使DIF研究成为测量学未来的基础研究领域之一。相似文献

8.

Reasons for Gender-Related Differential Item Functioning in a College Admissions Test

Jonathan Wedman 《Scandinavian Journal of Educational Research》2018,62(6):959-970

Gender fairness in testing can be impeded by the presence of differential item functioning (DIF), which potentially causes test bias. In this study, the presence and causes of gender-related DIF were investigated with real data from 800 items answered by 250,000 test takers. DIF was examined using the Mantel–Haenszel and logistic regression procedures. Little DIF was found in the quantitative items and a moderate amount was found in the verbal items. Vocabulary items favored women if sampled from traditionally female domains but generally not vice versa if sampled from male domains. The sentence completion item format in the English reading comprehension subtest favored men regardless of content. The findings, if supported in a cross-validation study, can potentially lead to changes in how vocabulary items are sampled and in the use of the sentence completion format in English reading comprehension, thereby increasing gender fairness in the examined test. 相似文献

9.

第二语言学习者专业背景对HSK阅读成绩影响的项目功能差异检验 总被引：1，自引：0，他引：1

黄春霞《考试研究》2011,(5):59-66

本文旨在考察HSK应试者的专业背景是否会对他们的阅读成绩产生影响。运用MH方法和SIBTEST方法对2009年HSK(初中等)考试阅读题目进行DIF筛查,把专业背景为自然科学的HSK考生设为目标组,专业背景为人文社会科学的HSK考生设为参照组。MH方法的结果是没有找到含有DIF的题目;SIBTEST方法的结果如下:第一轮DIF筛查检测到一个题目,第二轮DBF筛查检测到一组题目。这组题目有利于人文社会学科专业背景的被试。就检测DIF的方法而言,本研究认为SIBTEST方法更加敏感,DBF检验更加适合像阅读理解测验这样的一组或多组相互关联的题目。相似文献

10.

我国成就测验的项目功能差异研究述评

朱乙艺韦小满《教育与考试》2012,(1):78-81

本文系统梳理了我国成就测验的项目功能差异研究,主要包括介绍国外的项目功能差异研究的综合介绍性研究、利用我国的成就测验进行DIF检测方法的比较研究和影响因素研究、对我国的各种成就测验进行项目功能差异分析的应用性研究。在此基础上,指出了我国成就测验的项目功能差异研究存在的问题。相似文献