排序方式: 共有32条查询结果,搜索用时 0 毫秒
1.
2.
This study analyzes gains in cognitive components of learning competence with respect to cohorts based on ability tracking in a Czech longitudinal study. Propensity score matching is used to form parallelized samples of academic and non-academic track students and to eliminate the effect of selective school intake. We applied regression models on the total scores to test for the overall track effect. Furthermore, we analyze scores and gains on the subscores and check for differential item functioning in Grade 6 and in change to Grade 9. While after 3 years, no significant difference between the two tracks was apparent in the total learning competence score, we did, however, find significant differences in some subscores and in the functioning of some items. We argue that item-level analysis is important for deeper understanding of the tracking implications and may provide the basis for more precise evidence-based decisions regarding the tracking policy. 相似文献
3.
Kadriye Ercikan Rubab Arim Danielle Law Jose Domene France Gagnon Serge Lacroix 《Educational Measurement》2010,29(2):24-35
This paper demonstrates and discusses the use of think aloud protocols (TAPs) as an approach for examining and confirming sources of differential item functioning (DIF). The TAPs are used to investigate to what extent surface characteristics of the items that are identified by expert reviews as sources of DIF are supported by empirical evidence from examinee thinking processes in the English and French versions of a Canadian national assessment. In this research, the TAPs confirmed sources of DIF identified by expert reviews for 10 out of 20 DIF items. The moderate agreement between TAPs and expert reviews indicates that evidence from expert reviews cannot be considered sufficient in deciding whether DIF items are biased and such judgments need to include evidence from examinee thinking processes. 相似文献
4.
要保证语言测试的效度和公平性,必须采取有效方法侦测及消除测试中存在的偏差。本文从性别偏差的角度对测试偏差作一简要介绍,探讨了性别偏差和公平性的概念,并介绍了用来侦测性别偏差的方法。 相似文献
5.
A. Sterl Artley 《Journal of Experimental Education》2013,81(3):181-186
One explanation for why many students routinely choose to do their homework with a radio or TV operating is that this distraction facilitates studying. To investigate the validity of this explanation, the authors carried out two experiments with sixth-grade students who worked on two difficulty levels of individualized mathematics assignments (N = 12) and reading assignments (N = 9) under three distraction conditions: quiet; self-selected, self-regulated radio; and self-selected, self-regulated TV. The data yielded no evidence of sizable distractor effects on students' time spent studying, computational accuracy, reading comprehension, or reading rate. These results generally are consistent with the theoretical conceptualization of attention as flexible and adaptable. 相似文献
6.
In this study, the effectiveness of detection of differential item functioning (DIF) and testlet DIF using SIBTEST and Poly-SIBTEST
were examined in tests composed of testlets. An example using data from a reading comprehension test showed that results from
SIBTEST and Poly-SIBTEST were not completely consistent in the detection of DIF and testlet DIF. Results from a simulation
study indicated that SIBTEST appeared to maintain type I error control for most conditions, except in some instances in which
the magnitude of simulated DIF tended to increase. This same pattern was present for the Poly-SIBTEST results, although Poly-SIBTEST
demonstrated markedly less control of type I errors. Type I error control with Poly-SIBTEST was lower for those conditions
for which the ability was unmatched to test difficulty. The power results for SIBTEST were not adversely affected, when the
size and percent of simulated DIF increased. Although Poly-SIBTEST failed to control type I errors in over 85% of the conditions
simulated, in those conditions for which type I error control was maintained, Poly-SIBTEST demonstrated higher power than
SIBTEST. 相似文献
7.
DIF分析实际应用中的常见问题及其研究新进展 总被引:1,自引:0,他引:1
多等级计分题、小样本、匹配变量不纯以及DIF检验后的原因分析是DIF检验面临的常见问题,对多等级计分题目进行DSF分析,小样本情况下DIF检测的平滑方法,匹配变量不纯情况下采用MIMIC法,以及运用Logistic模型进行DIF检验后的原因分析是DIF研究中的一些新进展。对这些进展的分析使我们相信,多种检验方法的配合使用、运用DIF研究进行多维IRT框架下的潜在变量探究等,都有可能使DIF研究成为测量学未来的基础研究领域之一。 相似文献
8.
Jonathan Wedman 《Scandinavian Journal of Educational Research》2018,62(6):959-970
Gender fairness in testing can be impeded by the presence of differential item functioning (DIF), which potentially causes test bias. In this study, the presence and causes of gender-related DIF were investigated with real data from 800 items answered by 250,000 test takers. DIF was examined using the Mantel–Haenszel and logistic regression procedures. Little DIF was found in the quantitative items and a moderate amount was found in the verbal items. Vocabulary items favored women if sampled from traditionally female domains but generally not vice versa if sampled from male domains. The sentence completion item format in the English reading comprehension subtest favored men regardless of content. The findings, if supported in a cross-validation study, can potentially lead to changes in how vocabulary items are sampled and in the use of the sentence completion format in English reading comprehension, thereby increasing gender fairness in the examined test. 相似文献
9.
第二语言学习者专业背景对HSK阅读成绩影响的项目功能差异检验 总被引:1,自引:0,他引:1
本文旨在考察HSK应试者的专业背景是否会对他们的阅读成绩产生影响。运用MH方法和SIBTEST方法对2009年HSK(初中等)考试阅读题目进行DIF筛查,把专业背景为自然科学的HSK考生设为目标组,专业背景为人文社会科学的HSK考生设为参照组。MH方法的结果是没有找到含有DIF的题目;SIBTEST方法的结果如下:第一轮DIF筛查检测到一个题目,第二轮DBF筛查检测到一组题目。这组题目有利于人文社会学科专业背景的被试。就检测DIF的方法而言,本研究认为SIBTEST方法更加敏感,DBF检验更加适合像阅读理解测验这样的一组或多组相互关联的题目。 相似文献
10.
本文系统梳理了我国成就测验的项目功能差异研究,主要包括介绍国外的项目功能差异研究的综合介绍性研究、利用我国的成就测验进行DIF检测方法的比较研究和影响因素研究、对我国的各种成就测验进行项目功能差异分析的应用性研究。在此基础上,指出了我国成就测验的项目功能差异研究存在的问题。 相似文献