首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
学业成就评价是当前新课程改革研究的热点之一。如何科学地设计和开发试题,对深化新课改、进行基础教育质量监控有着重要意义。PISA是一项权威的国际学生评价项目,具有较高的可比性、可信性和有效性。PISA2006科学评估框架包含情境、知识、态度和能力等相互联系的四个方面,其试题设计和开发技术采用了"双位编码"评分设计,增加了态度评估试题,保证了试题与标准的匹配。  相似文献   

2.
学生的数学素养具有多维结构,素养导向的数学学业成就测评需要提供被试在各维度上的表现信息,而不仅是一个单一的总分。以PISA数学素养结构为理论模型,以多维项目反应理论(MIRT)为测量模型,利用R语言的MIRT程序包处理和分析某地区8年级数学素养测评题目数据,研究数学素养的多维测量方法。结果表明:MIRT兼具单维项目反应理论和因子分析的优点,利用其可对测试的结构效度和测试题目质量进行分析,以及对被试进行多维能力认知诊断。  相似文献   

3.
Drawing valid inferences from item response theory (IRT) models is contingent upon a good fit of the data to the model. Violations of model‐data fit have numerous consequences, limiting the usefulness and applicability of the model. This instructional module provides an overview of methods used for evaluating the fit of IRT models. Upon completing this module, the reader will have an understanding of traditional and Bayesian approaches for evaluating model‐data fit of IRT models, the relative advantages of each approach, and the software available to implement each method.  相似文献   

4.
This paper illustrates that the psychometric properties of scores and scales that are used with mixed‐format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is on mixed‐format tests in situations for which raw scores are integer‐weighted sums of item scores. Four associated real‐data examples include (a) effects of weights associated with each item type on reliability, (b) comparison of psychometric properties of different scale scores, (c) evaluation of the equity property of equating, and (d) comparison of the use of unidimensional and multidimensional procedures for evaluating psychometric properties. Throughout the paper, and especially in the conclusion section, the examples are related to issues associated with test interpretation and test use.  相似文献   

5.
Large‐scale assessments such as the Programme for International Student Assessment (PISA) have field trials where new survey features are tested for utility in the main survey. Because of resource constraints, there is a trade‐off between how much of the sample can be used to test new survey features and how much can be used for the initial item response theory (IRT) scaling. Utilizing real assessment data of the PISA 2015 Science assessment, this article demonstrates that using fixed item parameter calibration (FIPC) in the field trial yields stable item parameter estimates in the initial IRT scaling for samples as small as n = 250 per country. Moreover, the results indicate that for the recovery of the county‐specific latent trait distributions, the estimates of the trend items (i.e., the information introduced into the calibration) are crucial. Thus, concerning the country‐level sample size of n = 1,950 currently used in the PISA field trial, FIPC is useful for increasing the number of survey features that can be examined during the field trial without the need to increase the total sample size. This enables international large‐scale assessments such as PISA to keep up with state‐of‐the‐art developments regarding assessment frameworks, psychometric models, and delivery platform capabilities.  相似文献   

6.
Large data sets from a state reading assessment for third and fifth graders were analyzed to examine differential item functioning (DIF), differential distractor functioning (DDF), and differential omission frequency (DOF) between students with particular categories of disabilities (speech/language impairments, learning disabilities, and emotional behavior disorders) and students without disabilities. Multinomial logistic regression was employed to compare response characteristic curves (RCCs) of individual test items. Although no evidence for serious test bias was found for the state assessment examined in this study, the results indicated that students in different disability categories showed different patterns of DIF, DDF, and DOF, and that the use of RCCs helps clarify the implications of DIF and DDF.  相似文献   

7.
The purpose of this paper is to define and evaluate the categories of cognitive models underlying at least three types of educational tests. We argue that while all educational tests may be based—explicitly or implicitly—on a cognitive model, the categories of cognitive models underlying tests often range in their development and in the psychological evidence gathered to support their value. For researchers and practitioners, awareness of different cognitive models may facilitate the evaluation of educational measures for the purpose of generating diagnostic inferences, especially about examinees' thinking processes, including misconceptions, strengths, and/or abilities. We think a discussion of the types of cognitive models underlying educational measures is useful not only for taxonomic ends, but also for becoming increasingly aware of evidentiary claims in educational assessment and for promoting the explicit identification of cognitive models in test development. We begin our discussion by defining the term cognitive model in educational measurement. Next, we review and evaluate three categories of cognitive models that have been identified for educational testing purposes using examples from the literature. Finally, we highlight the practical implications of blending models for the purpose of improving educational measures .  相似文献   

8.
摘要:随着我国新课程改革的不断加深。情境认知理论的实践性成为教师在教学中不断探索的课题。根据情境认知理论的内涵,写作课教师在教学情境的创设中可遵循引导性、互动性、有效性、多元性原则,并结合新课程改革的要求,针对不同年级学生的具体情况,根据教学目标等内容,灵活设计教学情境,选择适宜的教学模式。  相似文献   

9.
为比较结构方程模型和 IRT等级反应模型在人格量表项目筛选上的作用,以《中国大学生人格量表》的7229个实际测量数据为基础,针对因素二“爽直”分别以Lisrel8.70和Multilog7.03进行结构方程模型和等级反应模型的参数估计与拟合,比较两种方法的项目筛选结果.二者统计结果均认为项目5、6、7、8拟合度不佳,在结构方程模型上表现为因子负荷较低,整体拟合指数不理想;在等级反应模型上表现为区分度参数和位置参数不理想,相关项目的特征曲线和信息曲线形态较差.但结构方程模型倾向于项目6、8更差,而等级反应模型则倾向于项目5、6更差.结构方程模型和 IRT等级反应模型对人格量表项目的统计推断结果从总体上讲是一致的,但在个别项目上略有差异.二者各有优势,可以结合使用.  相似文献   

10.
信访工作是政府行政管理部门处理人民群众合理诉求、解决人民群众遇到的各类矛盾和问题的渠道。清醒地认识和准确地把握新形势下信访稳定工作的发展变化,是做好新形势下信访稳定工作的前提和基础。做好信访工作,必须充分把握当前信访稳定工作的理论与实践认知定位,不断增强新形势下做好信访稳定工作的责任感。  相似文献   

11.
给排水专业英语课程是一门应用性强的课程,在新形势下进行教学改革势在必行。基于目前给排水专业英语教学中存在的问题,笔者从优化教学安排、更新教学内容、丰富教学过程、改革考核方式等方面进行改革探讨。为培养具有国际视野和创新能力的毕业生打下坚实基础。  相似文献   

12.
本文采用共同题非等组设计,对五种基于IRT的项目参数等值方法进行比较研究。研究数据来自湖北某试点地区课改实验区和非课改区考生在标准化中考数学科目的考试数据,兼用大样本标准和其他标准作为各等值方法比较的检验标准,以RMSD指标作为操作性检验标准,利用STUIRT程序进行等值分析。研究结果表明,针对本研究所设置的等值情境,MS方法稳健性最差,对于项目难度参数的等值,同时校准方法最好,其次是SL特征曲线法,对于项目区分度参数的等值,MM方法精确性最好。  相似文献   

13.
As access and reliance on technology continue to increase, so does the use of computerized testing for admissions, licensure/certification, and accountability exams. Nonetheless, full computer‐based test (CBT) implementation can be difficult due to limited resources. As a result, some testing programs offer both CBT and paper‐based test (PBT) administration formats. In such situations, evidence that scores obtained from different formats are comparable must be gathered. In this study, we illustrate how contemporary statistical methods can be used to provide evidence regarding the comparability of CBT and PBT scores at the total test score and item levels. Specifically, we looked at the invariance of test structure and item functioning across test administration mode across subgroups of students defined by SES and sex. Multiple replications of both confirmatory factor analysis and Rasch differential item functioning analyses were used to assess invariance at the factorial and item levels. Results revealed a unidimensional construct with moderate statistical support for strong factorial‐level invariance across SES subgroups, and moderate support of invariance across sex. Issues involved in applying these analyses to future evaluations of the comparability of scores from different versions of a test are discussed.  相似文献   

14.
莫雷和Robins对于“材料模式、认知负荷对学生类比推理迁移的影响”所做的实验,其结果存在着明显的差异:莫雷的结论支持积极的反应理论,Robins的结论支持积极的学习理论。为了确定差异产生的原因和条件,以及他们的结论正确与否。我们通过实验作了进一步验证。通过对数据进行方差分析和多重比较得出结论:(1)莫雷和Robins的实验结论所产生的主要差异是由于他们所选择的被试存在着较大的年龄差别;(2)积极的学习理论和积极的反应理论并不是相互矛盾的,而是有各自成立的前提条件,二者各适用于不同的年龄阶段。  相似文献   

15.
通过对1999年以来我国中考命题质量评估历程的分析,揭示开展学业水平考试命题质量评估的困境成因在于:缺乏命题质量评估界定,缺乏命题质量标准,缺乏命题质量评估主体标准,评估实践的“只技术”思维,缺乏命题质量评估文化。在此基础上,提出3条解困对策:建立学业水平考试命题容错制度,依法治考,专业化与标准化双管齐下。  相似文献   

16.
具身认知强调认知是身体参与的认知,通过身体、环境、感知、心智的互动融合完成知识的表征。科技场馆借助实体场馆与先进信息技术的融合,构建了具身学习的场域。文章首先基于具身认知理论,在科技场馆的具身学习中融合三种环境(即物理环境、社会环境和心理环境)和三类具身(即实感具身、实境具身和离线具身),设计了科技场馆学习支架。随后,文章在“电流的磁效应”主题学习中开展了两轮迭代设计研究,验证科技场馆学习支架的应用效果。最后,文章形成了修正后的科技场馆学习支架,以提升学习者身体在场的行动参与感,帮助学习者身体体验的内化与经验建构,为学习者在科技场馆中的具身学习提供有效的学习支持。  相似文献   

17.
The premise of a great deal of current research guiding policy development has been that accommodations are the catalyst for student performance differences. Rather than accepting this premise, two studies were conducted to investigate the influence of extended time and content knowledge on the performance of ninth‐grade students who took a statewide mathematics test with and without accommodations. Each study involved 1,250 accommodated students (extended time only) with learning disabilities and 1,250 nonaccommodated students demonstrating no disabilities. In Study One, a standard differential item functioning (DIF) analysis illustrated that the usual approach to studying the effects of accommodations contributes little to our understanding of the reason for performance differences across students. Next, a mixture item response theory DIF model was used to explore the most likely cause(s) for performance differences across the population. The results from both studies suggest that students for whom items were functioning differently were not accurately characterized by their accommodation status but rather by their content knowledge. That is, knowing students' accommodation status (i.e., accommodated or nonaccommodated) contributed little to understanding why accommodated and nonaccommodated students differed in their test performance. Rather, the data would suggest that a more likely explanation is that mathematics competency differentiated the groups of student learners regardless of their accommodation and/or reading levels.  相似文献   

18.
现代教育技术的应用和普及,冲击着整个基础教育教学体系,而基础教育教学体系的重点就是课堂教学模式,因此现代教育技术环境下的课堂教学模式研究必成为我国新世纪基础教育教学改革的新视点.本文在分析传统教学模式存在的不足之后,利用现代教育技术,以建构主义理论为基础构建了情知互促教学模式,并阐述了在该教学模式下如何实施课堂教学.  相似文献   

19.
岗位主导、项目带动课程模式构建探究   总被引:4,自引:0,他引:4  
“岗位主导,项目带动”型课程模式,以专业培养目标所确定的职业岗位需求,主导专业的课程结构和课程内容的选择,以项目为载体组织课程内容,带动课程教学实施。其核心是构建项目实践体系,目标是把学生导向岗位工作体系。其课程开发体现了CBE、MES及行动导向等理念的统合。  相似文献   

20.
谷科 《科教导刊》2020,(2):38-39
在学科大类的培养体系下,如何对各个专业的公共课程、公共专业基础课程、专业课程从顶层进行规划,建立联系和区分,改革计算机专业大类人才的基础教育课程培养体系,加强基础理论教育的同时培养学生的专业发展兴趣和良好的思想品德,将是一个值得研究的教改课题。本文以新时代教育方针下的要求作为指导方向,根据现有计算机相关专业学生的培养目标和培养方案,探讨和改进省属重点高校计算机专业大类人才培养方案基础课程体系改革的研究内容和目标,主要包括新时代教育方针下省属重点高校计算机专业大类人才基础课程培养体系改革的必要性、存在的问题及原因、培养体系改革与实践的具体措施、体系改革的实效性分析等。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号