首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 9 毫秒
1.
高中学业水平考试是高考改革新方案中的重要一环。要做好高中学业水平考试,避免出现大规模学生反复多考、放弃重要学科盲目追求A等、大量高水平考生获得过低等级分数等问题,就必须提前做好标杆试卷的研发、学业标准的设定、分数常模的研制以及测验等值的实现四项工作。建议学科专家、测量学专家和教育管理工作者共同合作,力争在正式试行高中学业水平考试之前一年完成这些工作,以免对新高考方案、学业水平考试、基础教育以及高等教育带来负面影响。  相似文献   

2.
Because of the unique nature of the students eligible for alternate assessments based on modified academic achievement standards, their varied access to the general education curriculum, and their unique learning needs, innovative psychometric thinking and practice is needed to assure high technical quality of alternate assessments. Indeed, we at least must marshal state-of-the-art procedures to secure strong psychometric evidence to support appropriate and meaningful design and use of these important assessments. The authors contributing work to this special issue, Alternate Assessments Based on Modified Academic Achievement Standards, address important issues and provide guidance to policymakers, test developers, and educators. They also each raise important technical quality issues. This article offers a brief review of such psychometric considerations, in light of the work and comments of the special issue authors.  相似文献   

3.
在"不再制定考试大纲"的条件下,根据课程标准命题是今后高考、中考等大规模教育考试工作的常态。文章首先从知识、能力、功能和质量四个角度,具体讨论了取消考试大纲所面临的命题挑战,然后从课程标准的操作性解读、现代教育测量模型的中国化处理、命题团队的建设与培养、题库建设四个方面,提出了取消考试大纲后大规模教育考试命题的具体建议。  相似文献   

4.
针对已经出台的6省市高考综合改革方案,从教育测量学角度,就多次考试、赋分方法和多元评价3个方面存在的问题进行了探讨,分析了其中的难点和潜在风险,提出了解决等值难题、参考代表性样本而不是选考样本确定等级分数线、尝试跨栏式选才而不是求总分式选才等应对策略,为其他省市制定高考综合改革方案提供了一些新的视角。  相似文献   

5.
Research on psychometric methods is heavily dependent on software. The quality, availability, and documentation of such software are critical to the advancement of the field. In 2000, an ad hoc committee of NCME recommended that NCME adopt policies that promote greater availability and better documentation of software. This article follows the ad hoc committee's report by examining the use of software in four top-tiered journals in recent years. The results indicated that the most frequently cited programs were those written by the articles' authors. The documentation and availability for these programs are often not clear, particularly for software used for simulations. The use of proprietary software was not widespread in the four journals, but there is still room for concern in the future. This article recommends that NCME form a permanent committee to address software issues.  相似文献   

6.
古代寓言     
  相似文献   

7.
《教育实用测度》2013,26(3):203-205
Many credentialing agencies today are either administering their examinations by computer or are likely to be doing so in the coming years. Unfortunately, although several promising computer-based test designs are available, little is known about how well they function in examination settings. The goal of this study was to compare fixed-length examinations (both operational forms and newly constructed forms) with several variations of multistage test designs for making pass-fail decisions. Results were produced for 3 passing scores. Four operational 60-item examinations were compared to (a) 3 new 60-item forms, (b) 60-item 3-stage tests, and (c) 40-item 2-stage tests; all were constructed using automated test assembly software. The study was carried out using computer simulation techniques that were set to mimic common examination practices. All 60-item tests, regardless of design or passing score, produced accurate ability estimates and acceptable and similar levels of decision consistency and decision accuracy. One interesting finding was that the 40-item test results were poorer than the 60-item test results, as expected, but were in the range of acceptability. This raises the practical policy question of whether content-valid 40-item tests with lower item exposure levels and/or savings in item development costs are an acceptable trade-off for a small loss in decision accuracy and consistency.  相似文献   

8.
花丛拾锦     
月份与花 1.January snowdrop雪花.象征希望和幸运 2.February violet紫罗兰,象征谦虚、同情心及心平气和  相似文献   

9.
10.
Selected acts     
  相似文献   

11.
花丛拾锦     
  相似文献   

12.
13.
This study investigated the psychometric characteristics of constructed-response (CR) items referring to choice and non-choice passages administered to students in Grades 3, 5, and 8. The items were scaled using item response theory (IRT) methodology. The results indicated no consistent differences in the difficulty and discrimination of the items referring to the two types of passages. On the average, students' scale scores on the choice and non-choice passages were comparable. Finally, the choice passages differed in terms of overall popularity and in their attractiveness to different gender and ethnic groups  相似文献   

14.
Psychometric properties of item response theory proficiency estimates are considered in this paper. Proficiency estimators based on summed scores and pattern scores include non-Bayes maximum likelihood and test characteristic curve estimators and Bayesian estimators. The psychometric properties investigated include reliability, conditional standard errors of measurement, and score distributions. Four real-data examples include (a) effects of choice of estimator on score distributions and percent proficient, (b) effects of the prior distribution on score distributions and percent proficient, (c) effects of test length on score distributions and percent proficient, and (d) effects of proficiency estimator on growth-related statistics for a vertical scale. The examples illustrate that the choice of estimator influences score distributions and the assignment of examinee to proficiency levels. In particular, for the examples studied, the choice of Bayes versus non-Bayes estimators had a more serious practical effect than the choice of summed versus pattern scoring.  相似文献   

15.
The psychometric test results of a sample of 100 LD students with severe achievement problems were cluster analyzed. The variables included in this analysis were the subtests of the WISC-R, the Bender Gestalt, the Benton Visual Retention, the Purdue Perceptual-Motor, and the Lindamood Auditory Conceptualization tests. Using K-means iterative clustering procedures, three clusters were obtained. The first cluster was defined by low scores on attention and concentration subtests; the second was defined by low scores on subtests of verbal-associative intelligence; the third was defined by low scores on visual-spatial and motoric subtests. Limitations of the study, in the scope of the psychometric testing and the lack of pediatric and neurologic diagnoses, are discussed.  相似文献   

16.
Abstract

This study investigated the effectiveness of praise and knowledge of results in increasing the performance of lower- and middle-class fifth-grade pupils on a numeral cancellation test. The effectiveness of the reinforcers was based upon the mean gain (loss) scores of Tests 9 and 10 over Test 1. Middle-class boys made significantly larger gains under the knowledge-of-results condition than under the praise condition. Under the knowledge-of-results condition, lower-class boys made significantly larger gains than lower-class girls. Other differences were insignificant.  相似文献   

17.
Pupil monitoring systems support the teacher in tailoring teaching to the individual level of a student and in comparing the progress and results of teaching with national standards. The systems are based on the availability of an item bank calibrated using item response theory. The assessment of the students’ progress and results can be further supported by using computerized adaptive testing where the items selected from the item bank are targeted at the specific ability level of the student. The present article discusses psychometric issues of pupil monitoring systems, such as ability estimation, the optimal construction of tests from the item bank and monitoring of progress.  相似文献   

18.
Many American children are currently receiving a reduced quality education because of the increasingly widespread misuse of educational tests. Employing a religious metaphor, the author argues that members of the educational measurement community are culpable, at least in part, for this calamity. During recent decades, our nation's assessment personnel have failed to speak out vigorously against the increasingly prevalent improper use of traditionally constructed achievement tests to appraise school quality. This absence of action, it is claimed, constitutes a nontrivial sin of omission. To secure absolution for that sin, it is contended that measurement specialists must promote widespread assessment literacy.  相似文献   

19.
The Parent Role Questionnaire (PRQ) is a recently developed instrument to study individual perceptions of the parent role. The PRQ was piloted, revised, and studied psychometrically. This study examines PRQ internal consistency, test-retest reliability, and validity. Results reveal that the instrument has relatively high internal consistency, moderate test-retest reliability, and high validity. The discussion includes implications for further research and study on parents, parenting, and the parent role.  相似文献   

20.
传统的测量模型有一重要假设,即被试在完成测验过程中自始至终采用同一种策略.事实上,被试会根据题目类型不同而改变其解题策略,称之为策略转换.使用策略转换模型和潜在类别分析两种方法对平衡秤任务测验作答过程中的策略转换现象进行了分析比较.结果显示:策略转换模型存在策略位置参数越界、顺序混乱等缺陷,不适于策略转换问题研究;使用潜在类别分析方法可有效分析被试的策略转换行为,儿童在完成平衡秤任务测验时呈现出不同的策略转换路径.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号