期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

John R. Hills 《Educational Measurement》1984,3(2):43-44

相似文献

2.

《中国考试》2016,(6)

为适应现代社会对人才的需要,美国教育考试服务处对美国大学入学考试SAT进行了改革。改革后的SAT数学考试更注重与学生课堂学习的联系,更强调对数学概念的理解,进一步加强了对逻辑推理能力和计算能力的考查;对数学运算的准确和熟练程度都提出了更高的要求;在丰富的应用场景中,考查数学在职业、科学和在社会研究中的应用;试卷的长度加长,综合程度提高。这些都为我国的高考改革提供了有益的启示和借鉴。相似文献

3.

Comparing State SAT Scores: Problems, Biases, and Corrections

Stephan F. Gohmann 《Journal of Educational Measurement》1988,25(2):137-148

Comparing SAT scores among states using regression analysis leads to biased results because states differ in the proportion of students taking the exam. When the proportion of students taking the exam is included in the regression equation, the results can be biased because of misspecifieation bias. A method intended to correct for selection bias is presented, and empirical results suggest that sample selection bias is present in SAT score regressions. Regression equations and state rankings are compared between the selection-corrected equation and equations for which the selection problem is not addressed. The proposed method is one of many available as possible solutions to the selection problem. Alternative methods may produce different results 相似文献

4.

Implications of low SAT scores

John C. Wenger 《Academic Questions》2000,13(4):7-8

相似文献

5.

Grades and Test Scores: Accounting for Observed Differences 总被引：1，自引：0，他引：1

Warren W. Willingham Judith M. Pollack Charles Lewis 《Journal of Educational Measurement》2002,39(1):1-37

Why do grades and test scores often differ? A framework of possible differences is proposed in this article. An approximation of the framework was tested with data on 8,454 high school seniors from the National Education Longitudinal Study. Individual and group differences in grade versus test performance were substantially reduced by focusing the two measures on similar academic subjects, correcting for grading variations and unreliability, and adding teacher ratings and other information about students. Concurrent prediction of high school average was thus increased from 0.62 to 0.90; differential prediction in eight subgroups was reduced to 0.02 letter‐grades. Grading variation was a major source of discrepancy between grades and test scores. Other major sources were teacher ratings and Scholastic Engagement, a promising organizing principle for understanding student achievement. Engagement was defined by three types of observable behavior: employing school skills, demonstrating initiative, and avoiding competing activities. While groups varied in average achievement, group performance was generally similar on grades and tests. Major factors in achievement were similarly constituted and similarly related from group to group. Differences between grades and tests give these measures complementary strengths in high‐stakes assessment. If artifactual differences between the two measures are not corrected, common statistical estimates of validity and fairness are unduly conservative. 相似文献

6.

性别和种族对美国SAT考试成绩影响的定量分析

叶露邵佳锰《民族高等教育研究》2019,(1):67-72

SAT(Scholastic Assessment Test)作为美国目前广为接受的大学入学考试,其公平性一直遭受质疑,尤其是在性别、种族等敏感领域。基于美国某高中学生的SAT数据,运用最小二乘估计法,建立了关于SAT考试成绩的单方程线性回归模型。回归结果显示在保持模型中其他因素不变的情况下,SAT考试的确存在性别和种族歧视,且性别对成绩的影响要大于种族对成绩的影响。最后结合2016年SAT考试的公平性改革,探究SAT的未来发展方向及对我国新高考改革的借鉴。相似文献

7.

SAT考试:高考制度改革可资借鉴的一面铜镜 总被引：3，自引：2，他引：3

孙崇文《教育发展研究》2001,21(7):80-82

自1999年开始,我国高考制度改革的重心实现了向考试科目设置以及高考形式和内容的改革方向的转移,江苏、浙江、吉林和山西四省分别推出了“3 综合”的考试新模式,广东省也积极进行了“3 X”考试模式的新探索,并将逐步推广到全国其他省市自治区。新一轮普通高校的招生考试制度改革,普遍摒弃了以往以单纯的知识测试作为录取学生的唯一依据的传统考试模式,突出和强调了对学生综合素质的考察,这反映了高等教育“大众化”发展趋势的要求,也体现了人们对于实施素质教育思想的高度认同。一、素质教育的实施,要求我们必须改变传统的教育… 相似文献

8.

非标准化试题的智能评分

陈慈弟林远明黄聪田民格《三明学院学报》2011,28(2):7-10

通过对函数S-粗集和动态规划算法的研究,提出了相似度和可信度概念,给出了非标准化试题实现评分的方案和步骤,其中关键步骤是迁移处理和计算最长公共子序列长度。主要阐述了基于函数S-粗集的迁移处理,并分析了计算最长公共子序列长度解的结构和计算方法,最后分别给出了迁移函数和计算最长公共子序列长度函数的源程序。相似文献

9.

高考分数的科学解释和利用——ACT考试分数量表评介

《中国考试》2015,(11)

高考虽然是选拔性考试,但需要应用标准参照考试的理论,深入细致地分析考试数据和考生答题情况,这样既可知道考生在群体中的地位,更可以知道考试分数的意义以及考生能力发展水平和知识掌握程度,对考生做出科学合理的评价。进而使招生的高校更加具体、深入地了解考生的学业水平和学科特长,挑选满足自身招生要求、适合本专业培养的考生,也将会更有利于人才的选拔,也更有利于人才的培养。相似文献

10.

The Reliability of Test Scores

《The Journal of educational research》2012,105(5):370-379

相似文献

11.

Uses and Abuses of Achievement Test Scores

Susan Bobbitt Nolen Thomas M. Haladyna Nancy S. Haas 《Educational Measurement》1992,11(2):9-15

Are variations in test-preparation practices from school to school undermining the meaningfulness of achievement test results? Is there pressure to raise achievement test scores by the use of educationally unsound practices? What uses of achievement test scores are most common? Do teachers and administrators have reasonably accurate views of test score uses? 相似文献

12.

Combining Multiple-Choice and Constructed-Response Test Scores: Toward a Marxist Theory of Test Construction

《教育实用测度》2013,26(2):103-118

Assessment instruments of the future will probably be composed of a combination of different types of questions. Even though different kinds of questions require different scoring procedures, there may be a need to have those different scores combined as a composite. In this article, we describe how mixtures of such scores may be efficaciously combined. Also, if no post hoc adjustment is desired, we provide two characterizations of measurement effectiveness to aid in making unadjusted score combinations efficient. In addition, we explore the implications for test construction of some typical findings. 相似文献

13.

New Perspectives on the Correlation of SAT Scores, High School Grades, and Socioeconomic Factors

Rebecca Zwick Jennifer Greif Green 《Journal of Educational Measurement》2007,44(1):23-45

In studies of the SAT, correlations of SAT scores, high school grades, and socioeconomic factors (SES) are usually obtained using a university as the unit of analysis. This approach obscures an important structural aspect of the data: The high school grades received by a given institution come from a large number of high schools, all of which have potentially different grading standards. SAT scores, on the other hand, can be assumed to have the same meaning across high schools. Our analyses of a large national sample show that, when pooled within-high-school analyses are applied, high school grades and class rank have larger correlations with family income and education than is evident in the results of typical analyses, and SAT scores have smaller associations with socioeconomic factors. SAT scores and high school grades, therefore, have more similar associations with SES than they do when only the usual across-high-school correlations are considered . 相似文献

14.

Alignment and Implications for Test Takers

Catherine J. Welch Stephen B. Dunbar 《Educational Measurement》2020,39(2):8-17

The use of assessment results to inform school accountability relies on the assumption that the test design appropriately represents the content and cognitive emphasis reflected in the state's standards. Since the passage of the Every Student Succeeds Act and the certification of accountability assessments through federal peer review practices, the content validity arguments supporting accountability have relied almost exclusively on the alignment of statewide assessments to state standards. It is assumed that if alignment does not hold, the scores will not provide valid inferences regarding the degree to which test takers have performed. Although alignment results are commonly used as evidence of test appropriateness, Polikoff (this issue) would argue that given the importance of alignment in policy decisions, research related to alignment is surprisingly limited. Few studies have addressed the adequacy of alignment methodologies and results as support for the inferences to be made (i.e., proficient on state standards). This paper uses an example of test taker performance (and common performance indicators) to investigate to what extent the degree of alignment impacts inferences made about performance (i.e., classification into performance levels, estimates of student ability, and student rank order). 相似文献

15.

Schooling and the Norming of Intelligence Test Scores

Sorel Cahan 《Educational Measurement》2000,19(3):26-32

How does schooling affect the development of intelligence in children? How should the amount of schooling be considered when developing norms for turning intelligence test performance into IQ scores? 相似文献

16.

Variability in Reading Scores on a Given Level of Intelligence Test Scores

《The Journal of educational research》2012,105(6):440-446

ABSTRACT

Previous studies have shown that several key variables influence student achievement in geometry, but no research has been conducted to determine how these variables interact. A model of achievement in geometry was tested on a sample of 102 high school students. Structural equation modeling was used to test hypothesized relationships among variables linked to successful problem solving in geometry. These variables, including motivation, achievement emotions, pictorial representation, and categorization skills, were examined for their influence on geometry achievement. Results indicated that the model fit well. Achievement emotions, specifically boredom and enjoyment, had a significant influence on student motivation. Student motivation influenced students’ use of pictorial representations and achievement. Pictorial representation also directly influenced achievement. Categorization skills had a significant influence on pictorial representations and student achievement. The implications of these findings for geometry instruction and for future research are discussed. 相似文献

17.

Maintaining Equivalent Cut Scores for Small Sample Test Forms

Andrew C. Dwyer 《Journal of Educational Measurement》2016,53(1):3-22

This study examines the effectiveness of three approaches for maintaining equivalent performance standards across test forms with small samples: (1) common‐item equating, (2) resetting the standard, and (3) rescaling the standard. Rescaling the standard (i.e., applying common‐item equating methodology to standard setting ratings to account for systematic differences between standard setting panels) has received almost no attention in the literature. Identity equating was also examined to provide context. Data from a standard setting form of a large national certification test (N examinees = 4,397; N panelists = 13) were split into content‐equivalent subforms with common items, and resampling methodology was used to investigate the error introduced by each approach. Common‐item equating (circle‐arc and nominal weights mean) was evaluated at samples of size 10, 25, 50, and 100. The standard setting approaches (resetting and rescaling the standard) were evaluated by resampling (N = 8) and by simulating panelists (N = 8, 13, and 20). Results were inconclusive regarding the relative effectiveness of resetting and rescaling the standard. Small‐sample equating, however, consistently produced new form cut scores that were less biased and less prone to random error than new form cut scores based on resetting or rescaling the standard. 相似文献

18.

考试分数的强化与评价视域的窄化——有关考试与评价问题的几点辨析

吴维宁 ;高凌飚《考试研究》2009,(4):42-51

在应试教育的背景下,考试分数的作用被无限夸大。考试分数的强化窄化了评价视域,简化了课程目标,进而异化了基础教育。异化的教育又将考试分数的虚高价值进一步推向极致,最终形成教育的怪圈。本文在对考试分数的不当使用进行案例分析的基础上,提出正确理解与把握评价的几对关系,以期对走出教育的怪圈有所启示。相似文献

19.

Responses to Two Letters: Grading Practices and Improving Standardized Test Scores

John R. Hills 《Educational Measurement》1990,9(3):33-33

相似文献

20.

On State SAT Research: A Response to Wainer

Brian Powell Lala Carr Steelman 《Journal of Educational Measurement》1987,24(1):84-89

相似文献