共查询到20条相似文献,搜索用时 15 毫秒
1.
Elina Saeki Natasha Segool Laura Pendergast Nathaniel von der Embse 《Psychology in the schools》2018,55(4):391-403
This study examined the potential influence of test‐based accountability policies on school environment and teacher stress among early elementary teachers. Structural equation modeling of data from 541 kindergarten through second grade teachers across three states found that use of student performance on high‐stakes tests to evaluate teachers indirectly was related to teachers’ professional investment via test stress in the environment. Although students in kindergarten through second grade do not take high‐stakes assessments, early elementary teachers reported high levels of stress associated with test‐based accountability policies. This study provides data across multiple states that test‐based accountability policies may have negative influences on school environment and teacher stress among early elementary teachers. Implications for practice and research are discussed. 相似文献
2.
Most studies predicting college performance from high‐school grade point average (HSGPA) and college admissions test scores use single‐level regression models that conflate relationships within and between high schools. Because grading standards vary among high schools, these relationships are likely to differ within and between schools. We used two‐level regression models to predict freshman grade point average from HSGPA and scores on both college admissions and state tests. When HSGPA and scores are considered together, HSGPA predicts more strongly within high schools than between, as expected in the light of variations in grading standards. In contrast, test scores, particularly mathematics scores, predict more strongly between schools than within. Within‐school variation in mathematics scores has no net predictive value, but between‐school variation is substantially predictive. Whereas other studies have shown that adding test scores to HSGPA yields only a minor improvement in aggregate prediction, our findings suggest that a potentially more important effect of admissions tests is statistical moderation, that is, partially offsetting differences in grading standards across high schools. 相似文献
3.
4.
Much of the recent focus of educational policymakers has been on improving the measurement of teacher effectiveness. Linking student growth to teacher effects has been a large part of reform efforts. To date, neither researchers nor practitioners have arrived at a consensus on how to treat test scores from students with disabilities in growth‐based teacher effectiveness indicators, despite the fact that these students make up approximately 13% of the K‐12 student population. In this study, we leverage longitudinal data from the population of teachers in one state to explore practical questions related to including general assessment scores from students with disabilities in teacher evaluation. Findings suggest that including test scores from students with disabilities allows more teachers to be evaluated and does not substantially affect teachers’ scores. Moreover, including disability‐related covariates can allow for fairer evaluations for teachers with many students with disabilities in their class. 相似文献
5.
Jennifer Booher‐Jennings 《British Journal of Sociology of Education》2008,29(2):149-160
Although high‐stakes tests play an increasing role in students’ schooling experiences, scholars have not examined these tests as sites for socialisation. Drawing on qualitative data collected at an American urban primary school, this study explores what educators teach students about motivation and effort through high‐stakes testing, how students interpret and internalise these messages, and how student hierarchies develop as a result. I found that teachers located boys’ failure in their poor behavior and attitudes, while arguing that girls simply needed more self‐esteem to pass the test. Most boys accepted their teachers’ diagnosis of the problem. However, the boys who felt that they were already ‘doing their best’ and ‘working hard’ began to doubt that educational success is a function of merit and effort. I conclude that students learn about much more than the three Rs through their experiences with high‐stakes testing, and argue that future research should attend to the social dimensions of these experiences. 相似文献
6.
《Educational Philosophy and Theory》2013,45(3):357-360
I will argue in what follows, following the insights of James Marshall on busno‐cratic power, that resistance to this new power is already well underway, and that this resistance is potentially problematic and potentially transgressive (in Marshall's words ‘a reflective reconstitution’) . The self is not only a chooser in busno‐cratic land, it is also re‐commodifying itself and in so doing, beginning to struggle at the limits of its commodified situation. I will argue that commodified selves, as much as they are constrained, are also potent sites for resistance. Part of that resistance is being waged in the terrain of the high stakes test, where the self that could ‘choose’ runs headlong into a product that definitively limits its range of choice. In order to engage critically with this resistance, I examine the cracks in the monolithic power of testing, cracks that point to the uncertainty of numbers and the ambivalent anxieties of test takers. 相似文献
7.
Nathaniel P. von der Embse Alexander M. Schoemann Stephen P. Kilgus Maribeth Wicoff Mark Bowler 《教育心理学》2017,37(3):312-331
The present study examined the use of student test performance for merit pay and teacher evaluation as predictive of both educator stress and counterproductive teaching practices, and the moderating role of perceived test value. Structural equation modelling of data from a sample of 7281 educators in a South-eastern state in the United States supported the hypothesis that educators who perceived the test as an invalid measure of teaching effectiveness were more likely to report high levels of test stress and to use counterproductive teaching practices, including fear appeals, in an attempt to motivate students for test-taking. This study provides initial evidence for the hypothesised relationships of test-based accountability policy with teacher mental health and instructional practices. Implications for research and practice are discussed. 相似文献
8.
应用项目反应理论对中考命题质量进行分析,可以排除抽样干扰,准确评估试题的难度,客观精细地描述试题的区分度,评估整套试卷和各试题对学生能力估计的精度,查找赋分标准和阅卷过程中存在的问题。 相似文献
9.
本研究利用建构图设计一套含有六大部分的30道试题。题型包括拼写题、选择题和简答题。共有175名6到14岁儿童参加了此项考试。Rasch分析结果发现题组内局部题目依赖并不严重。信度为0.85。考题的难度和考生能力的配合度相当良好。我们根据建构图来编写考题,因此有一定程度的内容效度。但有9道题的难度稍微与原先预期略有出入。有5道题不大吻合Rasch模式的预期,没有发现在性别上有明显的项目功能差异。考生能力与学习英语的时间有正相关。最后探讨了基于信息通讯技术的远程计算机自适应测验的技术问题。 相似文献
10.
Wayne W. Au 《British Journal of Sociology of Education》2008,29(6):639-651
High‐stakes, standardized testing has become the central tool for educational reform and regulation in many industrialized nations in the world, and it has been implemented with particular intensity in the United States and the United Kingdom. Drawing on research on high‐stakes testing and its effect on classroom practice and pedagogic discourse in the United States, the present paper applies Bernstein’s concept of the pedagogic device to explain how high‐stakes tests operate as a relay in the reproduction of dominant social relations in education. This analysis finds that high‐stakes tests, through the structuring of knowledge, actively select and regulate student identities, and thus contribute to the selection and regulation of students’ educational success. 相似文献
11.
Damazo T. Kadengye Eva Ceulemans Wim Van Den Noortgate 《Journal of Experimental Education》2015,83(2):175-202
In educational environments, monitoring persons' progress over time may help teachers to evaluate the effectiveness of their teaching procedures. Electronic learning environments are increasingly being used as part of formal education and resulting datasets can be used to understand and to improve the environment. This study presents longitudinal models based on the item response theory (IRT) for measuring persons' ability within and between study sessions in data from web-based learning environments. Two empirical examples are used to illustrate the presented models. Results show that by incorporating time spent within- and between-study sessions into an IRT model; one is able to track changes in ability of a population of persons or for groups of persons at any time of the learning process. 相似文献
12.
Stephanie Abraham Beth A. Wassell Kathryn McGinn Luet Nancy Vitalone-Racarro 《教育政策杂志》2019,34(4):523-546
This study is a critical discourse analysis of the New Jersey Opt-Out Movement. In 2015, and in response to the increasing standardization of US public school instruction, and over-use of high-stakes testing, NJ parents began to refuse to allow their children to take a key end-of-the year exam, the PARCC. We employ the concepts of master and counter narratives to exam the qualitative data that was collected in an online-questionnaire, a focus group, and refusal letters. We argue that these parents have refused this exam for their children based on differing reasons, that parents effectively changed the master narrative of the PARCC to grow their movement, and their refusal letters were key in promoting a new counter narrative. 相似文献
13.
14.
Liang-Ting Tsai 《Environmental Education Research》2019,25(2):264-279
This study established a Chinese scale for measuring high school students’ ocean literacy. This included testing its reliability, validity, and differential item functioning (DIF) with the aim of compensating for the lack of DIF tests focusing on current scales. The construct validity and reliability were verified and tested by analyzing the established scale’s items using the Rasch model, and a gender DIF test was conducted to ensure the test results’ fairness when distinct groups were compared simultaneously. The results indicated that the scale established in this study is unidimensional and possesses favorable internal consistency and construct validity. The gender DIF test results indicated that several items were difficult for either female or male students to correctly answer; however, the experts and scholars discussed these items individually and suggested retaining them. The final Chinese version of the ocean literacy scale developed here comprises 48 items that can reflect high school students’ understanding of ocean literacy—which helps students understand the topics of marine science encountered in real life. 相似文献
15.
Jaekyung Lee 《教育政策杂志》2013,28(1):73-93
This study tracks American states’ policy choices under the No Child Left Behind Act and explores their consequences for student achievement. Using the path analysis of relationships among state‐level policy input, context, and outcome variables, the study portrays a Halloween‐like ‘trick‐or‐treating’ game between the federal and state governments in the new ecology of the test‐driven education accountability system. States that chose the ‘trick’ path with a calculative policy negotiation and manipulation strategy made significant gains on their own state assessments but not on the national assessment. In contrast, states that followed the ‘treat’ path with a faithful policy implementation for funding strategy have not yet brought about significant gains on either the national or state assessments. The first‐generation accountability states with a prior history of high‐stakes testing tended to employ both strategies at the same time. However, neither effective illusion nor ineffective implementation serves the goal of long‐term, sustainable academic improvement. Implications for research and policy are discussed. 相似文献
16.
This paper analyzes the contradictory location of the professional and managerial new middle class within the rising tension between old systems of the industrial capitalist model of education, epitomized by a reliance on high‐stakes, standardized testing and the newer forms of production associated with the ‘fast’ capitalism of the global economy. The author argues that the professional and managerial new middle class is faced with a dilemma since they benefit from systems of high‐stakes, standardized testing, yet require schools to also teach the types of skills and flexibility associated with knowledge economy. The analysis suggests that this dilemma represents the contradictory class location of the new middle class relative to both discursive and productive resources. 相似文献
17.
王善桃 《廊坊师范学院学报(自然科学版)》2009,9(4):34-35,39
自适应考试系统具有公正、安全、高效的特点,它的理论基础是项目反应模型,运用迭代法来求解非线性方程,采用专家估计和极大似然估计相结合的方法来计算应试者的能力值,为自适应考试系统的抽题算法打下基础。 相似文献
18.
Qingping He 《Educational research; a review for teachers and all concerned with progress in education》2013,55(1):89-112
Background:?Although on-demand testing is being increasingly used in many areas of assessment, it has not been adopted in high stakes examinations like the General Certificate of Secondary Education (GCSE) and General Certificate of Education Advanced level (GCE A level) offered by awarding organisations (AOs) in the UK. One of the major issues with on-demand testing is that some of the methods used for maintaining the comparability of standards over time in conventional testing are no longer available and the development of new methods is required. Purpose:?This paper proposes an item response theory (IRT) framework for implementing on-demand testing and maintaining the comparability of standards over time for general qualifications, including GCSEs and GCE A levels, in the UK and discusses procedures for its practical implementation. Sources of evidence:?Sources of evidence include literature from the fields of on-demand testing, the design of computer-based assessment, the development of IRT, and the application of IRT in educational measurement. Main argument:?On-demand testing presents many advantages over conventional testing. In view of the nature of general qualifications, including the use of multiple components and multiple question types, the advances made in item response modelling over the past 30 years, and the availability of complex IRT analysis software systems, coupled with increasing IRT expertise in awarding organisations, IRT models could be used to implement on-demand testing in high stakes examinations in the UK. The proposed framework represents a coherent and complete approach to maintaining standards in on-demand testing. The procedures for implementing the framework discussed in the paper could be adapted by people to suit their own needs and circumstances. Conclusions:?The use of IRT to implement on-demand testing could prove to be one of the viable approaches to maintaining standards over time or between test sessions for UK general qualifications. 相似文献
19.
Substantial growth in the numbers of English language learners (ELLs) in the United States and Canada in recent years has significantly affected the educational systems of both countries. This article focuses on critical issues and concerns related to the assessment of ELLs in U.S. and Canadian schools and emphasizes assessment approaches for test developers and decision makers that will facilitate increased equity, meaningfulness, and accuracy in assessment and accountability efforts. It begins by examining the crucial issue of defining ELLs as a group. Next, it examines the impact of testing originating from the No Child Left Behind Act of 2001 (NCLB) in the U.S. and government‐mandated standards‐driven testing in Canada by briefly describing each country's respective legislated testing requirements and outlining their consequences at several levels. Finally, the authors identify key points that test developers and decision makers in both contexts should consider in testing this ever‐increasing group of students. 相似文献
20.
In this study, we examined the impact of covariate measurement error (ME) on the estimation of quantile regression and student growth percentiles (SGPs), and find that SGPs tend to be overestimated among students with higher prior achievement and underestimated among those with lower prior achievement, a problem we describe as ME endogeneity in this article. We proceeded to assess the effect of covariate ME correction on SGP estimation at two levels—the individual (student) and the aggregate (classroom). Our ME correction approach was limited to the simulation‐extrapolation method known as SIMEX. For both the individual and aggregate SGP, we find SIMEX effective in bias reduction. Further, because SIMEX is especially effective in reducing SGP bias for students with very high or very low prior achievement, it significantly weakens the ME endogeneity. SIMEX is also effective in reducing the MSE of aggregate SGP, provided that the students are sorted to some extent on their latent prior achievement. Our empirical study confirms the pattern of the simulation results: SIMEX mainly affects the mean SGP of classes in the highest and lowest quintiles of the prior score distribution, and significantly lowers the correlation between class SGP and prior achievement. 相似文献