PISA与TIMSS是近年来较为活跃的两个国际评价项目,它们在评价的目的、使用的评价框架以及试题的形式等方面有所不同,但其中又包含一定的相似的成分,对二者异同的分析将有助于我们进一步认识数学课程实施及数学素养评价的要素和关键。  相似文献   

国际数学和科学成就趋势研究(Trends in International Mathematics and Science Study,TIMSS)和国际学生评价项目(Programme for International Student Assessment,PISA)是两项著名的国际学生学业成就比较研究,参与国家多达数十个,对全球的教育具有广泛的影响。为了使国内的教育工作者更好地了解和利用这两个项目的研究成果,本文将对这两个项目基本情况加以介绍,并着重分析两者在测评目的、理论模型、测评工具上的异同。  相似文献   


There appears to be something of an intellectual and philosophical gulf between education researchers who seek insights from statistical analyses of complex data-sets such as those provided by the OECD (PISA), and others who seek to develop rich, contextualised socio-historical understandings that can shed light upon why particular classroom practices operate and are sustained within a given milieu. This paper outlines these different perspectives, with particular reference to non-cognitive factors. Detailed analysis of the roots of high academic achievement, and associated challenges to student wellbeing, in many East Asian countries, is provided. The important influence of broad political and societal factors is highlighted by reference to cross-cultural differences across a large number of countries. The paper concludes by stating that while data from large datasets can assist in gaining greater understanding of cross-cultural differences, to be meaningful, such analyses should be incorporated within complex ecosystemic accounts.  相似文献   

TIMSS2007科学测评基于各参与国家的科学课程制订了评价框架,并开发了标准化的科学测评工具.在剖析TIMSS2007科学测评的评价框架基础上,对测评工具的开发、设置和命题特点等进行分析,提出对我国基于新课程的学业成就评价改革的启示.  相似文献   

回顾了PISA2021创造性思维背景、内涵与维度,分析了PISA2021创造性思维在课堂教学中的影响因素及表现,阐述了PISA2021创造性思维的评估目标、领域设计、创造力测试设计及样例。创造性思维能力是学生适应21世纪发展的必备核心能力,需要组织多领域专家开展创造性思维的理论研究,学校要秉持促进学生创造性思维发展的理念,教师要成为学生创造性思维的促进者,通过课程与课堂教学培养学生创造性思维能力。  相似文献   

本文通过对当前国际上颇具影响力的大规模教育评价项目——"学生能力国际评价("PISA)的评价理念和技术的解析,结合教育部考试中心开展PISA2006中国试测研究实践收获的启示,阐述了作者对我国教育评价研究与发展的思考和建议。  相似文献   

从测试的实施角度看,不同的测试可以有不同的实施规定和办法,但是这些规定和办法的目的都是为了使学生在相等的条件和情境下参加考试或测试,从而减少测量误差,保持测量结果的客观、公正和准确.从这一角度来看,HSA带给我们的不仅是国际化的测试理念,在测试具体实施层面也为国内的教育质量监测提供了很多值得借鉴的经验.  相似文献   

This study investigates measurement invariance of the mathematics, science, and ICT scales across the 47 countries that participated in the PISA 2015 ICT Familiarity Questionnaire. Knowing whether the same constructs and measurements can be reliably compared across countries constitutes an important goal. The Alignment method is employed to test the measurement invariance of the three scales. The results show that mathematics and science scores are highly invariant and can be used to compare countries, whereas the ICT scale is mostly non-invariant and cannot be used to reliably compare ICT means across all participating countries. Implications and limitations are discussed.  相似文献   

The present study aimed to examine the measurement invariance of the information, communication and technology (ICT) engagement questionnaire in the Programme for International Student Assessment (PISA) across countries, and further to explore the effects of ICT engagement on student literacy. The sample included 98,758 15-year-old students from 16 countries in PISA 2018. The multi-group confirmatory factor analyses results indicated that the ICT engagement questionnaire was invariant at the residual level so that meaningful cross-country comparisons can be made at the observed level. Furthermore, the multi-level model results indicated that perceived ICT autonomy was a positive predictor, while use of social media was a negative predictor of student literacy across the countries. However, the relations of interest in ICT and perceived ICT competence to student literacy were inconsistent, with some countries showing positive and linear relationships, while others negative and linear relationships.  相似文献   


This paper accepts that the OECD’s PISA has become influential in policy terms globally, but analyses the ways that the main PISA and PISA for Schools tests are positioned differently in Australia and the USA because of contrasting educational federalisms in the two nations. Our argument is that while PISA is undoubtedly influential, its effects are nonetheless mediated by the political structures – here, the different models of federalism – present within different nations, which in turn leads to quite distinct ‘PISA effects’. For instance, Australia oversamples on main PISA to make its data available for national and state-level policymaking, whereas the USA, with its focus on local governance in schooling, does not oversample, meaning that main PISA does not have a comparable policy salience as in Australia. Conversely, the newer PISA for Schools test originated in the USA with pressure from educators and philanthropic interests and has been implemented in a good number of schools, but it has not been taken up in the same way in Australia. Our analyses show how these differences reflect the idiosyncratic workings of federalism in the two nations, in which the federal government has a stronger policy and funding role in Australia than has hitherto been the case for the federal government in the USA.  相似文献   

The Standards for Educational and Psychological Testing identify several strands of validity evidence that may be needed as support for particular interpretations and uses of assessments. Yet assessment validation often does not seem guided by these Standards, with validations lacking a particular strand even when it appears relevant to an assessment. Consequently, the degree to which validity evidence supports the proposed interpretation and use of the assessment may be compromised. Guided by the Standards, this article presents an independent validation of OECD's PISA assessment of mathematical self-efficacy (MSE) as an instructive example of this issue. OECD identifies MSE as one of a number of “factors” explaining student performance in mathematics, thereby serving the “policy orientation” of PISA. However, this independent validation identifies significant shortcomings in the strands of validity evidence available to support this interpretation and use of the assessment. The article therefore demonstrates how the Standards can guide the planning of a validation to ensure it generates the validity evidence relevant to an interpretive argument, particularly for an international large-scale assessment such as PISA. The implication is that assessment validation could yet benefit from the Standards as what Zumbo calls “a global force for testing”.  相似文献   

TIMSS2011数学评价研究显示,新加坡、韩国、中国台北、中国香港与日本等东亚5地4、8年级学生的数学学业成就与优质生依然领跑世界;数学内容、认知领域成绩随总成绩同步升高,几何成绩相对薄弱,理解的成绩相对优于应用和推理;家庭环境、学校资源和风气、教师配备、课堂教学(数学课程)等关键影响因素与数学学业成就正相关.  相似文献   


Research related to the “teacher characteristics” dimension of teacher quality has proven inconclusive and weakly related to student success, and addressing the teaching contexts may be crucial for furthering this line of inquiry. International large-scale assessments are well positioned to undertake such questions due to their systematic sampling of students, schools, and education systems. However, researchers are frequently prohibited from answering such questions due to measurement invariance related issues. This study uses the traditional multiple group confirmatory factor analysis (MGCFA) and an alignment optimization method to examine measurement invariance in several constructs from the teacher questionnaires in the Trends in International Mathematics and Science Study (TIMSS) 2015 across 46 education systems. Constructs included mathematics teacher’s Job satisfaction, School emphasis on academic success, School condition and resources, Safe and orderly school, and teacher’s Self-efficacy. The MGCFA results show that just three constructs achieve invariance at the metric level. However, an alignment optimization method is applied, and results show that all five constructs fall within the threshold of acceptable measurement non-invariance. This study therefore presents an argument that they can be validly compared across education systems, and a subsequent comparison of latent factor means compares differences across the groups. Future research may utilize the estimated factor means from the aligned models in order to further investigate the role of teacher characteristics and contexts in student outcomes.


在众多的参与国中,美国对所收集的TIMSS各类信息进行了较为深入的研究,对国家教育政策的走向也产生了深远的影响。本文首先总结了美国TIMSS研究在研究对象、研究内容以及宣传手段方面所存在的特点,在此基础之上就TIMSS研究对美国教育所产生的影响进行了考察,从联邦和地方的教育改革、新型研究方法—录像带研究、联邦教育权力的扩张、教育意识从个性主义向标准化的转变四个层面进行了分析。  相似文献   

TIMSS 2019数学评价发现,中小学生的数学学习成绩正在逐渐提高,达到国际基准的比例也在提高,并存在区域、性别、领域和认识差异。同时,家庭与学校环境、师生教学准备、学生数学学习态度和数学课程与教学等背景因素都影响数学学习。为了促进我国数学教育的健康发展,我们要继承中华优秀教育传统,吸收先进教育理念,更新数学课程内容,优化教育教学评价,培养积极学习态度。  相似文献   

2009年4月上海首次参加国际学生评价项目(PISA)测评。本文结合生物学案例,简要介绍PISA科学素养的评价方式。  相似文献   

国际数学与科学趋势研究(TIMSS)最新测评结果显示,学生科学平均成绩的国际整体水平逐渐提高,东亚国家/地区处于领先地位,新加坡位列全球第一。研究表明,早期教育启蒙、教育资源、校园氛围及师资力量、学生态度和信心等因素会影响科学平均成绩。为了促进我国科学教育健康发展,首先需要构建安全的学习环境,培养学生学校归属感与积极的科学态度;其次,培养专业化教师来支持学生的科学实践;最后,将课程材料和学生日常经验相结合,合理开发科学课程资源。  相似文献   

Motivational constructs and students’ engagements have great impacts on students’ mathematics achievements, yet they have not been theoretically investigated using international large-scale assessment data. This study utilized the mathematics data of the Trends in International Mathematics and Science Study 2011 to conduct a comparative and empirical study on exploring: (1) the changes of motivational constructs from 4th grade to 8th grade; and (2) the effects of motivational constructs from the expectancy-value model and students’ engagements on mathematics achievements. The countries investigated include Chinese Taipei, Singapore, and the USA. The results showed that: (1) students’ motivations deteriorate over school years; (2) self-confidence in mathematics (SCM) has the strongest relationship with mathematics achievements. Furthermore, it is evident that Singapore has the most effective schools in students’ mathematics education. More findings of this comparative study are subsequently discussed.  相似文献   

PISA是当前最知名的国际教育成效评价项目之一。本文结合2006年PISA天津测试的实践经验,从PISA测试的背景、实施流程、保密措施等方面介绍PISA的实施理念与操作规范,并在此基础上从测验实施角度,总结提出值得我国教育考试机构借鉴的有关问题。  相似文献   

