首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 142 毫秒
1.
随着全国大学英语教学改革的启动和《大学英语课程教学要求》的颁布,海南师范大学对大学英语测试体系进行了改革,将口语测试纳入每学期的期末考试。分析2006级非英语专业大学生已参加的大学英语(一)(二)(三)的口试,发现口试的话题能够联系学生的生活,但有些试题馈入方式缺乏真实情景,输出形式缺乏交互性,评分标准存在不足。针对以上情况,改进完善大学英语口语测试体系应当在考试设计方面体现交际性,在口语测试效度方面应避免考生背答案现象,提高测试的评分信度,设计客观的评分标准。  相似文献   

2.
大学英语口语测试评分标准的设计与制定   总被引:1,自引:0,他引:1  
评分方法是口语测试中的一个重要环节,也是比较棘手的一个环节。评分方法的有效性与客观性直接决定了口语测试的成败。本文主要讨论北京航空航天大学大学英语口试改革中评分标准的制定及其理论依据。北京航空航天大学大学英语口试改革中评分标准分别借鉴准则参考性口试和常模参考性口试的优点:一方面确立考试准则,以便提高测试的效度;另一方面制定常模,在宏观上对口试评分进行控制。以提高测试的信度。  相似文献   

3.
大学英语大规模口语测试的设计与实施   总被引:1,自引:0,他引:1  
普通高校实施大学英语口语水平测试对大学英语教学有着积极的教学反拨作用,但面对数量如此庞大的大学英语(非英语专业)学生,如何进行有效、可信的大规模口语测试,是一个值得认真研究的问题。文章通过对忻州师范学院大规模数字化录音口试(Digital Recording Oral Test)的实例分析,详细阐述了大规模录音口试的考试内容、考试程序,以及评分标准,并对普通高校进行大规模数字化录音口试的可行性进行了一些探讨。  相似文献   

4.
英语口语考试的信度和效度受口试形式、评分标准和考官素质等多方面因素的影响。提高英语口试的效度和信度,需坚持英语口试形式与内容的统一,设计出科学、客观并具有可操作性的评分标准。高信度与效度的英语口语测试对教学具有积极的反拨作用。  相似文献   

5.
英语录音口试考试在形式和评分过程方面较传统口语测试有不同特点和优势。作为主观性测试的一种,其评分过程涉及因素多且复杂。因此,探究录音口试评分的过程和模式对于提高其测试信度和进一步推广非常重要。结合院校在我国高等英语教育中的重要作用,选取了北方民族大学外国语学院非英语专业研究生英语复试这一个案,围绕评分员这一评分理论的核心,从口试评分员对评分标准的处理,以及Milanovic et al.评分模式在实践中的具体运用,探究了民族院校英语录音口试评分的过程与模式。  相似文献   

6.
本文从英语专业四级测试(TEM4)的负面反拨作用、TEM4口试的题型和录音口试的施测方式、作文第一部分的命题形式和评分标准、两部分作文的分值分配的角度阐述了TEM4存在的主要缺陷。  相似文献   

7.
PETS是一种标准化英语考试,其口试有其独特的形式和特点。PETS的口试注重语言交际能力的考查,PETS口试的评分标准也以考查实际能力为核心。本文的主要目的是从现代交际测试理论反观PETS口试的信度和效度,也谈一谈口试中出现失误的原因及应试策略。  相似文献   

8.
汉语水平考试(HSK)高等口试是目前影响最大的汉语口语测试。口语测试应反映考生的真实语言水平,其评分标准也应从六个方面细化、科学化:语篇的自然度、表达形式、表达的清晰度、合适度、信息的精确度和相关的信息量。  相似文献   

9.
全国公共英语等级考试包括笔试和口试两大部分,分为五级。PETS-3口语测试评分方法既有优点,也存在不足之处。该口试评分方法对测试的信度产生了一定影响。评分的信度取决于评分标准、评分依据以及评分人三方面的共同作用.最终确保评分信度的真实、可靠。  相似文献   

10.
程利 《海外英语》2011,(1):7-8,10
本研究旨在测试在二语习得条件下词汇广度对英语口语流利度的影响。该研究通过八项流利性指标来测试受试者(重庆大学的39名英语专业大二学生)的口语流利度。研究发现词汇广度确实对英语口语流利度有一定影响,但却不是决定性的因素。在外语教学中,我们应该正确看待词汇教学,在口语交际中应加强一般词汇的创造性使用,教学生英语关键在于教他们如何灵活运用英语进行交流。  相似文献   

11.
通过有声思维实验方法并辅以刺激回忆,收集四名不同性格倾向的评分员在配对口语考试评分时进行的思维报告数据,定性分析结果表明:在实际评分中,评分员对评分量表的理解和使用存在很大的差异性,具体表现在:(1)外向的评分员在评分过程中,表现的比内向的评分员更为宽容;(2)内向的评分员更多地关注评分量表中的各项具体指标和标准,而外向的评分员强调任务的完成状况和考生之间的比较、交流,和互动;(3)外向的评分员比内向的评分员更少地依赖评分量表,更多地使用非语言的特征。本研究结果对考试评分标准的修订和评分员培训均有启示。  相似文献   

12.
This study investigated the agreement and stability of 3 teacher rating Scales used to assess ADHD in preschool children: the ADHD Rating Scale, the Child Attention Profile (CAP), and the Conners' Teacher Rating Scale-28 (CTRS-28). A sample of suburban children (n = 60) was observed and rated by their teachers and assistant teachers at preschool level (Time 1) and 4 years later at the elementary school level (Time 2). Agreement among the rating scales and interrater agreement between teacher and assistant teacher ratings yielded noticeably stronger correlations at Time 2 than at Time 1. Over the 4-year interval of the study, there was a significant change in the number of children identified as potential ADHD risks. It is probable there were a high number of false-positive indications in the preschool ADHD screenings. It is also possible that immature behavior of preschool children may mimic ADHD behavior at the elementary school level.  相似文献   

13.
Two complementary studies of poor and better attenders are presented. To measure emotional and behavioural difficulties (EBD) different teacher-completed rating scales were employed, and to determine social difficulties, the studies used sociometry and some items from the scales. One study had a longitudinal design. It revealed that, after controlling for social class, gender and EBD in Year 2, poor attendance between Years 2 and 6 had the effect of significantly increasing EBD in Year 6. Both studies also had a cross-sectional design element which gave an indication of some of the poor attenders’ social difficulties and confirmed the longitudinal findings. The rating scales also provided measurements of “neurotic” and “antisocial” disorders. These suggested that a “neurotic” disorder may have played a bigger part than an “antisocial” disorder in the poor attenders’ EBD. The article ends with a list of recent references dealing with intervention issues.  相似文献   

14.
This study examined the statistical effect of changing from a 10-point to a 5-point response scale on students' evaluation of instruction. Participants were 5,616 students enrolled in classes offered by the College of Social Sciences and Public Affairs at a large Western university, who completed both the old evaluation (10-point response) and the new evaluation (5-point response). On average, students used a greater range of points on the scale for the 5-point scale (32 percent) than on the 10-point scale (19 percent). The effects of changing the response scale are discussed.  相似文献   

15.
Several forms of assessment tools, including behavioral rating scales and objective tests such as the Continuous Performance Test (CPT), can be used to measure inattentive and hyperactive/impulsive behaviors associated with Attention-Deficit/Hyperactivity Disorder (ADHD). However, research with school-age children has shown that the correlations between parent ratings, teacher ratings, and scores on objective measures of ADHD-characteristic behaviors are modest at best. In this study, we examined the relations between parent and teacher ratings of ADHD and CPT scores in a sample of 65 preschoolers ranging from 50 to 72 months of age. No significant associations between teacher and parent ratings of ADHD were found. Parent-ratings of both inattention and hyperactivity/impulsivity accounted for variance in CPT omission errors but not CPT commission errors. Teacher ratings showed evidence of convergent and discriminant validity when entered simultaneously in a hierarchical regression. These tools may be measuring different aspects of inattention and hyperactivity/impulsivity.  相似文献   

16.
17.
In this study, the authors aimed to examine 8 of the different methods for computing confidence intervals around alpha that have been proposed to determine which of these, if any, is the most accurate and precise. Monte Carlo methods were used to simulate samples under known and controlled population conditions wherein the underlying item distribution is nonnormal and when the items’ responses are those of rating scales rather than dichotomous items. Overall, one can conclude that, despite concerns expressed over the use of Fisher's method for coefficient alpha, in general, it actually outperformed the other methods. Larger sample sizes and larger coefficient alphas also resulted in better band coverage, whereas smaller number of items resulted in poorer band coverage.  相似文献   

18.
Credentialing examination developers rely on task (job) analyses for establishing inventories of task and knowledge areas in which competency is required for safe and successful practice in target occupations. There are many ways in which task‐related information may be gathered from practitioner ratings, each with its own advantage and limitation. Two of the myriad alternative task analysis rating approaches are compared in situ: one establishing relative task saliency through a single scale of rated importance and another employing a composite of several independent scales. Outcomes regarding tasks ranked by two practitioner groups are compared. A relatively high degree of association is observed between tasks ranked through each approach, yielding comparable, though not identical examination blueprints.  相似文献   

19.
ABSTRACT

Students’ attitude towards science (SAS) is often a subject of investigation in science education research. Survey of rating scale is commonly used in the study of SAS. The present study illustrates how Rasch analysis can be used to provide psychometric information of SAS rating scales. The analyses were conducted on a 20-item SAS scale used in an existing dataset of The Trends in International Mathematics and Science Study (TIMSS) (2011). Data of all the eight-grade participants from Hong Kong and Singapore (N?=?9942) were retrieved for analyses. Additional insights from Rasch analysis that are not commonly available from conventional test and item analyses were discussed, such as invariance measurement of SAS, unidimensionality of SAS construct, optimum utilization of SAS rating categories, and item difficulty hierarchy in the SAS scale. Recommendations on how TIMSS items on the measurement of SAS can be better designed were discussed. The study also highlights the importance of using Rasch estimates for statistical parametric tests (e.g. ANOVA, t-test) that are common in science education research for group comparisons.  相似文献   

20.
The short-range property of interactions between scales in the compressible turbulent flow was examined. An estimation of the short-range scale scope and some formulae for the short-range eddy stress and heat transfer etc. were given. A concept of resonant-range interactions between extremely contiguous scales was introduced and some formulae for the resonant-range eddy stress and heat transfer etc. were also given. Multi-scale equations for the compressible turbulent flows were presented. The multi-scale equations are approximately closed and do not contain any empirical constants. The compressibility effects on turbulence are determined by the Farve averaged variables and the nonlinear relationships between the Farve- and physical-averaged variables.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号