期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Assessing faculty performance using student evaluations of teaching in an uncontrolled setting

Clifford Nowell Lewis R. Gale Bruce Handley 《Assessment & Evaluation in Higher Education》2010,35(4):463-475

This paper provides new evidence on the disparity between student evaluation of teaching (SET) ratings when evaluations are conducted online versus in‐class. Using a multiple regression analysis, we show that after controlling for many of the class and student characteristics not under the direct control of the instructor, average SET ratings from evaluations conducted online are significantly lower than average SET ratings conducted in‐class. Further, we demonstrate the importance of controlling for the factors not under the instructor’s control when using SET ratings to evaluate faculty performance in the classroom. We do not suggest that moving to online evaluation is overly problematic, only that it is difficult to compare evaluations done online with evaluations done in‐class. While we do not suppose that one method is ‘more accurate’ than another, we do believe that institutions would benefit from either moving all evaluations online or by continuing to do all evaluations in‐class. 相似文献

2.

The Causal Effect of Survey Mode on Students’ Evaluations of Teaching: Empirical Evidence from Three Field Experiments

Edgar Treischl Tobias Wolbring 《Research in higher education》2017,58(8):904-921

In recent years many universities switched from paper- to online-based student evaluation of teaching (SET) without knowing the consequences for data quality. Based on a series of three consecutive field experiments—a split-half design, twin courses, and pre–post-measurements—this paper examines the effects of survey mode on SET. First, all three studies reveal marked differences in non-response between online- and paper-based SET and systematic, but small differences in the overall course ratings. On average, online SET reveal a slightly less optimistic picture of teaching quality in students’ perception. Similarly, a web survey mode does not impair the reliability of student ratings. Second, we highlight the importance of taking selection and class absenteeism into account when studying survey mode effects and also show that it is necessary and informative to survey the subgroup of no-shows when evaluating teaching. Third, we empirically demonstrate the need to account for contextual setting of the survey (in class vs. after class) and the specific type of the online survey mode (TAN vs. email). Previous research either confounded contextual setting with variation in survey mode or generalized results for a specific online mode to web surveys in general. Our findings suggest that higher response rates in email surveys can be achieved if students are given the opportunity and time to evaluate directly in class. 相似文献

3.

Administrative and structural changes in student evaluations of teaching and their effects on overall instructor scores

Nina Zipser 《Assessment & Evaluation in Higher Education》2018,43(6):995-1008

Using nine years of student evaluation of teaching (SET) data from a large US research university, we examine whether changes to the SET instrument have a substantial impact on overall instructor scores. Our study exploits four distinct natural experiments that arose when the SET instrument was changed. To maximise power, we compare the same course/instructor before and after each of the four changes occurred. We find that switching from in-class, paper course evaluations to online evaluations generates an average change of ?0.14 points on a five-point scale, or 0.25 standard deviations (SDs) in the overall instructor ratings. Changing labelling of the scale and the wording of the overall instructor question generates another decrease in the average rating: ?0.15 of a point (0.27 SDs). In contrast, extending the evaluation period to include the final examination and offering an incentive (early grade release) for completing the evaluations do not have a statistically significant effect on the overall instructor rating. The cumulative impact of these individual changes is ?0.29 points (0.52 SDs). This large decrease shows that SET scores are not comparable over time when instruments change. Therefore, administrators should measure and account for such changes when using historical benchmarks for evaluative purposes (e.g. appointments and compensation). 相似文献

4.

A Bootstrap Experiment on the Statistical Properties of Students' Ratings of Teaching Effectiveness

Chau Chak-Tong 《Research in higher education》1997,38(4):497-517

This paper examines the effects of two background variables in students' ratings of teaching effectiveness (SETs): class size and students' motivation (as surrogated by students' likelihood to respond randomly). Resampling simulation methodology has been employed to test the sensitivity of the SET scale for three hypothetical instructors (excellent, average, and poor). In an ideal scenario without confounding factors, SET statistics unmistakably distinguish the instructors. However, at different class sizes and levels of random responses, SET class averages are significantly biased. Results suggest that evaluations based on SET statistics should look at more than class averages. Resampling methodology (bootstrap simulation) is useful for SET research for scale sensitivity study, research results validation, and actual SET score analyses. Examples will be given on how bootstrap simulation can be applied to real-life SET data comparison. 相似文献

5.

Whose feedback? A multilevel analysis of student completion of end-of-term teaching evaluations

Leah P. Macfadyen Shane Dawson Stewart Prest Dragan Gašević 《Assessment & Evaluation in Higher Education》2016,41(6):821-839

Student evaluation of teaching (SET) is now common practice across higher education, with the results used for both course improvement and quality assurance purposes. While much research has examined the validity of SETs for measuring teaching quality, few studies have investigated the factors that influence student participation in the SET process. This study aimed to address this deficit through the analysis of an SET respondent pool at a large Canadian research-intensive university. The findings were largely consistent with available research (showing influence of student gender, age, specialisation area and final grade on SET completion). However, the study also identified additional influential course-specific factors such as term of study, course year level and course type as statistically significant. Collectively, such findings point to substantively significant patterns of bias in the characteristics of the respondent pool. Further research is needed to specify and quantify the impact (if any) on SET scores. We conclude, however, by recommending that such bias does not invalidate SET implementation, but instead should be embraced and reported within standard institutional practice, allowing better understanding of feedback received, and driving future efforts at recruiting student respondents. 相似文献

6.

What response rates are needed to make reliable inferences from student evaluations of teaching?

Abdel Azim Zumrawi Simon P. Bates 《Educational Research and Evaluation》2014,20(7-8):557-563

This paper addresses the determination of statistically desirable response rates in students’ surveys, with emphasis on assessing the effect of underlying variability in the student evaluation of teaching (SET). We discuss factors affecting the determination of adequate response rates and highlight challenges caused by non-response and lack of randomization. Estimates of underlying variability were obtained for a period of 4?years, from online evaluations at the University of British Columbia (UBC). Simulations were used to examine the effect of underlying variability on desirable response rates. The UBC response rates were compared to those reported in the literature. Results indicate that small differences in underlying variability may not impact desired rates. We present acceptable response rates for a range of variability scenarios, class sizes, confidence level, and margin of error. The stability of estimates observed at UBC, over a 4-year period, indicates that valid model-based inferences of SET could be made. 相似文献

7.

Communicating Student Evaluation of Teaching Results: Rating Interpretation Guides (RIGs)

Ruth Neumann 《Assessment & Evaluation in Higher Education》2000,25(2):121-134

In the past decade student evaluation of teaching has expanded within Australian universities, with the results of evaluations increasingly being used to make judgements about teaching quality, career advancement and the funding of teaching. An important component of any student evaluation of teaching system is, therefore, the communication of rating results in a manner that enables fair and meaningful interpretations and comparisons of results by the wide range of evaluation users. This paper describes the development in one Australian university of Rating Interpretation Guides (RIGs) which take into account the influence of different teaching contexts on ratings and encourage evaluation users to explore ratings in terms of a range of scores rather than focusing on a single mean score. RIGs represent an important innovation in the communication of evaluation results for both formative and summative evaluation purposes. They are designed to enable meaningful interpretations of ratings which in turn will encourage appropriate use of results. 相似文献

8.

Analysis of Differences in Nonteaching Factors Influencing Student Evaluation of Teaching between Engineering and Business Classrooms

Arunachalam Narayanan William J. Sawaya III Michael D. Johnson 《Decision Sciences Journal of Innovative Education》2014,12(3):233-265

In recent years, there have been increasing calls from the government and other organizations to provide easy public access to student evaluations of teaching. Indeed, the increasing ease of displaying and viewing large quantities of information, and competition among universities and majors for students, makes it likely that an era of greater transparency of this type of information is at hand. While students’ evaluation of teaching (SET) is one quantitative metric that rates the instructor, it may be influenced by factors that are often beyond the instructor's control. In this study, we analyze a longitudinal data set from both engineering and business schools of a large public university, and identify factors that influence SET. We show which factors have the highest influence on overall SET scores, and contrast these between engineering and business colleges. Colleges within the same university may have differences in the factors affecting SET, and recognition of this is important in effectively and fairly evaluating SET scores. We also provide recommendations regarding information that should be displayed along with the SET, particularly when SET scores are made public, so that instructors are not unduly penalized when their evaluations can be influenced by factors over which they have no control. 相似文献

9.

Meta-analysis of faculty's teaching effectiveness: Student evaluation of teaching ratings and student learning are not related

《Studies in Educational Evaluation》2017

Student evaluation of teaching (SET) ratings are used to evaluate faculty's teaching effectiveness based on a widespread belief that students learn more from highly rated professors. The key evidence cited in support of this belief are meta-analyses of multisection studies showing small-to-moderate correlations between SET ratings and student achievement (e.g., Cohen, 1980, Cohen, 1981; Feldman, 1989). We re-analyzed previously published meta-analyses of the multisection studies and found that their findings were an artifact of small sample sized studies and publication bias. Whereas the small sample sized studies showed large and moderate correlation, the large sample sized studies showed no or only minimal correlation between SET ratings and learning. Our up-to-date meta-analysis of all multisection studies revealed no significant correlations between the SET ratings and learning. These findings suggest that institutions focused on student learning and career success may want to abandon SET ratings as a measure of faculty's teaching effectiveness. 相似文献

10.

Assessment & Evaluation in Higher Education Using the margin of error statistic to examine the effects of aggregating student evaluations of teaching

David James Gregory Schraw 《Assessment & Evaluation in Higher Education》2019,44(7):1042-1052

We proposed an extended form of the Govindarajulu and Barnett margin of error (MOE) equation and used it with an analysis of variance experimental design to examine the effects of aggregating student evaluations of teaching (SET) ratings on the MOE statistic. The interpretative validity of SET ratings can be questioned when the number of students enrolled in a course is low or when the response rate is low. A possible method of improving interpretative validity is to aggregate SET ratings data from two or more courses taught by the same instructor. Based on non-parametric comparisons of the generated MOE, we found that aggregating course evaluation data from two courses reduced the MOE in most cases. However, significant improvement was only achieved when combining course evaluation data for the same instructor for the same course. Significance did not hold when combining data from different courses. We discuss the implications of our findings and provide recommendations for practice. 相似文献

11.

The effect of switching to mandatory online course assessments on response rates and course ratings

Ojmarrh Mitchell Melissa Morales 《Assessment & Evaluation in Higher Education》2018,43(4):629-639

Increasingly, student assessments of courses are being conducted online as opposed to administered in class. A growing body of research compares response rates and course ratings of courses evaluated online versus on paper. The present study extends this research by comparing student course assessments before and after the University of South Florida made online evaluations mandatory for all courses. This change only directly affected courses taught on-campus, as online courses were already being assessed online. However, we examine the effect of this change on courses taught on-campus and online, because we expect this change in policy to have differential effects. We hypothesise that by making online assessments mandatory for all courses, online assessments went from a novel method of evaluation to the norm; and, therefore, increased response rates for online courses, but had the opposite effect for on-campus courses. We find mixed support for our hypothesis. 相似文献

12.

How Fair are Student Ratings in Assessing the Teaching Performance of University Teachers?

Kam‐por Kwan 《Assessment & Evaluation in Higher Education》1999,24(2):181-195

This paper examines the effects of course characteristics on student ratings of university teaching. The multivariate effects of various course characteristics on student ratings of university teachers were examined by a 5×4×3×2×2 multivariate analysis of variance (MANOVA) with academic discipline, class size, level of course, type of course, and mode of study as independent variables and a set of six composite student ratings as the dependent variable. Data were obtained from a university in Hong Kong and consisted of 4996 course average ratings collected from 25 departments over two academic years. Significant differences were found in course average student ratings across academic disciplines, class size groups, course levels, types of course, and modes of study. The effect sizes of the largest differences between sub‐groups within each of the factors were also estimated. Academic discipline differences and class size were found to have the largest effect on student ratings, with effect sizes of 0.43–0.86 and 0.55–1.01 respectively. Implications of the findings are discussed. 相似文献

13.

Comparison of online and classroom-based student evaluations of instruction

Jeffrey R. Stowell William E. Addison Jennifer L. Smith 《Assessment & Evaluation in Higher Education》2012,37(4):465-473

In recent years, colleges have been moving from traditional, classroom‐based student evaluations of instruction to online evaluations. Because of the importance of these evaluations in decisions regarding retention, promotion and tenure, instructors are justifiably concerned about how this trend might affect their ratings. We recruited faculty members who were teaching two or more sections of the same course in a single semester and assigned at least one section to receive online evaluations and the other section(s) to receive classroom evaluations. We hypothesised that the online evaluations would yield a lower response rate than the classroom administration. We also predicted that there would be no significant differences in the overall ratings, the number of written comments, and the valence (positive/neutral/negative) of students’ comments. A total of 32 instructors participated in the study over two semesters, providing evaluation data from 2057 students. As expected, online evaluations had a significantly lower response rate than classroom evaluations. Additionally, there were no differences in the mean ratings, the percentage of students who provided written comments or the proportion of comments in the three valence categories. Thus, even with the lower response rate for online evaluations, the two administration formats seemed to produce comparable data. 相似文献

14.

Student perceptions of student perception of module questionnaires: questionnaire completion as problem solving

S. Ian Robertson 《Assessment & Evaluation in Higher Education》2004,29(6):663-679

A sample of 202 students filling in a student evaluation of teaching (SET) questionnaire were asked to complete another questionnaire asking about the specific reasons for awarding a score to the specific SET questionnaire items. The aim was to find out what influenced students' judgements on those items. It was found that students' interpretation of some questions differed from the ‘expected’ interpretation. Several factors, such as the placing of questions and the salience of items retrieved from memory, could influence a score. It was also found that asking for an explanation improved scores overall. The conclusions were that questionnaire completion could be understood as a form of problem solving and judgement under uncertainty. The specific heuristics used led to variability in students' interpretation of the task. 相似文献

15.

Online student evaluations of teaching: what are we sacrificing for the affordances of technology?

Anglica Risquez Elaine Vaughan Maura Murphy 《Assessment & Evaluation in Higher Education》2015,40(1):120-134

In the context of increased emphasis on quality assurance of teaching, it is crucial that student evaluations of teaching (SET) methods be both reliable and workable in practice. Online SETs particularly tend to raise criticisms with those most reactive to mechanisms of teaching accountability. However, most studies of SET processes have been conducted with convenience, small and cross-sectional samples. Longitudinal studies are rare, as comparison studies on SET methodological approaches are generally pilot studies followed shortly after by implementation. The investigation presented here significantly contributes to the debate by examining the impact of the online administration method of SET on a very large longitudinal sample at the course level rather than attending to the student unit, thus compensating for the inter-dependency of students’ responses according to the instructor variable. It explores the impact of the administration method of SET (paper based in-class vs. out-of-class online collection) on scores, with a longitudinal sample of over 63,000 student responses collected over a total period of 10 years. Having adjusted for the confounding effect of class size, faculty, year of evaluation, years of teaching experience and student performance, it is observed that the actual effect of the administration method exists, but is insignificant. 相似文献

16.

Taking the grading leniency story to the edge. The influence of student, teacher, and course characteristics on student evaluations of teaching in higher education

Bert Brockx Pieter Spooren Dimitri Mortelmans 《Educational Assessment, Evaluation and Accountability》2011,23(4):289-306

The use of student evaluation of teaching (SET) to evaluate and improve teaching is widespread amongst institutions of higher education. Many authors have searched for a conclusive understanding about the influence of student, course, and teacher characteristics on SET. One hotly debated discussion concerns the interpretation of the positive and statistically significant relationship that has been found between course grades and SET scores. In addition to reviewing the literature, the main purpose of the present study is to examine the influence of course grades and other characteristics of students, courses, and teachers on SET. Data from 1244 evaluations were collected using the SET-37 instrument and analyzed by means of cross-classified multilevel models. The results show positive significant relationships between course grades, class attendance, the examination period in which students receive their highest course grades, and the SET score. These relationships, however, are subject to different interpretations. Future research should focus on providing a definitive and empirically supported interpretation for these relationships. In the absence of such an interpretation, it will remain unclear whether these relationships offer proof of the validity of SET or whether they are a biasing factor. 相似文献

17.

EFFECTS OF TEACHER–CHILD INTERACTION TRAINING (TCIT) ON TEACHER RATINGS OF BEHAVIOR CHANGE

Lauren L. Garbacz Kristen E. Zychinski Rachel M. Feuer Jocelyn S. Carter Karen S. Budd 《Psychology in the schools》2014,51(8):850-865

Problem behaviors in preschool‐aged children negatively affect teacher‐child relationships and children's skill development. In this clinical replication of an initial study, we implemented Teacher–Child Interaction Training (TCIT), a teacher‐delivered, universal intervention designed for early childhood settings. The initial study evaluated the TCIT program in a sample of 4‐ to 5‐year‐old children, whereas the current study focused on 2‐ to 3‐year‐old children. Teacher ratings of children's behavior indicated a significant main effect for time on children's protective factor scores, but not on behavioral concerns. However, for children whose ratings fell in the below‐average range at baseline, significant large effect sizes were obtained for changes over time for both protective factors and behavioral concerns. Higher levels of teacher skill change were significantly associated with overall higher protective factor scores, as well as lower behavioral concern scores for children when baseline levels of behavioral concerns were high. Results provide further support for the effectiveness of TCIT as a universal intervention designed to improve children's behaviors through targeted improvements in teachers’ relationship‐building skills and classroom management strategies. 相似文献

18.

Student evaluations of teaching: the impact of faculty procedures on response rates

Karen Young Jeffrey Joines Trey Standish Victoria Gallagher 《Assessment & Evaluation in Higher Education》2019,44(1):37-49

In the last 10–15 years, many institutions of higher education have switched from paper-and-pencil methods to online methods of administering student evaluations of teaching (SETs). One consequence has been a significant reduction in the response rates to such instruments. The current study was conducted to identify whether offering in-class time to students to complete online SETs would increase response rates. A quasi-experiment (nonequivalent group design) was conducted in which one group of tenured faculty instructed students to bring electronic devices with internet capabilities on a specified day and offered in-class time to students to complete online SETs. A communication protocol for faculty members’ use was developed and implemented. A comparison group of tenured faculty who did not offer in-class time for SET completion was identified and the difference-in-differences method was used to compare the previous year’s response rates for the same instructor teaching the same course across the two groups. Response rates were substantially higher when faculty provided in-class time to students to complete SETs. These results indicate that high response rates can be obtained for online SETs submitted by students in face-to-face classes if faculty communicate the importance of SETs in both their words and actions. 相似文献

19.

The Effect of Content Delivery Style on Student Performance in Anatomy

Lloyd J. White Heath W. McGowan Aaron C. McDonald 《Anatomical sciences education》2019,12(1):43-51

The development of new technologies and ensuing pedagogical research has led many tertiary institutions to integrate and adopt online learning strategies. The authors of this study have incorporated online learning strategies into existing educational practices of a second year anatomy course, resulting in half of the course content delivered via face-to-face lectures, and half delivered online via tailored video vignettes, with accompanying worksheets and activities. The effect of the content delivery mode on student learning was analyzed by tailoring questions to content presented either face-to-face or online. Four practical tests were conducted across the semester with each consisting of four questions. Within each test, two questions were based on content delivered face-to-face, and two questions were based on content delivered online. Examination multiple choice questions were similarly divided and assessed. Findings indicate that student learning is consistent regardless of the mode of content delivery. However, student viewing habits had a significant impact on learning, with students who viewed videos multiple times achieving higher marks than those less engaged with the online content. Student comments also indicated that content delivery mode was not an influence on learning. Therefore student engagement, rather than the mode of content delivery, is a determinant of student learning and performance in human anatomy. Anat Sci Educ. © 2018 American Association of Anatomists. 相似文献

20.

Stability and correlates of student evaluations of teaching at a Chinese university

Guo‐Hai Chen David Watkins 《Assessment & Evaluation in Higher Education》2010,35(6):675-685

This paper examines the stability and validity of a student evaluations of teaching (SET) instrument used by the administration at a university in the PR China. The SET scores for two semesters of courses taught by 435 teachers were collected. Total 388 teachers (170 males and 218 females) were also invited to fill out the 60‐item NEO Five‐Factor Inventory together with a demographic information questionnaire. The SET responses were found to have very high internal consistency and confirmatory factor analysis supported a one‐factor solution. The SET re‐test correlations were .62 for both the teachers who taught the same course (n = 234) and those who taught a different course in the second semester (n = 201). Linguistics teachers received higher SET scores than either social science or humanities or science and technology teachers. Student ratings were significantly related to Neuroticism and Extraversion. Regression results showed that the Big‐Five personality traits as a group explained only 2.6% of the total variance of student ratings and academic discipline explained 12.7% of the total variance of student ratings. Overall the stability and validity of SET was supported and future uses of SET scores in the PR China are discussed. 相似文献