期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

This study was designed to examine the level of dependence within multiple true-false (MTF) test item clusters by computing sets of item intercorrelations with data from a test composed of both MTF and multiple choice (MC) items. It was posited that internal analysis reliability estimates for MTF tests would be spurious due to elevated MTF within-cluster intercorrelations. Results showed that, on the average, MTF within-cluster dependence was no greater than that found between MTF items from different clusters, between MC items, or between MC and MTF items. But item for item, there was greater dependence between items within the same cluster than between items of different clusters. 相似文献

6.

THE RELATIVE MERITS OF MULTIPLE TRUE-FALSE ACHIEVEMENT TESTS

DAVID A. FRISBIE DARYL C. SWEENEY 《Journal of Educational Measurement》1982,19(1):29-35

相似文献

7.

THE EFFECTS OF GUTTMAN WEIGHTS ON THE RELIABILITY AND PREDICTIVE VALIDITY OF OBJECTIVE TESTS WHEN OMISSIONS ARE NOT DIFFERENTIALLY WEIGHTED

PAUL RAFFELD 《Journal of Educational Measurement》1975,12(3):179-185

相似文献

8.

INCREMENTAL RELIABILITY AND VALIDITY OF MULTIPLE-CHOICE TESTS WITH AN ANSWER-UNTIL-CORRECT PROCEDURE1

GERALD S. HANNA 《Journal of Educational Measurement》1975,12(3):175-178

相似文献

9.

THE INFLUENCE OF DIFFERENT STYLES OF TEXTBOOK USE ON INSTRUCTIONAL VALIDITY OF STANDARDIZED TESTS

DONALD J. FREEMAN GABRIELLA M. BELLI REW C. PORTER ROBERT E. FLODEN WILLIAM H. SCHMIDT JOHN R. SCHWILLE 《Journal of Educational Measurement》1983,20(3):259-270

相似文献

10.

SELF DESCRIPTION QUESTIONNAIRE III: THE CONSTRUCT VALIDITY OF MULTIDIMENSIONAL SELF-CONCEPT RATINGS BY LATE ADOLESCENTS

HERBERT W. MARSH ROSALIE O'NEILL 《Journal of Educational Measurement》1984,21(2):153-174

相似文献

11.

THE USE OF "NONE-OF-THESE" VERSUS HOMOGENEOUS ALTERNATIVES ON MULTIPLE-CHOICE TESTS: EXPERIMENTAL RELIABILITY AND VALIDITY COMPARISONS

Malcom L. Williamson Kenneth D. Hopkins 《Journal of Educational Measurement》1967,4(2):53-58

相似文献

12.

THE EFFECT OF ITEM TYPE ON THE CONSEQUENCES OF CHANGING ANSWERS ON MULTIPLE CHOICE TESTS

MALBERT SMITH III KINNARD P. WHITE RICHARD H. COOP 《Journal of Educational Measurement》1979,16(3):203-208

相似文献

13.

THE INADEQUACY OF VERBAL TEACHING

Mowat G. Fraser 《Educational theory》1955,5(1):53-55

相似文献

14.

IV. RELIABILITY AND VALIDITY OF THE CDI INVENTORIES

《Monographs of the Society for Research in Child Development》1994,59(5):25-31

相似文献

15.

ADMINISTERING AND SCORING OF THE NARRATIVE TESTS

《Monographs of the Society for Research in Child Development》1996,61(1-2):230-238

相似文献

16.

THE CONCURRENT VALIDITY OF STANDARDIZED ACHIEVEMENT TESTS BY CONTENT AREA USING TEACHERS' RATINGS AS CRITERIA

KENNETH D. HOPKINS CATHERINE A. GEORGE DAVID D. WILLIAMS 《Journal of Educational Measurement》1985,22(3):177-182

To assess the concurrent validity of standardized achievement tests using teachers' ratings (and rankings) of pupils' academic achievement as criteria, 42 teachers evaluated each of their students (n = 1,032) in each of five major curricular areas prior to the administration of a battery of standardized achievement tests. The teachers were directed to rate each student's proficiency disregarding attendance, attitude, deportment, and so on. Within-class correlation coefficients were computed to eliminate rater leniency bias. The standardized achievement tests were found to have substantial concurrent validity in reading, math, language arts, science, and social studies. The normalized teacher ranks yielded significantly higher validity coefficients than did the ratings, although the magnitude of the difference was small. The concurrent validity coefficients for language arts, reading, and math were significantly higher than those in science and social studies. 相似文献

17.

WHICH EXAMINEES ARE MOST FAVOURED BY THE USE OF MULTIPLE CHOICE TESTS?

GLENN L. ROWLEY 《Journal of Educational Measurement》1974,11(1):15-23

Scores were obtained from 198 ninth grade students on achievement motivation, test anxiety, testwiseness, and risktaking. Tests in mathematics and vocabulary were constructed in free response and multiple choice form, and administered to the subjects in that order, with an interval of 5 weeks between administrations. Partial correlations were computed between scores on the multiple choice tests and achievement motivation, test anxiety, testwiseness, and risktaking, with free response scores partialled out. The partial correlations were corrected for the unreliability in the free response scores, and tested for significance. All partials involving achievement motivation and test anxiety were nonsignificant, as were all partials based on mathematics scores. The partial correlations of vocabulary scores with testwiseness and risktaking were significant without exception. It was concluded that the use of multiple choice tests can favour certain examinees those who are highly testwise and willing to take risks in the test situation. It was noted that the extent to which these examinees were favoured was dependent on the nature of the test, and that a verbal test seemed more susceptible than a numerical test. 相似文献

18.

口语报告：方法与展望

李贤余嘉元《内蒙古师范大学学报(哲学社会科学版)》2006,35(1):55-58

口语报告是了解人类认知过程的重要方法。口语报告方法又称为出声思考方法,它能使被试的思维过程外部语言化,研究者以此可以直接研究人类复杂的信息加工过程。笔者介绍了口语报告方法的使用程序和国内外的有关口语报告方法的应用研究,分析了口语报告方法的发展趋势以及应用前景。相似文献

19.

论中国西部粮食安全的思路与对策

申亚民徐宝勤《西安文理学院学报》2002,17(2):1-4

西部大开发 ,一退三还 ,使得本来粮食短缺、经济落后的西部地区粮食安全雪上加霜 ,进而成为中国粮食安全的隐患。本文认为解决西部粮食安全的思路在于走区域大循环之路 ,并从粮食流通体制、提高粮食自给率、增加稳定的供应和提高西部农民购买力等方面采取对策 ,以实现西部和全国粮食安全的良性循环和可持续发展。相似文献

20.

ESTIMATING THE RELIABILITY, VALIDITY, AND INVALIDITY OF ESSAY RATINGS

H. BLOK 《Journal of Educational Measurement》1985,22(1):41-52

In an essay rating study multiple ratings may be obtained by having different raters judge essays or by having the same rater(s) repeat the judging of essays. An important question in the analysis of essay ratings is whether multiple ratings, however obtained, may be assumed to represent the same true scores. When different raters judge the same essays only once, it is impossible to answer this question. In this study 16 raters judged 105 essays on two occasions; hence, it was possible to test assumptions about true scores within the framework of linear structural equation models. It emerged that the ratings of a given rater on the two occasions represented the same true scores. However, the ratings of different raters did not represent the same true scores. The estimated intercorrelations of the true scores of different raters ranged from .415 to .910. Parameters of the best fitting model were used to compute coefficients of reliability, validity, and invalidity. The implications of these coefficients are discussed. 相似文献