期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

A BAYESIAN DECISION-THEORETIC PROCEDURE FOR USE WITH CRITERION-REFERENCED TESTS1

H. SWAMINATHAN RONALD K. HAMBLETON JAMES ALGINA 《Journal of Educational Measurement》1975,12(2):87-98

相似文献

2.

THE ROLE OF RELIABILITY IN CRITERION-REFERENCED TESTS

MICHAEL T. KANE 《Journal of Educational Measurement》1986,23(3):221-224

In discussion of the properties of criterion-referenced tests, it is often assumed that traditional reliability indices, particularly those based on internal consistency, are not relevant. However, if the measurement errors involved in using an individual's observed score on a criterion-referenced test to estimate his or her universe scores on a domain of items are compared to errors of an a priori procedure that assigns the same universe score (the mean observed test score) to all persons, the test-based procedure is found to improve the accuracy of universe score estimates only if the test reliability is above 0.5. This suggests that criterion-referenced tests with low reliabilities generally will have limited use in estimating universe scores on domains of items. 相似文献

3.

AN INTERPRETATION OF LIVINGSTON'S RELIABILITY COEFFICIENT FOR CRITERION-REFERENCED TESTS

CHESTER W. HARRIS 《Journal of Educational Measurement》1972,9(1):27-29

An alternative interpretation of Livingston's reliability coefficient is based on the notion of the relation of the size of the reliability coefficient to the range of talent. It is shown that the (generally) larger Livingston coefficient does not imply a smaller standard error of measurement and consequently does not imply a more dependable determination of whether or not a true score falls below (or exceeds) a given criterion value. 相似文献

4.

A REPLY TO HARRIS "AN INTERPRETATION OF LIVINGSTON'S RELIABILITY COEFFICIENT FOR CRITERION-REFERENCED TESTS"

SAMUEL A. LIVINGSTON 《Journal of Educational Measurement》1972,9(1):31-31

相似文献

5.

A CONSUMERs' GUIDE TO CRITERION-REFERENCED TEST RELIABILITY

RONALD A. BERK 《Journal of Educational Measurement》1980,17(4):323-349

相似文献

6.

CRITERION-REFERENCED TESTING: COMMENTS ON RELIABILITY1

RICHARD J. SHAVELSON JAMES H. BLOCK MICHAEL M. RAVITCH 《Journal of Educational Measurement》1972,9(2):133-137

Currently there is concern among some educators regarding the reliability of criterion-referenced (CR) measures. In this comment, a recent attempt to develop a theory of reliability for CR measures is examined, and some considerations for determining the reliability of CR measures are discussed. Conventional reliability statistics (e.g., coefficient alpha, standard error of measurement) are found appropriate for CR measures satisfying the assumptions of the measurement model underlying classical test theory. For measures with underlying multidimensional traits, conventional reliability statistics may be used at the homogeneous subscale level. When the confidence interval about a student's “below criterion score” includes the criterion, additional evidence about the student should be obtained. Two-stage sequential testing is suggested as one method for acquiring additional evidence. 相似文献

7.

THE ISSUE OF ITEM AND TEST VARIANCE FOR CRITERION-REFERENCED TESTS: A REPLY

M. I. CHAS. E. WOODSON 《Journal of Educational Measurement》1974,11(2):139-140

It is a necessary condition that items and tests have variance and discrimination in the range of interest (population of observations) for which they are calibrated and selected. The basis for selection of the calibration sample determines the kind of scale which will be developed, A random sample from a population of individuals leads to a norm-referenced scale, and a sample representative of abilities of a range of a characteristic leads to a criterion-referenced scale. 相似文献

8.

THE ISSUE OF ITEM AND TEST VARIANCE FOR CRITERION-REFERENCED TESTS: A CLARIFICATION

JASON MILLMAN W. JAMES POPHAM 《Journal of Educational Measurement》1974,11(2):137-138

This note contends that item or score variability is an unnecessary characteristic of criterion-referenced tests as they have been traditionally conceived, namely, as measures of well defined classes of examinee behaviors. 相似文献

9.

A COMPARISON OF THREE METHODS OF ESTABLISHING CUT-OFF SCORES ON CRITERION-REFERENCED TESTS

CRAIG N. MILLS 《Journal of Educational Measurement》1983,20(3):283-292

相似文献

10.

A NOTE ON THE INTERPRETATION OF THE CRITERION-REFERENCED RELIABILITY COEFFICIENT

SAMUEL A. LIVINGSTON 《Journal of Educational Measurement》1973,10(4):311-311

相似文献

11.

ESTIMATING RELIABILITY FROM A SINGLE ADMINISTRATION OF A CRITERION-REFERENCED TEST*

MICHAEL J. SUBKOVIAK 《Journal of Educational Measurement》1976,13(4):265-276

相似文献

12.

DETERMINING THE LENGTHS FOR CRITERION-REFERENCED TESTS

RONALD K. HAMBLETON CRAIG N. MILLS ROBERT SIMON 《Journal of Educational Measurement》1983,20(1):27-38

相似文献

13.

THE ISSUE OF ITEM AND TEST VARIANCE FOR CRITERION-REFERENCED TESTS1

M. I. CHAS. E. WOODSON 《Journal of Educational Measurement》1974,11(1):63-64

It has been argued that item variance and test variance are not necessary characteristics for criterion-referenced tests, although they are necessary for normreferenced tests. This position is in error because it considers sample statistics as the criteria for evaluating items and tests. Within a particular sample, an item or test may have no variance, but in the population of observations for which the test was designed, calibrated, and evaluated, both items and tests must have variance. 相似文献

14.

TOWARD AN INTEGRATION OF THEORY AND METHOD FOR CRITERION-REFERENCED TESTS1,2

RONALD K. HAMBLETON MELVIN R. NOVICK 《Journal of Educational Measurement》1973,10(3):159-170

In this paper, an attempt has been made to synthesize some of the current thinking in the area of criterion-referenced testing as well as to provide the beginning of an integration of theory and method for such testing. Since criterion-referenced testing is viewed from a decision-theoretic point of view, approaches to reliability and validity estimation consistent with this philosophy are suggested. Also, to improve the decision-making accuracy of criterion-referenced tests, a Bayesian procedure for estimating true mastery scores has been proposed. This Bayesian procedure uses information about other members of a student's group (collateral information), but the resulting estimation is still criterion referenced rather than norm referenced in that the student is compared to a standard rather than to other students. In theory, the Bayesian procedure increases the “effective length” of the test by improving the reliability, the validity, and more importantly, the decision-making accuracy of the criterion-referenced test scores. 相似文献

15.

EFFECTS OF DIFFERENT SAMPLES ON ITEM AND TEST CHARACTERISTICS OF CRITERION-REFERENCED TESTS

THOMAS MICHAEL HALADYNA 《Journal of Educational Measurement》1974,11(2):93-99

Although many have rejected classical test construction and analysis procedures for criterion-referenced tests, the present study was concerned with the possibility that classical procedures are both applicable and appropriate when samples of both mastery and nonmastery examinees are employed. A rationale for using these samples was presented, and empirical evidence was gathered which supported the practice of combining samples to increase the variance of test scores and thereby permit the proper estimate of reliability and item validities. 相似文献

16.

ON THE USE OF CUT-OFF SCORES WITH CRITERION-REFERENCED TESTS IN INSTRUCTIONAL SETTINGS

RONALD K. HAMBLETON 《Journal of Educational Measurement》1978,15(4):277-290

相似文献

17.

ESTIMATING THE RELIABILITY OF MULTIPLE TRUE-FALSE TESTS

DAVID A. FRISBIE CYNTHIA A. DRUVA 《Journal of Educational Measurement》1986,23(2):99-105

This study was designed to examine the level of dependence within multiple true-false (MTF) test item clusters by computing sets of item intercorrelations with data from a test composed of both MTF and multiple choice (MC) items. It was posited that internal analysis reliability estimates for MTF tests would be spurious due to elevated MTF within-cluster intercorrelations. Results showed that, on the average, MTF within-cluster dependence was no greater than that found between MTF items from different clusters, between MC items, or between MC and MTF items. But item for item, there was greater dependence between items within the same cluster than between items of different clusters. 相似文献

18.

A NOTE ON HUYNH'S NORMAL APPROXIMATION PROCEDURE FOR ESTIMATING CRITERION-REFERENCED RELIABILITY

CHAO-YING J. PENG MICHAEL J. SUBKOVIAK 《Journal of Educational Measurement》1980,17(4):359-368

相似文献

19.

ACCURACY OF TWO PROCEDURES FOR ESTIMATING RELIABILITY OF MASTERY TESTS

HUYNH HUYNH JOSEPH C. SAUNDERS 《Journal of Educational Measurement》1980,17(4):351-358

相似文献

20.

EMPIRICAL INVESTIGATION OF PROCEDURES FOR ESTIMATING RELIABILITY FOR MASTERY TESTS

MICHAEL J. SUBKOVIAK 《Journal of Educational Measurement》1978,15(2):111-116

相似文献