期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

TEST ITEM ARRANGEMENT, TESTING TIME, AND PERFORMANCE

RONALD N. MARSO 《Journal of Educational Measurement》1970,7(2):113-118

Two experiments were conducted to determine if a relationship exists between test item arrangements and student performance on power tests. The primary hypotheses were: item arrangements based upon item difficulty, similarity of content, or order of class presentation do not influence test score or required testing time. In the first experiment 122 subjects were randomly assigned to three item difficulty arrangements of 139 test items with a 0–100% difficulty range, and in the second experiment 156 subjects were randomly assigned to three item content arrangements of 103 items. Results of analyses of variance with test anxiety used as a classification factor supported the hypotheses. 相似文献

2.

EFFECTS OF ANXIETY TYPE AND ITEM-DIFFICULTY SEQUENCING ON MATHEMATICS TEST PERFORMANCE*

NELSON J. TOWLE PAUL F. MERRILL 《Journal of Educational Measurement》1975,12(4):241-249

相似文献

3.

EFFECTS OF DIFFERENT SAMPLES ON ITEM AND TEST CHARACTERISTICS OF CRITERION-REFERENCED TESTS

THOMAS MICHAEL HALADYNA 《Journal of Educational Measurement》1974,11(2):93-99

Although many have rejected classical test construction and analysis procedures for criterion-referenced tests, the present study was concerned with the possibility that classical procedures are both applicable and appropriate when samples of both mastery and nonmastery examinees are employed. A rationale for using these samples was presented, and empirical evidence was gathered which supported the practice of combining samples to increase the variance of test scores and thereby permit the proper estimate of reliability and item validities. 相似文献

4.

THE EFFECTS OF VARIOUS FORMS OF ITEM ARRANGEMENTS ON TEST PERORMANCE

Gilbert Sax Theodore R. Cromack 《Journal of Educational Measurement》1966,3(4):309-311

相似文献

5.

EFFECTS OF TEST FAMILIARIZATION ON SAT PERFORMANCE

DONALD E. POWERS DONALD L. ALDERMAN 《Journal of Educational Measurement》1983,20(1):71-79

相似文献

6.

TEST PERFORMANCE UNDER THE CONDITION OF KNOWN ITEM DIFFICULTY

SCHUYLER W. HUCK 《Journal of Educational Measurement》1978,15(1):53-58

相似文献

7.

THE EFFECTS OF MANIPULATED ITEM WRITING CONSTRAINTS ON THE HOMOGENEITY OF TEST ITEMS1

EVA L. BAKER 《Journal of Educational Measurement》1971,8(4):305-309

Described are the effects of four sets of instructions on the observed item inter- correlations of current events and subtraction items. The four conditions were: (a) general objective, (b) behavioral objective, (c) behavioral objective plus test item, and (d) behavioral objective plus item-form. Two tests, one in each subject matter, constructed by selecting four items generated from each of the experimental conditions, were administered to 51 seventh grade children. Not found were the expected tendencies toward greater homogeneity among items produced under the three conditions employing behavioral objectives. 相似文献

8.

THE EFFECTS OF ITEM ANALYSIS METHODS AND CONFIDENCE LEVELS UPON TEST VALIDITY AND CROSS-VALIDITY

J. C. Wofford 《Journal of Educational Measurement》1968,5(2):109-114

相似文献

9.

THE EFFECTS OF VIOLATIONS OF UNIDIMENSIONALITY ON THE ESTIMATION OF ITEM AND ABILITY PARAMETERS AND ON ITEM RESPONSE THEORY EQUATING OF THE GRE VERBAL SCALE

NEIL J. DORANS NEAL M. KINGSTON 《Journal of Educational Measurement》1985,22(4):249-262

One of the major assumptions of item response theory (IRT)models is that performance on a set of items is unidimensional, that is, the probability of successful performance by examinees on a set of items can be modeled by a mathematical model that has only one ability parameter. In practice, this strong assumption is likely to be violated. An important pragmatic question to consider is: What are the consequences of these violations? In this research, evidence is provided of violations of unidimensionality on the verbal scale of the GRE Aptitude Test, and the impact of these violations on IRT equating is examined. Previous factor analytic research on the GRE Aptitude Test suggested that two verbal dimensions, discrete verbal (analogies, antonyms, and sentence completions)and reading comprehension, existed. Consequently, the present research involved two separate calibrations (homogeneous) of discrete verbal items and reading comprehension items as well as a single calibration (heterogeneous) of all verbal item types. Thus, each verbal item was calibrated twice and each examinee obtained three ability estimates: reading comprehension, discrete verbal, and all verbal. The comparability of ability estimates based on homogeneous calibrations (reading comprehension or discrete verbal) to each other and to the all-verbal ability estimates was examined. The effects of homogeneity of item calibration pool on estimates of item discrimination were also examined. Then the comparability of IRT equatings based on homogeneous and heterogeneous calibrations was assessed. The effects of calibration homogeneity on ability parameter estimates and discrimination parameter estimates are consistent with the existence of two highly correlated verbal dimensions. IRT equating results indicate that although violations of unidimensionality may have an impact on equating, the effect may not be substantial. 相似文献

10.

INTERACTION OF RACE AND TEST ON READING PERFORMANCE SCORES

NORMAN EAGLE ANNA S. HARRIS 《Journal of Educational Measurement》1969,6(3):131-135

This study examines the relationship between race and performance on two nationally standardized reading tests. The appropriate reading tests of the Iowa Test of Basic Skills and Metropolitan Achievement Battery were administered to all fourth and sixth-grade students in all elementary schools of an urban school district near New York City. Although white pupils earned higher scores than nonwhite pupils on both tests, the Metropolitan produced significantly greater differences between the races than the Iowa, at both grade levels. Factorial analysis of variance confirmed the statistical significance of these differences. Implications of Race X Test (suggesting S.E.S. X Test) interaction effects for program evaluation and instruction are briefly discussed. 相似文献

11.

EFFECTS OF AGE,SEX, AND STATUS ON PERCEPTION OF THE UTILITY OF EDUCATIONAL PARTICIPATION

Edward E. Marcus 《Educational gerontology》2013,39(4):295-319

This study investigated the influence of age, sex, and socioeconomic status on the perception of participants in adult education that their participation is useful. Two forms of utility were postulated: instrumental and expressive. An instrument containing scales of perceived utility, needs, goals, time orientation, and enjoyment was administered to selected classes at various educational institutions in the Chicago metropolitan area and, for comparison, a class in Florida. The results permitted inferences that needs, goals, and time orientation partially determine perception that participation is instrumentally useful and that age, status, and femaleness tend to favor perception of expressive utility. The findings supported previous research indicating that adult educational participation is complex behavior involving more than subject matter interests and motivational orientations and opened a new line of attack on the problem. 相似文献

12.

EFFECTS OF SEX,APTITUDES, AND ATTITUDES ON THE ACADEMIC ACHIEVEMENT OF COLLEGE FRESHMEN1

Donald J. Veldman 《Journal of Educational Measurement》1968,5(3):245-249

相似文献

13.

DEMONSTRATING THE UTILITY OF THE STANDARDIZATION APPROACH TO ASSESSING UNEXPECTED DIFFERENTIAL ITEM PERFORMANCE ON THE SCHOLASTIC APTITUDE TEST

NEIL J. DORANS EDWARD KULICK 《Journal of Educational Measurement》1986,23(4):355-368

The standardization method for assessing unexpected differential item performance or differential item functioning (DIF) is introduced. The principal findings of the first five studies that have used this approach on the Scholastic Aptitude Test are presented. 相似文献

14.

PERFORMANCE ON THE RAVEN PROGRESSIVE MATRICES AS A FUNCTION OF AGE,EDUCATION, AND SEX

Ruth Guttman 《Educational gerontology》2013,39(1):49-55

The Raven Progressive Matrices (RPM) were administered to 408 individuals in 100 family groups. Subjects’ ages ranged from 8 to 60. Scores on all five subtests were highest in the 18‐26 age group, decreasing with age. Males scored higher on each subtest in each age group. Performance on the RPM increased with additional years of education. Within each educational level, performance declined with age. Although decline with age appears to be invariant with education, changes in schools and educational methods may be factors operating in addition to aging. 相似文献

15.

INTERACTIONS BETWEEN ITEM CONTENT AND GROUP MEMBERSHIP ON ACHIEVEMENT TEST ITEMS

ROBERT L. LINN DELWYN L. HARNISCH 《Journal of Educational Measurement》1981,18(2):109-118

相似文献

16.

APPLICATION OF ITEM RESPONSE MODELS TO CRITERION-REFERENCED TEST ITEM SELECTION

RONALD K. HAMBLETON DATO N. M. DE GRUIJTER 《Journal of Educational Measurement》1983,20(4):355-367

相似文献

17.

MEASURING PROBLEM SOLVING ABILITY IN MATHEMATICS WITH MULTIPLE-CHOICE ITEMS: THE EFFECT OF ITEM FORMAT ON SELECTED ITEM AND TEST CHARACTERISTICS

ROBERT A. FORSYTH KEVIN F. SPRATT 《Journal of Educational Measurement》1980,17(1):31-43

相似文献

18.

EFFECT OF INCREASED TEST-TAKING TIME ON TEST SCORES BY ETHNIC GROUP, YEARS OUT OF SCHOOL, AND SEX

CHERYL L. WILD ROBIN DURSO DONALD B. RUBIN 《Journal of Educational Measurement》1982,19(1):19-28

相似文献

19.

THE EXTENT, CAUSES AND IMPORTANCE OF CONTEXT EFFECTS ON ITEM PARAMETERS FOR TWO LATENT TRAIT MODELS

WENDY M. YEN 《Journal of Educational Measurement》1980,17(4):297-311

相似文献

20.

SOCIAL CLASS AND PERFORMANCE ON AN INTELLIGENCE TEST

CLINTON I. CHASE RICHARD C. PUGH 《Journal of Educational Measurement》1971,8(3):197-202

Noting the wide differences in verbal abilities of middle and lower class children, the investigators proposed that two groups of children, one from the lower class, one from the middle class, who achieve comparable total scores on a group intelligence test, would get their scores by successfully completing different sets of items. In the first study children were placed in social classes based on their fathers' occupations, following guidelines from the Warner scale. Middle class children were matched with lower class children on total Otis scores. No item-social class interaction was found. The study was repeated using the occupational categories of the Dictionary of Occupational Titles as a guide to social class standing. Again no item-social class interaction appeared. If two social class groups are equated on total intelligence scores, one social class sample appears to succeed on essentially the same test items as does the other social class sample. A given score on an intelligence test appears to represent the same skills for one social class as it does for another social class. 相似文献