Sage Journals: Discover world-class research

Abstract

The data of the Collaborative Behavioral Teratology Study were reanalyzed. The degree of reproducibility of the treatment effect across experiments was assessed as the magnitude of fluctuation in strength of association (η²) for the replication experiment. There was no evidence that fluctuation of treatment effects across experiments was greater for behavioral measures than for nonbehavioral ones, suggesting the conventional belief that low reproducibility of results in behavioral teratology reflects the low reliability of the behavior test per se is unwarranted. Although the distributions of obtained treatment effects were almost symmetric and unimodal for most measures, the ranges were considerable. Considering that experiments were conducted under strictly standardized conditions across experiments, finding of the considerable ranges indicates that inconsistency in results across experiments cannot be remedied adequately by employing solely a standardized method. Basing in the main their results in terms of conventional significance tests, research workers in the collaborative study concluded that excellent reproducibility of behavioral data was demonstrated. The logic underlying the statistically nonsignificant interaction used in the original work was examined critically. A serious limitation upon the significance test for assessing the reproducibility of results was pointed out, and a warning issued about customary reliance only on a significance test.

Get full access to this article

View all access options for this article.

References

Bakan

The test of significance in psychological research. Psychological Bulletin, 1966, 66, 423–437.

Buelke-Sam

Kimmel

C. A.

Adams

Nelson

C. J.

Vorhees

C. V.

Wright

D. C.

St. Omer

Korol

B. K.

Butcher

R. E.

Geyer

M. A.

Holson

J. F.

Kutscher

C. L.

Wayner

M. J.

Collaborative Behavioral Teratology Study: results. Neurobehavioral Toxicology & Teratology, 1985, 7, 591–624.

Hays

W. L.

Statistics. (3rd ed.) New York: Holt, Rinehart & Winston, 1981.

Kimmel

C. A.

Buelke-Sam

Collaborative Behavioral Teratology Study: back-ground and overview. Neurobehavioral Toxicology & Teratology, 1985, 7, 541–545.

Kimmel

C. A.

Buelke-Sam

Adams

Collaborative Behavioral Teratology Study: implications, current applications and future directions. Neurobehavioral Toxicology & Teratology, 1985, 7, 669–673.

Kirk

R. E.

Experimental design: procedures for the behavioral sciences. Monterey, CA: Brooks/Cole, 1968.

Linton

Gallo

P. S.

The practical statistician: simplified handbook of statistics. Monterey, CA: Brooks/Cole, 1975.

Nelson

C. J.

Felton

R. P.

Kimmel

C. A.

Buelke-Sam

Adams

Collaborative Behavioral Teratology Study: statistical approach. Neurobehavioral Toxicology & Teratology, 1985, 7, 587–590.

Roseboom

W. W.

The fallacy of the null-hypothesis significance test. Psychological Bulletin, 1960, 57, 416–428.

10.

Tachibana

Persistent erroneous interpretation of negative data and assessment of statistical power. Perceptual and Motor Skills, 1980, 51, 37–38.

11.

Tachibana

A comment on confusion in open-field studies: abuse of null-hypothesis significance test. Physiology & Behavior, 1982, 29, 159–161.

12.

Tachibana

Utility of the randomization test and importance of nonstatistical inference in experimental teratology. Congenital Anomalies, 1984, 24, 55–62.

13.

Tachibana

A critical view of the utility of positive controls in a test battery. Neurobehavioral Toxicology & Teratology, 1984, 6, 155–159.

14.

Tachibana

Higher reliability and closer relationship between open-field test measures on aggregation data. Animal Learning & Behavior, 1985, 13, 345–348.

15.

Tachibana

Two types of strain difference in open-field behavior in rats and its mapping. Journal of General Psychology, 1986, 113, 263–275.

16.

Tachibana

Seemingly discrepant results across studies and random assignment of subjects: an example by using prenatally X-irradiated effect on open-field behavior. Physiology & Behavior, 1988, 43, 835–840.

17.

Tachibana

Significance test in studies of nonrandomly assigned subjects: a call for more discussion. Psychological Reports, 1988, 62, 415–418.

18.

Tachibana

Behavioral teratogenic effect of methylmercury and d-amphetamine: meta-analysis and power analysis of data from the Collaborative Behavioral Teratology Study of National Center for Toxicological Research. (Manuscript submitted for publication).

19.

Tachibana

Effect of sample size on reproducibility of behavioral study results: a computer simulation experiment using data from the Collaborative Behavioral Teratology Study of National Center for Toxicological Research. (Manuscript in preparation).

20.

Wachter

K. W.

Disturbed by meta-analysis?

Science, 1988, 241, 1407–1408.

Reproducibility across Experiments of Behavioral Results Obtained under Strictly Standardized Conditions: Another View of the Results of the Collaborative Behavioral Teratology Study

Abstract

Get full access to this article

References