Journal ol Consulting Psychology 1967, Vol. 31, No. 6, 626-630 DRAW-A-PERSON TEST AS A MEASURE OF INTELLIGENCE IN PRESCHOOL CHILDREN FROM VERY LOW INCOME FAMILIES LOIS-ELLIN DATTA 1 National Institute of Mental Health Ethnic group and sex did not affect either the IQ-equivalent scores or the congruent validities of the figure-drawing test (Goodenough-Harris scoring) in a nationally representative sample of 956 children enrolled in full-year Head Start programs. Higher correlations were obtained for performance (Caldwell￾Soule Preschool Inventory) than for verbal (PPVT) measures and for older (4-7 yr.) than for younger children. The obtained coefficients (.3-.S) compare favorably with those previously reported for kindergarten and Ist-grade children from less impoverished families. Both PPVT and DAP mean IQ equivalents indicated, however, substantially lower performance for Head Start than for normative groups. Recent interest in special educational pro￾grams has drawn attention to problems in measuring intellectual abilities and changes in performance in preschool children from low income families. The question of "culture￾fairness" is thus added to the already consid￾erable task of obtaining reliable measures at an age when the behavioral repertoire is lim￾ited. The figure-drawing test has been widely used as a measure of intelligence in children (Sundberg, 1960). It is simple to administer and score and is considered to have predic￾tive and congruent validity coefficients that, while relatively low, compare favorably with those reported for other standardized intelli￾gence and achievement tests (Shipp & Lou￾don, 1964; Vane & Kessler, 1964). Dennis (1966) concluded that Draw-A-Person per￾formance reflects experience with representa￾tional art rather than parental education or literacy. The availability of data from a nationwide sample of children enrolled in Project Head Start centers provided an opportunity to esti￾mate the congruent validity of the figure￾drawing test for younger children from very low income families. This was measured by comparing the results of the Draw-A-Person l This study used data collected by the Planning Research Corporation for the Office of Economic Opportunity under Contract No. OEO-1308, 1966. The author wishes to thank Ann Drake, for her assistance in data analysis, and Ruth Ann O'Keefe, for her contributions to every phase of the study. test (DAP) with the results obtained from the Peabody Picture Vocabulary Test (PPVT) and the Caldwell-Soule Preschool Inventory (PSI).2 The PPVT (Dunn, 1965) is a widely used measure of verbal intelli￾gence; the PSI has been developed as a "cul￾ture-fair" measure of intelligence in preschool children. The DAP requires less equipment, administration time, and examiner training than does the PPVT. The PSI is similar to the WISC in terms of equipment, examiner training, administration and scoring time, and the apparent contribution of verbal and non￾verbal skills to test performance. For the DAP, psychometrically desirable characteristics of a culture-fair test would include (a) a mean standard score of about 100 and (b) correlations between the DAP and the PPVT and the DAP and the PSI at least similar in magnitude to validity coeffi￾cients typically reported for the DAP (Har￾ris, 1963). METHOD Seventy-two Project Head Start centers were se￾lected to provide a sample representative of the population of 1966 full-year program centers in terms of geographic distribution and program length. From each center, 12-15 children were selected at random from an identification number list for in￾clusion in the survey. The DAP, PPVT, and PSI were administered indi￾vidually by college graduates with special training 2 B. Caldwell and D. Soule, The Preschool Inven￾tory. Unpublished paper, Project Head Start, Office of Economic Opportunity, Contract S14, 1966. 626

INTELLIGENCE IN PRESCHOOL CHILDREN FROM VERY Low INCOME FAMILIES 627 in examining disadvantaged children. Sixty-five chil￾dren, who were predominantly of Mexican origin, were tested in Spanish. Since the equivalence of the Spanish and English versions of the PPVT was not determined, data from children tested in Spanish were analyzed separately from data of children tested in English. Draw-A-Person (Machover, 1948) rather than Draw-A-Man instructions were used for the figure￾drawing test. Data on sexual identification will be reported in a later paper. Bliss and Berger (19S4) have concluded that the two forms of the test yield substantially the same results. Unless the drawing was identified as a woman by the child, ambiguous figures were scored by the Goodenough-Harris cri￾teria for drawings of men (Harris, 1963). Of the 956 drawings, 239 were not recognizable figures (Class A), 111 were scored by Draw-A-Woman criteria, and 606 were scored by Draw-A-Man cri￾teria. Interrater reliabilities among the four scorers ranged .S9-.99 for samples of 14-50 drawings. (For a detailed report of sampling, selection, and testing procedures, see Commins, Cort, Henderson, & O'Keefe, 1967.) RESULTS AND DISCUSSION Mean Standard Scores The DAP and PPVT raw scores were con￾verted to standard scores; the mean standard score at each age is set at 100 for the norma￾tive samples for both tests. Table 1 shows that regardless of age, sex, or ethnic group, the average performance on both the DAP (overall mean standard score, 77.22) and the PPVT (overall mean standard score, 82.02) was substantially lower than the mean for the normative samples. The low PPVT standard scores are con￾sistent with the poor performance on verbal tasks frequently reported for children from lower class and minority group families (Deutsch, 196S). The low DAP standard scores were to some extent unexpected. Previ￾ous studies have indicated that at least four relatively disadvantaged groups have achieved mean standard scores of about 100 on the DAP. Such means were reported for white and Negro kindergarten children in New York City public schools (Vane & Kessler, 1964), for white and Negro S-year-old chil￾dren in a New York City day care center (Anastasi & D'Angelo, 1952), and for a representative sample of 300 Negro first￾grade children from southeastern states (Ken￾nedy & Lindner, 1964). Bowers and Giles (1966) found an increase in DAP scores as socioeconomic status increased among 6- to 12-year-old children in Evanston, Illinois, but the mean DAP standard scores for the lowest socioeconomic groups, regardless of sex or age, were about 100. The norms for younger children on the DAP are not geographically and economically representative of the national population at TABLE 1 DAP, PPVT, AND PSI CORRELATIONS BY SEX, ETHNIC GROUP, AND AGE IN A SAMPLE OF 956 PRESCHOOL CHILDREN ENROLLED IN PROJECT HEAD START Group Total Sex, ethnic groupb Boys, white Boys, Negro Girls, white Girls, Negro Spanish-speaking Age* 3 4 5 6 N 956 188 273 166 264 65 n 397 335 87 M age0 60.3 63.3 56.4 62.1 58.2 71.8 44.1 54.9 63.8 76.0 M standard score PPVT 82.02 89.26 80.64 85.18 79.32 69.77 80.01 82.23 84.32 83.02 DAP 77.22 76.57 76.02 78.14 77.45 80.66 76.76 78.76 74.21 79.53 Raw-score correlation DAP/ PPVT .46 .40 .52 .51 .44 .53 .38 .22 .31 .52 DAP/ PSI .56 .51 .56 .60 .54 .60 .39 .26 .44 .57 PPVT/ PSI .73 .69 .72 .77 .65 .72 .48 .69 .62 .80 * In months. >> Includes only children English; too few children were tested in Spanish to compute data for sex and age subgroups

628 LOIS-ELLIN DATTA these ages; the present sample differs from the normative samples and from the four cited lower income groups in geographic dis￾tribution and in degree of economic depriva￾tion. Eisenberg and Conners (1966) have reported a DAP mean standard score of about 81 for 712 children entering Baltimore Head Start classes; of these children, 44% came from families with an annual income of less than $3,000, 30% were supported by welfare, 64% of the fathers were unskilled, and about 60% of both parents had less than a tenth￾grade education. In the present sample, the mean annual income was $3,771 for a living group which averaged 6.7 persons, approxi￾mately $560 per year per person. In comparison, the median educational level completed by the parents of the New York City children (Anastasi & D'Angelo, 19S2) was the eleventh grade, all of the mothers were employed, the median number of siblings was 1, and only 13% of the fath￾ers were unskilled. The Baltimore children thus appear to differ from the New York City children primarily in the severity of economic deprivation; they appear to differ from the present sample primarily in geographic loca￾tion and urban/rural distribution. Although the data do not permit estimations of the independent contributions of income, city size, geographic area, or of their interactions, fac￾tors associated with very low income rather than factors associated with geographic or urban/rural distributions seem to be responsi￾ble for the differences in DAP performance between Head Start children and the previ￾ously reported samples. The low mean DAP standard scores indi￾cate that, with the present norms, the test would not provide a "culture-fair" measure of individual attainment in an economically heterogeneous group. Within the Head Start sample, however, DAP standard scores were not affected by ethnic or sex differences, while the PPVT standard scores were affected by factors associated with both sex and eth￾nic group. Results of a 2 X 2 unweighted means analysis of variance (Winer, 1962) indicated that the DAP standard scores of girls and boys and of Negro and white chil￾dren did not differ significantly. Ethnic group and sex F ratios significant at Si .01 were found for PPVT standard scores: The PPVT performance of white children was higher than the performance of Negro children (F = 31.43, p ^ -001), and boys, regardless of eth￾nic group, achieved higher PPVT scores than did girls (F = 4.37, p^.Ol). The PPVT mean standard score for Spanish-speaking children (69.77) was significantly lower than the PPVT means for other groups; the DAP mean standard score for Spanish-speaking children (80.66) did not differ significantly from the DAP mean scores for other groups. The DAP may thus be relatively insensitive to factors affecting the PPVT scores, and among these factors may be those related to cultural influences. It is difficult to estimate the extent to which the low mean standard scores on both the DAP and the PPVT are due to cognitive as contrasted to emotional or motivational associates of deprivation. Some evidence of the importance of cognitive factors may be found in the report that culturally deprived children were not reliably lower on all mea￾sured aspects of psycholinguistic functioning, but were primarily handicapped in the areas of auditory word comprehension and auditory vocal automatic decoding (Barrett, Semmel, & Weener, 1965). On the other hand, rela￾tively minor changes in testing conditions have been associated with substantial im￾provement in performance (Riessman, 1962). Despite agreement on the importance of op￾timum testing conditions, there have been few systematic studies comparing directive ("Think again; you can do better than that"), standard-neutral, and supportive atti￾tudes for deprived and privileged preschool and elementary children. Congruent Validity The raw-score product-moment correlations among the DAP, PPVT, and PSI shown in Table 1 are all significant at ^ .01; t com￾parisons among the correlations (r to z trans￾formations) indicated that age, sex, and eth￾nic group did not significantly affect the congruent validity of the DAP. The corre￾lations for all subgroups compare favorably with the .4 typically reported for groups of about 100 normal kindergarten and first-

INTELLIGENCE IN PRESCHOOL CHILDREN FROM VERY Low INCOME FAMILIES 629 grade children and with the .39 DAP/PPVT raw-score correlation obtained in a sample of 5- to 6-year-old Head Start children (Eisen￾berg & Conners, 1966); The DAP/PPVT standard-score product-moment correlations ranged .19-.S2; these correlations, while lower than the raw-score correlations, were significant at g .01. The single exception was .22 obtained for 3-year-old children. Test bias has been defined (Educational Testing Service, 1966) as the consistent over￾or underprediction of a criterion in one sub￾group as compared to another subgroup, so that equally high predictive validity within subgroups would indicate a lack of bias. If this definition is extended to congruent valid￾ity, the value of the DAP as an estimate of general intelligence appears to be as high among children between 4 years, 0 months and 6 years, 11 months from very low income families as it is among children of this age or slightly older from less economically de￾prived backgrounds. Age and validity. The validity of the DAP for school children has previously been re￾ported to decrease with age, being higher for children in kindergarten and the first grade than for children older than 9 years (Ellis, 19S3; Kennedy & Lindner, 1964; Pringle & Pickup, 1963; Vane & Kessler, 1964). As sample size increases, correlation magnitudes tend to decrease. If sample size is considered in this preschool sample, the DAP/PPVT raw-score correlations tend to increase with age. The lower congruent validity of the DAP for the younger children suggests that the value of the DAP as a measure of intelligence in children may be curvilinear with respect to age, increasing from 3-5 years and decreasing after about 8 years of age. Performance and verbal measures. The con￾gruent validity of the DAP was higher for performance (.56) than for verbal (.45) abilities, regardless of age, sex, or ethnic group. Similar results have been reported by Pringle and Pickup (1963) and Harris (1959). These correlations were, however, considerably lower than the PPVT/PSI cor￾relation of .73; the PSI would appear to have more reliable variance associated with a verbal than with a performance measure of intelligence. SUMMARY AND CONCLUSIONS The DAP, the PPVT, and the PSI were administered to a nationally representative sample of 956 children attending 1966 Proj￾ect Head Start full-year classes. Among chil￾dren from 4 years, 0 months to 6 years, 11 months, the DAP/PPVT and DAP/PSI cor￾relations compared favorably with validity coefficients previously reported for children from less deprived homes. Among younger children, the congruent validity of the DAP was lower. Neither sex nor ethnic group sig￾nificantly affected DAP correlations; the DAP thus meets one criterion of a culture￾fair measure. On both the PPVT and the DAP, however, the mean standard scores were substantially lower than those reported for the norm groups. By the second criterion, the value of the DAP as a culture-fair measure of intelli￾gence remains in question for children in sam￾ples heterogeneous for socioeconomic status, although within this very low income sample, the DAP was less affected than the PPVT by factors associated with ethnic group and sex. REFERENCES ANASTASI, A., & D'ANGELO, R. A comparison of Ne￾gro and white preschool children in language de￾velopment and Goodenough Draw-A-Man IQ. Journal of Genetic Psychology, 1952, 81, 147-165. BARRETT, L. S., SEMMEL, M. I., & WEENER, P. D. A comparison of the psycholinguistic functioning of "educationally deprived" and "educationally ad￾vantaged" children. Paper presented at the meet￾ing of the American Educational Research Associ￾ation, Chicago, February 196S. BLISS, M., & BERGER, A. Measurement of mental age as indicated by the male figure drawings of the mentally subnormal using Goodenough and Mach￾over instructions. American Journal of Mental Deficiency, 1954, 59, 73-79. BOWERS, N. D., & GILES, G. C. Social class, sex and age differences in sex role identity as inferred from children's drawings of men and women. Paper presented at the meeting of the American Psycho￾logical Association, New York, September 1966. COMMINS, W. D., CORT, H. R., HENDERSON, N. H., & O'KEEFE, R. A. A study of the 1966 full-year Head Start programs. Preliminary Report D1268, 1967, Planning Research Corporation, Contract OEO-1308, Office of Economic Opportunity. DENNIS, W. Goodenough scores, art experience, and moderation. Journal of Social Psychology, 1966, 68, 211-228

630 LOIS-ELLIN DATTA DEUTSCH, M. The role of social class in language development and cognition. American Journal of Orthopsychiatry, 196S, 35, 78-88. DUNN, L. M. Peabody Picture Vocabulary Test: An expanded manual. Minneapolis: American Guidance Series, 196S. EDUCATIONAL TESTING SERVICE. Are aptitude tests unfair to Negroes? ETS investigates two kinds of bias. Developments, 1966, 14, 1-4. EISENBERO, L., & CONNERS, C. K. The effect of Head Start on developmental processes. Paper presented at the 1966 Joseph P. Kennedy, Jr., Foundation Scientific Symposium on Mental Retardation, Boston, April 1966. ELLIS, R. Comparison of scores on Goodenough Draw-A-Man, revised Stanford-Binet and Wech￾sler Intelligence Scale for Children, obtained from a group of children seen at a psychiatric clinic. Unpublished master's colloquium paper, University of Minnesota, 1953. HARRIS, D. B. A note on some ability correlates of the Raven Progressive Matrices (1947) in the kindergarten. Journal of Educational Psychology, 1959, SO, 228-229. HARRIS, D. B. Children's drawings as measures of intellectual maturity. New York: Harcourt, Brace & World, 1963. KENNEDY, W. A., & LINDNER, R. S. A normative study of the Goodenough Draw-A-Man test on southeastern Negro elementary school children. Child Development, 1964, 35, 33-62. MACHOVER, K. Personality projection in the drawing of the human figure. Springfield, HI.: Charles C Thomas, 1948. PRINOLE, M. L., & PICKUP, K. T. The reliability and validity of the Goodenough Draw-A-Man test. British Journal of Educational Psychology, 1963, 33, 297-306. SHIPP, D. E., & LOUDON, M. L. The Draw-A-Man test and achievement in the first grade. Journal of Educational Research, 1964, 57, 918-921. SUNDBERG, N. Clinical assessment problems and pro￾cedures: A national survey. American Psychologist, 1960, 15, 404. RIESSMAN, F. The culturally deprived child. New York: Harper & Row, 1962. VANE, J., & KESSLER, R. The Goodenough Draw-A￾Man test: Long term reliability and validity. Journal of Clinical Psychology, 1964, 20, 487-488. WINER, B. J. Statistical principles in experimental design. New York: McGraw-Hill, 1962. (Received April 3, 1967)
