Timed Writing Assessment as a Measure of Writing Ability: A Qualitative Study

Lau, Arthur

From Discussions VOL. 9 NO. 2

Timed Writing Assessment as a Measure of Writing Ability: A Qualitative Study

By Arthur Lau
Discussions
2013, Vol. 9 No. 2 | pg. 1/2 | »

IN THIS ARTICLE

KEYWORDS

Keywords:Writing Education Academic Performance Standardized Testing Assessment

Throughout the American education system, the assessment of writing skill and general academic performance through timed essay examinations has become increasingly pervasive, contributing to the determination of grades and course placements and ultimately affecting college admissions through their use in standardized tests. In March 2005, the College Board introduced a new writing section for the SAT that incorporates a 25-minute impromptu essay component, as well as traditional multiple-choice questions on grammar and usage (Hass; “SAT Test Sections”). Likewise, timed writing assessment holds a prominent position in the ACT, which features an optional 30-minute essay section that is mandatory for students applying to some institutions, and in the College Board’s Advanced Placement program, whose English Literature examination requires three essays written over a two-hour period (“The ACT Plus Writing”; “English Literature: The Exam”). As Nancy Hass reports in the New York Times, the introduction of timed writing in the SAT has generated substantial public controversy, with many colleges deciding not to consider the essay scores in the admissions process. At the same time, a number of universities have elected to utilize the essay section results, not only for admissions, but also for the determination of placement in composition courses, sometimes provoking passionate opposition from their own writing faculty members (Isaacs and Molloy 518-20).

Employing the SAT essay section as an illustration of the debate surrounding timed essay examinations, this paper seeks to investigate the accuracy, instructional usefulness, and social implications of the widespread use of timed writing assessment as a measure of writing ability at the high school and collegiate levels. To supplement a review of the published literature, this study integrates material from interviews conducted by the author with five experienced instructors in composition and literature programs at Stanford University and the University of California (UC), Davis. Both in standardized examinations and in the classroom setting, timed writing assessment can offer a rough and imprecise, but most often fairly accurate, prediction of a student’s performance on longer, traditional writing assignments. Nevertheless, as this paper will attempt to demonstrate, the imposition of severe time constraints induces an altogether different mode of writing, called an “assessment genre” by one of the instructors, that renders questionable the comparability of the writing skills displayed in timed and untimed contexts. Given this finding, teachers and institutional administrators should carefully consider the potentially objectionable social values and attitudes toward writing communicated by the choice of timed writing as an assessment technique, especially when used to identify excellence rather than to certify basic competence.

In recent decades, the accuracy and appropriateness of timed writing assessment as a measure of writing ability have been subject to progressively rising doubts from English instructors and scholars of composition. As Kathleen Yancey discusses in an article on the history of writing assessment, in the period between 1950 and 1970, the evaluation of writing was frequently conducted through ‘objective,’ multiple-choice tests on grammar and vocabulary (485). In the 1970s, trends in standardized writing assessment gradually shifted to the holistically scored essay test, and by the 1980s and 1990s, many universities had again changed their evaluation methods to adopt broader, multi-element writing portfolios for such purposes as assigning course placement (Yancey 487-94). Yancey observes that beneath these fluctuations were evolving views on the relative importance of reliability and validity, the two central concepts of psychometric theory. Reliability is associated with objective assessments, while validity is associated with ‘direct’ tests of writing skill that provide samples of actual writing (487-94). An assessment is reliable to the extent that it yields consistent scores across readers and test administrations, while it is valid insofar as it successfully measures the particular characteristic, in this case writing ability, that it purports to measure (Yancey 487; Moss 6).

Current scholarship opposing the use of timed writing assessment continues to voice the same concerns about the validity of essay tests originally raised by those advocating the introduction of portfolios. For example, in her 1991 paper encouraging the further spread of the newly developed portfolio assessment method, Sarah Freedman argues that timed essay examinations involve “unnatural” writing conditions, with students producing work that has no function except for external evaluation, on topics that may not interest them (3). Similarly, as Ann Del Principe and Jeanine Graziano-King contend in their 2008 article, the testing environment created by timed writing assessment undermines the authenticity of essay examinations because it inhibits the complex processes of thinking and articulation that enable students to produce quality writing (297-98). Of course, many more specific arguments can be adduced under the general heading of authenticity in assessments. For Kathy Albertson and Mary Marwitz, the dangers of high- stakes standardized writing examinations are dramatically exemplified by students who seek security in uninspired, formulaic essays and students who unknowingly imperil their prospects of reader approval by engaging with challenging topics through more sophisticated pieces that they lack the time to complete (146-49). Finally, in an inversion of the usual call for more valid assessment techniques, Peter Cooper reprises the common contention that a single writing sample on a particular topic cannot fully represent a student’s abilities, only to present this consideration as evidence in support of multiple-choice writing tests (25).

In defense of timed writing assessment, noted composition theorist Edward White invokes the historical attractions of objective tests of writing, which remain in use in a large proportion of colleges (32). As he writes in his widely referenced article “An Apologia for the Timed Impromptu Essay Test,” the scoring of timed essays provides significant cost savings relative to the labor- intensive evaluation of portfolios, allowing institutions that would otherwise employ even less expensive multiple- choice assessments to include some form of direct writing evaluation in reviewing student performance (43-44). He also remarks on the utility of timed in-class writing in preventing plagiarism and in helping students to focus on the task of composition (34-36). Importantly, in response to concerns about a lack of opportunities for revision, White asserts that, even though an impromptu essay test may encourage first- draft writing, a first draft still constitutes a form of writing: thus the use of timed writing rather than multiple-choice tests emphasizes the value of writing (35-38). Extending this reasoning further, Marie Lederman argues that, despite the rightful focus on revision and the writing process in the curriculum, only the final product of that process holds any significance or communicative potential for the reader, lending legitimacy to the product-oriented nature of timed writing assessment (40-42).

A second major line of thought in favor of timed essay tests, separate from the pragmatic and conceptual arguments surveyed above, relates to their empirical capacity to predict the future academic performance of students. College Board researchers claim that, of all the components of the SAT, the writing section most accurately predicts students’ grade point average in the first year of college, demonstrating its validity as an assessment instrument (Kobrin et al. 1, 5-6). A crucial point of weakness in this argument, however, is the idea that a strong correlation between scores and later academic success in isolation can show the validity of a given assessment. For as Rexford Brown insisted as early as 1978, in the course of his opposition to objective writing tests, the fact that parental income and education might also correlate with writing ability and predict performance in college does not mean that they should form the basis for judging students’ aptitude (qtd. in Yancey 490-91). Validity requires that an assessment measures what it is intended to measure, so the question remains whether timed writing examinations truly reflect writing ability. In order to explore this issue in greater detail, one can turn to the material gathered in interviews with Brenda Rinard, a member of the University Writing Program at UC Davis, and postdoctoral fellows Roland Hsu, Barbara Clayton, Patricia Slatin, and Jeffrey Schwegman, all teaching in Stanford’s Introduction to the Humanities (IHUM) program. Though all interviewees had experience with timed essay tests in their respective courses, it is a limitation of this study that the four participating IHUM fellows, unlike Dr. Rinard, were seeking to evaluate student examinations not explicitly in terms of writing quality but rather in terms of content. All of these instructors nevertheless offered valuable information while answering questions regarding the accuracy and social implications of timed writing assessment and their motivations for using it in their courses (see the Appendix).

The interviewees were first asked about the accuracy of timed writing assessment as a measure of writing ability, where the standard for writing skill is assumed to be students’ performance in producing traditional argumentative papers. All subjects reported that timed essay tests generally provided a fairly accurate indication of students’ writing ability as demonstrated in regular paper assignments, although they all mentioned some exceptions or variations in accuracy as well. In particular, Dr. Clayton stated that students who had previously submitted papers of lower quality would sometimes show a surprising level of proficiency on essay tests, perhaps on account of additional preparation for the examination. In contrast, Dr. Schwegman noted that the students most skilled in composing extended papers would not usually produce the highest-quality timed essays in the class. Dr. Rinard emphasized the adverse effects of the testing environment for students with test anxiety and students for whom English was not the first language. Interestingly, Dr. Slatin and Dr. Rinard both affirmed, when asked, that timed writing assessment could offer only an imprecise measurement of writing ability, one that would not accommodate fine distinctions in skill or provide for the display of the full range of variation in writing ability.¹ Their observations agree in this respect with the conjecture of Leo Ruth and Sandra Murphy that “short, timed writing tests are likely to truncate severely the range of performance elicited,” as suggested by surveys indicating that more sophisticated writers often consider time allocations inadequate due to their use of a greater amount of time for planning their work (151-54).

At this point, given that timed writing assessment does not seem grossly inaccurate in evaluating broader writing skill, one might be inclined to accept White’s contention that the use of standardized essay tests is justified by their practical efficiency and the fact that they at least require first- draft writing. Once again, however, this conclusion can be warranted only by a demonstration of the validity of timed writing examinations in measuring the same sort of writing ability that manifests itself in regular paper assignments, not simply by a correlation between the two forms of writing. From this standpoint, the true importance of the notion that timed writing is first-draft writing becomes evident: it embodies the idea that timed writing is fundamentally similar to the writing involved in extended composition. Only if timed writing is sufficiently continuous with, and therefore comparable to, writing without such time constraints can the validity of timed writing assessment be maintained. Indeed, as Murphy notes, assessment specialist Roberta Camp has argued that standardized writing tests implicitly assume that timed, impromptu writing can be considered representative of writing in general and that writing involves a uniform set of skills regardless of its purpose or circumstances (Murphy 38). One of the criticisms offered byDr. Rinard challenges the core assumptions underlying the use of timed essays to determine writing ability. In particular, she believes that the timed writing on standardized examinations constitutes a distinct “assessment genre” with its own unique rhetorical situation, implying that judgments of writing skill obtained using timed writing may not be generalizable to writing in other contexts.

One must now resolve the question of whether the timed writing should be regarded as representative of all academic writing or should instead be classified as a narrow and artificial “assessment genre.” Insight on this topic is supplied by the other interviewees’ remarks on their motivations for employing timed essay examinations. With a notion of timed writing as essentially continuous with other forms of writing, one might expect that they would conceive of essay examinations as simply compressed versions of regular papers, assigned because they require less time to grade and offer greater protection against plagiarism. To the contrary, in fact, the four IHUM fellows tended not to express any of these practical motivations for using timed essay examinations. The exceptions to this trend were Dr. Hsu, who cited the necessity of ensuring that work submitted was a student’s own, and Dr. Schwegman, who briefly remarked on the issue of time available for grading, but even these two instructors spoke at length about other reasons for employing the essay test format. Dr. Hsu, for instance, contended that timed essays were useful for encouraging students to construct a “less developed synthesis” of the material, meaning, as he explained, that they would not be influenced by the interchange of ideas with the teacher or other students and would therefore need to “take ownership” of their work in a way not facilitated by traditional papers. On the other hand, Dr. Slatin emphasized the importance of timed essay examinations as another mode of evaluation different from longer paper assignments, contributing to the diversity of assessment measures and thus ensuring fairness to all students in grading. Likewise, Dr. Schwegman found his principal motivation in the idea of achieving fairness by employing a broad spectrum of assessment methods, each engaging a distinct skill set and a different type of ability. All of these perspectives on the utility of timed writing assessment crucially presuppose a fundamental dissimilarity between timed writing and the extended composition demanded by regular papers.

Moreover, a majority of the interviewees indicated that the writing produced on the essay tests that they had used generally failed by a large margin to satisfy the standards of a decent first draft for any other assignment. This finding, in addition to the previously developed suggestion of a divergence in the skills and processes involved in timed writing and other forms of writing, further challenges White’s assertion that timed writing should be regarded as first-draft writing. Dr. Schwegman, for instance, freely admitted that the writing submitted for final examinations was often “atrocious” in quality, and Dr. Clayton related that she would expect students to spend far more time than that permitted in essay examinations on the first draft of even a short paper. For a piece comparable to one on the AP tests, where a student might receive approximately 40 minutes per essay, Dr. Rinard estimated that a student might require anywhere from one to three hours to produce a draft of reasonable quality.Continued on Next Page »

1 2

Albertson, Kathy, and Mary Marwitz. “The Silent Scream: Students Negotiated Timed Writing Assessments.” Teaching English in the Two Year College 29 (2001): 144-153. 6 May 2012 .

Boud, David. “Assessment and the Promotion of Academic Values.” Studies in Higher Education 15 (1990): 101-11. 6 May 2012 .

Broadfoot, Patricia M. Education, Assessment, and Society. Buckingham, Eng.: Open University P, 1996.

Cho, Yeonsuk. “Assessing Writing: Are We Bound by Only One Method?” Assessing Writing 8 (2003): 165-91. 6 May 2012 .

Clayton, Barbara. Personal interview. 2 May 2012.

Cooper, Peter L. “The Assessment of Writing Ability: A Review of Research.” Educational Testing Service Research Report 84-12. May 1984. 6 May 2012 .

Del Principe, Ann, and Janine Graziano-King. “When Timing Isn’t Everything: Resisting the Use of Timed Tests to Assess Writing Ability.” Teaching English in the Two Year College 35 (2008): 297-311. 15 Apr. 2012 .

Eggleston, John. “School Examinations--Some Sociological Issues.” Selection, Certification, and Control: Social Issues in Educational Assessment. Ed. Patricia Broadfoot. London: Falmer, 1984. 17-34.

“English Literature: The Exam.” 2012. College Board. 6 May 2012 .

Freedman, Sarah Warshauer. Evaluating Writing: Linking Large- Scale Testing and Classroom Assessment. Berkeley, CA: National Center for the Study of Writing, 1991.

Hanson, F. Allan. “How Tests Create What They Are Intended to Measure.” Assessment: Social Practice and Social Product. Ed. Ann Filer. London: Routledge Falmer, 2000. 67-81.

Hass, Nancy. “The Writing Section? Relax.” New York Times 5 Nov. 2006. Proquest Historical Newspapers. 15 Apr. 2012 .

Hsu, Roland. Personal Interview. 1 May 2012.

Huot, Brian. “Toward a New Theory of Writing Assessment.” College Composition and Communication 47 (1996): 549-566. JSTOR. 15 Apr. 2012 .

Isaacs, Emily, and Sean A. Molloy. “Texts of Our Institutional Lives: SATs for Writing Placement: A Critique and Counterproposal.” College English 72 (2010): 518-38. Proquest Research Library. 6 May 2012 .

Kobrin, Jennifer L., et al. “Validity of the SAT for Predicting First-Year College Grade Point Average.” College Board Research Report 2008-5. 2008. 15 Apr. 2012 .

Lederman, Marie Jean. “Why Test?” Writing Assessment: Issues and Strategies. Ed. Karen L. Greenberg, Harvey S. Wiener, and Richard A. Donovan. New York: Longman, 1986. 35-43.

Luna, Catherine, Judith Solsken, and Eleanor Kutz. “Defining Literacy: Lessons from High-Stakes Teacher Testing.” Journal of Teacher Education 51 (2000): 276-88. Sage Journals. 6 May 2012 .

Moss, Pamela A. “Can There Be Validity Without Reliability?” Educational Researcher 23 (1994): 5-12. JSTOR. 6 May 2012 .

Murphy, Sandra. “Some Consequences of Writing Assessment.” Balancing Dilemmas in Assessment and Learning in Contemporary Education. Ed. Anton Havnes and Liz McDowell. New York: Routledge, 2008. 33-49.

Rinard, Brenda. Telephone Interview. 28 Apr. 2012.

Ruth, Leo, and Sandra Murphy. Designing Tasks for the Assessment of Writing. Norwood, NJ: Ablex, 1988. “SAT Test Sections.” 2012. College Board. 6 May 2012 .

Schwegman, Jeffrey. Personal Interview. 4 May 2012. Slatin, Patricia. Personal Interview. 3 May 2012.

“The ACT Plus Writing.” 2012. ACT, Inc. 6 May 2012 . White, Edward M. “An Apologia for the Timed Impromptu Essay Test.” College Composition and Communication 46 (1995): 30- 45. JSTOR. 15 Apr. 2012 .

Yancey, Kathleen Blake. “Looking Back as We Look Forward: Historicizing Writing Assessment.” College Composition and Communication 50 (1999): 483-503. JSTOR. 15 Apr. 2012 .

Endnote

1. This was not one of the standard questions posed to all interviewees, but one that occurred as the conversations progressed. All the other instructors either were not asked for their opinion on this subject or did not oppose the position that timed writing assessment would yield only a somewhat crude method of determining skill levels.

Appendix

In the interviews conducted for this project, the course of the conversation and the phrasing of the questions varied in each instance, but all the instructors were asked a series of five basic questions modeled on the following.

How accurately, in your experience, does timed writing assessment reflect students’ broader academic writing ability? Does the timed assessment environment emphasize certain aspects of writing skill at the expense of others?
What effects does the presence of timed writing assessment in a course have on your own instructional techniques? Do you recognize any influences on student writing patterns from the prevalence of timed writing assessment throughout high school and college?
What factors motivate you to employ timed writing assignments in place of, or in addition to, regular papers? To what extent do practical considerations such as plagiarism concerns or grading time affect the decision to use timed writing assessment?
What social values and attitudes toward writing, and communication in general, are projected by the importance of timed writing assessment in education?
Would you consider the assessment environment of timed writing to be more or less fair, or equitable, in comparison to the evaluation of regular papers, given that timed writing assessment ensures that exactly the same resources and amount of time are available to each student?

Monthly Newsletter Signup

The newsletter highlights recent selections from the journal and useful tips from our blog.

Follow us to get updates from Inquiries Journal in your daily feed.

Follow @inquiriesjourn

Follow IJ

Latest in Education

Education Policy

2022, Vol. 14 No. 02

Funding for Creative Education and its Socioeconomic Benefits: A Literature Review

By Grady Klein

The United States spends more on public education per student than all but three countries in the world based on 2016 findings from the National Center for Education Statistics, and yet a similar study by the same agency three years later demonstrates... Read Article »

Education Policy Social Mobility Dropout Prevention Creative Education Public Policy

Education Policy

2022, Vol. 14 No. 01

Do Textbooks Shape Attitudes Toward War? Narrative 'Images' and Implicit Social Cognition

By Noemi Andrusello

To explore the relationship between history education and attitudes to war, narrative primes about World War II were read by 20 undergraduate students at California State University, Fresno. Afterwards, in the course of experimental interviews,... Read Article »

Education Policy Prospect Theory Cultural Schemas Social Cognition War Behavioral Psychology

Language Learning

2021, Vol. 13 No. 01

The Efficacy of the Incorporation of First Language in ESL English Grammar Learning

By Hiu Man Ho

English grammar learning is challenging but essential for English-as-a-second-language (ESL) learners. It is vital for ESL learners to develop effective learning strategies to facilitate grammar learning. The efficacy of the incorporation of a learners... Read Article »

Language Learning Grammar ESL English Language

Restorative Justice

2019, Vol. 11 No. 10

Restoring Justice: An Alternative to the School-to-Prison Pipeline

By Daisy Morales

The school-to-prison pipeline, a "partnership” between juvenile courts and the school system, "developed through a punitive and harmful framework to the detriment of many vulnerable children and adolescents,” is a phenomenon of the... Read Article »

Restorative Justice Juvenile Offenders School Discipline Criminal Justice Policy Vulnerable Youth Education Policy

Higher Education

2018, Vol. 10 No. 04

On the Necessity of Developing General Education for Undergraduates in Chinese Higher Institutions

By Songchen Ma

The promotion of general education is a matter of ongoing debate owing to the pressing question of how to improve higher education in China. However, the available analytical material still remains somewhat experiential and emotion-oriented.In this... Read Article »

China Chinese Culture Chinese Higher Education Liberal Arts Chinese Education System Higher Education

Oppression

2010, Vol. 2 No. 01

Race, Class, and Oppression: Solutions for Active Learning and Literacy in the Classroom

By Steven A. Carbone II

Oppression tends to exist in compartmentalized, clearly labeled categories of race, social class, gender, or sexual preference. While these rigidly defined categories may have been applied to allow for rational discussion of problems and solutions... Read Article »

Classism Racisim Education Literacy Race Class Poverty Learning Kozol Economics Teaching Active Learning

Teaching

2009, Vol. 1 No. 12

The Value of Homework: Is Homework an Important Tool for Learning in the Classroom?

By Steven A. Carbone II

Homework continues to be a controversial topic. The debate over homework is an old one, with attitudes shifting throughout the debate over the years. Proponents and opponents make cases to support their views on the necessity and importance of homework... Read Article »

Education Homework Assessment Teaching Grading Grading Policy Primary Education Homework Policy Learning Methods Teaching Practices

What are you looking for?

Tweets by @inquiriesjourn

FROM OUR BLOG

How to Use Regression Analysis Effectively

The Career Value of the Humanities & Liberal Arts

Inquiries Journal

[1] How accurately, in your experience, does timed writing assessment reflect students’ broader academic writing ability? Does the timed assessment environment emphasize certain aspects of writing skill at the expense of others?

[2] What effects does the presence of timed writing assessment in a course have on your own instructional techniques? Do you recognize any influences on student writing patterns from the prevalence of timed writing assessment throughout high school and college?

[3] What factors motivate you to employ timed writing assignments in place of, or in addition to, regular papers? To what extent do practical considerations such as plagiarism concerns or grading time affect the decision to use timed writing assessment?

[4] What social values and attitudes toward writing, and communication in general, are projected by the importance of timed writing assessment in education?

[5] Would you consider the assessment environment of timed writing to be more or less fair, or equitable, in comparison to the evaluation of regular papers, given that timed writing assessment ensures that exactly the same resources and amount of time are available to each student?

Social Sciences

Humanities

Arts

Sciences

From Discussions VOL. 9 NO. 2

Timed Writing Assessment as a Measure of Writing Ability: A Qualitative Study

Endnote

Appendix

From This Issue (Vol. 9 No. 2)

From the Inquiries Journal Blog

Related Reading

Monthly Newsletter Signup

Suggested Reading from Inquiries Journal

Follow IJ

Latest in Education

What are you looking for?

FROM OUR BLOG

Social Sciences

Humanities

Arts

Sciences

From Discussions VOL. 9 NO. 2

Timed Writing Assessment as a Measure of Writing Ability: A Qualitative Study

Endnote

Appendix

From This Issue (Vol. 9 No. 2)

From the Inquiries Journal Blog

Related Reading

Monthly Newsletter Signup

Suggested Reading from Inquiries Journal

Follow IJ

Latest in Education

What are you looking for?

FROM OUR BLOG

Need an Account?