{"id":1065,"date":"2019-08-01T11:44:30","date_gmt":"2019-08-01T15:44:30","guid":{"rendered":"https:\/\/nonpartisaneducation.org\/blog1\/?p=1065"},"modified":"2019-08-01T11:44:40","modified_gmt":"2019-08-01T15:44:40","slug":"should-we-switch-from-mandated-standardized-tests-to-mandated-performance-tests","status":"publish","type":"post","link":"https:\/\/nonpartisaneducation.org\/blog1\/2019\/08\/should-we-switch-from-mandated-standardized-tests-to-mandated-performance-tests\/","title":{"rendered":"Should we switch from mandated \u201cstandardized\u201d tests to mandated \u201cperformance\u201d tests?"},"content":{"rendered":"<p>Sandra Stotsky, August 1, 2019<\/p>\n<p>According to many education writers in this country, there are no tests in Finnish schools, at least no \u201cmandated standardized tests.\u201d That phrase was carefully hammered out by <em>Smithsonian Magazine<\/em> to exclude the many no- or low-stakes \u201cnorm-referenced\u201d tests (like the Iowa Test of Basic Skills, or ITBS) that have been given for decades across this country especially in the elementary grades to help school administrators to understand where their students\u2019 achievement fell under a \u201cnormal curve\u201d of distributing test scores. <a href=\"https:\/\/thefederalist.com\/2014\/09\/24\/top-ten-things-parents-hate-about-common-core\/\">https:\/\/thefederalist.com\/2014\/09\/24\/top-ten-things-parents-hate-about-common-core\/<\/a> <a href=\"https:\/\/www.smithsonianmag.com\/innovation\/why-are-finlands-schools-successful-49859555\/\">https:\/\/www.smithsonianmag.com\/innovation\/why-are-finlands-schools-successful-49859555\/<\/a><\/p>\n<p>Yet, a prominent Finnish educator tells us that Finnish teachers regularly test their upper grade students. <a href=\"https:\/\/pioneerinstitute.org\/news\/the-serpent-in-finlands-garden-of-equityessay-review-of-finnish-lessons-what-can-the-world-learnfrom-educational-change-in-finland-by-pasi-sahlberg\/\">https:\/\/pioneerinstitute.org\/news\/the-serpent-in-finlands-garden-of-equityessay-review-of-finnish-lessons-what-can-the-world-learnfrom-educational-change-in-finland-by-pasi-sahlberg\/<\/a> As Finnish educator, Pasi Sahlberg, noted (p. 25), teachers assess student achievement in the upper secondary school at the end of each six to seven-week period, or five or six times per subject per school year. There are lots of tests in Finnish schools, it seems, but mainly teacher-made tests (not state-wide tests) of what they have taught. There are also \u201cmatriculation\u201d tests at the end of high school (as the Smithsonian article admits)\u2014for students who want to go to a Finnish university. They are in fact voluntary; only students who want to go on to university take them. Indeed, there are lots of tests for Finnish students, just not where American students are heavily tested (in the elementary and middle grades) and not constructed by a testing company.<\/p>\n<p>Why should Americans now be even more interested in the topic of testing than ever before? Mainly because there seems to be a groundswell developing for \u201cperformance\u201d tests in place of \u201cstandardized\u201d tests. And they are called \u201cassessments\u201d perhaps to make parents and teachers think they are not those dreaded tests mandated by state boards of education for grades 3-8 and beyond as part of the Every Student Succeeds Act (ESSA). Who wouldn&#8217;t want a test that \u201caccurately measures one or more specific course standards\u201d? And is also \u201ccomplex, authentic, process and\/or product-oriented, and open-ended.\u201d Edutopia\u2019s writer, Patricia Hilliard, doesn\u2019t tell us in her 2015 blog \u201cPerformance-Based Assessment: Reviewing the Basics\u201d whether it also brushes our hair and shines our shoes at the same time. <a href=\"https:\/\/www.edutopia.org\/blog\/performance-based-assessment-reviewing-basics-patricia-hilliard\">https:\/\/www.edutopia.org\/blog\/performance-based-assessment-reviewing-basics-patricia-hilliard<\/a><\/p>\n<p>It\u2019s as if our problem was simply the type of test that states have been giving, not what is tested nor the cost or amount of time teachers and students spend on them. It doesn\u2019t take much browsing on-line to discover that two states have already found out there were deep problems with those tests, too: Vermont and Kentucky.<\/p>\n<p>An old government publication (1993) warned readers about some of the problems with portfolios: \u201dUsers need to pay close attention to technical and equity issues to ensure that the assessments are fair to all students.\u201d <a href=\"https:\/\/www2.ed.gov\/pubs\/OR\/ConsumerGuides\/admuses.html\">https:\/\/www2.ed.gov\/pubs\/OR\/ConsumerGuides\/admuses.html<\/a> It turns out that portfolios are not good for high stakes assessment\u2014for a range of important reasons. In a nutshell, they are costly, time-consuming, and unreliable. Quoting one of the researchers\/evaluators in the Vermont initiative, it indicates: \u201cThe Vermont experience demonstrates the need to set realistic expectations for the short-term success of performance-assessment programs and to acknowledge the large costs of these programs.\u201d The authors state elsewhere in their own blog that the researchers \u201cfound the reliability of the scoring by teachers to be very low in both subjects&#8230; Disagreement among scorers alone accounts for much of the variance in scores and therefore invalidates any comparisons of scores.\u201d <a href=\"https:\/\/www.ernweb.com\/educational-research-articles\/preliminary-results-of-a-large-scale-portfolio-assessment-program\/\">https:\/\/www.ernweb.com\/educational-research-articles\/preliminary-results-of-a-large-scale-portfolio-assessment-program\/<\/a> <a href=\"https:\/\/eric.ed.gov\/?id=EJ598325\">https:\/\/eric.ed.gov\/?id=EJ598325<\/a><\/p>\n<p>Validity and reliability are the two central qualities needed in a test. Indeed, the first two chapters of the testing industry&#8217;s &#8220;bible,&#8221; The <em>Standards for Educational and Psychological Testing<\/em> are devoted to those two topics. <a href=\"https:\/\/www.apa.org\/science\/programs\/testing\/standards\">https:\/\/www.apa.org\/science\/programs\/testing\/standards<\/a><\/p>\n<p>We learned even more from a book chapter by education professor George K. Cunningham on the \u201cfailed accountability system\u201d in Kentucky. <a href=\"http:\/\/education-consumers.org\/pdf\/Cunningham2.pdf\">http:\/\/education-consumers.org\/pdf\/Cunningham2.pdf<\/a> One of Cunningham\u2019s most astute observations is the following:<\/p>\n<p>Historically, the purpose of instruction in this country has been increasing student academic achievement. This is not the purpose of progressive education, which prefers to be judged by standards other than student academic performance. The Kentucky reform presents a paradox, a system structured to require increasing levels of academic performance while supporting a set of instructional methods that are hostile to the idea of increased academic performance (pp. 264-65).<\/p>\n<p>That is still the dilemma today\u2014skills-oriented standards assessed by \u201cstandardized\u201d tests that require, for the sake of a reliable assessment, some multiple-choice questions.<\/p>\n<p>Cunningham also warned, in the conclusion to his long chapter on Kentucky, about using performance assessments for large-scale assessment (p. 288). \u201cThe Performance Events were expensive and presented many logistical headaches.\u201d In addition, he noted:<\/p>\n<p>The biggest problem with using performance assessments in a standards-based accountability system, other than poor reliability, is the impossibility of equating forms longitudinally from year to year or horizontally with other forms of assessment. In Kentucky, because of the amount of time required, each student participated in only one performance assessment task. As a result, items could never be reused from year to year because of the likelihood that students would remember the tasks and their responses. This made equating almost impossible.<\/p>\n<p>Further details on the problems of equating Performance Events may be found in a technical review in January 1998 by James Catterall and four others for the Commonwealth of Kentucky Legislative Research Commission. Also informative is a 1995 analysis of Kentucky\u2019s tests by Ronald Hambleton et al. It is a scanned document and can be made searchable with Adobe Acrobat Professional.<\/p>\n<p><a href=\"https:\/\/legislature.ky.gov\/LRC\/OEA\/Documents\/MEASUREMENT%20QUALITY%20FINAL%20REPORT%2091-94.pdf\">https:\/\/legislature.ky.gov\/LRC\/OEA\/Documents\/MEASUREMENT%20QUALITY%20FINAL%20REPORT%2091-94.pdf<\/a><\/p>\n<p>A slightly optimistic account of what could be learned from the attempt to use writing and mathematics portfolios for assessment can be found in a recent paper by education analyst Richard Innes at Kentucky\u2019s Bluegrass Institute. <a href=\"http:\/\/www.freedomkentucky.org\/images\/d\/d4\/KERAReport.pdf\">http:\/\/www.freedomkentucky.org\/images\/d\/d4\/KERAReport.pdf<\/a><\/p>\n<p>For more articles on the costs and benefits of student testing, see the following:<\/p>\n<p>Phelps, R. P. (2002, February). Estimating the costs and benefits of educational testing programs. Briefings on Educational Research, Education Consumers Clearinghouse, 2(2). <a href=\"http:\/\/www.education-consumers.com\/briefs\/phelps2.shtm\">http:\/\/www.education-consumers.com\/briefs\/phelps2.shtm<\/a><\/p>\n<p>Phelps, R. P. (2000, Winter). Estimating the cost of systemwide student testing in the United States. Journal of Education Finance, 25(3) 343\u2013380. <a href=\"http:\/\/www.jstor.org\/discover\/10.2307\/40704103?uid=3739896&amp;uid=2134&amp;uid=2&amp;uid=70&amp;uid=4&amp;uid=3739256&amp;sid=21106063737141\">http:\/\/www.jstor.org\/discover\/10.2307\/40704103?uid=3739896&amp;uid=2134&amp;uid=2&amp;uid=70&amp;uid=4&amp;uid=3739256&amp;sid=21106063737141<\/a><\/p>\n<p>Phelps, R. P., et al. (1993). Student testing: Current extent and expenditures, with cost estimates for a national examination. GAO\/PEMD-93-8, U.S. General Accounting Office, U.S. Congress.<\/p>\n<p>Concluding Remarks:<\/p>\n<p>Changing to highly subjective \u201cperformance-based assessments\u201d removes any urgent need for content-based questions. That was why the agreed-upon planning documents for teacher licensure tests in Massachusetts (which were required by the Massachusetts Education Reform Act of 1993) specified more multiple-choice questions on content than essay questions in their format (they all included both) and, for their construction, revision, and approval, required content experts as well as practicing teachers with that license, together with education school faculty who taught methods courses (pedagogy) for that license. With the help of the president of the National Evaluation Systems (NES, the state\u2019s licensure test developer) and others in the company, the state was able to get more content experts involved in the test approval process. What Pearson, a co-owner of these tests, has done since its purchase of NES is unknown.<\/p>\n<p>For example, it is known that for the Foundations of Reading (90), a licensure test for most prospective teachers of young children (in programs for elementary, early childhood, and special education teachers), Common Core\u2019s beginning reading standards were added to the test description, as were examples for assessing the state\u2019s added standards to the original NES Practice Test. It is not known if changes were made to the licensure test itself (used by about 6 other states) or to other Common Core-aligned licensure tests or test preparation materials, e.g., for mathematics. Even if Common Core\u2019s standards are eliminated (as in Florida in 2019 by a governor\u2019s Executive Order), their influence remains in some of the pre-Common Core licensure tests developed in the Bay State\u2014tests that contributed to academically stronger teachers for the state.<\/p>\n<p>It is time for the Bay State\u2019s own legislature to do some prolonged investigations of the costs and benefits of \u201cperformance-based assessments\u201d before agreeing to their possibility in Massachusetts and to arguments that may be made by FairTest, a Bay State-based company, or others who are eager to eliminate \u201cstandardized\u201d testing but implement expensive and unreliable performance tests.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Sandra Stotsky, August 1, 2019 According to many education writers in this country, there are no tests in Finnish schools, at least no \u201cmandated standardized tests.\u201d That phrase was carefully hammered out by Smithsonian Magazine to exclude the many no- &hellip; <a href=\"https:\/\/nonpartisaneducation.org\/blog1\/2019\/08\/should-we-switch-from-mandated-standardized-tests-to-mandated-performance-tests\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":10,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_s2mail":"yes","footnotes":""},"categories":[80,210,33,113,14],"tags":[],"class_list":["post-1065","post","type-post","status-publish","format-standard","hentry","category-common-core","category-curriculum-instruction","category-reading-writing","category-sandra-stotsky","category-testingassessment"],"_links":{"self":[{"href":"https:\/\/nonpartisaneducation.org\/blog1\/wp-json\/wp\/v2\/posts\/1065","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/nonpartisaneducation.org\/blog1\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/nonpartisaneducation.org\/blog1\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/nonpartisaneducation.org\/blog1\/wp-json\/wp\/v2\/users\/10"}],"replies":[{"embeddable":true,"href":"https:\/\/nonpartisaneducation.org\/blog1\/wp-json\/wp\/v2\/comments?post=1065"}],"version-history":[{"count":3,"href":"https:\/\/nonpartisaneducation.org\/blog1\/wp-json\/wp\/v2\/posts\/1065\/revisions"}],"predecessor-version":[{"id":1070,"href":"https:\/\/nonpartisaneducation.org\/blog1\/wp-json\/wp\/v2\/posts\/1065\/revisions\/1070"}],"wp:attachment":[{"href":"https:\/\/nonpartisaneducation.org\/blog1\/wp-json\/wp\/v2\/media?parent=1065"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/nonpartisaneducation.org\/blog1\/wp-json\/wp\/v2\/categories?post=1065"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/nonpartisaneducation.org\/blog1\/wp-json\/wp\/v2\/tags?post=1065"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}