Reign of Error: The Hoax of the Privatization Movement and the Danger to America's Public Schools

Hanushek suggested that there were three ways to get this dramatic improvement in teacher quality. One was to recruit higher-caliber teachers; another was to improve the skills of current teachers. But he maintained that both these methods had been tried and found inadequate. Instead, he recommended “deselection” of the bottom teachers based on their performance, defined as the test scores of their students. But school districts and states would need to change their policies, he believed, to attract and retain the kinds of teachers who could produce amazing test scores:

Another reason it is hard to prove the theory is that teachers are not factory workers who can be shifted from spot to spot as if they were on an assembly line. The teacher who is highly effective in one school may not be equally effective in another. But we can’t know for sure, because no one has tried to move teachers around to prove the theory that three great teachers in a row will close the achievement gap for an entire school or district. Not yet, anyway.

While it seems certain that some teachers are excellent and others are not, the theory is based on some wobbly claims. The very concept of value-added assessment reflects the mind-set of statisticians and economists who measure productivity gains. A farmer plants corn of a certain variety in a certain type of soil, treats it with certain conditions, and then measures the growth of the crop to determine the worthiness of the treatment. In the context of value-added assessment, the teacher is the treatment. If the teacher is effective, the corn grows to a certain height. If the teacher is not, the corn does not grow or grows very little.

But children are not corn. They are not seeds or plants with fixed characteristics. Children’s lives are not static. They have crises and ups and downs in their home lives and their personal lives. Maybe their parents got divorced. Maybe a parent lost her job. Maybe a student broke up with her boyfriend or totaled the family car. Maybe a family member died. Maybe the family moved to a new home. Maybe they were evicted from their home. These changes affect motivation, attention, and school performance. Children are not crops. They are not empty vessels waiting to be filled by a teacher.

• school factors such as class sizes, curriculum materials, instructional time, availability of specialists and tutors, and resources for learning (books, computers, science labs, and more)

• home and community supports or challenges

• individual student needs and abilities, health, and attendance

• peer culture and achievement

• prior teachers and schooling, as well as other current teachers

• differential summer learning loss, which especially affects low-income children

• the specific tests used, which emphasize some kinds of learning and not others, and which rarely measure achievement that is well above or below grade level ¹⁷

Darling-Hammond concluded that the teacher ratings “largely reflect whom a teacher teaches, not how well they teach. In particular, teachers show lower gains when they have large numbers of new English-learners and students with disabilities than when they teach other students. This is true even when statistical methods are used to ‘control’ for student characteristics.”

Why punish teachers for choosing to teach the students with the greatest needs or for being assigned to a class with such students?

If the goal of teacher evaluation is to help teachers improve, this method doesn’t work. It doesn’t provide useful information to teachers or show them how to improve their practice. It just labels and ranks them in ways that teachers find demeaning and humiliating. Darling-Hammond noted that Houston used a value-added method to fire a veteran who had been the district’s teacher of the year. Another teacher in Houston said: “I teach the same way every year. [My] first year got me pats on the back. [My] second year got me kicked in the backside. And for year three, my scores were off the charts. I got a huge bonus. What did I do differently? I have no clue.”

Stated as politely as possible, value-added assessment is bad science. It may even be junk science. It is inaccurate, unstable, and unreliable. It may penalize those teachers who are assigned to teach weak students and those who choose to teach children with disabilities, English-language learners, and students with behavioral problems, as well as teachers of gifted students who are already at the top.

CHAPTER 11

The Facts About Teachers and Test Scores