The Testing Effect

Monday, October 6th, 2014

Sometimes, when we open a test, we see familiar questions on material we’ve studied — and yet we still do badly. Why does this happen?

Psychologists have studied learning long enough to have an answer, and typically it’s not a lack of effort (or of some elusive test-taking gene). The problem is that we have misjudged the depth of what we know. We are duped by a misperception of “fluency,” believing that because facts or formulas or arguments are easy to remember right now, they will remain that way tomorrow or the next day. This fluency illusion is so strong that, once we feel we have some topic or assignment down, we assume that further study won’t strengthen our memory of the material. We move on, forgetting that we forget.

Often our study “aids” simply create fluency illusions — including, yes, highlighting — as do chapter outlines provided by a teacher or a textbook. Such fluency misperceptions are automatic; they form subconsciously and render us extremely poor judges of what we need to restudy or practice again. “We know that if you study something twice, in spaced sessions, it’s harder to process the material the second time, and so people think it’s counterproductive,” Nate Kornell, a psychologist at Williams College, said. “But the opposite is true: You learn more, even though it feels harder. Fluency is playing a trick on judgment.”

The best way to overcome this illusion is testing, which also happens to be an effective study technique in its own right. This is not exactly a recent discovery; people have understood it since the dawn of formal education, probably longer. In 1620, the philosopher Francis Bacon wrote, “If you read a piece of text through twenty times, you will not learn it by heart so easily as if you read it ten times while attempting to recite it from time to time and consulting the text when your memory fails.”

Scientific confirmation of this principle began in 1916, when Arthur Gates, a psychologist at Columbia University, created an ingenious study to further Bacon’s insight. If someone is trying to learn a piece of text from memory, Gates wondered, what would be the ideal ratio of study to recitation (without looking)? To interrogate this question, he had more than 100 schoolchildren try to memorize text from Who’s Who entries. He broke them into groups and gave each child nine minutes to prepare, along with specific instructions on how to use that time. One group spent 1 minute 48 seconds memorizing and the remaining time rehearsing (reciting); another split its time roughly in half, equal parts memorizing and rehearsing; a third studied for a third and recited for two-thirds; and so on.

After a sufficient break, Gates sat through sputtered details of the lives of great Americans and found his ratio. “In general,” he concluded, “best results are obtained by introducing recitation after devoting about 40 percent of the time to reading. Introducing recitation too early or too late leads to poorer results.” The quickest way to master that Shakespearean sonnet, in other words, is to spend the first third of your time memorizing it and the remaining two-thirds of the time trying to recite it from memory.

Continue reading the main story
In the 1930s, a doctoral student at the State University of Iowa, Herman F. Spitzer, recognized the broader implications of this insight. Gates’s emphasis on recitation was, Spitzer realized, not merely a study tip for memorization; it was nothing less than a form of self-examination. It was testing as study, and Spitzer wanted to extend the finding, asking a question that would apply more broadly in education: If testing is so helpful, when is the best time to do it?

He mounted an enormous experiment, enlisting more than 3,500 sixth graders at 91 elementary schools in nine Iowa cities. He had them study an age-appropriate article of roughly 600 words in length, similar to what they might analyze for homework. Spitzer divided the students into groups and had each take tests on the passages over the next two months, according to different schedules. For instance, Group 1 received one quiz immediately after studying, then another a day later and a third three weeks later. Group 6, by contrast, didn’t take one until three weeks after reading the passage. Again, the time the students had to study was identical. So were the quizzes. Yet the groups’ scores varied widely, and a clear pattern emerged.

The groups that took pop quizzes soon after reading the passage — once or twice within the first week — did the best on a final exam given at the end of two months, marking about 50 percent of the questions correct. (Remember, they had studied their peanut or bamboo article only once.) By contrast, the groups who took their first pop quiz two weeks or more after studying scored much lower, below 30 percent on the final. Spitzer’s study showed that not only is testing a powerful study technique, but it’s also one that should be deployed sooner rather than later. “Achievement tests or examinations are learning devices and should not be considered only as tools for measuring achievement of pupils,” he concluded.

The testing effect, as it’s known, is now well established, and it opens a window on the alchemy of memory itself. “Retrieving a fact is not like opening a computer file,” says Henry Roediger III, a psychologist at Washington University in St. Louis, who, with Jeffrey Karpicke, now at Purdue University, has established the effect’s lasting power. “It alters what we remember and changes how we subsequently organize that knowledge in our brain.”

Posted in Education, Science | 1 Comment »

Comments

Carl says:

October 7, 2014 at 11:14 am

There’s a big difference between thinking you understand something and actually understanding something. You don’t actually know that you don’t know something until you have to explain it to someone else.

gaikokumaniakku: Tangentially related: https://simplicius76.sub stack.com/p/3m22-zircon- debunking-misconceptions At the link, a military expert criticizes Russian hypersonic weapons.
bob sykes: LBJ also kept a cooler of beer in that Caddy, and he usually was drinking a beer while he drove around.
Jim: Design really is everything.
Jim: Superb excerption.
Jim: I predicted thith.
Phileas Frogg: “Interestingly, working-class Americans are more likely to read local news, while the wealthy and highly educated favor national and global news.” I wonder how much of this is social norms vs self-perception. Do the wealthy and educated feel in touch with (or that they should be in touch with) national and global events, or is it mere mimicry? How about the working-class? The two are necessarily mutually exclusive of course, but I wonder if there’s a different primary...
Phileas Frogg: As advances in weapon’s have improved the range and effectiveness of the individual soldier’s impact on the battlefield, the number of soldiers has increased in importance relative to the quality of any individual soldier. It’s the Thermopylae Principle in reverse because of weapons advancements. Reminds me of RTS balancing, where to effectively implement melee units developers need to give them either unrealistic speed or durability relative to the ranged capabilities of...
Bob Sykes: Calling Mohamed Farrah Aidid’s militia a mob is a bit much, but they were lightly armed and poorly trained. However, they defeated the US/UN mission to Somalia, or at least fought it to a draw. We’re still fighting Aidid’s grandchildren, and we still haven’t won. The war is now in its 4th decade, with no end in sight. Settling aside our Indian Wars (1607 to 1918). Somalia is the longest US war.
McChuck: Anybody who has ever played a wargame can tell you that your defenses can handle a certain amount of opposition, but when enemy numbers, regardless of quality, exceed that number, you get overrun. The mobs in Mogadishu back in 1993 weren’t well organized, weren’t well equipped, weren’t well trained, and weren’t well led. But there sure were a whole lot of them shooting at the Rangers.
Russell: New article about Thorp https://archive.ph/fq5JC
Phileas Frogg: The class divide in the US is as deep and wide as any as has existed in history, but instead of acknowledging the divide, much less trying to bridge it, we have chosen moralization and mutual animosity, which will be the death of us.
Freddo: https://www.zerohedge.co m/geopolitical/us-drones -are-expensive-and-error -prone-so-ukraine-turns- china
Jim: The principle applies whatever the source of the wall-penetrating radio waves in question.
Phileas Frogg: In conjunction with the Rob Henderson excerpt one can safely conclude that neither drug use, nor sexual immorality, can be meaningfully correlated with class. “But poverty causes crime!” Does it?
Bruce: Green energy is a patronage fraud, and Terraform Industries is working with it. But their tech isn’t a fraud.
Handle: What is Henderson taking about? I was under the impression that it was a broadly believed meme and one frequently portrayed in popular entertainment for several generations before Henderson was born that college kids got drunk and enjoyed / experimented with recreational drugs quite a lot. Maybe Rob’s poorer friends couldn’t afford to watch those episodes of South Park or didn’t hear about George W Bush’s “youthful indiscretions”? Man, the left really is...
Bob Sykes: How effing stupid are we supposed to be. The energy input to make the methane will be several times the energy of combustion when the methane is burned. And, of course, all the methane will become exactly the amount of carbon dioxide originally removed from the atmosphere. Net, no carbon dioxide removal. And considering all the carbon produced during the manufacture of the Terraform system and the solar/wind systems, this proposal actually increases atmospheric CO2. And the cost claims are...
Gaikokumaniakku: If it’s “synthetic,” they could just call it “synthetic methane,” right? Calling it “synthetic natural gas” makes my brain hurt because I like to think that natural things are not synthetic. I love methane tech because I was raised on Mad Max: Beyond Thunderdome. Typically I like to see it as a way of dealing with dung from agriculture, but I am a fan of methane tech regardless of how they make it.
Bruce: Very glad to see @TerraformIndies here. This could be really big. For example, why not put a Terrform Indies converter on every nuclear power plant in front of those electric heaters in the cooling towers? Closer to where people need natural gas than oil wells where they just burn the natural gas off because it’s not worth transport costs. And if CO2 is bad, like the climate change people say, it’s great to have a way to turn CO2 to something useful. And worse case, it’s a...
Namur: Well, it’s all about EROI (Energy Returned on Invested). Can this machine produce more energy in the form of natural gas than it consumes in the first place? Also, the energy to build and operate it also counts. Now, let’s say that it works: we are still far away for something like it that works for Diesel, the holy grail of our energy society.

Isegoria

The Testing Effect

Comments

Leave a Reply

Search

Recent Comments

Categories

Archives