Desirable Difficulties: What the Research Gets Right (and Wrong)
Desirable difficulties are study strategies that feel harder during practice but produce stronger long-term retention and better transfer. This article summarizes what the research gets right and wrong — covering retrieval practice, spacing, and interleaving — and offers practical guidance for applying these tactics to high-stakes exams.
Desirable Difficulties: What the Research Gets Right (and Wrong)
Introduction
Desirable difficulties are learning tasks that feel harder in the moment but produce stronger long‑term retention and better transfer to new problems. Research on this idea was popularized by Elizabeth and Robert Bjork and has been replicated across labs and domains (Bjork & Bjork, 2011). Yet not every “hard” study tactic is helpful — some simply add effort and frustration without improving learning. This guide explains what the science gets right, where it overreaches, and, most importantly, how to apply the effective strategies in a predictable, practical way for high‑stakes exams.
The Science (Why It Works)
At the heart of the literature is the distinction between performance (how well you do during practice) and learning (durable changes that support later recall and transfer). Conditions that boost immediate performance — massed practice, blocked repetition, rereading — often raise short‑term retrieval strength but not the deeper storage strength needed for retention (Bjork & Bjork, 2011).
Three mechanisms explain why desirable difficulties work:
-
Retrieval practice / generation: Attempting to retrieve or generate answers strengthens memory traces and makes future retrieval easier. Tests are powerful learning events, not only assessments (Roediger & Karpicke research summarized by Bjork & Bjork; see also applied work in medical education) (Bjork & Bjork, 2011; Nelson & Eliasz, 2022).
-
Spacing and forgetting: Allowing some forgetting between sessions forces more difficult retrieval; successful retrieval after forgetting produces larger gains than easy repeated exposure (spacing effect) (Bjork lab and many replications summarized in review articles).
-
Interleaving and varied practice: Mixing related topics forces learners to discriminate and select the correct strategy for each problem, encouraging deeper encoding and transfer rather than brittle context‑bound performance (Kornell & Bjork; skill literature) (Bjork & Bjork, 2011).
These mechanisms are robust, but they have limits. Desirable difficulties are only “desirable” when learners have enough background knowledge and support to succeed with effort. If the difficulty overwhelms working memory or denies necessary scaffolding, it becomes an undesirable difficulty that blocks learning (de Bruin, 2024; cognitive load theory perspectives summarized in Nelson & Eliasz, 2022).
The Protocol (How To Do It)
Below is a prescriptive, step‑by‑step protocol for integrating evidence‑based desirable difficulties into exam preparation. Use it as a template and adapt timing to your course calendar.
-
Set goals and baseline
- Define the exam scope and target performance (types of problems, formats).
- Do an initial diagnostic: a timed practice test or a difficult set of problems to identify knowledge gaps.
-
Build a spaced schedule
- Break study into short sessions over weeks, not one long cram session.
- Use an expanding schedule: review topics after 1 day, 3 days, 1 week, 2+ weeks.
- Prioritize weaker topics with slightly shorter spacing; leave stronger topics on a wider spacing.
-
Prioritize retrieval practice
- Replace passive rereading with active retrieval: practice tests, flashcards that force answer generation, closed‑book problems.
- When using multiple‑choice, try to answer before looking at options; explain why distractors are wrong.
-
Interleave related topics
- Mix problems from different but related areas (e.g., in finance: valuation, risk measures, financial statement analysis).
- Avoid long blocks focusing on the same subtopic. Switch topics every 20–40 minutes or every practice set.
-
Vary practice conditions
- Change environmental cues (room, device), formats (essay, MCQ, worked example), and input modalities (read, listen, draw).
- For skills, vary problem parameters (different fact patterns, contexts) to build generalizable procedures.
-
Use generation + feedback
- Attempt answers before consulting solutions. If incorrect, study the solution and immediately try again to strengthen encoding.
- Space corrective feedback across sessions so you must retrieve corrected knowledge later.
-
Scaffold difficulty
- Start with partially guided retrieval (cue‑based prompts) then progress to full, unsupported retrieval.
- Align difficulty with current knowledge — don’t force maximal difficulty before readiness.
-
Monitor and calibrate metacognition
- After each session, record confidence and objective performance.
- If confidence is high but delayed test performance is low, increase spacing and retrieval frequency for that topic.
-
Repeat with reflection
- Periodically run full, mixed practice exams under test conditions to detect context‑dependence and consolidate transfer.
Common Pitfalls
-
Mistaking fluency for learning. Fast, smooth practice (massed repetition) feels good but predicts poor long‑term retention (Bjork & Bjork, 2011). Don’t use ease as your cue for mastery.
-
Starting with overly hard tasks. If learners lack prerequisites, desirable difficulties become discouraging — this is the classic boundary where difficulty becomes undesirable (de Bruin, 2024; Nelson & Eliasz, 2022).
-
Poorly implemented retrieval practice. Looking at answers before trying, or using recognition‑only tasks, removes the generative benefit. Force active recall first.
-
Interleaving chaos. Randomly mixing unrelated domains without clear structure can add extraneous load. Interleave strategically among related topics to promote discrimination.
-
Ignoring feedback and reflection. Retrieval without corrective feedback or delayed re‑retrieval won’t correct errors and can reinforce misconceptions.
-
Abandoning strategies because they feel hard. Students frequently stop spaced or interleaved practice because it feels less productive in the short run; instructor support and explicit explanation are necessary to sustain use (de Bruin, 2024).
Example Scenario: Applying This to a Finance/Law Exam
Goal: Prepare for a law‑oriented finance exam covering securities regulation, valuation, and contract remedies with essay and problem questions.
-
Baseline: Take a timed mixed test with one essay and two problem sets. Score and identify weak areas (e.g., contract remedies).
-
Schedule: Plan 8 weeks. Block out 4–6 short study sessions weekly. For each topic: initial study, retrieval session after 1 day, revisit after 4 days, then after 12 days.
-
Session design:
- Start with 15 minutes retrieval: closed‑book problem on securities regulation.
- Next 20 minutes interleaved practice: one valuation problem, one remedies problem, one quick statutory interpretation Q.
- Finish with 10 minutes of reflection and jotting where confidence mismatched performance.
-
Variation:
- Alternate essay practice (argument construction) with problem practice (numerical valuation).
- Practice under different settings: library desk, café, simulated exam hall once per week.
-
Feedback loop:
- Use model answers to check essays only after attempting first; rewrite a corrected outline from memory the next day.
- For valuation, generate intermediate steps (calculations) and test later without formula sheets.
-
Final weeks:
- Run three full mixed practice exams spaced 1 week apart. After each, redistribute study based on delayed recall rather than immediate performance.
This approach leverages retrieval, spacing, interleaving, and variation and avoids the performance‑illusion traps that come from blocked cramming (Bjork & Bjork, 2011; APS Observer summary).
Key Takeaways
- Performance ≠ learning. Short‑term fluency can be misleading; measure learning by delayed, mixed tests.
- Use three core strategies: retrieval practice, spacing, and interleaving.
- Keep difficulties desirable: calibrate difficulty to prerequisite knowledge; provide feedback and scaffolding.
- Implement in small, repeatable habits: short sessions, scheduled spacing, forced generation.
- Educate learners about the paradox — understanding the mechanism increases willingness to persist (de Bruin, 2024).
- Avoid overcomplicating: poorly planned difficulty becomes extraneous cognitive load and wastes time.
Useful Resources
- Bjork, E. L., & Bjork, R. A. — Creating Desirable Difficulties to Enhance Learning (PDF): https://bjorklab.psych.ucla.edu/wp-content/uploads/sites/13/2016/04/EBjork_RBjork_2011.pdf
- Psychological Science Observer — Desirable Difficulties (APS article summarizing Bjorks’ talk): https://www.psychologicalscience.org/observer/desirable-difficulties
- De Bruin, A. (2024). Dealing with Desirable Difficulties: Supporting Students to … (PMC): https://pmc.ncbi.nlm.nih.gov/articles/PMC10858853/
- Allen, M. M. — Pearls: Desirable Difficulty—Make Learning Harder on Purpose (PMC): https://pmc.ncbi.nlm.nih.gov/articles/PMC10723876/
- Nelson, A., & Eliasz, K. L. — Desirable Difficulty: Theory and application (Wiley): https://onlinelibrary.wiley.com/doi/abs/10.1111/medu.14916
End of article.