How to Make Effective Flashcards (Cognitive Science Edition)

TL;DR

A flashcard is a forced retrieval question. If you can answer it by recognising the back of the card instead of producing the answer yourself, it's not a flashcard, it's a recognition test. Three rules cover 90% of the difference between a deck that works and a deck that doesn't: one fact per card, the prompt is a question, and the answer is short enough to say out loud.

You've probably seen a deck of someone else's flashcards and thought "I should make those." Two weeks later, you have 400 cards, none of them have been reviewed in five days, and the ones you have reviewed feel weirdly easy in the moment and weirdly impossible on the exam. The deck isn't bad because of the schedule. It's bad because of the cards.

Most flashcard advice online is about which app to use. The app matters less than people think — Anki, Queazy, Quizlet, paper, all work fine. What determines whether a deck actually moves your retention is the design of each individual card. Below: the three rules that matter, what they look like in practice across different subjects, and the common failure modes.

Rule 1: One fact per card

If you can split the answer into two unrelated sentences and they'd both still be true alone, it's two cards.

A card that violates this: "What are the four causes of metabolic acidosis with anion gap and which two are most common in alcoholics?" The answer contains two pieces of information that don't logically depend on each other in the moment of retrieval. When you review it, you'll get one half right and the other wrong, and the card scores as a miss — but you've actually mastered half of it. The schedule wastes your time re-testing the part you knew.

Split: "What are the four common causes of high anion-gap metabolic acidosis?" and "In an alcoholic patient with anion-gap metabolic acidosis, what are the two most likely causes?" Each card now has a single retrieval target. The schedule can promote each one independently.

The pragmatic version of this rule: if the answer is longer than three sentences, you wrote two cards.

Rule 2: The prompt is a question, not a topic

"Hyperkalemia" is a topic. "What ECG changes appear in hyperkalemia, in order of severity?" is a question.

Topic-fronted cards encourage a vague "tell me everything you know" response. That's not retrieval — that's free association. The card grades itself on whether the response contains the correct material, which means you can ramble for 90 seconds, mention the right thing in passing, and call it a hit.

Question-fronted cards force a specific answer. You either produced "peaked T waves → prolonged PR → widened QRS → sine wave" or you didn't. The grade is binary and the retrieval is real.

A test for whether your prompt is a question: read it out loud. Does someone hearing it for the first time know exactly what answer you want? If they'd respond "what about it?", you have a topic.

Rule 3: The answer is sayable

Out loud. In one breath. Without slides.

This is the rule most students break. They put a five-bullet definition on the back of the card, the front asks "what is X?", and they review by reading both sides and feeling like they learned something. They haven't, because:

The answer was too long to retrieve from memory in a single pass — they had to read it instead of recalling it.
"Read both sides" is the failure mode where the card becomes a re-reading exercise.

Operational rule: the back of the card should be answerable as a sentence or a short list (≤5 items). If your concept genuinely needs a five-bullet answer, split it: one card per bullet, with the prompt narrowed enough to elicit that single bullet.

What good cards look like — by subject

Three subjects, three styles. The rules are the same; the texture differs.

Anatomy / pharmacology (fact-dense)

Bad: Front: "Cranial nerve VII" / Back: "Facial nerve. Innervates muscles of facial expression. Exits at stylomastoid foramen. Branches: temporal, zygomatic, buccal, mandibular, cervical."

This is six facts on one card. Test for recognition, not recall. You'll feel familiar with all of them and remember none.

Good (split into 3 cards):

"Which cranial nerve innervates the muscles of facial expression?" → "VII (facial)"
"At what foramen does the facial nerve exit the skull?" → "Stylomastoid foramen"
"Five terminal branches of the facial nerve, in order from superior to inferior?" → "Temporal, zygomatic, buccal, mandibular, cervical (To Zanzibar By Motor Car)"

The third one bundles five items because the mnemonic makes them a unit — and the prompt explicitly asks for "five, in order," which forces production.

Conceptual subjects (mechanisms, causation)

Bad: Front: "Mechanism of cardiac action potential" / Back: a 200-word paragraph describing all 5 phases.

Good (split):

"Phase 0 of the cardiac action potential — which channel opens, what happens to membrane potential?" → "Fast Na+ channels open; rapid depolarisation from -90 to +20 mV"
"Phase 2 of the cardiac action potential — what causes the plateau?" → "Balance between Ca²⁺ influx (slow L-type channels) and K+ efflux"
"In phase 3, which channel is responsible for repolarisation?" → "Delayed rectifier K+ channel (IKr)"

Each card tests one transition. You can promote phase 2 to monthly review while keeping phase 4 in daily review if phase 4 is the one giving you trouble.

Languages (vocabulary, sentence patterns)

Bad: Front: "der Tisch" / Back: "the table"

This is fine as a starting card but it's testing recognition. You'll see "der Tisch" and feel familiar with it. You won't be able to produce "der Tisch" when you need to say "the table" in German.

Better: Front: "How would you say 'the table is in the kitchen' in German?" / Back: "Der Tisch ist in der Küche."

Now you're producing a full sentence with article agreement, preposition case, and word order — three lessons in one card with a single retrieval cue. Cloze-deletion (fill-in-the-blank) is even better for vocabulary acquisition: "Der ___ ist in der Küche." (Answer: Tisch.)

Quantitative subjects (formulas, problem patterns)

Bad: Front: "Quadratic formula" / Back: "x = (-b ± √(b² - 4ac)) / 2a"

This tests memorising a formula. Memorising the formula is fine but it doesn't help you apply it.

Good (paired):

"State the quadratic formula." → "x = (-b ± √(b² - 4ac)) / 2a"
"x² - 5x + 6 = 0 — solve using the quadratic formula. Show the discriminant first." → "Discriminant: 25 - 24 = 1. x = (5 ± 1) / 2 = 3 or 2."

The first card builds the recall. The second card is the actual exam skill — applying it to a specific problem. You need both.

Common failure modes

The "definition trap" Front: "Define X." Back: a textbook definition. The card tests whether you can recognise the definition you copied, not whether you understand X. Replace with: "What's the simplest sentence you could explain X to a first-year student?" The answer is in your own words, which forces actual understanding.

The "I'll figure out what I meant later" card Front: "the thing about hyperaldosteronism" / Back: "K+ low, BP high, urinary K+ wasting." Three weeks later you have no idea what version of the question your past self had in mind. Write the prompt as if a stranger is reading it. Always.

The cloze that includes the answer in the prompt "Hyperkalemia causes ___ T waves on ECG." You can guess "peaked" from context without retrieving anything. Better: "What's the earliest ECG change in hyperkalemia?" → "Peaked T waves."

The card you make right after first reading the chapter You don't yet know what's actually testable. Wait until you've done one problem set or seen one exam question on the topic — then make cards. Otherwise you make 30 cards on definitions and zero on the mechanisms that show up on the exam.

The card you copied from someone else's deck and don't understand The mental shortcut "Anking has 30,000 cards, they're all good" is wrong. About a third of any large community deck has at least one of the failure modes above. Make your own cards, even if a community deck exists — the act of making the card is half the retrieval practice.

How many cards is too many

A working number for the average student: 15-20 new cards per day, ~80-120 reviews. If you're full-time studying for a board exam (USMLE, Bacalaureat, MCAT), you can push new cards to 30/day and reviews to 200, but only because you have the time to review without skipping.

If your review queue is over 300/day and you're not full-time studying, your deck is too large. Suspend the lowest-priority cards (the ones from chapters that haven't been tested) until your queue stabilises.

What to do tomorrow

Open the deck you have. Pick five random cards. For each, ask:

Does the answer have more than three sentences? → split it.
Is the prompt a topic rather than a question? → rewrite.
Can you answer it by recognising rather than producing? → reformat or delete.

If all five cards fail one of those tests, you have a recognition deck, not a recall deck. Rebuild the rules into your next 20 cards before adding any more.

FAQ

Should I use images on flashcards?

Yes, for visual-spatial subjects (anatomy, histology, geography, architecture). Mayer (2009) showed dual-coding — text plus image — produces better retention than text alone for spatial material. For pure-fact subjects, images often distract more than they help.

Cloze-deletion or front/back?

Cloze-deletion is better for vocabulary, sentence patterns, and any case where context matters. Front/back is better when the answer is genuinely a standalone fact. You can mix both in the same deck.

How long should I spend making each card?

90 seconds to two minutes. More than that and you're over-engineering the wording — at which point you're writing notes, not flashcards. If a card needs a four-paragraph back, it's not a flashcard; it's a summary that belongs in your notes.

Is it worth making flashcards if I'll only use them once before an exam?

Honestly, no. Active recall during card-making gives you some encoding, but flashcards earn their keep through repeated retrieval over days. If you're cramming the night before, just do practice problems or blurt the topic from a blank page — same retrieval benefit, no deck-management overhead.

Sources

Karpicke, J. D., & Blunt, J. R. (2011). Retrieval practice produces more learning than elaborative studying with concept mapping. Science, 331(6018), 772-775. https://doi.org/10.1126/science.1199327
Roediger, H. L., & Karpicke, J. D. (2006). Test-enhanced learning: Taking memory tests improves long-term retention. Psychological Science, 17(3), 249-255. https://doi.org/10.1111/j.1467-9280.2006.01693.x
Mayer, R. E. (2009). Multimedia Learning (2nd ed.). Cambridge University Press. https://doi.org/10.1017/CBO9780511811678
Bjork, R. A., & Bjork, E. L. (2011). Making things hard on yourself, but in a good way: Creating desirable difficulties to enhance learning. https://bjorklab.psych.ucla.edu/wp-content/uploads/sites/13/2016/07/BjorkBjork2011.pdf
Dunlosky, J., Rawson, K. A., Marsh, E. J., Nathan, M. J., & Willingham, D. T. (2013). Improving students' learning with effective learning techniques. Psychological Science in the Public Interest, 14(1), 4-58. https://doi.org/10.1177/1529100612453266