Evidence Ledger

Each global claim links here. Rows include domain, measurement, effect size, window, sources, and confidence level.

How to read a row

Effect size always in percentage points for the target behavior, not proxies
Window is explicit (for example D0–D7)
Sources link to a case write‑up and raw data when possible

ID	Claim	Domain	Effect size	Window	Confidence	Sources
BS-0001	Behavior-first validation during planning improves initiative success compared with assumption-first approaches.	Cross-industry	Qualitative pattern across the case library (no single published effect size)	Project-specific	Working	Case
BS-0002	Shorter time to first target behavior correlates with higher medium-term retention.	Consumer and SaaS	Product- and cohort-specific (validate per segment; no single published effect size)	Baseline to 90 days (or product-specific)	Working	Method
BS-0003	At-scale nudge-unit RCTs show small average effects (~1.4 pp), far below academic journal averages (~8.7 pp).	Public sector and enterprise	~1.4 pp average (vs ~8.7 pp in academic journals)	126 RCTs; 23 million individuals	High	Study
BS-0004	Opt-out (default) organ donation regimes do not reliably increase transplantation; complementary system investments drive outcomes.	Healthcare policy	No reliable increase in deceased donation after opt-out policy shifts; some evidence of decreased living donation (−29%)	Cross-national, multi-year comparisons	High	Review, Study, Study
BS-0005	Pivoting from low-fit behaviors to high-fit behaviors drives rapid adoption (Instagram: check-ins → photo sharing).	Technology	25K users day 1; 100K in first week; 7M in 9 months (company-reported)	Launch window; first 60–90 days	Working	Interview, Podcast, Interview
BS-0006	Behavior-first internal tool pivots (Slack) yield high early adoption and strong cohort retention when they solve real team behaviors.	Enterprise software	8k users day-1; 93% team retention after 2+ weeks (company-reported)	First 30–90 days of public beta	Working	Interview
BS-0007	Mobile money infrastructure (M-PESA) enables financial behaviors at population scale when environmental bottlenecks are removed.	Finance / Emerging markets	~194,000 households (2%) lifted out of poverty in Kenya (study estimate); improved risk-sharing and consumption smoothing	Multi-year (Kenya; study periods vary)	High	Study, Study
BS-0008	Personalized automation that removes choice overload (Spotify Discover Weekly) increases sustained discovery behaviors.	Technology / Media	>20% → >30% of listening from recommendations after feature launch (reported)	First 12–24 months	Working	Company post, Case write-up
BS-0009	Behavior scaffolding (micro-lessons, progress visibility, immediate feedback) enables durable learning behaviors (Duolingo).	Education / EdTech	Peer-reviewed evaluations report meaningful language learning gains for beginners using Duolingo over the study period	Study-specific (weeks to months)	Working	Study
BS-0010	Frictionless access at scale (Zoom) drives massive increases in meeting participation when environmental constraints spike.	Enterprise / Communication	10M → 300M daily participants (Dec 2019–Apr 2020); TTFB near-instant	Q1–Q2 2020	Working	Company post
BS-0011	Meta-analyses of choice architecture interventions show small-to-medium average effects with substantial heterogeneity and publication bias.	Cross-domain meta-analysis	d ≈ 0.43–0.45 (small-to-medium)	214 publications; 455 effect sizes	High	Meta-analysis
BS-0012	Commuter carpooling RCTs show context-bound effects; system design (safety, coordination, incentives) matters more than default prompts.	Transportation / Public policy	Varies; contingent on study design and system supports	Trial-specific	Working	RCT
BS-0013	Home energy reports (Opower) generate small, persistent behavior changes; effects are modest and context-dependent.	Energy / Residential	Small average reductions; persistence with decay (10-20% decay/year after reports stop)	12–24 months	High	RCT
BS-0014	Eliminating transaction fees (Robinhood) removes economic friction and enables first-time investing behaviors for new segments.	Finance	Large platform growth; industry-wide fee elimination followed	2013–2019	Working	Industry
BS-0015	Reputation and trust systems (Airbnb) enable high-stakes sharing behaviors; conversion improves when trust cues are salient.	Technology / Marketplaces	Conversion deltas vary by study; generally positive	12–36 months	Working	Experiment
BS-0016	Behavioral public strategy connects micro-foundations (individuals, teams, tools) to meso-level performance; simple nudges are brittle without system design.	Public policy	Not applicable; conceptual synthesis	Not applicable	High	Article
BS-0017	Temporal design of measurement (windows, cadence) materially affects whether change is detected; precise temporal hypotheses improve detection.	Methods / Measurement	Not applicable; meta-analysis of temporal aspects	Not applicable	High	Meta-analysis, Review
BS-0018	Self-identity explains additional variance in behavioral intentions beyond attitudes, subjective norms, and perceived behavioral control (TPB).	Psychology / Theory	r+ = 0.47; ΔR² ≈ 6% (intention) and 9% (identity) beyond TPB components	40 independent tests; N = 11,607	High	Meta-analysis
BS-0019	Minimum-component (bottleneck) logic: the lowest-scoring determinant constrains behavior; addressing bottleneck components is necessary for reliable change.	Methods / Theory	Not applicable	Not applicable	High	Model
BS-0020	Values-affirmation (identity-aligned) interventions yield durable academic benefits versus reminders alone.	Education	Context-dependent; durable effects reported in field studies	Multi-year	Working	Field
BS-0021	Psychological targeting (personality-tailored messaging) outperforms generic messaging for persuasion and conversion.	Marketing / Persuasion	Positive treatment effects; varies by context	Campaign-specific	Working	Experiment
BS-0022	In CB Insights’ startup post‑mortem analysis, “no market need” is a top failure reason (#2).
Product / Strategy	35% (ranked #2; CB Insights)	Various; cross-company	High	Report
BS-0023	The Fogg Behavior Model (B=MAP) established that behavior occurs when Motivation, Ability, and Prompt converge at the same moment.	Behavior Design / Theory	Foundation for 100+ behavior design applications	Since 2009	High	Conference paper
BS-0024	Three distinct brain systems govern behavior: Pavlovian (hardwired), habitual (automatic, outcome-insensitive), and goal-directed (flexible, outcome-sensitive).	Behavioral Neuroscience	Distinct circuits identified via fMRI and lesion studies	Established body of research	High	Review, Review
BS-0025	Goal-directed behaviors are outcome-sensitive (change with reward devaluation); habits are not. Most meaningful behaviors are goal-directed, not habitual.	Behavioral Neuroscience	Robust dissociation across human and animal studies	Established body of research	High	Computational model, Review
BS-0026	Habit formation research (66 days median) applies only to simple, automatic behaviors like drinking water, not to complex behaviors.	Habit Formation	Median 66 days (range 18-254)	84-day study	High	Longitudinal study
BS-0027	Bias-corrected analyses find an aggregated nudge effect (d = 0.270) that drops to near zero (d = 0.004) after publication-bias adjustment.	Nudge Effectiveness	d = 0.270 aggregate; d = 0.004 after publication-bias adjustment	13 articles (14 meta-analyses); 1,638 primary studies; ~30M participants	High	Meta-analysis, Second-order meta-analysis
BS-0028	Implementation intentions (if-then planning) improve goal achievement with medium-to-large effect size.	Behavior Planning	d = 0.65 (medium-large)	94 studies, 8,000+ participants	High	Meta-analysis
BS-0029	Intentions explain less than one-third of behavior variance; the intention-behavior gap is substantial and systematic.	Behavior Theory	TPB meta-analysis: ~27% of behavior variance explained; intention-change interventions shift intentions (d = 0.66) more than behavior (d = 0.36)	Meta-analyses across 185+ studies (TPB) and 47 experimental tests (intention → behavior)	High	Meta-analysis, Meta-analysis, Meta-analysis, Review
BS-0030	Behavior is predicted by attitudes, subjective norms, and perceived behavioral control (Theory of Planned Behavior).	Behavior Theory	R² = 0.27-0.39 for intentions; varies for behavior	Foundational theory since 1991	High	Theory paper
BS-0031	Habits are triggered by context cues, not goals; stable contexts produce stable behaviors.	Habit Formation	Strong context effects for simple, repeated behaviors	Review synthesis	High	Review
BS-0032	Automatic enrollment with escalation dramatically increases retirement savings behavior.	Finance / Behavior Design	Savings rates increased from 3.5% to 13.6% over 40 months	Multi-year longitudinal	High	Field study
BS-0033	Self-control is a limited resource that depletes with use; environmental design sustains behavior better than willpower.	Self-Regulation	Significant depletion effects (though effect sizes debated in replications)	Meta-analytic synthesis	Working	Review
BS-0034	General cognitive ability (GCA) predicts job performance with corrected validity around r=0.22 in 21st-century samples.	Cognitive Ability / Work	Observed r = 0.16; corrected r = 0.22	153 samples; N = 40,740 (21st-century)	High	Meta-analysis
BS-0035	Academic performance is jointly predicted by cognitive ability and Big Five traits; cognitive ability accounts for most of the explained variance; conscientiousness is the largest trait contributor.	Education / Psychometrics	Combined explains 27.8% variance; cognitive ability 64% of explained variance; conscientiousness 28%	267 samples; N = 413,074; 228 studies	High	Meta-analysis
BS-0036	Big Five traits predict performance across criteria; conscientiousness shows the most consistent positive association; other trait effects vary by performance category.	Personality / Performance	Overall: conscientiousness ρ = 0.19; extraversion ρ = 0.10; agreeableness ρ = 0.10; neuroticism ρ = −0.12; openness ρ = 0.13. Category-specific: −0.13 to 0.24	54 meta-analyses; k = 2,028; N = 554,778	High	Meta-analysis
BS-0037	Big Five measures do not fully capture HEXACO scale variance; using Big Five instead of HEXACO entails a large loss of information.	Personality / Psychometrics	Large deficiency in capturing HEXACO variance (authors conclude loss comparable to discarding one Big Five scale)	Multiple Big Five measures compared with HEXACO-PI-R scales	High	Psychometrics
BS-0038	Honesty-Humility is meaningful as a distinct factor; separating it from classic agreeableness-related facets improves prediction of deceit-related variables.	Personality / Psychometrics	Improved prediction reported for deceit-without-hostility variables after realignment	Lexical Big Five + Five-Factor Model facet realignment	High	Psychometrics
BS-0039	Personality traits show high rank-order stability in adulthood; stability increases with age and reaches ~0.74 in later adulthood.	Personality / Psychometrics	r ≈ 0.74 (ages 50–70; ~6.7-year interval at peak)	152 longitudinal studies; 3,217 test–retest coefficients	High	Meta-analysis
BS-0040	Exercise/physical-activity identity correlates with physical activity behavior across studies.	Health Behavior / Physical Activity	r = 0.44	Meta-analysis across 62 datasets	High	Meta-analysis
BS-0041	In an exercise RCT, increases in exercise identity predict physical activity at follow-up beyond treatment condition.	Health Behavior / Maintenance	b = 16.76 minutes/week per +1 identity point (95% CI: 2.36–31.15)	16-week RCT + follow-up; N = 130	Medium	RCT
BS-0042	Habit and identity are strongly correlated in health behaviors, consistent with a mutual reinforcement pathway for maintenance.	Health Behavior / Habit	r = 0.55 (95% CI: 0.49–0.60)	19 articles; N = 13,340	High	Meta-analysis
BS-0045	Conscientiousness is consistently associated with healthier behavior patterns across health behavior domains.	Personality / Health Behavior	r ≈ 0.05 (physical activity) to r ≈ −0.28 (drug use), directionally consistent across domains	194 studies	High	Meta-analysis
BS-0046	Personality traits can change via intervention, but average effects are modest and typically require sustained effort.	Personality / Change	Average trait change d ≈ 0.37 across interventions	207 intervention studies; 20,653 participants	High	Meta-analysis
BS-0047	Personality–life outcome associations replicate at high rates in a preregistered, high-powered replication project (with expected direction and moderate effect-size shrinkage).	Personality / Replication	87% of replications significant in expected direction; replication effect sizes averaged 77% of original effects	78 replications across 4 samples; total N = 6,106	High	Preregistered replication
BS-0048	Traits can be modeled as density distributions of states: people show substantial within-person variability, yet their average trait levels (distribution means) are highly stable/reliable within experience-sampling windows.	Personality / Psychometrics	Location stability (Study 1, 2-week ESM): Extraversion 0.90, Agreeableness 0.94, Conscientiousness 0.87, Emotional Stability 0.90, Intellect 0.94	Experience sampling over ~2 weeks; Study 1 N = 127	High	Experience sampling
BS-0049	Conscientiousness is associated with lower mortality risk across longitudinal cohorts.	Personality / Health	r = −0.11 (95% CI: −0.13 to −0.09)	Meta-analysis of 20 independent samples	High	Meta-analysis
BS-0050	General mental ability is a strong predictor of performance in job training programs.	Cognitive Ability / Training	r ≈ 0.56 for job training performance	Meta-analytic synthesis (Schmidt & Hunter, 1998)	High	Meta-analysis
BS-0051	A preregistered, 39-lab replication did not support the classic induced-compliance (high-choice > low-choice) attitude-change effect.	Social Psychology / Replication	Primary high-choice vs low-choice prediction not supported; counterattitudinal vs neutral contrast supported	39 labs; 19 countries; N = 4,898	High	Multilab preregistered replication
BS-0052	A modern archival analysis concludes that key elements of the classic ‘When Prophecy Fails’ account were substantially inaccurate and involved serious ethical problems.	History / Research Integrity	Not applicable	Archival analysis published 2026	Medium	Historical analysis
BS-0053	A widely cited behavioral-ethics intervention paper on signature placement and honesty was retracted.	Behavioral Ethics / Retraction	Not applicable	Retraction published 2021 (original 2012)	High	Retraction
BS-0054	Automatic enrollment dramatically increases 401(k) participation among new hires and anchors many participants to default contribution rates and asset allocations.	Finance / Defaults	Participation at 3–15 months tenure: 86% (automatic enrollment) vs 37% (opt-in). Default contribution adherence declines from 92% to 57% over 15 months.	3–15 months of tenure; up to 15 months post-hire	High	Field study, Working paper
BS-0055	Automatic enrollment can raise participation, but long-run retirement wealth effects can be modest after accounting for turnover and withdrawals.	Finance / Defaults	+0.6% of income in retirement savings; effects substantially smaller once accounting for job changes and cash-outs	Long-run model with turnover and withdrawals	Medium	Working paper
BS-0056	Experimental attempts to induce habits in humans often fail; extensive training may not produce outcome-insensitive (habitual) responding.	Habit Formation / Neuroscience	Five experiments reported failures to induce habitual responding in humans	Five experiments	High	Journal article
BS-0057	Acorns increases saving/investing contributions by piggybacking on an existing behavior (spending) via automatic round-ups.	FinTech / Personal finance	Average round-ups: ~$43 per customer per month (reported); >$150 invested in first 4 months from round-ups alone (company-reported)	Monthly; first 4 months post-enablement	Working	Company documentation, Company post, Press
BS-0058	Active allocation behaviors (assigning dollars before spending) can be more durable than passive awareness tools for budgeting (YNAB vs Mint).	FinTech / Personal finance	Mint: 1.5M users at 2009 acquisition (reported) and later shutdown (2024). YNAB: sustained subscription model anchored in repeated allocation behavior.	2007–2024	Working	Company documentation, Press, Press
BS-0059	ClassPass built a viable model around variety-seeking fitness behavior instead of forcing single-gym commitment.	Fitness / Marketplace	Reported partner-side pattern: high share of visits are to new studios early; usage concentrates after initial exploration (company/partner reporting varies).	2012–2018 (model evolution)	Working	Press, Case write-up, Press
BS-0060	Gym membership contracts often monetize overconfidence: people pay flat fees while under-attending and delaying cancellation.	Behavioral Economics / Fitness	Members overestimated visits (~9.5 forecast vs ~4.2 actual/month); flat-fee members attended ~4.3x/month and paid an effective price >$17/visit (field data).	3 years; N=7,752 members (3 US health clubs)	High	Journal article
BS-0061	Peloton grew by matching at-home exercise constraints with instructor-led classes and social reinforcement; adoption shifted as context shifted post-pandemic.	Fitness / Consumer subscription	Hardware-subscriber engagement spiked during COVID and normalized afterward (reported).	2018–2023	Working	Press, Press
BS-0062	Social reinforcement (kudos) can modestly increase running frequency; Strava leverages social feedback and competition to sustain exercise behaviors.	Fitness / Social platforms	Receiving kudos is associated with increased running frequency (study of running clubs; magnitude context-specific); Strava reports 180M+ athletes in 185+ countries.	Study-specific (running clubs; longitudinal social network analysis)	Working	Journal article, Company
BS-0063	Couch to 5K enables novice running by reducing capability barriers through progressive overload (walk/run intervals).	Fitness / Program design	Completion in one study: 27.3% (n=110); UK NHS Couch to 5K: millions of downloads and completions (reported).	9-week program; app reporting (2016–2024)	Working	Study, Government
BS-0064	Discord achieved adoption by formalizing an existing behavior chain (team voice + text coordination) for gaming groups and then expanding to broader communities.	Technology / Communication	200M+ global monthly active users (company-reported, 2025); expansion beyond gaming (reported).	2015–2025	Working	Company, Reference
BS-0065	Figma won by matching real collaborative design workflows: real-time multiplayer editing removed ‘handoff’ friction and enabled team co-creation behavior.	Technology / Work tools	Real-time multiplayer editing reduced coordination overhead by eliminating export/sync/email file behaviors (mechanism; quantitative varies).	2016–2023	Working	Company post, Press
BS-0066	TikTok outcompeted Vine by reducing creator friction and increasing distribution predictability via algorithmic recommendations and longer-format flexibility.	Technology / Social media	Mechanism-level: recommendation feed makes reach less dependent on follower graphs; video-length flexibility expands viable content behaviors.	2016–2024	Working	Company post
BS-0067	YouTube succeeded by pivoting from a low-fit behavior (video dating) to a higher-fit one (upload/share any video), then enabling creator incentives (Partner Program).	Technology / Creator platforms	At acquisition (reported): ~100M daily video views; ~65K new videos uploaded daily.	2005–2007	High	Filing, Press
BS-0068	Netflix won by matching viewing behavior with subscriptions and eliminating late-fee friction; Blockbuster’s model depended on behavior (returns) people often failed to perform.	Media / Subscription	Blockbuster late fees were a significant revenue component (reported); Netflix growth followed subscription + no late fees model (reported).	1999–2010	Working	Press, Press
BS-0069	Google Wave failed because it required too much behavior change at once (new collaboration behaviors from everyone, unclear use case, high learning cost).	Technology / Collaboration tools	Creators reported misalignment between their needs and mass-market learning costs; early active usage was low relative to invites (reported).	2009–2012	Working	Interview, Reference
BS-0070	Quibi failed because its core context assumption (commute ‘in-between moments’) collapsed at launch and the target behavior could not compete with free social video.	Media / Failures	Trial-to-paid conversion reported ~8% for early cohorts (Sensor Tower estimate); shutdown announced ~6 months post-launch.	Apr–Oct 2020	Working	Press, Press, Press
BS-0071	Meditation apps often suffer severe early attrition because the target behavior is effortful with delayed rewards; retention is typically low without strong fit and scaffolding.	Health / Digital therapeutics	Meta-analytic attrition ~25% (higher in larger studies); median 30-day retention for mindfulness apps reported ~4.7% (app analytics).	30 days and longer; study-dependent	Working	Meta-analysis, Study
BS-0072	Digital health apps lose many users early; onboarding and early time-to-first-benefit strongly shape whether behavior persists.	Health / Digital products	Health app dropout reported ~43%; median abandonment over time can exceed 70% (study-dependent).	First weeks to ~100 days (study-dependent)	Working	Study
BS-0073	Simplifying HIV treatment behaviors (single-tablet regimens; long-acting injectables) improves adherence and clinical outcomes compared to higher-burden regimens.	Healthcare / Adherence	Meta-analysis: discontinuation 36.3% (single-tablet) vs 48.8% (multi-tablet); ~21% higher odds of undetectable viral load (reported).	Meta-analysis (study periods vary)	High	Meta-analysis, Meta-analysis, Report
BS-0074	Meal kit subscriptions often churn because the target behavior (plan, cook, and manage deliveries weekly) has low fit for many households; effort and context constraints dominate.	Consumer subscription / Food	Blue Apron reported ~50% continuation after two weeks and ~10% after six months (press reporting; varies by cohort).	First 6 months	Working	Press, Press
BS-0075	Waze scaled crowdsourced traffic reporting by making micro-contributions low-friction and rewarding, improving navigation outcomes.	Technology / Navigation	At acquisition: ~50M users (reported); incident reporting and map edits at large scale (reported).	2009–2013 and beyond	Working	Press, Case write-up, Press
BS-0076	Open-plan offices can backfire: face-to-face interaction drops sharply and digital messaging increases when noise and interruption costs rise.	Workplace design	Face-to-face interaction decreased ~70%; email increased ~56%; IM increased ~67% after open office adoption (field study).	Before/after move to open plan (Fortune 500 HQ)	High	Journal article, Review
BS-0077	Responsible beverage service (RBS) training reduces over-service and alcohol-related harms by targeting the gatekeeper behavior that can change in the moment.	Public health / Alcohol harms	Oregon server-training mandate associated with a 23% reduction in fatal single-vehicle nighttime crashes; community reviews report reduced intoxicated driving and related harms.	1986–1994 (Oregon); multi-study reviews	High	Systematic review, Study
BS-0078	Reducing time-to-first-value in SaaS onboarding increases activation: Proposify improved onboarding by guiding users to the core workflow quickly.	SaaS / Sales enablement	Before redesign, only ~14% of trial users completed onboarding steps (company-reported).	Pre/post onboarding redesign (company-reported; dates vary)	Working	Company-reported, Company post

How to add a row

Add an entry to /_data/evidence_ledger.yml.
Use {% include evidence-ref.html id="BS-XXXX" %} next to the claim on the relevant page.
Keep denominators explicit and link to case write‑ups.

How to read a row (example)

ID	Claim	Domain	Effect size	Window	Confidence	Sources
BS‑0001	TTFB < 5 min increased completion of account setup among new users	B2C mobile	Δ‑B +14 pp (22→36)	D0‑D7 post‑exposure	Medium	Case write‑up, Raw data

Jason Hreha· Updated January 31, 2026

On this page

Evidence Ledger

How to read a row (example)

See also