Does anything beat buy & hold on Gold (IAU)?
Every setup we tested on Gold (IAU) — ranked out-of-sample, corrected for multiple testing, and forward-tracked in public from the day this page published. The honest answer is the headline.
No setup beat simply holding once tested honestly. We say so plainly.
Its best setup trailed buy-and-hold out-of-sample. Buy-and-hold benchmark: +11.1% CAGR over 21.4 years (+16.8% CAGR in the out-of-sample window).
Educational research from historical backtests — not investment advice. Past performance does not predict future results.
Gold: Nothing Beat Buy-and-Hold, and That Is the Honest Answer
Broad, diversified instruments like Gold are where indicator strategies go to disappoint. We ran 742 setups against IAU, and none cleared the bar once scored honestly — on data the strategy never saw. The best of the batch, Stochastic (20,5) on the weekly timeframe, posted an out-of-sample Sharpe of 1.27, short of the 1.44 hurdle we require before calling anything real. For an index fund this is the expected result: whatever inefficiency exists in single names tends to average away in the basket, leaving buy-and-hold's +11.1% annualized return as the number nothing here managed to beat.
Read these figures with the selection problem in mind. Test 742 indicators, keep the best, and the winner looks impressive by construction — which is exactly why the hurdle exists instead of applause for a lucky draw. Here, only 1.5% of setups outperformed buy-and-hold even in-sample, and the top candidate produced -2.0% annual alpha over 6.4 unseen years, across 77 trades with a 59.7% win rate and a -35.7% drawdown. That pattern reads as noise, not signal. Markets also change, so even a genuine past edge can fade. This page documents what failed — useful to know before assuming something works.
Every figure above is computed from our own backtests — nothing is estimated or invented. Hypothetical results; not investment advice.
The least-bad setups — shown with their failure numbers
Nothing here earned a verdict — these are the best of a losing field, published so you can see exactly how "best" still failed.
Stochastic (20,5)
Mechanical rule (exactly as backtested): VARIANT — stochastic(20,5); long while %K leads %D. Signals are evaluated at weekly-bar close, the position changes on the NEXT bar, 0.08% cost per side, long/flat only — no leverage, no shorting.
Out-of-sample (last ~30% of the window, never used to pick this setup): Sharpe 1.27 · alpha -2.0% · 23 trades over 6.4 yrs.
Order-Flow Reversion
Mechanical rule (exactly as backtested): Net-liquidity reversion — fades a 2-sigma price stretch only when signed-volume order-flow shows sellers are exhausted (imbalance z-score), exits on the mean. Needs real volume, so it abstains on feeds that don't report it. Signals are evaluated at daily-bar close, the position changes on the NEXT bar, 0.08% cost per side, long/flat only — no leverage, no shorting.
Out-of-sample (last ~30% of the window, never used to pick this setup): Sharpe 1.19 · alpha -9.4% · 19 trades over 6.4 yrs.
Stochastic Slow (21,5)
Mechanical rule (exactly as backtested): Variant — slow stochastic; long while %K leads %D. Signals are evaluated at weekly-bar close, the position changes on the NEXT bar, 0.08% cost per side, long/flat only — no leverage, no shorting.
Out-of-sample (last ~30% of the window, never used to pick this setup): Sharpe 1.18 · alpha -3.1% · 26 trades over 6.4 yrs.
Since publication — including if it loses
The forward record is just getting started — the gap between the two is the honest score. Marked to market nightly from real prices, rules frozen at publication, as of 2026-06-29. Currently FLAT.
We tested 742 setups (indicator × parameters × timeframe) on Gold (IAU). Only setups with ≥30 trades qualify (617 did). Setups are ranked by out-of-sample Sharpe — the last ~30% of history, which standard-parameter rules never saw during selection. Because picking the best of 742 tries mines even the holdout, the VALIDATED verdict additionally requires the top setup’s OOS Sharpe to clear a selection hurdle of 1.44 (√(2 ln N)/√T) AND positive alpha in both windows. Of the eligible setups, 1.5% had positive out-of-sample alpha (median OOS Sharpe 0.62) — the table below is truncated, but this summary covers all of them. Full recipe: methodology · the engine’s contract lives in the repo as STRATEGY_METHODOLOGY.md.
Top 20 of 617 eligible setups
Ranked by out-of-sample Sharpe. Full + out-of-sample columns, costs included. Hypothetical.
| # | Setup | TF | Total ret | Sharpe | Max DD | Win | Trades | α vs B&H | OOS Sharpe | OOS α | OOS trades |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | Stochastic (20,5) | Weekly | +199.6% | 0.48 | -35.7% | 59.7% | 77 | -5.8% | 1.27 | -2.0% | 23 |
| 2 | Order-Flow Reversion | Daily | +115.1% | 0.48 | -16.4% | 74.2% | 66 | -7.5% | 1.19 | -9.4% | 19 |
| 3 | Stochastic Slow (21,5) | Weekly | +164.1% | 0.43 | -37.8% | 58.0% | 81 | -6.4% | 1.18 | -3.1% | 26 |
| 4 | Williams Alligator | Weekly | +327.3% | 0.59 | -45.5% | 66.7% | 54 | -4.1% | 1.16 | +0.3% | 12 |
| 5 | Fractal Adaptive MA | Weekly | +364.6% | 0.67 | -30.4% | 40.8% | 125 | -3.7% | 1.13 | -3.3% | 39 |
| 6 | Random Walk Index | Weekly | +392.3% | 0.6 | -38.4% | 51.9% | 52 | -3.4% | 1.09 | -0.2% | 10 |
| 7 | Random Walk Index | Weekly | +392.3% | 0.6 | -38.4% | 51.9% | 52 | -3.4% | 1.09 | -0.2% | 10 |
| 8 | DPO (10) | Weekly | +404.5% | 0.63 | -39.7% | 50.9% | 53 | -3.2% | 1.09 | -0.9% | 10 |
| 9 | Positive Volume Index | Daily | +328.7% | 0.59 | -37.2% | 35.0% | 60 | -4.1% | 1.08 | -0.0% | 15 |
| 10 | Balance of Power | Weekly | +429.9% | 0.62 | -38.6% | 55.1% | 49 | -3.0% | 1.08 | -0.3% | 11 |
| 11 | Premier Stochastic | Weekly | +435.9% | 0.65 | -32.1% | 51.3% | 39 | -2.9% | 1.08 | -0.9% | 11 |
| 12 | Vortex | Weekly | +400.4% | 0.61 | -38.4% | 49.1% | 53 | -3.3% | 1.06 | -0.7% | 10 |
| 13 | Chande Kroll Stop | Weekly | +605.4% | 0.66 | -39.4% | 49.1% | 57 | -1.5% | 1.05 | +0.1% | 14 |
| 14 | Accelerator Oscillator | Weekly | +236.0% | 0.52 | -34.4% | 54.7% | 64 | -5.3% | 1.05 | -3.6% | 19 |
| 15 | Triangular Hull MA | Weekly | +286.1% | 0.6 | -28.5% | 43.8% | 64 | -4.6% | 1.05 | -3.9% | 19 |
| 16 | Instantaneous Trendline | Weekly | +264.3% | 0.5 | -42.4% | 51.1% | 47 | -4.9% | 1.04 | -1.1% | 11 |
| 17 | QQE | Daily | +696.0% | 0.65 | -43.7% | 41.7% | 321 | -0.9% | 1.03 | +0.9% | 98 |
| 18 | Ease of Movement | Weekly | +420.1% | 0.63 | -49.4% | 46.8% | 47 | -3.1% | 1.03 | -1.3% | 12 |
| 19 | Cutler's RSI | Weekly | +562.5% | 0.71 | -33.0% | 40.3% | 62 | -1.9% | 1.02 | -1.4% | 11 |
| 20 | ROC (14) | Weekly | +562.5% | 0.71 | -33.0% | 40.3% | 62 | -1.9% | 1.02 | -1.4% | 11 |
Hypothetical backtests with 0.08%/side costs. Not investment advice — see the full disclaimer.
These are historical backtests of mechanical rules. They are educational research, not investment advice, not a recommendation, and not tailored to you. Educational information only — not investment advice. Hypothetical backtested results; past performance does not guarantee future results. Trading involves risk of loss.