Does anything beat buy & hold on Consumer Disc. (XLY)?
Every setup we tested on Consumer Disc. (XLY) — ranked out-of-sample, corrected for multiple testing, and forward-tracked in public from the day this page published. The honest answer is the headline.
No setup beat simply holding once tested honestly. We say so plainly.
Its best setup only beat buy-and-hold in one window — a regime artifact, not a strategy. Buy-and-hold benchmark: +9.6% CAGR over 27.4 years (+10.9% CAGR in the out-of-sample window).
Educational research from historical backtests — not investment advice. Past performance does not predict future results.
Consumer Disc.: Nothing Beat Buy-and-Hold, and That Is the Honest Answer
Broad, diversified instruments like Consumer Disc. are where indicator strategies go to disappoint. We ran 749 setups against XLY, and none cleared the bar once scored honestly — on data the strategy never saw. The best of the batch, Parabolic SAR on the daily timeframe, posted an out-of-sample Sharpe of 1.01, short of the 1.27 hurdle we require before calling anything real. For an index fund this is the expected result: whatever inefficiency exists in single names tends to average away in the basket, leaving buy-and-hold's +9.6% annualized return as the number nothing here managed to beat.
Read these figures with the selection problem in mind. Test 749 indicators, keep the best, and the winner looks impressive by construction — which is exactly why the hurdle exists instead of applause for a lucky draw. Here, only 3.3% of setups outperformed buy-and-hold even in-sample, and the top candidate produced +4.8% annual alpha over 8.2 unseen years, across 332 trades with a 42.8% win rate and a -59.4% drawdown. That pattern reads as noise, not signal. Markets also change, so even a genuine past edge can fade. This page documents what failed — useful to know before assuming something works.
Every figure above is computed from our own backtests — nothing is estimated or invented. Hypothetical results; not investment advice.
The least-bad setups — shown with their failure numbers
Nothing here earned a verdict — these are the best of a losing field, published so you can see exactly how "best" still failed.
Parabolic SAR
Mechanical rule (exactly as backtested): Long while price is above the Parabolic SAR (0.02, 0.2) trailing dots. Signals are evaluated at daily-bar close, the position changes on the NEXT bar, 0.08% cost per side, long/flat only — no leverage, no shorting.
Out-of-sample (last ~30% of the window, never used to pick this setup): Sharpe 1.01 · alpha +4.8% · 95 trades over 8.2 yrs.
T3 (Tillson)
Mechanical rule (exactly as backtested): Long while price is above the smooth, low-lag Tillson T3 moving average. Signals are evaluated at daily-bar close, the position changes on the NEXT bar, 0.08% cost per side, long/flat only — no leverage, no shorting.
Out-of-sample (last ~30% of the window, never used to pick this setup): Sharpe 1.01 · alpha +3.9% · 138 trades over 8.2 yrs.
Fractal Adaptive MA
Mechanical rule (exactly as backtested): Long while price is above Ehlers' Fractal Adaptive MA (adapts to fractal dimension). Signals are evaluated at weekly-bar close, the position changes on the NEXT bar, 0.08% cost per side, long/flat only — no leverage, no shorting.
Out-of-sample (last ~30% of the window, never used to pick this setup): Sharpe 0.89 · alpha +0.3% · 42 trades over 8.3 yrs.
Since publication — including if it loses
The forward record is just getting started — the gap between the two is the honest score. Marked to market nightly from real prices, rules frozen at publication, as of 2026-07-02. Currently LONG.
We tested 749 setups (indicator × parameters × timeframe) on Consumer Disc. (XLY). Only setups with ≥30 trades qualify (657 did). Setups are ranked by out-of-sample Sharpe — the last ~30% of history, which standard-parameter rules never saw during selection. Because picking the best of 749 tries mines even the holdout, the VALIDATED verdict additionally requires the top setup’s OOS Sharpe to clear a selection hurdle of 1.27 (√(2 ln N)/√T) AND positive alpha in both windows. Of the eligible setups, 3.3% had positive out-of-sample alpha (median OOS Sharpe 0.41) — the table below is truncated, but this summary covers all of them. Full recipe: methodology · the engine’s contract lives in the repo as STRATEGY_METHODOLOGY.md.
Top 20 of 657 eligible setups
Ranked by out-of-sample Sharpe. Full + out-of-sample columns, costs included. Hypothetical.
| # | Setup | TF | Total ret | Sharpe | Max DD | Win | Trades | α vs B&H | OOS Sharpe | OOS α | OOS trades |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | Parabolic SAR | Daily | +164.7% | 0.31 | -59.4% | 42.8% | 332 | -6.0% | 1.01 | +4.8% | 95 |
| 2 | T3 (Tillson) | Daily | +170.0% | 0.32 | -55.7% | 41.3% | 520 | -5.9% | 1.01 | +3.9% | 138 |
| 3 | Fractal Adaptive MA | Weekly | +291.6% | 0.48 | -50.8% | 52.1% | 167 | -4.3% | 0.89 | +0.3% | 42 |
| 4 | Elder-Ray | Weekly | +33.9% | 0.24 | -31.4% | 53.4% | 103 | -8.4% | 0.89 | -6.1% | 31 |
| 5 | HalfTrend | Daily | +74.2% | 0.21 | -58.4% | 41.3% | 223 | -7.5% | 0.86 | +1.4% | 60 |
| 6 | Half Trend | Daily | +73.5% | 0.21 | -58.4% | 41.3% | 223 | -7.5% | 0.86 | +1.4% | 60 |
| 7 | Supertrend (7,2) | Daily | +104.0% | 0.25 | -60.7% | 46.3% | 201 | -6.9% | 0.84 | +1.4% | 53 |
| 8 | LSMA 10/30 Cross | Daily | +99.7% | 0.25 | -45.6% | 46.9% | 288 | -7.0% | 0.84 | +1.1% | 79 |
| 9 | Ichimoku (fast) | Daily | +53.2% | 0.18 | -69.2% | 47.7% | 323 | -8.0% | 0.84 | +0.7% | 89 |
| 10 | Hull MA Trend | Daily | +24.6% | 0.13 | -73.4% | 42.3% | 383 | -8.8% | 0.83 | +0.9% | 104 |
| 11 | Chaikin Oscillator | Weekly | +389.4% | 0.47 | -38.7% | 47.4% | 76 | -3.5% | 0.82 | +1.0% | 20 |
| 12 | A/D Oscillator | Weekly | +389.4% | 0.47 | -38.7% | 47.4% | 76 | -3.5% | 0.82 | +1.0% | 20 |
| 13 | Zero-Lag EMA Cross | Daily | +127.1% | 0.28 | -56.1% | 42.9% | 331 | -6.5% | 0.79 | +0.4% | 92 |
| 14 | ZLEMA 10/30 Cross | Daily | +127.1% | 0.28 | -56.1% | 42.9% | 331 | -6.5% | 0.79 | +0.4% | 92 |
| 15 | Hull MA 10/40 Cross | Daily | +72.3% | 0.21 | -54.7% | 43.8% | 313 | -7.6% | 0.79 | +0.2% | 81 |
| 16 | TEMA 10/30 Cross | Daily | +180.6% | 0.33 | -46.9% | 43.5% | 292 | -5.7% | 0.78 | +0.3% | 81 |
| 17 | Zero-Lag LSMA | Weekly | +338.4% | 0.47 | -46.7% | 56.6% | 99 | -3.9% | 0.78 | -0.0% | 27 |
| 18 | Supertrend Fast (10,2) | Daily | +85.2% | 0.22 | -63.0% | 43.6% | 202 | -7.3% | 0.77 | +0.3% | 55 |
| 19 | Supertrend (10,2) | Daily | +85.2% | 0.22 | -63.0% | 43.6% | 202 | -7.3% | 0.77 | +0.3% | 55 |
| 20 | ALMA 10/30 Cross | Daily | +0.3% | 0.07 | -76.1% | 40.6% | 318 | -9.6% | 0.77 | -0.3% | 85 |
Hypothetical backtests with 0.08%/side costs. Not investment advice — see the full disclaimer.
These are historical backtests of mechanical rules. They are educational research, not investment advice, not a recommendation, and not tailored to you. Educational information only — not investment advice. Hypothetical backtested results; past performance does not guarantee future results. Trading involves risk of loss.