Does anything beat buy & hold on Consumer Staples (XLP)?
Every setup we tested on Consumer Staples (XLP) — ranked out-of-sample, corrected for multiple testing, and forward-tracked in public from the day this page published. The honest answer is the headline.
No setup beat simply holding once tested honestly. We say so plainly.
Its best setup trailed buy-and-hold out-of-sample. Buy-and-hold benchmark: +6.6% CAGR over 27.6 years (+8.4% CAGR in the out-of-sample window).
Educational research from historical backtests — not investment advice. Past performance does not predict future results.
Consumer Staples: Nothing Beat Buy-and-Hold, and That Is the Honest Answer
Broad, diversified instruments like Consumer Staples are where indicator strategies go to disappoint. We ran 746 setups against XLP, and none cleared the bar once scored honestly — on data the strategy never saw. The best of the batch, Williams %R on the weekly timeframe, posted an out-of-sample Sharpe of 1.09, short of the 1.26 hurdle we require before calling anything real. For an index fund this is the expected result: whatever inefficiency exists in single names tends to average away in the basket, leaving buy-and-hold's +6.6% annualized return as the number nothing here managed to beat.
Read these figures with the selection problem in mind. Test 746 indicators, keep the best, and the winner looks impressive by construction — which is exactly why the hurdle exists instead of applause for a lucky draw. Here, only 0.2% of setups outperformed buy-and-hold even in-sample, and the top candidate produced -1.0% annual alpha over 8.3 unseen years, across 33 trades with a 78.8% win rate and a -27.0% drawdown. That pattern reads as noise, not signal. Markets also change, so even a genuine past edge can fade. This page documents what failed — useful to know before assuming something works.
Every figure above is computed from our own backtests — nothing is estimated or invented. Hypothetical results; not investment advice.
The least-bad setups — shown with their failure numbers
Nothing here earned a verdict — these are the best of a losing field, published so you can see exactly how "best" still failed.
Williams %R
Mechanical rule (exactly as backtested): Buy when %R(14) crosses up from below -80, exit above -20. Signals are evaluated at weekly-bar close, the position changes on the NEXT bar, 0.08% cost per side, long/flat only — no leverage, no shorting.
Out-of-sample (last ~30% of the window, never used to pick this setup): Sharpe 1.09 · alpha -1.0% · 13 trades over 8.3 yrs.
Order-Flow Reversion
Mechanical rule (exactly as backtested): Net-liquidity reversion — fades a 2-sigma price stretch only when signed-volume order-flow shows sellers are exhausted (imbalance z-score), exits on the mean. Needs real volume, so it abstains on feeds that don't report it. Signals are evaluated at daily-bar close, the position changes on the NEXT bar, 0.08% cost per side, long/flat only — no leverage, no shorting.
Out-of-sample (last ~30% of the window, never used to pick this setup): Sharpe 0.97 · alpha -1.1% · 38 trades over 8.2 yrs.
Detrended Price Osc.
Mechanical rule (exactly as backtested): Long while the Detrended Price Oscillator (20) is positive. Signals are evaluated at weekly-bar close, the position changes on the NEXT bar, 0.08% cost per side, long/flat only — no leverage, no shorting.
Out-of-sample (last ~30% of the window, never used to pick this setup): Sharpe 0.84 · alpha +0.4% · 57 trades over 8.3 yrs.
Since publication — including if it loses
The forward record is just getting started — the gap between the two is the honest score. Marked to market nightly from real prices, rules frozen at publication, as of 2026-06-29. Currently FLAT.
We tested 746 setups (indicator × parameters × timeframe) on Consumer Staples (XLP). Only setups with ≥30 trades qualify (657 did). Setups are ranked by out-of-sample Sharpe — the last ~30% of history, which standard-parameter rules never saw during selection. Because picking the best of 746 tries mines even the holdout, the VALIDATED verdict additionally requires the top setup’s OOS Sharpe to clear a selection hurdle of 1.26 (√(2 ln N)/√T) AND positive alpha in both windows. Of the eligible setups, 0.2% had positive out-of-sample alpha (median OOS Sharpe 0.14) — the table below is truncated, but this summary covers all of them. Full recipe: methodology · the engine’s contract lives in the repo as STRATEGY_METHODOLOGY.md.
Top 20 of 657 eligible setups
Ranked by out-of-sample Sharpe. Full + out-of-sample columns, costs included. Hypothetical.
| # | Setup | TF | Total ret | Sharpe | Max DD | Win | Trades | α vs B&H | OOS Sharpe | OOS α | OOS trades |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | Williams %R | Weekly | +132.6% | 0.43 | -27.0% | 78.8% | 33 | -3.5% | 1.09 | -1.0% | 13 |
| 2 | Order-Flow Reversion | Daily | +145.9% | 0.45 | -23.2% | 72.8% | 103 | -3.4% | 0.97 | -1.1% | 38 |
| 3 | Detrended Price Osc. | Weekly | +160.7% | 0.4 | -29.0% | 57.2% | 173 | -3.1% | 0.84 | +0.4% | 57 |
| 4 | Projection Bands | Weekly | +126.9% | 0.43 | -24.7% | 68.4% | 38 | -3.6% | 0.83 | -2.5% | 13 |
| 5 | MACD-V | Daily | +33.4% | 0.16 | -41.2% | 42.3% | 286 | -5.7% | 0.81 | -0.8% | 73 |
| 6 | Zero-Lag MACD | Weekly | +122.8% | 0.38 | -23.0% | 59.2% | 125 | -3.7% | 0.76 | -1.2% | 34 |
| 7 | Zero-Lag MACD | Weekly | +122.8% | 0.38 | -23.0% | 59.2% | 125 | -3.7% | 0.76 | -1.2% | 34 |
| 8 | DEMA 10/30 Cross | Daily | +32.0% | 0.15 | -50.3% | 44.2% | 233 | -5.7% | 0.76 | -1.6% | 61 |
| 9 | Zero-Lag LSMA | Weekly | +134.8% | 0.4 | -29.7% | 52.4% | 105 | -3.5% | 0.75 | -1.7% | 26 |
| 10 | Heikin-Ashi Trend | Weekly | +67.9% | 0.25 | -34.1% | 48.7% | 263 | -4.7% | 0.75 | -1.8% | 77 |
| 11 | Fractal Adaptive MA | Weekly | +55.4% | 0.24 | -36.4% | 53.1% | 175 | -5.0% | 0.74 | -2.4% | 50 |
| 12 | Bollinger Mean-Reversion | Daily | +173.4% | 0.45 | -23.2% | 71.1% | 121 | -3.0% | 0.72 | -1.6% | 41 |
| 13 | Fibonacci Bands | Daily | +173.4% | 0.45 | -23.2% | 71.1% | 121 | -3.0% | 0.72 | -1.6% | 41 |
| 14 | Fisher Center-of-Gravity | Weekly | +27.3% | 0.16 | -33.1% | 58.8% | 153 | -5.7% | 0.71 | -3.7% | 41 |
| 15 | Donchian 10 Break | Daily | +153.0% | 0.4 | -26.0% | 50.8% | 130 | -3.3% | 0.7 | -2.0% | 36 |
| 16 | Price Momentum Oscillator | Daily | +76.9% | 0.27 | -33.6% | 45.9% | 207 | -4.6% | 0.68 | -2.3% | 59 |
| 17 | Demand Index | Weekly | +279.1% | 0.5 | -28.2% | 68.3% | 82 | -1.7% | 0.66 | -1.4% | 23 |
| 18 | Connors RSI | Weekly | +166.0% | 0.47 | -26.7% | 74.2% | 62 | -3.0% | 0.65 | -2.8% | 23 |
| 19 | Stochastic RSI | Weekly | +160.1% | 0.55 | -18.6% | 70.5% | 44 | -3.1% | 0.63 | -4.8% | 14 |
| 20 | QQE | Weekly | +351.6% | 0.51 | -39.1% | 51.2% | 80 | -1.0% | 0.62 | -0.5% | 22 |
Hypothetical backtests with 0.08%/side costs. Not investment advice — see the full disclaimer.
These are historical backtests of mechanical rules. They are educational research, not investment advice, not a recommendation, and not tailored to you. Educational information only — not investment advice. Hypothetical backtested results; past performance does not guarantee future results. Trading involves risk of loss.