Methodology
Suburb Scoring Tools Australia — What Separates a Backtested Methodology from a Guess
Every suburb scoring tool claims to be data-driven. Most of them publish a number, label it a score, and move on. Very few show you the test results — how accurate the score was on suburbs with known outcomes, how many false positives it generated, and what the gap was between genuine booms and missed signals.
Before you trust any tool with a property decision, you should know the answers to three questions: Is the methodology backtested against real outcomes? What were the accuracy results? And where does the underlying data come from? This post answers all three for BoomAU — and explains what you should be asking any other tool you consider using.
The First Question: Is It Backtested?
A suburb score without a backtest is an opinion with extra steps. Backtesting means taking historical suburb data, running the formula on it, and checking whether it correctly identified suburbs that actually boomed — versus suburbs that went nowhere. The formula has to prove itself against real outcomes, not just look logical on paper.
Why does this matter so much? Because many metrics that sound convincing don’t survive contact with real data. Infrastructure spending sounds like a logical boom driver. Population growth sounds like it should move prices. Building approvals sound like a supply signal. We tested all of them. They delivered 55% accuracy — a coin flip. The metrics that passed backtesting are different from the ones experts talk about at property meetups.
What to ask any tool
“What was your accuracy on a held-out sample of suburbs with known outcomes?” If the answer is a methodology PDF with no numbers, or a vague reference to “proprietary data,” that’s your answer.
What Good Accuracy Actually Looks Like
Accuracy alone doesn’t tell the full story. A formula that correctly labels every suburb “not booming” would hit very high accuracy just by being conservative — and would never surface a genuine opportunity. The number that matters alongside accuracy is the false positive rate: how often did the formula flag a suburb as booming when it wasn’t?
There’s also the question of sample size. Testing on 10 suburbs proves nothing. A formula optimised on a small sample will look brilliant on that sample and fall apart everywhere else. BoomAU’s current formula was validated across 78 suburbs — 28 that genuinely boomed and 50 controls that didn’t.
BoomAU v2.3 backtest results — 78 suburbs
The 20.2-point separation gap is the figure we pay most attention to. It measures the average score difference between suburbs that genuinely boomed and those that generated false signals. A tight gap means the formula is barely distinguishing real booms from noise. A 20-point gap means it’s identifying booms with conviction, not just getting lucky on edge cases.
Zero false positives matters for a different reason: it means the Strong Buy label can actually be trusted. If a formula flags dozens of suburbs as booming and half of them aren’t, the label is worthless. The value of a rating system lives and dies on the signal quality at the top tier.
393 suburbs. 5 components. Fortnightly.
BoomAU scores every suburb under $800K that passes the growth filters — using only the signals that survived backtesting. Join the wishlist.
The 5 Components of the Formula
BoomAU’s detection formula has five scored components. Each was chosen because it measures something real about market dynamics — buyer urgency, seller behaviour, rental demand, growth trajectory, or affordability position. The weights reflect how much each component contributed to separation between real booms and false signals in the backtest.
| Component | Weight | What it measures |
|---|---|---|
| Momentum | 0.30 | Whether price growth is accelerating or slowing |
| Growth Strength | 0.25 | Annual growth rate scored directly |
| Tightness | 0.20 | Days on market + vacancy rate |
| Sustainability | 0.15 | Rental yield + vacancy trend direction |
| Headroom | 0.10 | Suburb price vs the capital city median |
Momentum carries the most weight because price growth acceleration is the clearest signal that a boom is actively underway — not just that prices rose last year. A suburb can have strong annual growth but decelerating momentum, which is a very different situation from one where growth is still picking up speed.
Tightness is the combination of days on market and vacancy rate. When properties sell fast and rental vacancies are low, demand is outstripping supply at both ends — buyers and tenants are competing. That combination has consistently marked active boom conditions in the backtest data.
Headroom is weighted lowest in the scored components (0.10), but it plays a bigger role than that number suggests. It’s also the basis of the hard price filter. Every boom in the 78-suburb backtest was led by a suburb priced well below its capital city median. That finding was consistent enough that we made it a structural constraint, not just a scored variable.
Hard Filters: What Gets Excluded Before Scoring
Before a suburb gets scored at all, it must pass four hard filters. These aren’t soft preferences — they’re gates. Failing any one of them means the suburb doesn’t receive a score, regardless of what its other numbers look like.
Hard filters — all four must pass
- 1.Annual growth ≥ 5% — below this threshold, there’s no boom signal to detect
- 2.Days on market ≤ 45 — above 45 days, the market isn’t tight enough to support a boom reading
- 3.Vacancy rate ≤ 2% — higher vacancy suggests demand pressure isn’t sustained
- 4.Median price ≤ $800K — every boom in the backtest was in a suburb below the city median; this filter removes suburbs where the affordability story has already played out
Of the 8,417 suburbs in the national dataset, 393 currently pass all four filters and receive a score. That’s a meaningful filter — it concentrates attention on the suburbs where boom conditions are actually present, rather than applying a score to every suburb in the country and leaving investors to sort through noise.
The budget bands within the 393: 35 suburbs are under $400K, 149 are under $600K, and 204 are under $800K. The filter naturally concentrates on the price ranges where the affordability signal is strongest.
What the Tiers Show You
A suburb score is only useful if the tiers it produces actually correspond to different outcomes. If Strong Buy suburbs don’t outperform Pass suburbs in any meaningful way, the entire scoring exercise is cosmetic. The walk-forward backtest measured this directly across 12,360 postcode-months.
| Tier | Excess return | Beat market | n |
|---|---|---|---|
| Strong Buy | +7.5pp | 71% | 2,103 |
| Buy | +1.3pp | 55% | 3,349 |
| Watch | −0.7pp | 47% | 5,788 |
| Pass | −6.4pp | 28% | 1,120 |
Walk-forward backtest, 12,360 postcode-months. No lookahead. Excess return = suburb 12-month growth minus market median growth. Full methodology →
The result is perfectly monotonic: every tier step down corresponds to a worse outcome. Strong Buy outperforms the market by 7.5 percentage points and beats it 71% of the time. Pass underperforms by 6.4 percentage points with only 28% of suburbs beating the market. That’s a 13.9-point spread between the top and bottom tier — in the same market, the same year.
The monotonic separation is the validation that matters most. A scoring tool that produces Strong Buy and Pass labels is only useful if those labels point to genuinely different investment outcomes. These do.
Strong Buy suburbs outperform by +7.5pp. Pass suburbs underperform by -6.4pp.
BoomAU produces fortnightly Strong Buy / Buy / Watch / Pass labels filtered to your budget band. Join the wishlist.
Where the Data Comes From
A suburb scoring tool is only as reliable as its data sources. The key questions to ask: Are the sources independently reputable? Is the coverage national? And how fresh is the data?
BoomAU draws from four sources, all publicly available. This is deliberate — it means the underlying numbers can be checked against sources you already use.
YIP — Your Investment Property Magazine
Backed by CoreLogic data. Provides annual and quarterly growth figures, days on market, rental yield, median price, and annual sales volume at suburb level. This is the primary source for growth and tightness components.
SQM Research
Free postcode-level vacancy rate charts going back 16 years of monthly history. SQM is the only source with genuine long-run suburb-level vacancy data — which is why it’s irreplaceable for the sustainability component.
Domain
Active for-sale listings and sold rate at suburb level. Used to cross-check market absorption and confirm that listing activity matches the tightness picture from YIP.
htag.com.au
The national suburb sweep that seeds the watchlist. Covers 8,417 suburbs across Australia, allowing BoomAU to start from a complete national dataset rather than a pre-selected shortlist.
One caveat worth knowing: thin-market suburbs — those with fewer than roughly 30 annual sales — produce unreliable days on market figures. When a suburb only transacts a handful of times per year, a single fast sale pulls the median DOM down to 10 days, and a single slow listing pushes it to 150. Below about 15 annual sales, the DOM figure is not meaningful. BoomAU’s filters implicitly screen most of these out because thin markets rarely pass the growth threshold, but it’s worth understanding the limitation when reading any suburb-level data.
The Signal That Matters Most: Affordability Headroom
The most important finding from backtesting wasn’t about the formula itself — it was about which single variable best predicted whether a suburb would outperform its peers after cancelling the broader market tide.
The answer is affordability headroom: how a suburb’s median price compares to the capital city median. Every boom in the 78-suburb backtest was led by suburbs priced well below their city median. Above 1.5 times the city median, suburbs consistently underperformed. Below the city median, they consistently outperformed. The effect holds across states, across market conditions, and across time periods in the dataset.
How to check it yourself
Domain publishes capital city median house prices quarterly. Compare that figure to the suburb’s median (also on Domain or YIP). If the suburb sits below the city median, it has headroom. If it’s above, the backtested signal says outperformance is unlikely — regardless of what else looks good about the suburb.
This is also why boom size is era-dependent in the data. Pre-2015, the median boom in the backtest produced 1.3% excess gains. Post-2020, the median was 16.2%. Affordability headroom acts differently depending on the rate at which the city median is moving. A suburb’s relative position matters more than its absolute price.
The timing dimension comes from boom detection: BoomAU measures how much of the affordability gap has already closed. Suburbs that are early in a detected boom — with less than 30% of the headroom consumed — capture the most remaining upside. Detection typically catches booms 6 to 12 months after they start, which still leaves 60–85% of total gains on the table. You don’t have to be first. You have to be early enough.
How to Evaluate Any Suburb Scoring Tool
The framework is simple. Any suburb scoring tool worth using should be able to answer these questions directly:
1. What was the accuracy on a sample of known-outcome suburbs?
Not a training sample. A held-out validation sample with real-world outcomes. The answer should include a specific number, a sample size, and a false positive rate alongside the accuracy figure.
2. Do the tiers produce different outcomes?
Ask for the excess return and beat-market rate by tier. If Strong Buy and Pass suburbs performed similarly in the backtest, the score doesn’t discriminate. You want a monotonic spread — each tier should have materially different outcomes from the next.
3. Which metrics are in the formula — and which ones failed?
A credible methodology publishes what didn’t work as clearly as what did. Infrastructure, population growth, and building approvals all sounded reasonable and failed at 55% accuracy. A tool that still uses those metrics without acknowledging their limits is telling you something about how seriously it takes backtesting.
4. Where does the data come from?
Named, independently verifiable sources — CoreLogic, SQM Research, Domain — are better than “proprietary data.” You want to be able to cross-check figures yourself. If the tool won’t name its sources, you have no way of knowing whether the underlying numbers are fresh, complete, or accurate.
BoomAU’s full backtest methodology, the 78-suburb validation results, and the walk-forward tier discrimination data are published on the proof page. No sign-up required. The maths is there to check.
The bottom line
A suburb scoring tool is only as useful as its track record on real outcomes. 85.7% accuracy, 0% false positives, a 20.2-point separation gap, and a 13.9-point spread between the top and bottom tier — across 12,360 postcode-months. That’s the bar we hold ourselves to, and it’s the bar worth applying to any tool you use.
Join the Wishlist
We'll email you when BoomAU launches — starting with the budget range you care about.
Be first in line
- ✓Fortnightly Strong / Good / Fair / Weak signal labels per suburb
- ✓Filtered to your budget band
- ✓Built on a backtest of 12,360 postcode-months