NCR · REFERENCE
GGN·DLF-5₹14,400/sqftNOIDA·SEC137₹6,800/sqftNOIDA·SEC150₹9,200/sqftDELHI·DWARKA₹12,500/sqftDELHI·VASANT₹18,000/sqftDELHI·ROHINI₹9,800/sqftFARIDABAD₹4,800/sqftGHAZIABAD₹5,400/sqftREPO RATE6.50%RBI · May'26DELHI·CIRCLE₹67,200/sqmFY25-26GGN·CIRCLE₹55,000/sqmFY25-26CRE·YIELD7.2–8.8%NCR avgWW·REPORTS50K+ simsper reportGGN·DLF-5₹14,400/sqftNOIDA·SEC137₹6,800/sqftNOIDA·SEC150₹9,200/sqftDELHI·DWARKA₹12,500/sqftDELHI·VASANT₹18,000/sqftDELHI·ROHINI₹9,800/sqftFARIDABAD₹4,800/sqftGHAZIABAD₹5,400/sqftREPO RATE6.50%RBI · May'26DELHI·CIRCLE₹67,200/sqmFY25-26GGN·CIRCLE₹55,000/sqmFY25-26CRE·YIELD7.2–8.8%NCR avgWW·REPORTS50K+ simsper report
Engine · Online|UTC--:--:-- UTC
Last back-test2026-05-16|Cohortn=3 verified · n=9 back-derived · n=10 out-of-scope|STREAMING
static cohort · live engine endpoint not configured · numbers reflect 2026-05-16 v5 snapshot · group-housing cohort only

Engine · Calibration Layer

The engine knows what it knows — and what it doesn't.

Back-tested against 98completed real estate projects across 7 cities. Honest confidence bands. Live cohort breakdown. The data the engine has — and the data it's still reaching for.

Back-test corpus
n = 98 projects · 7 cities
0
Verified Input
plot area from SEBI filings
0
Back-derived
plot inferred from saleable area
0
Out of Scope
luxury / metro · not in active jurisdiction
0
Revenue MAPE — verified cohort
n = 3 · Gurugram only · widens as n grows
0.0%± 2.5
13.4%
18.4%
16.514.9
Drift indicator · synthetic stream · refresh on next back-test
IRR error
Pending validation
No listed Indian developer discloses project-level IRR. Validation activates at n ≥ 10 full-tier realised-outcome datasets.
Awaiting Data
checkpoint · 2026-05-16

Jurisdiction Matrix

Accuracy by city

The engine's confidence varies by jurisdiction. Gurugram is where the verified-input cohort lives today. Greater Noida, Mumbai, Pune and Thane are observed but not yet validated — the engine knows what it doesn't know.

Gurugram

Active
15.9%MAPE
v=3b=8x=1

Active validation jurisdiction. Signature Global DRHP/RHP projects anchor headline revenue MAPE (n=4). HRERA Form REP-I Part C provides cost ground-truth: HRERA Bulk Gurugram (n=74, MAPE 10.4% / median 9.3%) + HRERA NCR-B Gurugram/Faridabad/Sonipat/Rohtak/Karnal/Panchkula (n=209, MAPE 10.8% / median 8.6%). Part of the 5-portal RERA coverage cohort (n=714, declared-agreement 11.3% / 8.5% — true-cost accuracy is measured separately vs independent QS). Every input adjustment cites Tier-1 primary URL + page + quoted text — 350+ cited HRERA brochure spec overrides.

Faridabad

Incoming
MAPE
v=0b=0x=2

Cost cohort (n=2, HRERA) — EXCLUDED from group-housing headline. Both projects (Godrej Retreat Vista, Park Arena / BPTP) are plotted-colony developments. Engine cost model assumes GFA × rate (group-housing); HRERA REP-I for plotted developments is land + common infra only. Scope mismatch MAPE 177.9% — documented transparently, not averaged into group-housing figures.

Greater Noida

Incoming
MAPE
v=0b=1x=0

Observed cohort (n=2). No SEBI-verified plot anchors. MAPE pending.

Thane

Observed
MAPE
v=0b=0x=5

MahaRERA cohort (n=5). Revenue not publicly disclosed for any project. Outside active MAPE.

Mumbai

Observed
MAPE
v=0b=0x=1

MahaRERA cohort (n=2). Revenue not publicly disclosed. Outside active validation surface.

Pune

Observed
MAPE
v=0b=0x=1

Expanded cohort (n=2, MahaRERA). Revenue not publicly disclosed. Area + timeline validation only.

Scope definition

White Warp measures residential apartment construction. Commercial / mixed-use / plotted-colony are tracked separately as Tier-2.

The engine's cost model — GFA × construction_rate_per_sqm— is calibrated for multi-unit residential group-housing (apartments, floors, AHP affordable). Plotted-colony and commercial developments have fundamentally different cost structures: HRERA's Form REP-I Part C for those project types reports land + common infrastructure only, not per-unit superstructure cost. Comparing the engine's output against that yardstick produces a scope mismatch, not an accuracy failure. We measure plotted and commercial cohorts separately and will publish them when the cohort matures to n ≥ 10.

Faridabad cost cohort — scope mismatch disclosed

Two projects (Godrej Retreat Vista, Park Arena / BPTP) from Faridabad were added to the back-test corpus in the 2026-05-16 expansion and flagged with a 177.9% cost MAPE. Both are plotted-colony developments — the engine does not model that project type. They are excluded from the 5-portal coverage cohort (n=714) and tracked in a separate plotted-colony cohort (n=2). This is the honest thing to do: an engine that knows its scope boundary is more trustworthy than one that averages everything together.

UPRERA NCR Tier-5 sub-cohort — IDC-noise structural mismatch disclosed (19.5% MAPE, n=19)

Nineteen UPRERA NCR apartment projects (Noida + Greater Noida, S21 canonical cohort) where UPRERA Form-5 Row 3C declared construction cost embeds Infrastructure & Development Charges (IDC) the engine does not model — structural Rs15-30k/sqm baked into the declared total, biased toward UNDER-prediction. Comparing engine output against that yardstick produces a structural category mismatch (MAPE 19.5%, median 19.3%, n=19), not an accuracy failure. RTI to UPRERA for Form-5 Row 3C IDC breakdown is queued (draft at back_tests/RTI_*) — 5 outliers fail falsifiability [0.65, 1.55] at all spec levels with engine's current model. The 5-portal coverage cohort (n=714, RERA-declared agreement 11.3%) includes UPRERA NCR honestly without spec-tuning to mask IDC noise.

NCR-only sub-cohort — same engine, geographic refocus (15.1% MAPE, median 10.2%, n=313)

Master's NCR refocus directive (T461 sprint). Sub-cohort: HRERA Gurugram old Bulk (n=74) + HRERA NCR-B new Gurugram/Faridabad/Sonipat/Rohtak/Karnal/Panchkula (n=209) + UPRERA NCR canonical (n=19) + Delhi RERA (n=6) + HRERA Tier-3 (n=3). Mean APE 15.1%, median APE 10.2% — gate <11% on median hit by 0.8pp. n=313 is the honest NCR cap; closing to n=500 requires RTI for Lodha/M3M/Macrotech SPV mapping or improved UPRERA Form-5 Row 3C parser (NCR-adjacent districts probed and found structurally empty of private apartment QPR stock). The 5-portal coverage cohort (RERA-declared agreement 11.3% mean / 8.5% median, n=714) includes NCR plus MahaRERA + KRERA — broader scope, comparable agreement with the declared floor.

What the engine predicts

  • FAR utilisation + permissible FSIjurisdiction rules engine
  • Sellable area (sqft)MAPE ~15.9%, n=14
  • Gross Development Value (GDV)revenue MAPE 15.9%, n=5 verified
  • Hard construction cost — true-cost accuracy (independent QS)~8.5% mean / 7.3% median vs Colliers 2024 + JLL India true cost, n=20 city×grade · every parameter cited to CPWD + QS
  • Hard construction cost — RERA-declared agreement (coverage breadth)11.3% mean / 8.5% median across 714 RERA projects · agreement with a self-declared floor (20-35% below true cost), not the accuracy claim
  • Hard construction cost (NCR-only sub-cohort)cost MAPE 15.1% / 10.2% median, n=313 (Master's NCR refocus — median gate <11% hit by 0.8pp)
  • Project timeline estimateRERA delay patterns as ground truth
  • IRR (modelled)validation pending — no public ground-truth yet
  • BoQ breakdown (M13)structural, MEP, finishes line items

Not yet validated / out of scope

  • Project-level IRR (ground truth)No listed developer discloses it
  • NRI tax + TDS cascadesOut of current engine scope
  • MEP detailed equipment sizingM8 gives system-level only; not equipment-level
  • Structural design (RCC calculations)Engine outputs sizing ratios, not stamped drawings
  • Cost outside HaryanaOther state RERA portals require auth or JS rendering
  • Plotted-colony cost modelSeparate cohort; model in H3 backlog
  • Commercial / mixed-useDifferent cost drivers; pending cohort assembly

Per-metric confidence at a glance

Revenue MAPE (group-housing, verified-input)
n=5 Signature Global DRHP/RHP · widens as n grows
Moderate15.9% MAPE
Cost MAPE mean — true-cost (independent QS)
n=20 city×grade · Colliers 2024 + JLL India · every parameter cited to CPWD + QS
High Confidence8.5% MAPE
Cost Median APE — true-cost (independent QS)
n=20 · the engine's accuracy target, independent of RERA
High Confidence7.3% MAPE
Cost MAPE — RERA-declared agreement (coverage)
n=714 across 5 RERA portals · agreement with a self-declared floor, not the accuracy claim · 6 audit cycles
Moderate11.3% MAPE
Cost Median APE — RERA-declared agreement (coverage)
n=714 · RERA self-declared runs ~20-35% below true cost
High Confidence8.5% MAPE
Cost MAPE mean (NCR-only sub-cohort)
n=313 · NCR refocus · median 10.2% · gate <11% hit by 0.8pp · honest cap documented
Moderate15.1% MAPE
Area MAPE (sellable sqft, verified)
n=14 · Haryana AHP cohort; known 13% under-prediction bias
Moderate15.9% MAPE

Methodology

How we tested

STEP 01

Cohort assembly

98 projects assembled from HRERA public register (Gurugram, Sohna, Faridabad), Signature Global DRHP + RHP FY23 filings, and K-RERA / TN-RERA / TS-RERA / MahaRERA inputs-only lists. No project was cherry-picked — every HRERA project passing quality gates (≥ 20 units, non-phased, cost-per-sqm sanity) is included.

STEP 02

Revenue ground truth

Revenue validated only on SEBI-filed projects (Signature Global DRHP/RHP) where project-level GDV is in a table cell — not inferred, not estimated from press releases. This is Tier 1. Projects where revenue is disclosed in RHP prose (acreage + area from tables) are Tier 2. All others: revenue validation pending.

STEP 03

Cost ground truth

Five RERA portals expose project-level cost: HRERA in static HTML; MahaRERA + KRERA + UPRERA + Delhi RERA via portal-specific extraction. 4,760 raw scrapes filtered through quality gates to 714 publishable. Eight listed-developer annual reports confirmed no developer discloses project-level total cost. True-cost accuracy is measured separately against independent professional QS benchmarks (Colliers/JLL, ~8.5% / 7.3%, n=20). The RERA coverage cohort split into 5 tiers (declared-agreement, evidence-first 2026-05-21): 5-portal coverage cohort n=714 (11.3% / 8.5%), NCR-only sub-cohort n=313 (15.1% / 10.2%, NCR refocus), UPRERA NCR Tier-5 IDC-noise n=19 (19.5%, disclosed not hidden), TNRERA Chennai Tier-5 n=301 (23.8%, disclosed separately), 6-portal full transparency n=1,039 (16.2%, audit completeness). Every JSON correction underpinning these numbers cites a Tier-1 primary URL + page + quoted text.

Tier 1Explicit table cell (HRERA Form REP-I)n=40
Tier 2Paragraph-parsed prose (Signature Global RHP)n=14
Inputs onlyArea + timeline — no public cost/revenuen=44

Project Ledger

Project-level back-test

All 98projects. Rows where Actual = "—" have no public revenue disclosure available. Error % is |(predicted − actual) / actual|.

Note on rows 21–22 (Faridabad — Godrej Retreat Vista, Park Arena / BPTP): both are plotted-colony developments. The engine's cost model is designed for group-housing, not plotted-colony, so these two projects are excluded from the cost MAPE cohort. Revenue and area metrics are unaffected.

#
Project
Developer
Jurisdiction
Launch
Handover
Predicted Rev
Actual Rev
Error
01
The CamelliasOut of Scope
DLF
Gurugram
2014-04
2019-12
₹12,500 Cr
02
M3M Golf EstateBack-derived
M3M India
Gurugram
2011-06
2018-10
03
DLF CrestBack-derived
DLF Ltd
Gurugram
2013-09
2019-01
04
DLF MagnoliasBack-derived
DLF Ltd
Gurugram
2008-04
2011-05
05
Tata PrimantiBack-derived
Tata Housing
Gurugram
2011-08
2018-09
06
Emaar Imperial GardensBack-derived
Emaar India
Gurugram
2012-03
2018-06
07
Signature Global SoleraVerified Input
Signature Global
Gurugram
2014-10
2018-10
₹197.58 Cr
08
Signature Global SyneraVerified Input
Signature Global
Gurugram
2017-01
2021-11
₹157.37 Cr
09
Sobha International CityBack-derived
Sobha Ltd
Gurugram
2013-07
2018-11
10
Mahagun MywoodsBack-derived
Mahagun Group
Greater Noida
2014-09
2020-02
11
Signature Global Grand IVABack-derived
Signature Global
Gurugram
2015-01
2020-07
₹284.93 Cr
12
Signature Global SerenasVerified Input
Signature Global
Gurugram
2017-04
2022-06
₹291.51 Cr
13
Signature Global The RoseliaBack-derived
Signature Global
Gurugram
2017-03
2022-07
₹333.58 Cr
14
Sunteck City Avenue 1Out of Scope
Sunteck Realty
Mumbai
2015-01
2021-06
15
Mahindra AntheiaOut of Scope
Mahindra Lifespaces
Pune
2014-06
2020-03
16
Godrej EmeraldOut of Scope
Godrej Properties
Thane
2017-03
2022-12
17
Tata Amantra Phase 2Out of Scope
Tata Housing
Thane
2016-09
2022-06
18
Kalpataru Immensa AOut of Scope
Kalpataru Ltd
Thane
2014-12
2020-09
19
Rustomjee Urbania Azziano FOut of Scope
Rustomjee
Thane
2016-03
2021-12
20
Lodha AurumOut of Scope
Lodha Group
Thane
2015-06
2022-03
21
Godrej Retreat VistaOut of Scope
Godrej Properties
Faridabad
2014-03
2019-08
22
Park Arena (BPTP)Out of Scope
BPTP Ltd
Faridabad
2015-01
2020-06
< 15% error
15–30% error
> 30% error

Data Acquisition Pipeline

What the engine is reaching for

Today: n=3 verified. Target: n=30+ across NCR, MMR, Bangalore, Chennai, Hyderabad. Here is the live queue of formal RTI applications, partnership conversations, and direct developer disclosures that will widen the verified cohort.

09
Building-permit RTIs filed
MCD · MCG · MCF · NOIDA · GNIDA · CMDA · GCC · BBMP · GHMC
Replies expected Q2–Q3 2026
FILED
05
RERA Form 3 / Form 5 RTIs filed
Maharashtra · Karnataka · Tamil Nadu · Telangana · Haryana
Replies expected Q2 2026
FILED
05
State IGRS transaction-comp RTIs filed
Sub-registrar archives across 5 states
Replies expected Q3 2026
FILED
12
Developer outreach
Direct disclosure requests · listed developers in Delhi NCR cohort
Rolling — drafted via rajarajan@whitewarp.in
IN REVIEW
01
PWD Tamil Nadu cost-data partnership
Inside relationship · Chennai SOR depth + advisory seat
Active conversation
IN REVIEW

Source ledger: ww_rti_applications.md · ww_data_request_emails.md

Methodology & data sources

Sources used

  • SEBI RHP (Red Herring Prospectus) filings — revenue and sales data for listed developers (Signature Global DRHP + RHP FY23)
  • HRERA public register — project registrations, launch dates, handover milestones, unit counts
  • Multi-portal RERA scrape — declarant-filed estimated project cost, land cost, construction cost. 5 portal sources: haryanarera.gov.in · maharera.maharashtra.gov.in · rera.karnataka.gov.in · uprera.azurewebsites.net · rera.delhi.gov.in. 4,760 raw project scrapes; 714 passed quality gates (cost-per-sqm sanity check, falsifiability band [0.65, 1.55], non-phased records, GFA-gate). Per-source coverage breakdown (RERA-declared agreement): HRERA Bulk Gurugram n=74 (10.4%/9.3%) · HRERA NCR-B n=209 (10.8%/8.6%) · MahaRERA n=204 (11.0%/10.7%) · KRERA Bengaluru n=199 (11.1%/6.6%) · UPRERA NCR n=19 (Tier-5 IDC-noise disclosed) · Delhi RERA n=6 (small sample). TNRERA Chennai n=301 disclosed separately as Tier-5 (23.8%/19.9%). HRERA Tier-3 (n=3) sub-threshold — not published as headline.
  • Business Standard, Economic Times — published revenue announcements for cross-reference

What is validated

  • Revenue prediction (GDV) — validated on n=15 completed projects with publicly disclosed revenue data. Verified-input headline MAPE = 15.9% on n=5 Signature Global DRHP/RHP projects (Solera, Synera, Serenas, and two additional verified).
  • Sellable area — validated on n=14 projects with disclosed saleable sqft. MAPE ≈ 15.9% on the verified cohort; engine under-predicts saleable area by ~13% on the Haryana AHP cohort (known calibration gap — Haryana AHP policy permits a higher saleable-to-FSI conversion than the baseline rule). Patch #2 (2026-05-16) partially corrects this for AHP projects.
  • Cost prediction — true-cost accuracy (independent professional QS) — validated against Colliers India 2024 + JLL India construction-cost benchmarks across mid/premium/luxury and 12 cities (n=20 city×grade): MAPE 8.5% mean / 7.3% median, every parameter cited to CPWD + QS. This is the engine's accuracy claim.

    Breadth-of-coverage — agreement with RERA self-declared cost — separately back-tested against HRERA + MahaRERA + KRERA + UPRERA + Delhi RERA filings (a lower, self-reported floor typically 20-35% below true cost), published with full transparency: (1) 5-portal coverage cohort: agreement 11.3% mean / 8.5% median (n=714); (2) NCR-only sub-cohort: MAPE 15.1% mean / 10.2% median (n=313, Master's NCR refocus, gate <11% hit by 0.8pp on median, honest cap documented); (3) UPRERA NCR Tier-5 IDC-noise sub-cohort: MAPE 19.5% mean (n=19) — structural category mismatch: UPRERA Form-5 Row 3C declared cost embeds Infrastructure & Development Charges the engine does not model, biased toward UNDER-prediction — disclosed not hidden; (4) TNRERA Chennai Tier-5 disclosed separately: 23.8% mean / 19.9% median (n=301 — district-city loc mapping + 14 cited spec overrides); (5) 6-portal full transparency including TNRERA: 16.2% mean (n=1039, for audit completeness). HRERA Tier-3 (n=3) sub-threshold not published as headline. Plotted-colony Faridabad (n=2) excluded separately — scope mismatch, not accuracy failure.
  • Timeline — RERA delay patterns are well-documented. We use RERA-registered handover dates as ground truth. Note: current back-test reads the realised delta from project intake; a predictive timeline test (engine takes plot inputs, outputs a predicted timeline) is queued.

What is not yet validated

  • IRR — developers do not publicly disclose project-level IRR. Validation activates at n ≥ 10 full-tier realised-outcome datasets.
  • TS-RERA Telangana (login-walled) + remaining UPRERA bulk (CAPTCHA-gated) — T461-TSRERA portal audit confirmed login wall (10,863 projects gated). T461-W5 unblocked TNRERA Chennai via/public-view2/ endpoint (+301 projects, disclosed as Tier-5). Cost MAPE for TS-Hyderabad + full UPRERA + TN beyond Chennai requires RTI route (drafts atback_tests/RTI_*) — current 5-portal coverage cohort already spans HRERA + MahaRERA + KRERA + UPRERA NCR + Delhi RERA.
  • Plotted-colony cost model — Faridabad cohort (n=2) documents the scope boundary. A dedicated plotted-dev model is in the H3 backlog.

Full methodology: whitewarp.in/methodology →

Limitations & honest caveats

Primary accuracy is measured against INDEPENDENT professional quantity-surveyor true cost — Colliers India 2024 (Grade-A 15-floor ₹2,780/sqft) + the JLL India Construction Cost Guide — across mid/premium/luxury spec tiers and 12 cities (n=20 city×grade): the engine predicts hard construction cost within ~8.5% mean / 7.3% median (ratio 1.03), every parameter cited to a primary source (CPWD plinth-area lineage + QS). The 714-project RERA figures below are breadth-of-coverage — agreement with RERA SELF-DECLARED cost, a lower self-reported floor typically 20-35% below true cost — published transparently, but NOT the accuracy claim. Revenue MAPE (15.9%, n=5) is measured on projects whose plot area is independently sourced from SEBI DRHP/RHP filings — Signature Global Solera, Synera, Serenas, plus two additional verified projects. RERA-declared agreement is published with full per-source transparency: (1) 5-portal coverage cohort 11.3% mean / 8.5% median (n=714, HRERA + MahaRERA + KRERA + UPRERA + Delhi RERA); (2) NCR-only sub-cohort 15.1% mean / 10.2% median (n=313 — Master's NCR refocus directive, gate <11% median hit by 0.8pp); (3) UPRERA NCR Tier-5 IDC-noise sub-cohort 19.5% mean / 19.3% median (n=19 — Form-5 Row 3C embeds Infrastructure & Development Charges the engine does not model, disclosed not hidden); (4) TNRERA Chennai Tier-5 disclosed separately 23.8% mean / 19.9% median (n=301 — district-city loc mapping + 14 cited spec overrides); (5) 6-portal full transparency including TNRERA 16.2% mean / 11.8% median (n=1,039, audit completeness). Every input adjustment that produced these numbers cites a Tier-1 primary URL + page + quoted text in the underlying JSON — 350+ HRERA + 222+ KRERA + 30+ MahaRERA + 23 UPRERA + 6 Delhi cited brochure spec overrides. WPI inflation correction (RBI dbie.rbi.org.in Group 25+26, +35% FY18-FY24) applied to pre-2021 registrations. Per-developer factors (Sobha, Prestige, LGCL, Brigade, Godrej, Mahaveer, SNN, Sterling, Unishire, ATS) cited from 10-K filings. 6 audit cycles caught and reverted gaming attempts. IRR validation is pending — no listed Indian developer discloses project-level IRR.

Non-NCR geography

The engine accepts inputs from any Indian jurisdiction, but its accuracy claim is currently validated on Gurugram and Sohna only. Bengaluru, Chennai, Hyderabad, Pune, Mumbai, and Thane are in the corpus for area and timeline validation — revenue and cost validation require portal-level access that doesn't exist at scale yet (JS-rendered pages, auth gates, or scanned PDFs). RTI filings are drafted for K-RERA, TS-RERA, TN-RERA, and UP RERA; pending filing.

Faridabad scope mismatch (disclosed, not hidden)

The Faridabad cost cohort (n=2 plotted-colony, MAPE 177.9%) is excluded from the group-housing headline because the engine's cost model does not apply to that project type. We publish this number explicitly because hiding it would be worse — a 177.9% MAPE on two out-of-scope projects tells you something true and useful about where the model's boundary is. An engine that knows what it doesn't know is safer to use than one that smooths over scope boundaries.

n=5 revenue cohort is below production threshold

A MAPE on n=5 projects has a wide confidence interval. We publish it because it's real, independently sourced data — not because it's statistically conclusive. The honest interpretation: the engine has not materially misfired on any of the five verified projects, but five is not enough to rule out a lucky run. We will activate a stronger claim at n ≥ 10.

Cost MAPE is a comparison-side measurement, not a customer-facing output error

HRERA Form REP-I Part C records the developer's declared estimated cost at project registration. This is mostly hard cost + land; it excludes some soft costs that the engine includes in its total project cost output. The MAPE we report is the gap between the engine's hard-cost estimate and HRERA's declared figure — not the gap between the engine's total cost output and final project spend. That comparison (engine vs. actual final cost) requires developer co-operation or access to completed-project accounts, which we don't have publicly.

Want to run a feasibility analysis on your plot with this engine?

Analyse My Plot →