Methodology
How Keystone Court Data's research and intelligence reports are built, and what their known limitations are.
1. Data origin
Keystone Court Data sources property-related court filings directly from county court records, which are public information. Filings are collected on a daily schedule in our active coverage area (Indiana, North Carolina, Pennsylvania, Connecticut, New Jersey).
Before any filing enters the dataset that powers these reports, it passes through a verification stage that confirms the current owner-of-record and screens out filings tied to entity owners (LLCs, trusts, banks, corporations) and to properties where the named party is not the current owner. The resulting dataset represents the owner-occupied, individual-owned, current-record subset of court filings. It is not the raw scrape count, and percentages in the reports are computed against this verified subset only.
2. Volume thresholds and small-N policy
Reports are published only when sample sizes support meaningful claims:
- County reports require at least 75 verified filings in the dataset for the county.
- State reports require at least 100 verified filings statewide.
- Individual breakouts (e.g. a single monthly bucket, value bucket, or ZIP) with fewer than 5 filings are shown for transparency but not used as the basis for headline claims.
3. Known limitations
- Time window. Daily scrape coverage began in early 2026 in most counties. Year-over-year comparisons are not yet available. Trend claims use month-over-month within the available window.
- Coverage depth varies by county. Not every county we cover has the same scrape start date or operational depth. "Top counties by filing volume" tables reflect both real underlying activity AND our coverage depth.
- Property value and equity enrichment is populated for a subset of filings (rate disclosed per report). Aggregate claims about value use the available subset and the coverage rate is shown in each report.
- Case lifecycle tracking (filing-to-served days, settlement rates, etc.) is currently limited to a subset of cases with active docket monitoring. When lifecycle stats appear in a report, the sample they're based on is disclosed.
- Pre-2026 historical data is not available.
4. What we don't publish
- Individual case data. No case numbers, defendant names, exact property addresses. Court records are public information, but Keystone aggregates rather than republishes them. Reports show counts, percentages, and distributions only.
- Our business metrics. Subscriber counts, revenue, conversion rates. Reports cover filing activity in the underlying market, not Keystone's commercial state.
- Internal pipeline details. The specific vendors, scoring models, and verification techniques used to build the dataset are proprietary and not described here. The general principle (court-direct collection, ownership verification, aggregate-only publication) is documented; the implementation is not.
5. Update cadence
Reports are regenerated monthly on the 3rd of each month from the live dataset. Each page is dated to its generation. Research notes are dated to publication and are updated only when the underlying audit is re-run.
6. Citation and corrections
Each report carries a suggested citation line at the bottom of the page. The general form is:
Keystone Court Data, "<Report Title>," <generation date>, <URL>
Corrections or questions about a specific number on any report: carson@keystonecourtdata.com.
Updated 2026-06-03