Introducing the WebPKI Observatory

Leave a reply

For as long as I have been in this industry, the WebPKI compliance conversation has run on impressions. People with long memories and regular conference attendance have built up a picture of which CAs are well-run, which are struggling, and where the oversight gaps are. That picture has generally been accurate. It has also been almost entirely unmeasured.

The WebPKI Observatory at webpki.systematicreasoning.com, a project from Systematic Reasoning, is an attempt to change that. It’s a public dashboard covering 1,690 compliance incidents drawn from Mozilla Bugzilla between 2014 and 2025, cross-referenced with CCADB membership data, certificate issuance volumes from CT logs, root program trust store compositions, and the complete history of CA distrust events. The goal was simple: replace the shared intuition with actual data, and see what the data shows that intuition missed.

Some of it confirmed what most people in this space already suspected. Some of it was genuinely surprising.

The finding that reframes everything else is detection. When a compliance incident occurs, who finds it? Root programs find 52% of incidents. Automated external tools — CT log monitors, certificate linters, community scanning infrastructure — find 14%. CAs find their own problems in 9% of cases.

That number deserves more attention than it typically gets. One in eleven. CAs have full access to their own issuance systems, their own audits, their own CPSs, their own disclosure obligations, and they are the least effective detection mechanism in the ecosystem. External parties without any privileged access outperform internal CA monitoring by a factor of six or more. The compliance monitoring function has been effectively outsourced to external parties by default, and mostly without anyone deciding that was the right architecture.

Everything else in the data follows from that.

The failure classes that have grown are instructive. Technical misissuance has declined as a share of incidents over the past decade. What has grown is the process layer. In 2019, governance failures represented 21% of all incidents. By 2025 that figure was 60%. Policy violations, CPS failures, disclosure deadline misses. These are by definition things internal compliance programs should be catching. The 260 incidents tagged policy-failure or disclosure-failure in the dataset are a direct indictment of internal compliance operations. A CA that violates its own documented policy is not being surprised by an external attacker.

The oversight picture is also worth examining. In 2017, Mozilla engaged with 79% of Bugzilla compliance bugs. Chrome had no formal root program yet and was near zero. By 2025 the picture had reversed and degraded simultaneously. Chrome now contributes the dominant share of oversight engagement but covers only 18% of incidents. Mozilla covers 8%. The total corpus has roughly doubled since 2017 while combined meaningful oversight coverage has fallen by two-thirds. The Chrome Root Program launched in 2021, and its effect on the governance landscape is visible in the data — Chrome has made 239 substantive oversight comments in recent years versus Mozilla’s 158 over the same period. The center of gravity in CA compliance governance has shifted to the browser with 78% market share. That is structurally significant. Microsoft, which operates the largest trust store by root count at 346 trusted roots, has made zero recorded governance comments across all 1,690 incidents spanning 11 years.

The distrust history is also clarifying. The common mental model is that CAs get removed for catastrophic technical failures. The data does not support that model. 14 of 16 distrust events involve compliance operations failures. The behavioral taxonomy matters, negligent noncompliance, willful circumvention, demonstrated incompetence, and argumentative noncompliance. In 10 of the 16 cases, the distrust event was preceded by a documented pattern of prior incidents. The median runway from the first incident to distrust is 3.2 years. The failures were not hidden. They were in Bugzilla the whole time. The CA just was not resolving them systematically.

That means distrust is largely predictable given sufficient data. The indicators show up well before the outcome. That is a sobering observation about past oversight and a useful one for anyone thinking about what the compliance monitoring function should actually do.

The Observatory is a measurement tool, not a verdict. The dataset has limits — Bugzilla under-represents incidents that never reach public disclosure, CT-derived issuance volumes reflect only unexpired certificates at the time of measurement, and the behavioral taxonomy applied to distrust events involves judgment calls. But the patterns are robust enough to be useful.

For CA operators, the detection data alone should prompt hard questions about internal monitoring coverage. For root programs, the oversight gap data quantifies a scaling problem that is currently being absorbed by Chrome without anyone having explicitly decided that is the right architecture. For the policy community, the shift from technical to governance failures as the dominant incident class has direct implications for what audit frameworks should actually measure.

The dashboard is live at webpki.systematicreasoning.com, updated daily. The methodology is documented. Pull requests are welcome

Signed, Auditable, Offline-Tolerant, PQ Secure QR Codes

Leave a reply

Signed, Auditable, Offline-Tolerant, PQ Secure QR Codes

A few months ago I wrote about what it would take to make a QR code verifiable in a post quantum world. In this post I wanted to explore what it would look like if we wanted one that is genuinely verifiable, not just signed, but auditable, offline-tolerant, and ready for a post-quantum world. That post was mostly conceptual. A conversation with Bruno Couillard last week nudged me to put down the thoughts I had been carrying about exactly that.

The design draws heavily on the draft for Merkle Tree Certificates, which is working through the IETF right now. MTC is aimed at TLS, but the core insight is that you can replace per-certificate signatures with compact Merkle inclusion proofs against a periodically updated signed root, and that insight translates directly to QR codes once you think carefully about the offline constraint. If you haven’t read it, the draft is at datatracker.ietf.org/doc/draft-davidben-tls-merkle-tree-certs.

The result of applying that idea to the QR problem is MTA-QR, a working implementation of what I’ve been calling Merkle Tree Assertions for QR codes. The demo is live at mta-qr.peculiarventures.com, and the full source is at github.com/PeculiarVentures/mta-qr-demo. There are Go and TypeScript implementations, a browser-only demo that generates and verifies without any backend, and an interoperability test matrix that exercises all three signing algorithms against both runtimes in every combination.

To be clear, this isn’t a production-ready library, but building it helped me identify things I had missed while whiteboarding it in my head.

The size problem is real but solvable

The original post flagged signature size as the central constraint. An ML-DSA-44 signature is 2,420 bytes. A Version 40 QR code at medium ECC holds about 1,273 usable bytes. Those two numbers don’t fit in the same sentence without a solution.

The solution is separating what goes in the QR from what you need to verify it. The QR carries the assertion content, a Merkle inclusion proof, and coordinates pointing to a signed checkpoint. The checkpoint itself contains the issuer signature, lives outside the QR, and gets cached on the verifier’s device, typically during a charge cycle before the device ever sees a QR code. Once cached, verification is fully offline.

The proof is the interesting part. A two-level tiled Merkle tree, with an inner batch tree and an outer parent tree, caps the total proof at eight hashes regardless of how large the log grows. Eight hashes is 256 bytes. That’s the ceiling, forever. The QR version stays fixed. The code never gets denser as the issuer accumulates millions of entries.

In practice, a Mode 1 QR carrying bearer claims and a Merkle inclusion proof fits comfortably within a Version 10 to 15 code at medium ECC, well under 500 bytes total. ML-DSA-44 doesn’t appear in the QR at all. The issuer signature lives in the checkpoint that the verifier fetched during its last charge cycle.

ML-DSA-44 won’t fit in a single QR in Mode 0, the fully embedded mode where the signature is in the QR itself. Mode 0 is the bootstrap mode: it works on air-gapped verifiers, on paper QR codes printed before any checkpoint infrastructure exists, and for scenarios where prefetch is operationally impractical. It’s not a niche failure case; it’s the starting condition for any new deployment. Mode 0 with PQC will require waiting for NIST to finalize smaller-signature algorithms, or accepting larger QR codes. Mode 1 is the practical path to PQC today.

Offline tolerance is mostly a framing problem

There’s a habit of treating offline verification as binary, either the device has connectivity at scan time, or it doesn’t. That framing creates a false constraint.

Every verifier with a battery has a window where it is stationary, connected, and idle. That’s when it charges. Fetching a checkpoint during a charge cycle is trivially cheap compared to everything else happening during that window. The relevant question isn’t whether the device has connectivity at scan time. It’s whether the assertion being scanned was issued before the verifier’s last checkpoint fetch.

For the common case, the answer is yes. A concert ticket issued last week, a prescription filled this morning, a badge issued at enrollment, all of these predate the verifier’s cached checkpoint by hours or days. Verification is fully offline because the relevant checkpoint was already there.

The narrow failure case is an assertion issued and scanned within the same charge cycle, before any checkpoint fetch. That falls back to a single cache-miss network call, which then covers every subsequent scan of the same batch. One round trip, then fully offline for the rest of the operational period.

Witnessing is where the transparency guarantee actually lives

The issuer’s signature proves the assertion came from a specific key. That’s useful, but it doesn’t prevent a compromised issuer from presenting different views of the log to different verifiers. Split-view attacks are subtle and hard to detect after the fact.

Witnesses solve this. A witness cosigns a checkpoint only after verifying it extends the previous one they saw, establishing a consistency guarantee across the full history of the log. Once multiple independent witnesses have cosigned a checkpoint, the issuer cannot retroactively rewrite or fork the log without those witnesses catching it.

The witness protocol comes from c2sp.org/tlog-cosignature, the same infrastructure underpinning the transparency.dev witness network. I worked on that witness network during my time at Google, so it was never far from my mind when designing this. Connecting MTA-QR to it means the issuance of every assertion can be monitored by parties with no relationship to the issuer. That’s the difference between a signed QR and an auditable one.

The implementation uses Ed25519 for witness cosignatures regardless of what algorithm the issuer uses for checkpoints. That’s not a design choice I made, it’s what the spec requires. It means an issuer can use ML-DSA-44 for the checkpoint signature while the witness infrastructure stays on stable, widely deployed Ed25519 keys. The two concerns are separated cleanly, and that separation matters. The quantum threat to the issuer signature and the operational threat to the witness network are different problems on different timelines.

What I had wrong in the original post

The earlier post mentioned UOV and SQISign as especially promising for QR codes because of their smaller signature sizes. That framing isn’t wrong exactly; smaller signatures do help with the size constraint, and both algorithms are genuinely interesting work. But the NIST competition covering them isn’t finished, which means neither is practical for anything you’d want to deploy or standardize against today. More importantly, once you separate the checkpoint from the payload, signature size matters only for the checkpoint, which isn’t size-constrained anyway. The Merkle structure removes the problem that UOV and SQISign were addressing. They may still have a role in Mode 0 once the standards are settled, but they’re not the lever that makes the design work.

What’s still missing

The spec has a revocation mechanism based on index ranges that a verifier checks at scan time, but the format for distributing and authenticating those revocation lists isn’t fully defined yet. This is the most operationally significant open item. An unsigned revocation list is vulnerable to a stale-list attack at the network layer. An adversary who can delay or suppress list delivery can extend the validity of a revoked assertion. The natural fix is issuer-signed lists using the same key that signs checkpoints, but that format isn’t written yet. Until it is, revocation is a weak link in any deployment that takes revocation seriously.

Type 0x02 key assertions, where the QR proves possession of a private key rather than just embedding bearer claims, are defined in the log entry format but the challenge-response protocol isn’t specified. Two implementations can’t interoperate on key assertions without it.

The C2SP tlog-checkpoint format needs registrations for ECDSA and ML-DSA before those algorithms can interoperate with standard tlog-checkpoint parsers. Ed25519 is fully specified today. ECDSA and ML-DSA work in the reference implementation but aren’t interoperable with external tooling yet. This is a practical blocker for adoption by anyone not using the reference implementation, and it’s the right next conversation to have with the C2SP and MTC communities.

Try it

The browser demo runs entirely in-page with no backend. It generates Ed25519 or ML-DSA-44 keys in your browser, issues assertions, builds the Merkle tree, produces QR codes, and runs the full 15-step verification trace. The tamper panel lets you flip proof bytes, corrupt the TBS, zero the proof, or truncate the payload, and watch exactly which verification step catches each failure. It’s a useful way to build intuition for what the protocol is actually checking and why each step is there.

The repo is at github.com/PeculiarVentures/mta-qr-demo. Pull requests welcome, especially on the open items.

When Compliance Records Become the Only Honest Signal

Leave a reply

I’ve been spending a lot of time lately building Systematic Reasoning with my long-time friend Vishal. The core premise is straightforward. Organizations reveal their true operational character through how they design to prevent failure, how they plan to handle it when it happens, and how they actually do. That signal deserves to be tracked, structured, and acted on. We’re building an agentic compliance platform to do exactly that.

Systematic Reasoning won’t be limited to any single domain, but we decided to start with the Web PKI. The reasoning was simple. It’s high impact in a way that’s hard to overstate. Every internet user depends, whether they know it or not, on a relatively small number of Certificate Authorities getting things right. The margin for error is zero. If that trust layer breaks, it breaks for everyone.

DigiNotar is the canonical example. A small Dutch CA, compromised so thoroughly that attackers could impersonate any website on the web, and did. That capability was used to spy on Iranian dissidents, intercepting communications that people believed were private and secure. The trust infrastructure that was supposed to protect them was turned into a weapon against them. DigiNotar isn’t an edge case or a cautionary tale from a more naive era; it’s a demonstration of the actual ceiling of what can go wrong. And it isn’t the only one. State-affiliated certificate authorities have been caught performing man-in-the-middle attacks on their own citizens’ traffic, something the Baseline Requirements explicitly prohibit, but prohibition only matters if it’s enforced. The web’s trust model works right up until the moment someone decides it’s more useful as surveillance infrastructure.

At the core of Systematic Reasoning, is a belief I’ve held for a while. Compliance can be a vital sign of organizational security, but only if it’s continuous. The reality today is that it isn’t. Code ships daily. Audits happen annually. The gap between those two rhythms is where things go quietly wrong.

I’ve written before about why I have limited faith in the current audit regime. Auditors are engaged by the organizations they assess. Their product is a clean seal; their incentive is to keep the client. They operate on point-in-time sampling with auditee-selected scope, and they’re often compliance professionals rather than engineers, which means they’re checking whether a policy exists more than whether the system actually behaves correctly. That’s if you’re lucky. Sometimes the audit is scoped against a version of the Baseline Requirements that was superseded over a year ago.

The same incentive shapes how certificate authorities write their governance documents. A CP/CPS that relies heavily on incorporation by reference, that omits specifics about what the organization actually does and what constraints it operates under, is easier to audit against than one that makes precise, testable commitments. Vagueness isn’t always carelessness. Sometimes it’s a design choice. The same thing happens in incident reports. A report that attributes a failure to “organic process evolution” or “human error” without describing the actual control gap is easier to close than one that names the broken system and commits to a specific fix. In both cases the document gets the box checked without creating accountability. References establish authority. Commitments establish accountability.

The audit gap isn’t compensated for by strong internal monitoring either. The majority of significant compliance failures are not caught internally. They are caught by external researchers, root program staff, or community tooling. A broken validation endpoint runs for five years and the organization finds out because someone posted a 404 error in a public issue tracker. A validation race condition exists undetected for seven and a half years not because it was well hidden but because nobody was looking. The absence of an internal alarm is not evidence that the system is healthy. It is often evidence that the monitoring itself is missing.

So public incident reports and governance documents become some of the most signal-rich material available. Policy documents tell you what an organization claims it will do. Incident reports tell you what happened when reality diverged from that claim. Together they create a longitudinal picture that neither document produces alone.

Building a system to reason over that data surfaced a problem I didn’t fully anticipate. When you’re working from the outside, with no access to internal systems and no way to verify what actually changed, the public record is almost all you have. The question isn’t whether to treat it with skepticism. It’s how much skepticism to build in by default.

The temptation is to give the benefit of the doubt. Organizations are required to describe the blast radius of an incident. Not every localized bug is a symptom of something systemic. But accepting minimizing language at face value is its own failure.

“Only” is doing a lot of work when the bug it’s describing went undetected for seven and a half years. “No compromise of end-entities” is doing a lot of work when what it really means is that nobody found the gap before you did. Framing survival as security isn’t reporting, it’s PR. And if an organization believes an incident is no big deal, you can predict with reasonable confidence that the root cause analysis will be shallow and the remediation will be a band-aid.

ForgeIQX, our first offering, tracks those signals longitudinally across both policy documents and incident reports. Not to prosecute organizations for their language choices, but to notice when a commitment made in a CP/CPS quietly disappears in the next version, or when a promised fix is nowhere to be found when the same failure mode surfaces years later. That’s commitment decay, the slow evaporation of a promise made under pressure, and it’s only visible if you’re tracking across multiple documents and incidents over time rather than treating each one in isolation.

The calibration problem is real and doesn’t have a clean answer. Get it wrong in one direction and you build a system that cries wolf. Get it wrong in the other and you build a system that launders PR-speak into clean signals, which is just automating the thing we already do too much of.

There’s a third failure mode that took me longer to see. A system like this can be gamed. Swap “we got lucky” for “our monitoring detected no active exploitation.” Replace “only thirty certificates” with a more clinical impact scoping statement that says the same thing in language that sounds like engineering rigor. The words change; the institutional posture doesn’t. A system that can be satisfied by better prose isn’t measuring operational maturity, it’s measuring communications sophistication.

That means the system has to be built with structural pessimism. Not cynicism for its own sake, but a deliberate prior that clean language is not the same as clean operations, and that the absence of red flags is not the same as the presence of green ones. We can’t verify that an organization fixed what it said it would fix. What we can do is watch whether the same failure mode surfaces again and whether the pattern of shallow root cause analyses continues or breaks. The historical record doesn’t tell us what’s true inside these organizations. It tells us what they were willing to say in public, under pressure, over time. Given the alternatives, that may be the most honest signal available.

A certificate authority with genuine operational maturity should want this kind of scrutiny applied to itself. Not because it will always produce a clean result, but because it surfaces the gaps before an external party does. ForgeIQX gives organizations a way to continuously monitor their own compliance posture, so their practices and code keep pace with their commitments. The same is true for auditors who want their findings to mean something beyond a checkbox. The problem with the current regime isn’t that the people in it are careless. It’s that the incentive structures don’t reward rigor, and the tooling to demonstrate it continuously doesn’t exist. That’s what we’re building.

The Web PKI is where we started because the stakes are concrete and the public record is unusually rich. But any regulated industry where compliance is measured annually, where governance documents are written to satisfy auditors rather than inform relying parties, and where incident reports are drafted with one eye on legal exposure, has the same gap between what the paper says and what the organization actually does. We started here. We don’t intend to stop here.

The Signal They Chose to Ignore

Leave a reply

Two prior posts worked through the statistics of the SB 6346 sign-in data. In the first I established the methodology and the finding. After applying a birthday-corrected collision test to separate organic participation from anomalous windows, roughly 90,000 legitimate CON participants remain against roughly 9,100 legitimate PRO participants. In the second I addressed the legislature’s claim that duplicate names make the dataset unreliable. The finding runs the other way. A genuine sample drawn from a real community produces name collisions at a predictable rate. People share surnames, people hit submit twice, households have two people with the same name. The PRO overnight batch produced zero collisions across 934 draws, where the statistical minimum expected is around 30. The anomaly is suspicious precisely because it has too few duplicates, not too many. Real participation is messy. This was not.

This post is not about those results. It is about what legislators said about them at a February 24 media availability, and whether their positions are statistically defensible.

They are not.

The Math Problem With “Not Helping Us Make Decisions”

“It’s not like we are making decisions not to pass a bill because of a sign in… they’re not really helping us make decisions in terms of amendments to bills or whether to pass it out of committee or not. We rely on people who actually come and testify in person.”

— Senator Manka Dhingra, February 24 media availability

That is a statistical claim. It asserts that the sign-in data has no decision-relevant information. For that to be true, one of two things must hold. Either the signal is too noisy to be meaningful, or legislators have better information that makes it redundant.

Neither holds.

The 10:1 ratio across 90,000 legitimate responses is not ambiguous. The margin of error at that sample size is roughly a third of a percentage point. The ratio does not wobble under any standard statistical treatment. Even applying the most aggressive self-selection correction anyone has proposed, assuming CON participants are twice as motivated to engage as PRO participants, the adjusted ratio is still 5:1. The signal does not disappear. Calling it noise is not a statistical judgment. It is a refusal to do the math.

As for better information, what would that be? Testimony at a two-hour hearing. Phone calls. Letters. The intuitions of members who have held their seats for multiple cycles. None of those are more statistically rigorous than 90,000 data points. Most are orders of magnitude less rigorous. If a senator’s read of the room outweighs a dataset this large at a ratio this clear, that is not superior methodology. That is substituting anecdote for evidence.

Dhingra’s preferred alternative, people who show up in person, has its own problem. The photo below is from the February 6 Senate hearing. The room is full of people in matching purple shirts and teal sashes. That is coordinated turnout, organized in advance, by people with the resources and flexibility to get to Olympia on a weekday. It is the physical equivalent of a sign-in campaign, except it requires taking a day off work and driving to the state capitol.

That standard also systematically excludes the people most affected by legislation. A small business owner in Spokane worried about a new tax on their income cannot easily testify on a Wednesday. A nurse working a shift cannot. A retired teacher in Yakima cannot. The sign-in system exists precisely because geographic and economic barriers make in-person participation inaccessible to most Washingtonians. Dismissing sign-ins in favor of in-person testimony is not a quality upgrade. It is a substitution of one self-selected sample for a smaller, more organizationally filtered one.

What Statistically Relevant Engagement Actually Looks Like

A standard poll commissioned to gauge public opinion on a major policy question uses around 1,000 respondents. That produces a margin of error of roughly 3.1% at 95% confidence. Those numbers drive legislation, inform campaign strategy, and get cited on the floor. Nobody demands methodology disclosure before a senator cites a Crosscut poll. That is simply the accepted evidentiary standard for constituent sentiment.

The sign-in dataset, after deduplication, contains roughly 90,000 legitimate CON responses. While strict margin of error calculations require randomized polling rather than opt-in data, the mathematical gravity at this scale is inescapable: a random sample of this size would carry a margin of error of approximately 0.33%. This dataset is ninety times larger than what legislators already treat as a reliable signal, with precision ten times tighter.

Washington has approximately 5.5 million registered voters. Ninety thousand responses represents roughly 1.6% of that population engaging with a single bill in committee. In political science research on constituent contact, engagement rates on individual pieces of legislation are typically measured in fractions of a percent. At 1.6%, this dataset is not a rounding error above that baseline. The prior record for sign-ins on any Washington bill was reportedly around 45,000, itself considered extraordinary. This dataset doubled it, and the legislative website crashed under the volume because nothing in the system’s design anticipated engagement at this scale.

The infrastructure of participation failed because the signal exceeded its design limits. That is not a data quality problem. That is evidence of something real happening in the electorate.

To put 90,000 in electoral terms: Washington has 49 legislative districts. Distributed statewide, that averages roughly 1,800 CON sign-ins per district. The 2024 state Senate race in the 10th district was decided by 153 votes. The House race in the 17th district was decided by fewer than 200. Several competitive seats turned on margins smaller than the number of people in those districts who showed up to oppose this bill. Legislators are not dismissing a fringe signal. They are dismissing a constituency that is, in several of their districts, larger than their margin of victory.

Consider how the same legislators would respond to a poll of 1,000 Washingtonians showing 10:1 opposition to a bill. That finding would be treated as dispositive. It would be cited in floor speeches, appear in press releases, and be described as a clear signal of constituent sentiment. This dataset shows the same ratio at ninety times the sample size, with a margin of error ten times tighter, with an audit trail, with a reproducible methodology, and after removing anomalous windows on both sides.

The legislators who called it noise do not apply that standard to anything else they use.

You Don’t Need to Read the Bill

“I don’t think everyone who’s signing in in support or opposition is actually reading the bill. So I think you got to take it for what it’s worth.”

— Senator Yasmin Trudeau, February 24 media availability

For a technical bill where the title might mislead, that would be a legitimate point. SB 6346 is not that kind of bill. Washington has not had an income tax in nearly a century. Voters have rejected it ten times. For most constituents signing in CON, reading the bill is beside the point. They already know where they stand. The question SB 6346 raises for them is not what the rate structure looks like. It is whether Washington should have an income tax at all, and on that question they have a consistent ninety-year answer. Beyond that settled position, the architects of this legislation documented their strategy in writing years before the bill was introduced.

In April 2018, Senator Jamie Pedersen sent an email to a former Democratic legislator explaining the real value of passing a capital gains tax. The major use of revenue, he wrote, was secondary. The more important benefit was on the legal side. Passing a capital gains tax would give the Supreme Court the opportunity to revisit its decisions that income is property, and would “make it possible to enact a progressive income tax with a simple majority vote.” Those emails were obtained through public records and published by the Washington Policy Center, which also documented the three-step sequence Pedersen described. Pass the capital gains tax to break the legal seal. Pass a millionaires tax to build the administrative infrastructure. Then lower the threshold to capture the middle class.

The capital gains excise passed in 2021. Pedersen also promised the revenue would reduce property and sales taxes. The state collected $1.8 billion in capital gains revenue from 2022 to 2024. Not a dollar went to reducing property or sales taxes. New spending absorbed everything. The Supreme Court upheld the excise in Quinn v. State in 2023, doing precisely what Pedersen predicted. A surcharge was added in 2025. SB 6346 arrived in 2026 as the simple majority vote Pedersen described eight years earlier.

A constituent who signs in CON without reading SB 6346 but who knows this history is not pattern-matching by instinct. They are responding accurately to a documented legislative strategy, now in its final stage, by an architect who wrote down the plan. Trudeau’s concern assumes the sign-in reflects ignorance. The record complicates that assumption.

The federal income tax was introduced in 1913 as a temporary measure with a top rate of 7% on incomes above $500,000. It has been neither temporary nor limited since. A constituent who has watched Washington’s capital gains excise follow the same arc, introduced with tax relief promises that were never kept and expanded within four years, is not being paranoid. They are reading the pattern correctly.

A constituent signing in CON on this bill is not evaluating the mechanics of a 9.9% rate on income above one million dollars. They are evaluating a mechanism with a documented history and a stated long-term purpose. That is not noise. That is the signal working as designed.

The Participation Double Standard

“As a general rule, I always warn my members, you shouldn’t really pay attention to that kind of dialogue… maybe focus less on numbers and more on quality of engagement.”

— Speaker Laurie Jinkins, February 24 media availability

“Quality of engagement” implies that organized participation is lower quality than spontaneous participation. Applied consistently, that standard would disqualify most of what the same legislators celebrate as democratic infrastructure.

Get out the vote campaigns are organized, at scale, through forwarded links, text banking, social media mobilization, and door knocking. They systematically encourage people to act on issues they may not have independently researched. Nobody argues that a voter who was reminded to register by a campaign text is less legitimate than one who showed up spontaneously. Nobody demands that turnout in heavily canvassed precincts be discounted because the participation was encouraged rather than organic.

The asymmetry is hard to explain on principled grounds. Get out the vote efforts are explicitly designed to shape electoral outcomes, which directly determines who holds legislative power. Organized sign-in campaigns are designed to inform legislators of constituent sentiment on a specific bill, which Jinkins then warns her members not to pay attention to anyway. If one is legitimate democratic infrastructure and the other warrants skepticism, that distinction requires an explanation nobody has offered.

The Self-Selection Argument Does Not Save Them

The legitimate version of the dismissal is astroturfing risk. Organized campaigns can mobilize sign-ins that do not reflect organic sentiment. Two problems follow.

First, the statistical work already addresses it. The anomalies I flagged in those prior posts run against the PRO side, not CON. The CON signal carries the messy collision fingerprint consistent with real people. The organized manipulation concern, applied rigorously and symmetrically, strengthens the CON signal rather than undermining it.

Second, self-selection disqualifies nothing legislators already use. Every constituent signal they rely on is self-selected. Calls. Letters. Town hall attendance. Donations. None represent a random sample of the electorate. The sign-in system is being held to an evidentiary standard that almost no feedback mechanism in democratic practice has met, and that standard is applied to nothing else.

What makes the sign-in data different from those signals is not that it is less reliable. It is that it is more systematic. It produces a record. It is auditable. It generated enough volume to run statistical tests on. The methodology applied here would hold up in a peer-reviewed context. The “I talked to my constituents” alternative would not.

For the underlying sentiment to be actually close to even, CON participants would need to be systematically ten times more motivated to engage through this specific channel than PRO participants. That is not a bias correction. That is a complete reversal of the observed signal. No one has offered a mechanism that produces that result.

The legislators dismissing this data are not applying a rigorous evidentiary standard. They are applying a selective one.

The Broader Pattern

In Disdain or Design? I wrote about what happens to constituent input in Washington when institutional actors have decided on an outcome. The user interface of democracy still renders. The buttons are there. What that piece examined is whether the backend those buttons connect to has been rewired.

The sign-in dismissal is that pattern made unusually explicit. When lawmakers assert that sign-in anomalies damage the ‘democratic process,’ the irony is staggering. The legislature already removed the actual democratic process from this bill by attaching an emergency clause, deliberately blocking the public’s ability to challenge it via referendum. They pre-emptively silenced the electoral signal; now legislative leaders are simply stating on camera that the only constituent participation left is not helping them make decisions.

Washington voters have rejected income taxation ten times through the constitutional amendment process. The legislature is effectively circumventing the initiative process that most recently codified that preference. Dismissing the largest constituent response in state legislative history as something members should not pay attention to is not a data science position.

It is a tell about whose input actually shapes the outcome.

The Question That Deserves an Answer

Every signal legislators use to read constituent sentiment is self-selected. Calls. Letters. Town halls. Protests. Donations. Sign-ins are just self-selection at scale, with a paper trail rigorous enough to audit.

It is a perfectly reasonable position to argue that 90,000 highly motivated people clicking a web form do not flawlessly represent the entire state of Washington. But if the legislature genuinely wanted a higher-fidelity democratic signal, they would not have attached an emergency clause to explicitly bypass the voters. And they would not be ignoring a century of bipartisan ballot results where Washingtonians have rejected this exact policy ten separate times.

Legislators are free to make that choice, but voters deserve transparency about it, not a smokescreen of statistical skepticism that the data itself dismantles. When the numbers speak this clearly, ignoring them isn’t methodology; it’s a deliberate unplugging of democracy’s earpiece.

Duplicates Are Not the Problem

1 Reply

The Washington House is now arguing that the sign-in dataset for SB 6346 is unreliable because it contains duplicate names. The claim is simple. If the same name appears more than once, you cannot trust the totals.

They are not wrong that duplicates exist. They are wrong about what duplicates mean and what to do about them.

Every real-world dataset contains noise. Names entered twice, typos, outliers, junk. This is not a scandal. It is a property of data collected from human beings at scale. The standard response is not to discard the dataset. It is to trim it. A trimmed mean, cutting the head or tail or both, is one of the oldest tools in data science. The presence of junk data is not a reason to abandon analysis. It is the reason analysis exists.

The birthday-corrected collision test applied in the previous post is a more principled version of exactly that. Rather than arbitrarily cutting a fixed percentage off the tail, it uses the population model to identify which specific windows are statistically anomalous and removes only those. The legislature is being offered a choice between principled trimming and throwing the whole dataset away. One of those is data science. The other is a talking point.

Why Duplicates Happen

Before getting to the test, it is worth being precise about why duplicates appear in the first place, because the innocent explanations are more common than the fraudulent ones.

Approximately 800 people named John Smith live in Washington state. These are real, distinct individuals.

The first is demographics. According to the U.S. Census Bureau, Smith is the most common surname in America, occurring roughly 828 times per 100,000 people. There are an estimated 32,000 people named John Smith in the United States, approximately 800 in Washington state alone. But national averages miss how name frequency actually works in practice. It clusters by community. Redmond and Bellevue have dense South Asian tech worker populations where Patel and Singh recur at rates far above the state average. Tukwila and south King County have large East African and Somali communities where Mohammed appears with predictable frequency. South Seattle and the Puget Sound corridor have substantial Vietnamese communities where Nguyen, already the most common surname in Vietnam, concentrates heavily. Name frequency is never random. It reflects religion, culture, and family tradition. Mohammed is among the most common names in the world because naming a son after the prophet is an act of Islamic devotion practiced across generations. That is not a data quality problem. The same full name appearing two or three times in 80,000 records is not evidence of anything. It is census math applied to a state that looks nothing like the national average.

The second reason duplicates appear is the sign-in form itself. It does not confirm that your submission was received. Anyone who has filled out a web form and stared at the screen knows what comes next. You submit again. Someone might also change their mind and resubmit to correct their position. A household with two people named Michael Johnson might both sign in independently. None of that is fraud. Both causes are real, and a serious analysis accounts for both.

Beyond that, even if we removed all of the duplicates, it would not even move the needle on the ultimate message being sent. With that said, it is worth noting that CON has more removals in absolute terms because it has ten times as many submissions, which is what we would expect based on the collision test.

On Rapid Submissions

A related claim is that submissions arriving within seconds of each other indicate bot activity. The timing observation is real. The interpretation is not supported by the data available.

Rapid same-name pairs are primarily a function of submission volume. When hundreds of people are submitting per hour, two people who share a name will statistically land within seconds of each other by chance alone. The chart below plots same-name rapid pairs against hourly submission rate for both sides. Both follow the same curve. The PRO overnight Feb 20 hours, at roughly 190 submissions per hour, fall below where the trend predicts they should be, which is consistent with what the collision test found. The timing argument does not add new evidence against PRO. It describes a mathematical property of any high-volume submission window.

The public export contains no IP addresses. Without them, rapid sequential submissions cannot be distinguished between three completely different explanations. The first is a single person double-submitting because the form gave no confirmation. The second is two people in the same household on the same connection. The third is two distinct people with different IPs whose submissions happened to land close together during a busy window.

The tool that would actually resolve this is IP address logs from the server. A same-IP rapid duplicate is strong resubmission evidence. A different-IP rapid duplicate from a residential ISP is two real people. A cluster of submissions from a datacenter or known VPN range is a different finding entirely. None of that analysis is possible from the public CSV, which is the only data anyone outside the AG’s office has seen.

This matters because the “within seconds” framing is being used to support a conclusion the available data cannot reach. The previous post noted that IP logs should be preserved before they age out. That recommendation stands. Until that analysis is done, timing alone is not evidence of anything specific.

It is also worth noting what the pattern does not look like. It shows zero name collisions and below-trend rapid pairs, the opposite of what cheap automation produces. What that pattern is consistent with is a large list of pre-generated unique names submitted at a controlled rate. CAPTCHA does not stop that. Each submission looks like a distinct human from the name and timing perspective. The fix legislators might reach for does not address the threat model the data actually points to.

What the Test Is Measuring

The birthday problem tells you that a room of 23 people has a 50% chance of containing a shared birthday. The same math gives you the expected number of name collisions in any random sample drawn from a community of known size. If you have 9,000 PRO supporters and draw 934 names from that pool, some names will repeat by chance. Not because anyone cheated. Because Jennifer Lee exists in multiples, and because some of them hit submit twice when the page did not respond.

The expected number of collisions for that sample is approximately 60. Not zero. Sixty. The test does not flag duplicates. It asks whether the duplication rate is consistent with what a genuine community would produce.

For the Senate PRO February 20th overnight window, the observed collisions were zero. Not fewer than expected. Zero. Across 10,000 simulations drawing from the actual PRO participant pool, the minimum produced was around 30. The overnight batch produced none.

The CON overnight windows tell the opposite story. More collisions than expected across several nights, consistent with resubmission, common names appearing organically, households submitting together. The kind of messy that real participation produces.

What This Means for the Dataset

The argument that duplicates make the dataset unreliable cuts in exactly the wrong direction. The PRO overnight batch from February 20th is anomalous precisely because it has too few duplicates, not too many. A genuine sample from a real community, one that includes people named John Smith and people who hit submit twice, does not produce zero collisions in 934 draws. It is statistically impossible.

Raw duplicate counts, without correcting for population name frequency and sample size, are not a meaningful metric. The legislature is being asked whether these sign-in totals reflect genuine public sentiment, and that is a statistical question with a statistical answer. The answer is not “the dataset has duplicates, therefore we cannot know.” The methodology was built specifically to separate expected duplication from anomalous duplication, and the findings hold.

Discarding the dataset because it contains duplicates is not data analysis. It is avoiding data analysis.

None of this is perfect. IP address analysis would not be definitive because VPNs, shared connections, and mobile carriers complicate attribution. The collision test rests on a population model that is an estimate, not a census. The rapid pairs chart fits a trend to noisy data. Statistical inference is always probabilistic, and anyone who tells you otherwise is selling something.

But the question legislators are actually asking is not whether this dataset is perfect. It is whether the sign-in totals are a reasonable signal of public sentiment, and whether the anomalies identified are significant enough to warrant skepticism about specific windows. For that question, the methodology does not need to be perfect. It needs to be fit for purpose.

A 10:1 ratio that survives deduplication, symmetrical trimming, and a collision test that was explicitly designed to tolerate legitimate duplication is a robust signal. The PRO overnight Feb 20 anomaly does not need to be proven beyond a reasonable doubt to be disqualifying for that window. The standard here is not a criminal conviction. It is whether legislators can treat the aggregate numbers as a directional guide to constituent sentiment. On that standard, the analysis is more than sufficient.

On Impersonation

Named officials discovering their identities appeared in the dataset without their consent is a real incident worth investigating. But the sign-in system was never designed to verify identity or attribute positions to specific individuals. Names are collected not to create a record of who voted, but because a completely anonymous system would be trivially manipulable. A name field is the minimal friction that makes aggregate analysis possible at all.

The relevant question for legislators is not “did John Smith actually sign this?” but “does the distribution of sign-ins reflect genuine public sentiment.” This is a survey mechanism, not a ballot. Washington has 7.8 million residents. Even a perfectly clean dataset with 100,000 CON sign-ins represents a small fraction of the population. Legislators have always understood these numbers as a directional signal, not a binding count. Treating impersonation as the central finding, rather than asking whether the aggregate signal survived manipulation, mistakes the instrument for the measurement.

The numbers behind the impersonation claim deserve scrutiny. Invest in Washington Now reported roughly 100-200 confirmed cases across 123,289 records, less than 0.2% of the dataset. Even tripling that estimate to account for unreported cases, it does not move a 10:1 ratio in any meaningful direction. And if you apply their own deduplication logic symmetrically: remove every name that appears more than once from both sides. CON drops from roughly 110,000 to 91,000 and PRO drops from roughly 10,000 to 9,000. The ratio is still 10:1. Their argument, applied consistently to both sides, does not change the conclusion.

Those confirmed cases were identified because victims self-reported. Public officials monitor mentions of their names, noticed the discrepancy, and came forward. That is the easiest fraud to find. It tells you nothing about what the rest of the dataset contains. Self-reported impersonation is the floor of what happened, not the ceiling, which is precisely why aggregate statistical analysis exists.

It is also worth considering what those confirmed cases likely represent. Some are probably legitimate resubmissions. Someone signed in, was not sure it worked, signed in again, and now appears twice. Some are probably trolling. Actual coordinated impersonation may be in there too, but the self-report mechanism cannot distinguish between the three. Treating 200 high-visibility cases driven by public figures monitoring their own names as representative of the full 123,000-record dataset is not a statistical argument. It is a press conference.

So What Does All of This Mean?

The answer to that is simple. The dataset has duplicates. The timing raised questions. Some names were submitted without consent. None of those observations, examined carefully, change what the data shows: roughly ten Washington residents opposed this bill for every one who supported it in committee. That signal has survived every test applied to it.

Teach to the Median, Punish the Variance

Leave a reply

Factories exist to produce consistent, cost-effective products. That is the point. The relentless optimization of cost of goods sold is not a side effect of industrial production. It is the mandate. And it works, until it doesn’t. The reason products last so much less than they did twenty years ago is not that we forgot how to make durable things. It is that durability lost the cost argument. Quality is expensive. Variance is expensive. The system optimizes both out. What survives is the median product, built to a price, reliable enough to ship, and no more.

Modern schooling often behaves the same way. It batches children by age, sequences content for throughput, and optimizes for a predictable median. Sir Ken Robinson made this observation twenty years ago, and the metaphor stuck, not because it is clever but because it names incentives, not architecture. When a system must operate at scale under budget, policy, and staffing constraints, variance becomes expensive. The median becomes the target. Outliers become the problem.

That is how you get the loop so many families recognize.

A child with a spiky profile, gifted and struggling at the same time, or simply learning in a different sequence, is hard for a production line to interpret. The system cannot see internal state. It can only see outputs it knows how to count. Pacing, compliance, turn in rates, standardized measures, and classroom friction. When it cannot measure what is actually happening, it collapses complexity into a label. Lazy. Defiant. Behind. Broken. Sometimes worse. The misclassification is not incidental. It is structural. The factory cannot afford to treat every student as a special case, so it treats special cases as defects.

Twice exceptional programs were a serious attempt to address exactly this failure mode. 2e was not supposed to be a vibe. It was an operational category, a way to route support without denying capability.

Institutions rarely attack reforms head-on. They metabolize them. The common move is not to announce that everyone is 2e. It is more subtle. Fold 2e into the general program, justify the change as an opportunity for all, and quietly remove the differentiated pathways, expertise, and accountability that made 2e real. The label survives. The function does not. The specialist becomes a roaming consultant, the pull-out becomes a generic intervention block, and the documentation becomes a checkbox.

Spencer Silver at 3M spent years trying to make a strong adhesive and produced one that was too weak to hold permanently. By factory logic it was a failed batch. It sat in the lab for years because the system had no category for a glue that did not stick properly. A colleague with a different problem recognized the variance as the feature. The factory almost never found out what it had.

This pattern is familiar in M&A. Companies are often acquired to address a capability, culture, or talent gap. The acquirer gets what it wanted on paper, and then the organization takes over. Microsoft bought Hotmail to compete in web-based email. Hotmail ran on Linux. Microsoft ported it to Windows, the product degraded, and what had been acquired to solve a problem became an example of the problem. The engineers who built Hotmail watched what they had created get dismantled and left. The institution did not transform around the acquisition. The acquisition transformed into the institution, and the talent that made it valuable walked out with their badges.

The proof a program still exists is not whether the brochure mentions it. It is whether the supports remain distinct, staffed, and enforceable. When a category stops changing what adults do, the system reverts to default settings. Teach to the median, punish variance, treat the casualties as defects.

You can see the same dynamic in curriculum fights. When a system cannot reliably lift the floor, the path of least resistance is to lower the ceiling and call it equity. This is not cynical in intent. It is cynical in effect. Acceleration does not disappear. It moves off the books. Tutoring, test prep, schedule hacking, summer programs, parent advocacy. The families who can afford those channels use them. The families who cannot are left with the official story that the ceiling was lowered for their benefit. The median experience is preserved. The gap widens. Official metrics improve because the ceiling has been redefined.

None of this is morally mysterious. It is operational. What makes it damning is that schooling runs this population optimization model without the measurement and accountability that would make it legitimate.

Medicine is honest about something uncomfortable. Treatments have side effects. They do not affect everyone equally. Approval assumes some negative outcomes are acceptable in exchange for a greater good. But medicine only earns the right to make that utilitarian bargain because it is paired with surveillance and accountability. Trials, defined endpoints, adverse event reporting, label changes, and sometimes recalls. When a drug underperforms or causes unacceptable harm, the system has mechanisms to withdraw it.

Schooling borrows the utilitarian posture and skips the legitimacy conditions. There is no adverse event tracking for predictable harms like anxiety spirals, learned helplessness, disengagement, or the systematic grinding down of nonstandard profiles. When you ask what the rollback criteria are, you get a blank stare, because the system does not think in rollback terms. It thinks in throughput terms.

Here is a small, concrete example. One of my children has an accommodation plan tied to a documented set of specific needs. A teacher recently told us the plan would not be needed anymore because the child does not show ADHD signs. There is no ADHD diagnosis, and the plan is not based on ADHD. The teacher was not acting maliciously. They were acting normally inside a system that treats supports as vibes. In a system with real measurement, you do not withdraw support based on a vibe. You tie withdrawal to documented criteria, with a rollback plan if the criteria are wrong. This is not exotic engineering. It is basic change management. Define the hypothesis, define success, define failure, and pre-commit to the revert.

Schooling routinely does the opposite, and the response when things go sideways is not to revisit the decision. It is to escalate.

More pressure. More compliance. More labeling. The system treats opt-out as a containment breach rather than a performance signal, because enrollment and funding are coupled together. The institution has no incentive to register failure. It has strong incentives to frame failure as the students’.

So why does this cycle finally have a credible exit?

Because AI breaks the monopoly on instruction.

For most of modern history, if you wanted a coherent explanation, feedback loops, sequenced practice, and the ability to revisit a concept from a different angle without embarrassment, you needed the institution or you needed money. Those are the same thing for most families. AI makes those pieces abundant. It makes it cheaper to learn in a different order. It makes it cheaper to revisit a concept from five angles without being punished for needing a sixth. It reduces the penalty for variance in a way that nothing else in the past century has.

This is why models like Alpha School are worth watching, whatever you think of their specific implementation. They are proof that you can architect learning around mastery and coaching rather than batching and seat time. They are not just a new school brand. They are evidence that instruction is no longer scarce, and that the existing system’s grip on the delivery layer is loosening.

The tradeoff is real and worth being honest about. The devil you know versus the one you do not.

The existing system’s harms are normalized, which means they are mostly invisible. The new world introduces different risks. Dependency on opaque tools, misinformation at scale, AI-driven learning environments that are even more coercive than human ones because they optimize metrics nobody agreed to, and a widening gap between families who can navigate the options and those who cannot.

The credential layer will be the next fight. Institutions that lose control of instruction will shift to defending legitimacy. Seat time requirements, accreditation barriers, and the bureaucratic right to define what counts for the purposes of the next gate. If instruction becomes abundant, the last monopoly is not learning. It is recognition.

But the direction of travel is hard to reverse. Bureaucracy protects the status quo long past the point where the quo has lost its status. AI accelerates the expiration date. The more schooling responds to exits with escalation rather than adaptation, the more it will be outcompeted by systems that treat variance as signal rather than a defect.

I keep coming back to the medicine analogy, but with a sharper edge. In medicine, adverse events are data. In schooling, adverse events become discipline referrals and bad grades. One system updates on failure. The other system records the failure as the student.

AI is not a magic cure. But it is the first credible exit from a century-old loop. A factory that mistakes difference for defect, and calls the casualties the cost of scale.

The Data Doesn’t Support the Narrative

2 Replies

What This Is About

SB 6346 would create Washington’s first personal income tax in nearly a century. A 9.9% rate on income above $1 million, projected at $3.4 billion annually, it passed the Senate 27-22 on party lines and is now in the House. Washington voters have rejected income taxation ten times at the ballot over the last hundred years. This is the most contested piece of legislation the state has seen in a generation.

Washington’s legislature runs an online sign-in system for committee hearings. Anyone can go to the legislative website and register their position on a bill, pro or con, without testifying. Legislators see the totals. The system is designed to give ordinary people a voice even if they can’t show up in Olympia. On SB 6346, it may be the only meaningful voice many residents have: the legislature designated the bill an emergency measure, which prevents a voter referendum. There is no ballot option. For most Washington residents opposed to this bill, signing in here, or calling their representative directly, is the entire menu.

When those numbers get manipulated, the perception gets manipulated. On a bill this significant, in a state with a century of voter resistance to income taxes, that is not a minor data quality problem. It is a distortion of the democratic signal legislators and journalists are using to understand where the public actually stands.

Which is why getting the analysis right matters. And why it matters that GeekWire got it wrong.

GeekWire reported Monday (Added 2/24/26: and apparently the Seattle Times too) that fraudulent sign-ins were used to inflate CON opposition to SB 6346. Named public officials confirmed their identities appeared without their consent. The framing was clear: the anti-tax side cheated.

The data tells a different story.

The story was built on analysis provided by Invest in Washington Now, a PRO-tax advocacy group that examined CON submissions and held a press conference. They reported a true incident, but failed to do the basic symmetric analysis needed to justify the narrative they attached to it.

I downloaded the full legislative sign-in export at 5:51 PM Pacific on February 23rd, 123,289 records, and ran the same tests on both positions. Here is what it shows.

The data confirms fraud. It does not confirm that fraud explains the opposition. Those are different claims, and the difference matters enormously on a bill this significant.

Who Actually Showed Up

“Legitimate” here means a unique name that appears at least once during daytime hours (7 AM to 11 PM PT). The export does not verify identity.

CON: roughly 90,000 legitimate unique participants.

CON was underrepresented during overnight Pacific hours, which is exactly what you would expect from Washington residents who are asleep at 2 AM. By every geographic measure in the data, CON’s daytime participation pattern is consistent with Washington residents showing up. CON does have an overnight anomaly, roughly 2,800 names that appear only in overnight windows and never in five days of daytime sign-ins, which reduces the legitimate unique count to roughly 90,000. More on this below.

PRO: roughly 9,100 apparent legitimate unique participants.

The PRO side has 9,919 unique names across the full hearing. But between 1 and 5 AM Pacific on February 20th, 934 submissions arrived in a single five-hour overnight window (1:00–5:59 AM). That window accounts for about 8.4% of all PRO submissions across the entire hearing period. Of those 934 submissions, 807 were unique names. Set those aside, and you have roughly 9,100 apparent legitimate PRO participants.

That is a roughly 10-to-1 ratio (90,408 CON vs 9,112 PRO in the export, after removing overnight-only names as defined here). A margin like that on a contested tax bill during a legislature that removed opportunity for feedback other than this survey is not, on its own, statistically implausible.

Here is what the data actually shows. Both sides have anomalous overnight submissions that do not match the daytime participation signature. On the CON side, roughly 2,800 names, about 3% of their total, appear only overnight and never during five days of daytime sign-ins. On the PRO side, 807 names, about 8% of their total, appeared in a single five-hour overnight window. Both anomalies are real. But after removing them, roughly 90,000 legitimate CON participants remain against roughly 9,100 legitimate PRO participants. The fraud did not manufacture the opposition. The fraud, such as it is, was larger in proportional terms on the side that was already losing 10 to 1.

The story treated fraud as the explanation for the margin. The data shows the margin survived the fraud. On every test, the more anomalous signal sits on the PRO side, not the CON side.

The Geography Test

The most straightforward test requires no statistics at all. Washington residents sleep on Pacific time. If you plot submissions by hour of day, genuine local participation should cluster during waking hours and drop off after midnight.

CON activity drops overnight, consistent with local participants sleeping on Pacific time.

PRO shows a pronounced spike on February 20th between 1 and 5 AM, running at close to 190 submissions per hour for five straight hours, while Washington residents were asleep and the CON side was quieter than usual. The spike is not a few night owls.

The Community Test

Here is where it gets harder to explain away.

Across five full days of daytime sign-ins, we have an observable picture of who the PRO community actually is. Roughly 9,100 people engaged during normal waking hours. Five days is a long window. If you are a genuine PRO supporter, the probability that you appeared in that record at least once is high.

Name communities have statistical fingerprints. Any two groups drawn from the same population will share common names at a predictable rate, the same demographic mix, the same frequency of “James Kim” or “Sarah Johnson.” So even if the overnight submitters were entirely different individuals from the daytime crowd, you would still expect their names to collide with the daytime pool at a rate consistent with drawing from the same community. The longer the daytime window, the higher that rate gets.

You can test this directly. Draw 934 random names from the known PRO pool and ask how many appear somewhere in five days of daytime submissions. Across 10,000 simulations, the answer is about 86%.

The observed overnight overlap was 13.6%. Nearly nine out of ten overnight names had never appeared in five days of daytime PRO submissions.

Every one of the 10,000 simulations produced more overlap than the overnight batch did. The minimum was 82%.

The same test applied to CON tells a similar structural story. CON’s overnight names also show only 21–25% overlap with the daytime pool across five nights, against an expected 90–94%. Both positions have overnight participants who are largely absent from the five-day daytime record. The difference is in magnitude and concentration. CON’s anomaly spreads 2,800 names across five nights. PRO’s concentrates 807 names into a single five-hour overnight window. The PRO signal is sharper, but the underlying pattern of overnight names that don’t match the daytime community appears on both sides.

The Name Collision Test

In any large population, names repeat. If you pull 934 people at random from Washington state, some of them will be named James Kim. Some will be named Sarah Johnson. That is not fraud, that is just how names work. The question is whether the names repeat at the rate you would expect given the size and demographic makeup of the community you are drawing from.

934 overnight PRO submissions produced zero repeated names. Not fewer than expected. Zero.

From a genuine community of roughly 9,100 PRO supporters, you would expect around 60 name collisions in a sample that size, just by chance. We ran 10,000 simulations drawing from the actual PRO participant pool. Every single one produced collisions. The lowest was around 30. The observed overnight batch produced none.

The most consistent explanation is that someone built a list and specifically made sure no name appeared twice. That is not what organic participation looks like. That is what a curated submission operation looks like.

The CON side shows the opposite pattern. Several overnight windows had more name repeats than expected, consistent with people resubmitting or households submitting together. Messy, in other words. The kind of messy that real participation produces.

934 submissions. 934 unique names. Zero repeats. That is the number that should be in the headline.

On CAPTCHA

The form uses CAPTCHA verification. This comes up because some coverage implies it as a meaningful protection.

It is not, against this class of problem. CAPTCHA distinguishes automated bots from humans. It provides no protection against human click farms, which are operations that pay workers in other countries to solve CAPTCHAs manually and complete form submissions by hand. This is a commercial industry. Services are publicly listed, priced at $1 to $3 per 1,000 submissions.

The pattern observed on February 20th is most consistent with coordinated human submissions using a curated name list: overnight timing, near-zero overlap with the daytime community, and zero name collisions across 934 submissions. A click-farm-style mechanism is one plausible explanation. The public export cannot prove attribution without server-side logs.

At those rates, the 934 anomalous overnight PRO submissions represent a trivial cost against a bill projecting $3.4 billion in annual revenue.

On the Audit Trail

The public data export does not include IP addresses. Whether internal server logs exist and whether they have been preserved is unknown. That is a question the AG’s investigation should answer before those logs age out.

Even with full IP logs, naive geolocation proves little. Click farm operations commonly route through VPNs, and an IP address alone does not establish geographic origin without infrastructure-level analysis of the autonomous system it belongs to. What you want to know is not which city the IP is registered to. You want to know whether it belongs to a residential ISP, a datacenter, or a known VPN provider range. Those are different findings with different implications.

What This Means

Neither finding resolves cleanly without a real investigation. The named official impersonations on the CON side are real and the AG should pursue them. But confirming specific named victims is the easiest fraud to find because the victims can self-report. That is not a statistical audit, and it does not address what the data shows on the other side.

Both findings warrant investigation. The system made both possible.

There is a useful analogy here. Risk-Limiting Audits are the gold standard for post-election verification. The premise is that you do not need to check every ballot to establish confidence in the outcome. You need to bound the probability that the anomalies are large enough to change the result. Advocates of RLAs often argue, correctly, that statistical evidence is sufficient to certify an election without requiring individual identity verification for every voter.

That is precisely what this analysis does. It does not identify every fraudulent submission. It asks whether the fraud on either side was large enough to manufacture the margin. The answer is no. After removing every overnight anomaly on both sides, roughly 90,000 legitimate CON participants remain against roughly 9,100 legitimate PRO participants. If statistical sampling is rigorous enough to certify an election, it is rigorous enough to evaluate a legislative sign-in system.

The sign-in infrastructure was built for access. Low friction, no identity binding, no rate limiting that held against coordinated submission. I have written before about how Washington has accumulated individually defensible choices that collectively produce systems incapable of defending their own integrity. The legislature is now trying to adjudicate participation fraud on infrastructure that was never designed to be auditable.

The question that does not get asked in any of the coverage: why did Washington build a public participation system with no ability to verify, audit, or forensically reconstruct what happened, and what would it take to build one that can?

Methodology

All analysis was run on the public CSV export of sign-in records for SB 6346, downloaded at 5:51 PM Pacific on February 23rd, 123,289 records total. Every test was applied symmetrically to both positions using the same parameters. The analysis does not attempt attribution. It bounds the probability of innocent explanation under stated assumptions.

Geographic analysis. Submissions were binned by Pacific hour. Each position’s hourly share was compared to that position’s overall base rate across the full hearing. CON activity drops overnight relative to daytime hours, consistent with participants sleeping on Pacific time. PRO showed a concentrated spike on February 20th between 1 and 5 AM, sustaining close to 190 submissions per hour across five consecutive hours. The 1 to 5 AM PT window corresponds to mid-day hours in parts of Asia and the Middle East.

Name overlap test. This test requires no statistical model and is not sensitive to assumptions about name distributions. For each overnight window with at least 20 submissions, the unique names were compared against that position’s daytime submissions (7 AM to 11 PM) across the full hearing. Overlap fraction equals names appearing in both sets divided by total overnight unique names.

To establish expected overlap, 10,000 random samples of size n were drawn without replacement from that position’s full-hearing participant pool, and the overlap fraction with the daytime set was computed for each draw. On February 20th, the PRO observed overlap of 13.6% fell below the minimum of all 10,000 simulations. The lowest simulated value was about 82%. CON overnight overlap ranged from 21–25% across five nights, against a bootstrap expectation of 90–94%, also below every simulation on every night. Both positions show overnight communities that are largely disconnected from their daytime pools.

Birthday-corrected collision analysis. Raw name duplication rates are not meaningful without correcting for sample size. In any large sample, some names will repeat by chance regardless of how the data was generated. The expected number of name collisions for a sample of size n drawn from a pool of N_effective distinct names follows the occupancy problem:

expected unique names = N_effective × (1 − e^(−n/N_effective)) expected collisions = n − expected unique

N_effective was estimated separately for each position from that position’s own daytime submissions using the method-of-moments estimator: N_effective = u² / (2s − u), where u is the number of unique names and s is total daytime submissions. This assumes the overnight community draws from the same underlying name distribution as the daytime community. That assumption is explicit and falsifiable. CON showed collision excesses across multiple nights, with effect sizes of 1.8%, 2.3%, and 4.1% on the three most anomalous nights. PRO worst night (February 20th): 0 observed collisions, approximately 60 expected, deficit of ~60. Statistical significance was assessed using the Poisson distribution, upper-tail for excess and lower-tail for deficit.

Sensitivity. The collision deficit finding holds unless the PRO overnight community drew from a pool of at least approximately 200,000–300,000 distinct name combinations, roughly 20–30 times the total observed PRO participant base across the full hearing. The entire PRO participant pool across five days is 9,919 unique names. A reader who disputes this should specify what pool size they would defend, and explain why that entire community was absent from every daytime window across the hearing period.

Duplicate submissions. The same name appearing multiple times are a separate question and not the subject of this analysis. Some duplication is expected in any real participation dataset; people resubmit, households share names, and common names genuinely recur. The relevant question is whether duplication rates deviate from what the population would predict. The overnight CON windows showed collision excesses consistent with resubmission or household participation. That is a different signal from the overnight PRO deficit, and it points in a different direction

What this analysis cannot determine. The geographic origin of submissions, the identity of any operator or coordinator, whether the system maintains server-side logs, and the mechanism behind any anomalous pattern. Attribution of intent from behavioral data alone is not supportable. These findings bound the probability of innocent explanation. They do not establish what the non-innocent explanation is.

I ran this analysis quickly after the GeekWire story published. There may be subtle issues in the methodology I have not caught. I am confident it is directionally correct. If you find an error, I will correct it.

When Building Gets Cheap, Distribution Becomes Destiny

Leave a reply

“Distribution is the new moat.” You can find some version of that sentence in almost any startup discussion from the last year. It circulates as a take, gets liked, gets reshared, and then gets reproduced by someone else who arrived at the same conclusion independently. The observation has become cheap to make precisely because it is true. What is harder, and what most of those takes skip, is understanding why the structural mechanics behind it matter and what they actually require you to do differently.

For decades, venture capital rewarded the ability to build. In the AI era, building is no longer scarce. Distribution is.

There was a time when building complex software required deep teams, long timelines, and substantial capital. Engineering was the constraint. Infrastructure was the constraint. Expertise was the constraint. That constraint justified venture scale returns.

AI is dissolving that constraint, not all at once, and not uniformly across every domain, but steadily and in ways that are already measurable.

This is not a cliff. It is a slope.

The companies founded today still face real execution challenges. The ones founded three years from now will face fewer. The ones founded ten years from now will operate in an environment where the cost of building sophisticated systems is a fraction of what it is today. We are in the early middle of this shift, not at the end of it. That matters because the temptation is to look at current valuations, current outcomes, and current M&A multiples and conclude that nothing has changed. Something has changed. It is just moving at the pace of markets and human institutions, not at the pace of model releases.

The Repricing of Expertise

We are watching a repricing of expertise, a slow one, with uneven edges.

Not at the foundational layer. Paradigm-shifting breakthroughs still matter. The rare intellectual leap that unlocks a new architecture or a new computational primitive remains valuable and durable. But most companies are not those breakthroughs. Most companies sit on top of them.

I have written before about how AI is repricing skill at the individual level, injecting liquidity into what was once a slow-moving market for technical expertise. What is happening at the venture level is the same dynamic playing out across entire product categories. When fifty startups can build near-equivalent products in twelve months, product differentiation compresses. Expertise becomes assisted. Execution becomes accelerated. Barriers to entry fall.

It is worth being direct about what that means. AI does not just flatten products. It flattens people. The scarcity that once justified premium human expertise, the advisor with the rare insight, the consultant who had seen this problem before, is narrowing. That edge does not disappear, but it compresses fast unless the expertise is embedded in distribution, in relationships and customer context that cannot be replicated from a prompt.

There is an important exception. In data-rich verticals, proprietary datasets create compounding advantages that AI amplifies rather than erodes. Healthcare, finance, legal, infrastructure – in these markets the data is not just an asset, it is a moat that gets stronger as it grows. AI makes that data more useful, not less defensible. The dynamic in these verticals is different. The scarcity is not building capability or even distribution in the generic sense. It is the data itself, and the domain-specific judgment required to use it correctly. This connects to a broader point worth sitting with: when you rent the capability layer, you rent the moat. In AI-native verticals, whoever owns the model behavior owns the product – and that is a different kind of lock-in than anything cloud computing created.

The result is predictable. A wave of companies will launch in every attractive AI-adjacent category. Many will grow quickly. Many will look venture-scale in their first 24 to 36 months. Most will not become venture-scale businesses.

They will explode and then flatten.

Not because they were poorly run. Not because the founders lacked talent. But because it became too inexpensive to create what they created. The winner-take-most dynamic compresses margins and growth for everyone except the few that secure durable control.

Cheap building creates crowded categories. Crowded categories destroy the middle of the return distribution.

The venture math here deserves to be stated plainly. Cheap building means more competitors. More competitors cap market power. Capped market power caps exit multiples. In a crowded AI category where any competent team can replicate the core product, the venture model itself compresses. Not because the market is small, but because structural dominance becomes harder to achieve and sustain. Many of these companies are structurally unlikely to become venture-scale businesses. The category economics will not support multiple large players once replication costs collapse, and most founders do not have the distribution infrastructure to be the one that survives. Asymmetric outcomes remain possible. They are just harder to achieve and harder to sustain in categories where the product itself can be reproduced quickly.

What This Does to Venture Capital

This has structural consequences for venture capital, though they will play out over years, not quarters.

If building is cheap and competition is abundant, returns concentrate harder and faster. You get more rockets. Fewer reach orbit.

Investors will demand signal sooner. Growth becomes the proxy for distribution dominance. Capital is deployed to test whether the company can win quickly, not whether it can build elegantly. The tolerance for long, patient build cycles without distribution proof shrinks. Capital releases in stages tied to evidence of emerging control.

This is reshaping round structure too. When building is cheap, large upfront rounds are harder to justify – you no longer need $20M to construct the product. Seed rounds compress because the build cost does not warrant more. But growth rounds are becoming larger and more heavily tranched, with capital tied to distribution milestones rather than product ones. Channel proof. Embedded customer cohorts. Pipeline velocity. The structure of the round starts to reflect the new scarcity. Capital flows in proportion to what is actually hard, and what is actually hard is no longer building the thing.

The traditional power-law model assumed a long tail of moderate outcomes. In a world of rapid replication, the moderate outcome becomes harder to sustain.

Meanwhile, IPO pathways have narrowed. The regulatory intent was investor protection. The outcome was exclusion. By making it harder for companies to go public early, regulators locked retail investors out of the steepest part of the value curve, the years when a company moves from promising to dominant. Secondary markets expanded to fill the gap, but access to those markets is not democratic. Private capital captures what public markets used to offer to a broader population. Venture starts to look less like broad-based growth capital and more like concentrated private allocation, closer to family offices, less like 1990s expansion funds. AI will likely accelerate that dynamic. The companies creating the most value will stay private longer, and the people with access to them will be a narrower group than before.

Selectivity increases. Portfolio sizes shrink or become more strategically concentrated. The “grow at all costs, you’ll get more later” model becomes harder to justify when many fast-growing companies are structurally incapable of sustaining dominance. Capital no longer buys uniqueness. It buys speed – the time and resources to build a distribution funnel, execute against it, and reach durable entrenchment before a competitor replicates the product and races to the same buyers.

Built for Acquisition, But It Is Not a Spreadsheet Decision

There is another dynamic that becomes more visible in this environment. Some startups are designed not to become category winners, but to slot perfectly into one specific incumbent. Not strategic fit in the abstract sense. Deliberate adjacency to a single buyer. The product is built to complete a portfolio gap. The roadmap mirrors a specific weakness in a specific acquirer’s product line. Some founders are not optimizing for market dominance. They are optimizing for perfect adjacency to one buyer, and shaping every decision around what makes that buyer say yes.

This is not new. But the calculus around it is shifting.

When technology is easier to replicate, the premium for strategic fit increases relative to the premium for raw IP. At the same time, the value of acquiring technology alone diminishes. If a product can be rebuilt internally in 12 to 18 months, the acquisition multiple compresses. The technology becomes a starting point for an internal conversation, not a reason to write a check.

What remains valuable in M&A is harder to replicate. Embedded distribution. Contractual entrenchment. Regulatory positioning. Customer relationships. Data gravity.

In regulated verticals, this goes further. A company that has already navigated the compliance requirements to operate in a market – secured the certifications, built the audit trails, established the regulatory relationships – has compressed years of a buyer’s time to market into something acquirable. Compliance readiness is not a cost center. It is a distribution accelerator. Vertical access and compliance readiness are part of the distribution story, not separate from it. For an acquirer trying to enter a regulated market, the fastest path is often not to build the product. It is to buy the company that already has permission to operate. That shifts what gets priced into an acquisition and why some targets command premiums that pure technology analysis cannot explain.

Technology without distribution is just an expensive prototype.

But what gets lost in that clean analysis is that acquisition decisions are not made by spreadsheets. They are made by people, in rooms, often under time pressure, with incomplete information and competing organizational interests.

A founder who has built real relationships inside a strategic buyer has a fundamentally different acquisition outcome than one who has not, even if the products are comparable. The internal champion who has watched you execute, who trusts your judgment, who has gone to bat for you in internal budget conversations, is not a nice-to-have. They are often the reason a deal happens at all.

Perception compounds this. Acquirers pay for confidence as much as capability. A company perceived as the category leader, even in a crowded category, commands a premium that may not be fully justified by its metrics. Market positioning, analyst coverage, conference presence, and the quality of your reference customers, these shape the narrative in an acquirer’s boardroom. The story they can tell internally about why they did this deal matters enormously. Acquisitions have to survive internal politics.

Timing is almost never purely rational either. Companies get acquired when a buyer is scared, or ambitious, or has capital to deploy, or is about to lose a competitive advantage they can feel slipping. Being visible and credible at that moment, not just when you need a buyer, is what closes deals.

None of this means product and metrics do not matter. They do. But they matter as the floor. Above the floor, acquisition outcomes are determined by relationships, reputation, and the story someone is willing to tell on your behalf inside an organization that does not know you.

The Irony of Automating Your Own Moat

Customer management is one of the domains AI is aggressively trying to automate. AI SDRs. AI account managers. Synthetic personalization. Automated follow-up. Generated relationship intelligence.

In a world where distribution is the scarce resource and relationships drive acquisition outcomes, the industry is racing to replace human relationship infrastructure with synthetic substitutes.

This is not irrational. Automation increases efficiency. Most sales and account management processes have enormous amounts of low-value activity that could and should be automated.

But in high-value markets, buyers are not just purchasing functionality. They are purchasing risk reduction. They are purchasing accountability. They are purchasing confidence. And confidence is built through consistent human judgment over time, through the accumulation of trust that comes from someone showing up, delivering, and being present when things go wrong.

There is a related dynamic at the talent level. I have written about how AI is eliminating the on-ramp for early-career engineers, absorbing the low-context work that once let junior developers accumulate the judgment and institutional knowledge that makes senior engineers valuable. The same problem applies to the people who build enterprise relationships. The craft of reading a room, navigating a stalled deal, and managing a difficult renewal, these compound over years of real exposure. Automating the entry-level work in sales and customer success is not just an efficiency play. It shapes who gets the chance to develop the judgment the role ultimately requires.

Assistive automation increases efficiency. Primary automation risks eroding the very thing that becomes the last defensible moat.

The counterargument is that AI can also accelerate distribution itself. Faster outreach. Better targeting. Smarter personalization at scale. That is true as far as it goes. But it confuses distribution tactics with distribution durability. AI can help you reach more people faster. It cannot manufacture the trust that makes them stay, the embeddedness that makes switching costly, or the relationship capital that makes an acquirer’s internal champion go to bat for you. Speed without stickiness is just faster noise.

In a world saturated with synthetic output, authentic relationships are appreciated. The companies that understand this distinction, between automating the low-value repetitive work and preserving the high-value human judgment, will have a structural advantage over those that optimize purely for efficiency.

Forward-deployed engineers become strategic assets. Customer success becomes competitive infrastructure. Enterprise sales become durable leverage.

This will not be obvious in year one. It will be obvious in year five.

Overgrowth Risk

Cheap building combined with abundant capital creates another problem. When capital is deployed to chase an early signal, companies scale headcount and burn before structural dominance is secured. If they are not the winner in their category, they are left with a cost structure built for orbit and a trajectory that never left the atmosphere.

They grew too fast for a market that would not support multiple large players.

This risk increases when categories are crowded, and replication is easy. AI does not eliminate business fundamentals. It amplifies their consequences.

The Structural Shift

The AI era does not eliminate venture capital, entrepreneurship, or breakthrough innovation.

It shifts the locus of scarcity, gradually, unevenly, and irreversibly.

Foundational intellectual leaps remain rare and valuable. But most startups are not foundational leaps. When building was expensive, builders won. When building becomes cheap, distribution becomes destiny.

This transition is already underway. It is not complete. The companies founded in the next few years will discover its contours the hard way, either because they adapted early or because they did not.

The founders who understand what is happening will optimize differently. They will invest in buyer access before polishing perfection. They will treat relationships as infrastructure. They will see funnel design as a core product, not a marketing afterthought. They will build the internal champions inside their strategic targets before they need them.

And they will move fast on all of it. When building is cheap, the window to establish distribution before a competitor replicates the product is shorter than it has ever been. Timing has always mattered in startups. In this environment, it compounds differently – being six months earlier into a key account, a channel partnership, or a strategic relationship can be the difference between owning the category and being one of the many that flattened. Speed used to be about shipping. Now it is about embedding.

The VCs who understand it will underwrite differently. They have always asked whether the product is impressive and whether the founders are domain experts worth betting on. Those questions do not go away. But distribution used to be a problem you could punt on, something a strong team would figure out in year two or three. That tolerance is shrinking. Investors will put more weight on whether the company already has a credible path to controlling the channel, and be less willing to assume it will materialize later.

Because in a world where fifty companies can build the same thing, the only one that matters is the one that owns the channel and has convinced someone on the inside that betting on them was the right call.

Technology used to be the moat.

Now the moat is access. And access is built by people, over time, in ways that are harder to automate than we would like to admit.

Domain Control Validation Grew Up. It Only Took Thirty Years.

Leave a reply

Let’s Encrypt announced DNS-PERSIST-01 support this week. That is worth noting on its own. But the announcement landed in a way that made me want to trace the longer arc, because what DNS-PERSIST-01 represents is not just a new ACME method. It is the last piece of a transition that took the ecosystem roughly three decades to complete.

That transition was simple in concept and genuinely hard in practice. Stop guessing who answers the phone and start proving who controls the namespace.

What “domain control validation” actually meant in the early days

If you were issuing or auditing certificates in the early web era, domain control validation was less a cryptographic proof than an act of institutional faith. The certificate authority (CA) would send a challenge to webmaster@, admin@, or hostmaster@ at the subject domain, or sometimes look up a fax number in WHOIS and send something there. If a human responded, the certificate got issued.

The model made a bet, a bet that there was a stable, security-relevant human role behind each domain, reachable through a stable channel, and that the person on the other end was both authorized and paying attention.

That bet was always shakier than it looked. What actually happened over time was that the alias went to a ticketing system, or an outsourcer, or a shared mailbox that someone forgot to audit, or just the wrong person entirely. The certificate still got issued. The CA had checked the box. No one had actually verified control of anything.

The worst failures in this period were not exotic cryptographic breaks. They were governance failures and operational drift. The “webmaster takeover” class of problem. The role stopped being real long before the method stopped being allowed. The Baseline Requirements, the industry rules governing what certificate authorities are allowed to do, carried these validation approaches forward because nobody volunteers to own the deprecation, and someone always depends on the thing you want to kill.SC-080 and SC-090 are essentially the CA/Browser Forum (CABF) writing down, in balloted form, what practitioners had already known for years, that being able to be reached at a business address does not demonstrate domain control.

The thing that made the real fix possible

It is easy to look at ACME, the protocol that powers automated certificate issuance, and treat it as a purely technical improvement. It was. But the reason it became viable as a default assumption had as much to do with deployment reality as with protocol design.

In 2014, roughly 30% of web traffic was HTTPS. Mozilla telemetry puts it above 80% globally by late 2024, with North America around 97%. Chrome’s numbers show the same shape, climbing from the low 30s in 2015 to 95-99% by 2020 and plateauing since.

That matters because ACME’s endpoint-based methods depend on actually reaching the endpoint. HTTP-01 proves control by serving a signed token over HTTP at a well-known path on port 80. TLS-ALPN-01 proves control by completing a TLS handshake on port 443 using a dedicated protocol extension and a special validation certificate, with no HTTP handling required. That distinction matters in practice; TLS-ALPN-01 exists specifically for hosting providers, CDNs, and TLS-terminating load balancers who want to validate at the TLS layer without routing validation traffic through to their backends. If port 80 is blocked or you are terminating TLS before HTTP ever reaches your application, TLS-ALPN-01 is the right tool. If you have a publicly reachable web server and port 80 is open, HTTP-01 is simpler.

Both are bootstrap proofs and you can establish domain control without DNS write automation, which matters for the long tail of deployments where DNS is locked down or outsourced in ways that make safe automation difficult. In 2014, assuming you could reach a public endpoint was optimistic. By 2024, the population of sites that cannot serve a response over HTTP or TLS is small enough to be the exception. The web converged on HTTPS fast enough that endpoint-based validation became the reasonable default.

HTTP-01 is also, almost certainly, the last insecure-by-design method that will survive long term, and it will survive for structural rather than technical reasons. There is a bootstrap problem – TLS-ALPN-01 requires TLS already be deployed and configurable at the edge, but if you are getting a certificate because you do not yet have TLS, you cannot use TLS-ALPN-01 to get it. HTTP-01 is how you break out of that loop. More durable than the bootstrap problem, though, is the org chart problem. In large organizations, the web team controls the servers, the network team owns port policies, the DNS team owns the zone, and security owns the TLS infrastructure decisions. None of them individually have the full set of permissions to deploy any other method without coordination. But the web team can serve a token file over port 80 without asking anyone. HTTP-01 wins by default, not because it is the right answer, but because it is the answer that requires the fewest cross-team conversations. That dynamic is unlikely to change, which means HTTP-01 will probably remain the method of last resort indefinitely, insecure channel and all.

DNS-01, and why scale broke it

DNS-01 changed the question from “who answers this email” to “who can write to this DNS zone.” That is a meaningfully better question. DNS is not a signal that you control the domain. It is the domain.

The operational reality, though, is that DNS automation means DNS API credentials distributed across issuance pipelines, renewal workflows, and whatever tooling you are running at the edge. At modest scale that is manageable. At high volume, across large platforms, IoT deployments, and multi-tenant environments, the recurring DNS write per renewal starts to look like both a performance constraint and a credential sprawl problem.

The CNAME delegation pattern that became common was a partial answer, point _acme-challenge.<domain> at a zone you control more tightly, and do the proof there. It worked. It also created a new problem, multiple independent solvers fighting over a shared label.

DNS-ACCOUNT-01, which solved the CNAME collision and nothing else

DNS-ACCOUNT-01 exists to solve that specific problem. By scoping the validation label to the ACME account rather than leaving it shared, multiple delegated pipelines can coexist without colliding. Two independent issuance systems, two different cloud providers, parallel solvers during a migration. They all get their own label and can run without coordinating.

It is intentionally narrow. It does not change the underlying rhythm of fresh proof per issuance. The label is persistent, the proof is still ephemeral. A new token per order, a new DNS write per renewal. The change is only where the proof lives, so delegation can scale cleanly. DNS churn remains, because that was not the problem DNS-ACCOUNT-01 was trying to solve.

In hindsight, that narrowness reflects the world it was designed for. Certificate validity was still measured in years, then in 398 days. Renewals were infrequent enough that requiring fresh DNS proof per issuance was a manageable cost. The credential distribution problem existed, but it was not yet acute. If DNS-ACCOUNT-01 had been designed in a world where certificates expire every 47 days, which is where the CABF is now taking us, it almost certainly would have looked a lot more like DNS-PERSIST-01 from the start. That is not a criticism. You cannot see the 47-day problem from inside a 398-day world.

DNS-PERSIST-01, which the short-validity world actually requires

The CA/Browser Forum’s ongoing push to shorten maximum certificate validity, from years down to 398 days and now trending toward 47, makes the recurring-proof model increasingly painful for everyone, not just operators running at high volume. At 398-day validity, a DNS write per renewal is a minor operational cost. At 47 days, you are writing to DNS eight times a year per certificate, across every certificate in your fleet, with API credentials that have to live somewhere in that pipeline. That is not a scaling problem. That is a design problem.

The more important point is that DNS-PERSIST-01 is simply the better tool for anyone who has DNS access and a CA that supports it, regardless of volume. It subsumes what DNS-01 and DNS-ACCOUNT-01 each solve – the CNAME collision problem goes away because each account’s standing authorization is already scoped, and the credential churn problem goes away because there is no recurring write.

The useful analogy here is passwords versus passkeys. Passwords require you to re-prove the secret on every authentication. Passkeys establish a cryptographic binding once and derive proof from it. Every DNS-based ACME method before DNS-PERSIST-01 worked like a password, prove control again, on this order, right now. DNS-PERSIST-01 works like a passkey; the binding is established, scoped, and cryptographically tied to your ACME account key. You do not re-prove the same thing on every renewal. You prove you still hold the key.

Instead of proving control on every renewal cycle, you establish a standing authorization record bound to your ACME account and the CA. Set it once. Reuse it across renewals. The CABF formalized this direction in SC-088v3, which added the DNS TXT Record with Persistent Value method to the BRs.

This is not a shortcut. The standing authorization is scoped, can carry expiration, and is explicitly tied to an ACME account key. The attack surface moves from the repeated DNS transaction to the account key itself, which is the right place for it. That is why Let’s Encrypt is being deliberate, Pebble (the reference ACME test server) support is in place, client support is in progress, and the staged rollout is planned for 2026. The scope controls around wildcard policy and authorization lifetime are part of the design, not afterthoughts.

What it eliminates is the recurring DNS write requirement that turned high-volume issuance into a credential distribution problem. In a world trending toward 47-day certificates, that is not a nice-to-have. It is the method that makes the new validity regime operationally survivable for anyone running at real scale.

What actually changed

The webmaster era died because the webmaster role died. The person who answered webmaster@ in 1995 was plausibly the person responsible for the domain. By 2010, that alias might go anywhere. By 2020, it was a cassette tape. Technically still a format, functionally forgotten.

This is the same pattern that gave us a decade of SIM-swapping attacks. SMS was a convenient channel, so the industry conscripted it into an authentication role it was never designed for, and held it there long after the threat model had outgrown the assumption. Nobody decided email-to-webmaster or SMS were the right security primitives for what they were being asked to do. They were just there, they mostly worked, and changing them had cost. The failures were predictable in retrospect and ignored in practice until the losses became undeniable.

The ACME methods work because they measure what they claim to measure. HTTP-01 proves you can respond at the endpoint. DNS-01 proves you can write to the zone. TLS-ALPN-01 proves you can complete a handshake. Technical controls, not institutional proxies.

DNS-PERSIST-01 is the mature form of that idea, a standing proof of control that does not require re-proving the same thing every 90 days at the cost of DNS churn and credential distribution. It is also the method that answers the question the old system was never actually asking. The old system broke because standing assumptions about institutional stability turned out not to hold. The new system makes the standing assumption explicit, scoped, bound to a cryptographic identity, and revocable.

That is not the same mistake. That is the lesson applied.

Start here when choosing a method. If you cannot touch DNS and port 80 is open, HTTP-01 is the simplest path. If port 80 is blocked or you are terminating TLS before HTTP reaches your application, TLS-ALPN-01 validates at the TLS layer without touching HTTP handling. If you need wildcard coverage or your edge is not publicly reachable at all, DNS-01 is the right tool. If you are running multiple independent pipelines against the same domain and CNAME delegation is creating label collisions, DNS-ACCOUNT-01 solves that without changing anything else. And if you are renewing at volume in a world trending toward 47-day validity, DNS-PERSIST-01 is the method that does not eventually break you, not because the others are wrong, but because repeated proof per renewal was designed for a renewal cadence that no longer exists.

In practice, large organizations often find themselves in a catch-22 that makes the decision for them. TLS-ALPN-01 requires TLS to already be deployed and configurable at the edge, but you need the certificate to deploy TLS in the first place. DNS-01 requires writing to the zone, but DNS is owned by a different team, and the change process takes weeks. DNS-PERSIST-01 requires standing up ACME account management, but that is a security infrastructure decision that needs approval. Meanwhile, the web team controls the servers and can serve a token file over port 80 today. So HTTP-01 it is, not because anyone evaluated the options and chose it, but because it was the only method where a single team had all the permissions needed to complete validation without a cross-functional project. The decision tree above describes the technically correct path. The org chart usually picks a different one.

Like most security improvements, the arc from fax-based DCV to persistent cryptographic authorization took longer than it should have, the gap between knowing something is broken and replacing it is always larger than it looks from the outside. But the trajectory is now clear that domain control validation means proving control, not guessing at it.

Disdain or Design?

2 Replies

Washington State is not in the middle of a single policy dispute.

It is moving through a structural sequence that has been building for nearly a century, and the density of constraint-adjacent actions has increased.

The millionaire income tax proposed in 2026, effective 2028 if enacted. The capital gains excise enacted in 2021 and upheld in 2023. The surcharge added to that excise in 2025. Pending legislation that would decouple Washington from federal Qualified Small Business Stock treatment for the first time. The repeated invalidation of voter initiatives on procedural grounds across two decades. The legislative adoption of initiatives followed by amendment within the same cycle. Dozens of emergency clauses attached to fiscal legislation in non-crisis sessions. Bills introduced to raise the cost of signature gathering. A parental rights initiative adopted unanimously by the legislature, modified the following year, and now the subject of a new citizen petition to restore it.

Individually, each of these events can be defended as constitutionally permissible. The question is not whether each action is likely legal. They probably are. The question is whether the aggregate reflects normal constitutional evolution or a consistent pattern in which available institutional tools have reduced the practical force of voter constraint while preserving formal legality.

Two patterns in particular repeat with enough consistency to warrant examination on their own terms. Voters approved caps on vehicle license fees three times across two decades. None held. Voters rejected graduated income taxation through the constitutional amendment process ten times across generations. Each time, new approaches emerged that produced similar policy outcomes while fitting within (or reclassifying) the existing constraint. These are not isolated grievances. They are the same structure applied to the same category of voter preference, across different subject areas and different decades.

Think of it this way: the user interface of democracy is still intact. The buttons are there: the ballot measure process, the referendum window, the initiative pathway. They look functional. What this piece examines is whether the backend they connect to has been rewired.

A benign model exists. Initiatives are procedurally brittle; courts enforce guardrails that exist for good reasons; emergency clauses are constitutionally authorized; legislatures must govern. On this reading, each outcome here reflects institutions functioning as designed. This piece argues that the density and directional consistency of these outcomes now exceeds what that model predicts. Normal constitutional operation produces friction in both directions. What the record below shows runs consistently one way.

This is not a claim of conspiracy or bad faith. It is a claim about system behavior. No single actor controls this. The pattern emerges from how available tools interact across courts, legislature, executive, and agencies, each operating within its own institutional logic. When multiple lawful mechanisms consistently reduce the practical force of voter constraint, legitimacy can erode even if every individual step is constitutionally defensible.

That question cannot be answered by looking at any single event. It requires watching the sequence play out over time, seeing which direction it moves, and asking whether the velocity and consistency of that direction tells you something the individual events do not. The examples that follow are not offered as an exhaustive dataset. They are offered as a representative sample of a pattern: documented, sequenced, and structurally related, not the only instances that exist.

Foundation

The Floor That Held for Ninety Years

Washington’s Constitution, ratified in 1889, imposed strict requirements on property taxation through Article VII. Section 1 mandates uniformity within property classes. Section 2 caps property tax levies. These were not minor provisions. They were structural commitments about who controls the revenue system and under what constraints.

In 1932, voters approved Initiative 69, enacting a graduated income tax.

In 1933, the Washington Supreme Court invalidated it in Culliton v. Chase, 174 Wash. 363, 25 P.2d 81 (1933). The Court held that income constitutes property under Article VII. Because property taxes must be uniform and capped, a graduated income tax violated the Constitution without amendment. The ruling was categorical and the doctrinal foundation it established was durable: income equals property; property taxes must be uniform; graduated income taxes are unconstitutional absent constitutional change.

What followed was a pattern of voter resistance that spans generations. Ballot measures to amend the Constitution and authorize a graduated income tax failed in 1936, 1938, 1940, 1942, 1944, 1970, 1973, 1975, 1982, and 2010. Ten attempts, none successful. That is not ambiguity about voter preference. That is a structural signal.

For nearly ninety years, the Culliton baseline held.

That baseline was the structural floor. Voters had set it, courts had confirmed it, and ten subsequent attempts to change it through the amendment process had each failed. The constraint was real and it was tested.

What follows is what happened as institutions pursued the same policy goals within the constraint set that voters had repeatedly declined to change.

The Procedural Record, 1999–2015

The Car Tabs: Three Votes, Zero Results

The late 1990s introduced a different kind of friction: voter-approved initiatives colliding with judicial enforcement of procedural rules.

In 1999, Initiative 695 passed with 56 percent approval. It capped vehicle license fees at $30. The Washington Supreme Court struck it down in 2000 for violating the single-subject rule and defective ballot title requirements. Amalgamated Transit Union Local 587 v. State, 142 Wn.2d 183, 11 P.3d 762 (2000).

In 2002, Initiative 776 passed with 51 percent approval, again targeting vehicle license fees. The Court upheld core provisions but preserved preexisting Sound Transit taxes, limiting the measure’s practical reach. Pierce County v. State, 159 Wn.2d 16, 150 P.3d 86 (2006).

In 2019, Initiative 976 passed with 53 percent approval, once more capping car tabs. The Supreme Court invalidated it unanimously in 2020, citing single-subject violations and misleading ballot language. Garfield County Transportation Authority v. State, 196 Wn.2d 814, 479 P.3d 1169 (2020).

From a judicial standpoint, these rulings enforced procedural safeguards embedded in Article II, Section 19. They are defensible on those grounds.

From a voter standpoint, materially similar policy outcomes were approved by majorities three times across two decades. None endured.

Durability is not owed. But repeated invalidation of the same voter preference, for reasons that feel technical to laypeople, creates a predictable legitimacy gap. Neither interpretation is irrational. But the perception gap (between what voters believe they decided and what institutional processes allowed to persist) became part of Washington’s governance environment. That gap does not close simply because the legal analysis is correct. It accumulates.

The car tabs sequence established one pattern. The same years produced another.

Property Taxes: The Pattern Extends

In 2000, Initiative 722 passed with 57 percent approval, establishing a two percent limit on property tax levy increases. The Supreme Court struck it down in 2001 for embodying unrelated subjects in violation of Article II, Section 19. City of Burien v. Kiga, 144 Wn.2d 819, 31 P.3d 659 (2001).

In 2001, Initiative 747 passed with 58 percent approval, reducing the general limit on property tax levy increases from six percent to one percent. The Supreme Court invalidated it in 2007 for violating Article II, Section 37, which requires amendatory laws to set forth the amended law at full length. Washington Citizens Action of Washington v. State, 162 Wn.2d 142, 171 P.3d 486 (2007).

These rulings applied established constitutional provisions. Yet they represent another instance where voter-approved fiscal constraints were nullified on procedural grounds, extending the pattern to property taxation.

Judicial review was not the only institutional lever available.

The Executive Lever

In Washington State Grange v. Locke, 153 Wn.2d 475, 105 P.3d 9 (2005), the Supreme Court upheld Governor Locke’s veto of sections within Engrossed Senate Bill 6453. That bill had enacted a “top two” primary system alongside a “Montana-style” alternative. The veto eliminated the top-two option, leaving the alternative in place amid challenges under Article III, Section 12 and Article II, Sections 19 and 38.

The Court sustained the exercise of executive discretion.

The structural effect was that a legislative choice about election architecture was altered by veto without returning the question to voters. Whether the outcome was correct as policy is separate from what it illustrates about the mechanics available to institutional actors when they want to shape structural outcomes.

The Emergency That Wasn’t

Initiative 960, approved in 2007, required supermajority legislative approval or voter ratification for tax increases and established advisory votes to give citizens a nonbinding voice on tax matters. It took effect under Laws of 2008, Chapter 1.

In 2023, the Legislature repealed the advisory vote requirement through Senate Bill 5082. The bill included an emergency clause, making it effective immediately and blocking referendum.

Article II, Section 1(b) of the Washington Constitution permits emergency clauses. Courts have granted broad deference to legislative declarations of emergency. CLEAN v. State, 133 Wn.2d 455, 928 P.2d 1054 (1997). The constitutional standard for challenging an emergency designation is high.

The structural point is not that the emergency clause was illegal. It was not.

The structural point is that a voter-enacted fiscal accountability mechanism, advisory votes, which gave citizens a nonbinding voice on legislative tax decisions, was repealed under emergency designation. There was no crisis. No disaster. No immediate fiscal collapse that required blocking the 90-day referendum window.

The emergency clause did not just accelerate the bill’s effective date. It foreclosed citizen review of the decision to remove a citizen-review mechanism.

Judicial invalidation continued alongside these new tools.

Charter Schools: The Pattern Beyond Taxes

In 2012, Initiative 1240 passed with 51 percent approval, authorizing up to forty charter schools and designating them as common schools eligible for dedicated funding under Article IX.

The Supreme Court struck down the Act in 2015, holding that charters lacked the voter-elected boards required for common schools, rendering their funding unconstitutional. League of Women Voters of Washington v. State, 184 Wn.2d 607, 355 P.3d 1131 (2015). Because common school funds were central to the Act, the court invalidated it entirely.

Another voter-approved reform nullified on constitutional grounds, adding to the accumulating record of procedural barriers overriding popular mandates.

By 2015, the pattern of post-passage judicial invalidation was well established. What developed next was different. New mechanisms emerged that did not require judicial action at all. The distance between what voters authorized and what they received could now be generated before a court ever got involved.

Sound Transit: The Gap Inside the Framework

In 2015, the Legislature authorized Sound Transit to levy an increased Motor Vehicle Excise Tax as part of the Sound Transit 3 package. Voters approved ST3 in November 2016 with 54 percent approval. The package totaled $54 billion and was funded in part through an MVET increase from 0.3 to 1.1 percent.

The valuation method for the MVET was set by the authorizing legislation. Rather than using current fair market value, Sound Transit applied a 1996 depreciation schedule based on Manufacturer’s Suggested Retail Price, not updated since the 1990s. A ten-year-old vehicle might be assessed at 85 percent of original MSRP rather than its actual depreciated worth. Annual tabs came in hundreds of dollars higher than what many voters expected when they approved ST3.

A class-action lawsuit challenged the MVET on constitutional grounds. The Washington Supreme Court upheld the tax in 2020, finding no constitutional violation. Sound Transit’s counsel stated before the Court that no fraud or deception had occurred. The Court agreed.

The structural observation is narrower than the legal ruling. The valuation methodology was a technical implementation choice embedded in the authorizing legislation. Voters approved the ST3 package as presented. The tab amounts they encountered after passage reflected valuation assumptions not prominently disclosed in ballot materials. In 2021, the Legislature passed SB 5326 phasing in Kelley Blue Book valuations, reducing average tabs by approximately 30 percent, conditioned on delaying certain ST3 capital projects to offset lost revenue.

This instance does not fit the procedural invalidation or adopt-and-amend patterns. No initiative was struck down. No emergency clause was invoked. The mechanism is different: a voter-approved framework produced outcomes materially different from voter expectations through implementation choices made in enabling legislation, upheld by courts, and partially corrected through subsequent legislation conditioned on trade-offs.

A different mechanism was operating at the same time, working not through enabling legislation but through the initiative process itself.

Adopt and Amend

Initiative 940 passed in 2018 with 59 percent approval, mandating police de-escalation training and modifying standards for use of deadly force. The Legislature adopted it directly through the initiative-to-legislature process, avoiding a ballot campaign under the two-thirds amendment threshold. It became law under Laws of 2019, Chapter 1.

Shortly thereafter, the Legislature amended it through Engrossed Second Substitute House Bill 1064, modifying liability provisions.

Legislative authority to amend statutes is unambiguous. The legal analysis is clean.

The structural pattern is what matters here. An initiative qualifies. The Legislature adopts it rather than sending it to the ballot, a choice that prevents public referendum and bypasses the two-thirds threshold for same-session amendment. The Legislature then amends it. The citizen-initiated content is altered without the public engagement the initiative process was designed to enable.

When this cycle repeats, and in Washington it has repeated, it raises a governance design question: Is the initiative-to-legislature pathway functioning as a genuine alternative route for voter expression, or as a mechanism that routes popular measures through an institutional process more permeable to subsequent modification?

New Mechanisms Emerge, 2015–2023

The Capital Gains Pivot

This section requires close attention because it is the doctrinal hinge for everything that follows.

In 2021, the Legislature enacted ESSB 5096, a 7 percent tax on long-term capital gains above $250,000. Codified at RCW 82.87, it was structured not as an income tax but as an excise tax imposed on the sale or exchange of capital assets. That framing was not accidental. The bill’s sponsors stated explicitly that the excise structure was chosen to work within Washington’s constitutional constraints on income taxation rather than challenge them directly.

A Douglas County Superior Court struck it down in 2022, treating it as an unconstitutional income tax.

In March 2023, the Washington Supreme Court reversed in Quinn v. State, 1 Wn.3d 453, 526 P.3d 25 (2023), a 7-2 decision. The Court held that the tax was imposed on a transaction, the act of sale or exchange, rather than on the ownership or receipt of income. That made it an excise tax, not a property tax subject to Article VII’s uniformity and cap requirements. The Court preserved Culliton nominally while narrowing its practical scope. The U.S. Supreme Court denied certiorari in 2024, leaving the ruling intact.

This is a doctrinal realignment.

For nearly ninety years, the rule was clear: income is property; property taxation must be uniform; graduated rates are unconstitutional. After Quinn, at least some forms of realized income, specifically long-term gains upon sale of capital assets, are classified as excise events outside that framework.

The Court’s distinction is grounded in recognized excise doctrine. Washington has long upheld excise taxes on transactions. Real estate excise taxes. Business and occupation taxes. The doctrinal parallel is not invented.

But the boundary matters enormously.

The critical question is not whether Quinn was correctly decided. It is what principle limits the excise classification going forward. The court held that the taxable incident is the act of sale or exchange. But if the legislature can define the taxable incident with sufficient granularity, what prevents increasingly ordinary economic receipts from being labeled transactional events rather than income? How much transactionality is required? What keeps “receipt” from being reframed as “event”?

Quinn did not answer that. SB 6346, which taxes the “receipt of income” rather than a discrete sale, tests whether the current court will hold the line Quinn drew or treat the excise frame as scalable to ordinary income. The boundaries remain untested.

That ambiguity is not a flaw in this analysis. It is the structural vulnerability.

This is not a theoretical framework. It is a description of what happened with capital gains, and it frames what is now being attempted with the proposed millionaire income tax.

The Acceleration, 2023–2026

The Pattern Beyond Tax Doctrine

Two additional invalidations in this period extend the pattern beyond tax doctrine.

In 2023, Spokane voters approved Proposition 1 with 75 percent approval, banning camping near schools, parks, and playgrounds. The Washington Supreme Court invalidated it in 2025, ruling that it exceeded local initiative scope under RCW 35A.11.090 and violated single-subject requirements.

In 2024, Initiative 2066 passed statewide. It aimed to preserve natural gas access by rolling back energy code changes favoring heat pumps. A King County Superior Court invalidated it in March 2025 for single-subject violations under Article II, Section 19 and failure to include full statutory text of altered laws. The case advanced to the Supreme Court in 2025 with arguments heard but no final ruling as of this writing.

Both invalidations followed recognized procedural doctrines.

Both overrode clear popular majorities.

The accumulation of instances, car tabs, Spokane Prop 1, Initiative 2066, is not evidence of conspiracy. It is evidence of a recurring structural dynamic: voter-approved measures reaching courts on procedural grounds and failing. Whether that reflects rigorous constitutional enforcement or selective application is a question courts themselves may eventually have to address as the pattern becomes harder to characterize as coincidental.

Absorb Rather Than Fight

In 2024, Initiative 2111 was filed to prohibit state and local personal income taxes, defined through federal gross income. Rather than sending it to the ballot, the Legislature adopted it directly under the initiative-to-legislature framework. It passed with bipartisan support, House 76-21, Senate 38-11, and took effect June 6, 2024.

The adoption reflected durable political reality. Income tax prohibition remains one of the few consistent cross-partisan commitments in Washington electoral history. The Legislature, by adopting rather than fighting the initiative, avoided a ballot campaign it would likely have lost.

But adoption through the initiative-to-legislature pathway does something that ballot adoption does not. It converts a citizen-initiated measure into a regular statute, amendable by simple legislative majority in future sessions. The two-thirds threshold that would otherwise protect same-session amendments does not apply once the session in which the initiative was adopted has ended.

SB 6346 in 2026 proposes to amend Initiative 2111 directly.

Across decades, the pattern runs in one direction. Voters signal clear opposition to income taxes, repeatedly. The Legislature adopts the initiative rather than fight it. Then the Legislature proposes to amend what it just adopted.

That sequence can be read as institutional adaptation. It can also be read as tactical sequencing.

The Parents’ Bill: Adopted, Then Amended

Initiative 2081, the “Parents’ Bill of Rights,” was certified in January 2024 with over 449,000 signatures. It enumerated fifteen rights for parents of public school children including rights to review curriculum, receive notifications about student health matters, and opt out of certain instruction. The Legislature adopted it in March 2024, House 82-15, Senate 49-0, unanimous in the upper chamber, effective June 6, 2024.

Twelve months later, HB 1296 amended it.

Signed by Governor Bob Ferguson in May 2025, the amendment eliminated prior notification requirements for certain medical services and added gender-inclusive policy provisions. A companion bill, SB 5181, modified records access provisions. HB 1296 included an emergency clause, making it effective immediately and blocking referendum on the amendment.

The pattern here is not subtle. A measure that passed 49-0 in the Senate was adopted without a ballot fight, precisely because unanimous opposition would have made a ballot fight futile. The following session, it was amended under emergency designation, foreclosing citizen review of that amendment.

Republicans including Representative Skyler Rude described it as a bait-and-switch: adopt to prevent a ballot supermajority threshold from triggering, then amend. That characterization is politically charged. Adoption followed by emergency-shielded amendment within one legislative cycle compresses the window in which citizens can respond through normal democratic channels.

In 2026, Initiative IL26-001 was certified with over 418,000 signatures to restore the original I-2081. The Legislature has declined hearings. If not adopted, it proceeds to the November 2026 ballot.

The cycle continues.

Excise Expansion and the QSBS Gap

After Quinn, the capital gains framework did not remain static.

In 2025, ESSB 5813 added a 2.9 percent surcharge on capital gains exceeding $1 million, effective January 1, 2025, producing a combined 9.9 percent rate on excess amounts.

The federal Qualified Small Business Stock exclusion under IRC Section 1202 allows founders and early investors to exclude up to 100 percent of gains on qualifying stock held more than five years in a C-corporation with assets under $50 million at issuance. Startups are high-risk. Section 1202 is a mechanism to make that risk economically reasonable.

Washington currently conforms to the federal QSBS exclusion by design. Because Washington’s capital gains excise begins with federal net long-term capital gain, gains excluded federally under Section 1202 never enter the Washington tax base. The Washington Department of Revenue confirms this explicitly: qualifying QSBS gains excluded from federal net long-term capital gain are not subject to Washington’s excise. A founder who qualifies for the federal exclusion pays $0 in Washington state capital gains tax on that exit.

That is the current state of the law.

Senate Bill 6229 and companion House Bill 2292, introduced in January 2026, would change this. They propose requiring taxpayers to add back Section 1202 excluded gains when calculating Washington’s capital gains excise — effectively decoupling Washington from the federal QSBS treatment for the first time. At a January 2026 hearing, startup founders and venture capitalists testified against the bills. The bills remain in committee as of this writing.

If enacted, a founder with a $2 million qualifying exit would owe $0 in additional federal tax and $151,500 in Washington state capital gains tax on the same gain. Under current law, the state tax is also $0.

This is not a gap that has existed since the excise was enacted. It is a proposed expansion of the excise base to capture gains that federal policy deliberately exempts. It illustrates the doctrinal point precisely: the excise classification established in Quinn has proven expansible without additional constitutional challenge. The limiting principle that was left untested in Quinn has not constrained subsequent legislative proposals.

This matters for Washington’s competitiveness in technology and life sciences. It matters for founders and early-stage investors making location decisions. And it matters doctrinally because the same mechanism that produced the capital gains excise is now being proposed as the vehicle for taxing gains that Congress specifically chose not to tax.

Raising the Cost of the Petition

In 2025, SB 5382 required signature gatherers to personally certify the validity of each signature under penalty of perjury. Proponents argued it reduced fraud. Opponents argued it would triple qualification costs, based on analogous experience in Oregon, and expose gatherers to legal liability without evidentiary basis. The bill died in committee.

In 2026, SB 5973 proposed to ban per-signature compensation, require hourly or salaried payment for gatherers, and mandate 1,000 qualifying signatures before the Secretary of State would issue a ballot title, a requirement that would force organizers to fund significant signature collection before even knowing the official title under which they were collecting. Lead sponsor Senator Javier Valdez described the legislation as targeting “aggressive, misleading tactics” incentivized by per-signature pay models. The bill died February 17, 2026 without a floor vote, the deadline for non-budget legislation. Senator Valdez indicated plans to revisit the restrictions in 2027.

The 2025 legislative session recorded approximately 47 bills carrying emergency clauses, a frequency that exceeds historical norms for non-crisis years. Among them: emergency clauses on fiscal and policy legislation where no immediate exigency was apparent from the record.

Senator Pedersen, as Senate Majority Leader, stated publicly that the Legislature would not pass the 2026 initiatives on parental rights restoration and girls’ sports, effectively signaling that those measures would bypass committee review and proceed directly to the ballot. House Speaker Laurie Jinkins provided similar signals. These are procedural choices that, individually, are within legislative discretion. As a pattern, they communicate something about how leadership views citizen-initiated policy as distinct from legislative policy.

The Millionaire Tax: Where It All Converges

SB 6346 proposes a 9.9 percent tax on income above $1 million, effective 2028. It passed the Senate on February 16, 2026, 27-22, along party lines. It is now in the House.

The bill explicitly amends Initiative 2111 to exempt this tax from the income tax prohibition. It projects approximately $3.4 billion annually, attributable to roughly 21,000 filers.

It includes an emergency clause.

Article II, Section 1(b) exempts from referendum laws deemed necessary for the immediate preservation of the public peace, health, or safety, or for the support of the state government and its existing public institutions, and courts grant broad deference to legislative declarations.

The millionaire income tax did not arise from a natural disaster, a public health emergency, or an imminent fiscal collapse. Washington is not in a state of fiscal crisis that forecloses a 90-day referendum window. The emergency clause on this bill removes the referendum option before citizens who opposed the measure in Senate testimony, over 61,000 signed against it in hearings, can exercise their constitutional right to challenge it at the ballot.

Here is what that bypass layer looks like structurally:

The emergency clause is almost certainly constitutionally authorized. Courts will likely defer. That is not the question.

The question is what purpose it serves when applied to a contested major tax bill with no emergency justification in the record. When emergency clauses become a routine tool for shielding controversial fiscal legislation from citizen review, the referendum power remains on paper while diminishing in function. The emergency clause is increasingly used as a referendum-avoidance mechanism on contested issues where the urgency is political rather than operational.

What the Sequence Reveals

The Routing Map

When all the institutional mechanisms described above operate in the same political environment, Washington’s governance system functions as a multi-layer routing engine. Citizen participation is not blocked, it is channeled through paths that are longer, more expensive, more procedurally vulnerable, and more readily preempted.

Every branch of this diagram is legally defensible. None requires misconduct. The system routes the way it routes because the mechanisms available to institutional actors are more numerous and more resilient than the mechanisms available to citizens.

The system still accepts input, but it has become increasingly resistant to correction.

The Sequence Assembled

Laid out chronologically, what has occurred is this:

In 1933, the Culliton court classified income as property and barred graduated income taxation absent constitutional amendment. Voters confirmed that barrier by rejecting amendment across a generation. In 1999, 2002, and 2019, voters approved caps on vehicle license fees. Courts invalidated all three on procedural grounds. In 2007, voters enacted supermajority requirements for tax increases and advisory votes. In 2018, voters approved a police accountability initiative that was adopted and amended within one cycle. In 2021, the Legislature enacted a capital gains tax structured as an excise on transactions, a framing chosen, by the sponsors’ own account, to work within the Culliton constraint rather than challenge it. In 2023, the Supreme Court upheld the excise classification in Quinn, narrowing the Culliton barrier without overruling it. In 2024, voters’ proxy initiative banning income taxes was adopted legislatively, and the bill that would amend it arrived the following year. Also in 2024, a parental rights initiative was adopted unanimously and amended the following session under emergency clause. Also in 2024, Initiative 2066 on natural gas access was approved by voters and invalidated at the superior court level in 2025 on procedural grounds, with Supreme Court review pending as of this writing. Spokane’s Proposition 1 followed the same arc. In 2025 and 2026, bills to restrict initiative mechanics were introduced. In 2026, the Legislature passed a millionaire income tax bill that amends the income tax prohibition initiative and includes an emergency clause to foreclose referendum.

Each step is individually defensible.

The accumulation moves consistently in one direction.

Cumulatively, they trace a directional line: nominal retention of the Culliton income barrier; excise classification expanding to absorb realized gains; startup gains taxable without federal alignment; voter initiatives repeatedly invalidated or amended post-adoption; initiative mechanics facing proposed restriction; emergency clauses shielding contested legislation from referendum review; and a velocity of activity in the 2023–2026 period that substantially exceeds the pace of prior decades.

That acceleration matters. The density of constraint-adjacent activity in recent years is not consistent with gradual constitutional evolution. It is consistent with institutional urgency.

The Counterarguments

This analysis has a responsibility to engage the strongest counterarguments.

On Quinn: The majority distinguished realized capital gains from general income on the grounds that the taxable incident is the transaction, not the ownership or receipt. That distinction has doctrinal support in Washington excise law and was applied carefully. Mahler v. Tremper, 40 Wn.2d 405, 243 P.2d 627 (1952), recognized excises on the exercise of property rights. The question is not whether Quinn was a plausible application of excise doctrine. It was. The question is whether its limiting principle, transactional framing as the determinative factor, is robust enough to contain what comes next. SB 6346 applies to the “receipt of income,” not to a discrete sale event. That distinction should, under Quinn‘s own logic, make SB 6346 more vulnerable. Whether the current court treats it that way is an open question.

On emergency clauses: Courts defer broadly. CLEAN v. State established that standard clearly. The constitutional text provides no objective threshold for what constitutes an emergency. The remedy for abuse, if abuse is occurring, is political rather than judicial in most cases. The structural critique is not that courts will or should intervene, they probably won’t, but that institutional actors understand this and factor it into their choices. Routine use of emergency clauses for non-emergency policy is a rational institutional strategy precisely because it is judicially durable. In 2025 alone, 47 such clauses were enacted. Washington averaged fewer than 15 emergency clauses per session in the decade prior to 2020. The frequency has more than tripled.

On initiative invalidation: The single-subject rule and ballot title requirements serve genuine constitutional purposes. Logrolling, combining unrelated provisions to build coalitions that wouldn’t otherwise exist, is a real concern that courts are right to police. There is also a defensible design rationale for applying these rules more strictly to citizen initiatives than to legislative enactments: initiatives are drafted outside the committee process, without the vetting that filters imprecision before a bill reaches the floor. Stricter procedural scrutiny of citizen-drafted law is not inherently arbitrary, it reflects a structural difference in how the two types of law are produced.

That rationale, however, does not dissolve the asymmetry, it explains its origin. The result in practice is that the same procedural standards that citizen measures must survive are not consistently applied to legislative alternatives that combine multiple subjects, attach emergency clauses to combined-purpose bills, or amend initiatives in ways that substantially alter their scope. The asymmetry may have a legitimate genealogy. Its cumulative effect on the balance between citizen and legislative authority is the same regardless.

On QSBS: Washington currently conforms to the federal QSBS exclusion implicitly, by starting from federal net long-term capital gain. The structural point is that SB 6229 would deliberately decouple that conformity — not to close an oversight, but to expand the excise base. That is a different kind of action than a pre-existing gap.

The thesis is falsifiable. If emergency clauses were rare and tied to demonstrable urgency, if excise classification remained narrowly confined to discrete sales events, if initiative amendments were infrequent and substantively limited, and if initiative restrictions were introduced in periods of political calm rather than during active citizen contestation, the pattern described here would dissolve. The argument depends on the pattern’s density, velocity, and directional consistency.

Disdain or Design?

Disdain in constitutional governance rarely appears as open contempt. It does not require a recorded statement or an explicit intent to override voter will. It appears as patterned reliance on procedural mechanisms that reduce the practical force of direct democratic constraint while preserving formal legality.

The sequence described in this piece supports two interpretations.

The first: Washington’s institutions are navigating a genuine structural tension between a nineteenth-century constitutional framework and twenty-first-century fiscal demands. The excise classification in Quinn reflects legitimate doctrinal development. Emergency clauses are almost certainly constitutionally authorized. Initiative invalidations enforce procedural requirements. The Legislature has authority to amend what it adopts. None of these actions individually represents a departure from constitutional design.

The second: The aggregation of these mechanisms, particularly their density in the 2023–2026 period and their consistent directional effect of narrowing voter constraint, reflects something beyond coincidence. Whether that reflects explicit coordination or emergent institutional preference for policy control over consent legitimacy is a harder question. But the outcome is the same either way.

Others looking at this sequence have read it as evidence of intent. That reading is available. This analysis does not rely on it. The argument stands on mechanism and documented effect. Intent would make the pattern more troubling. The absence of intent would not make it less real.

What this piece asserts is that the pattern exists, that its velocity has increased, and that the mechanisms employed produce a consistent result: each one individually lawful, each one incrementally reducing the friction that direct democracy applies to legislative outcomes.

The Systemic Hazard

The risk this piece is concerned with is not taxation.

Washington will raise or lower taxes. Courts will continue to apply constitutional standards. Initiatives will qualify or fail. These are normal features of a functioning state.

The risk is something more durable and more difficult to reverse: legitimacy erosion.

This is the backend problem. The user interface of democracy still renders correctly. The initiative process exists. The referendum window exists. The constitutional amendment pathway exists. Citizens can still vote. Their votes still count. But the mechanisms that translate those votes into durable policy outcomes have been progressively routed around, reclassified, absorbed, or shielded, each step individually legal, the cumulative effect something different.

Constitutional systems depend on a belief that constraints are real, that when voters say no, it means something; that when citizens sign an initiative and see it pass, it will not be reliably nullified, adopted-and-amended, or shielded from referendum in the same cycle; that emergency powers are for emergencies; that doctrinal interpretation is disciplined rather than instrumental.

When lawful tools are repeatedly used in ways that make major decisions difficult to reverse through normal citizen checks, elected office can remain representative in form while becoming directive in function.

When that belief weakens, formal legality becomes insufficient. Citizens who believe that participation is performative rather than determinative do not simply become disengaged, they become available to alternatives that promise to bypass the institutions they no longer trust. That dynamic has played out in other contexts and at other scales. Washington is not immune to it.

The nearest documented precedent is California’s Proposition 13 in 1978. Voters passed a hard cap on property tax increases via ballot measure. The legislature responded with fees, bonds, and special assessment districts that produced equivalent revenue through mechanisms the cap did not reach. The cap remained on paper. The constraint it represented eroded in practice. Trust in the fiscal system declined. The state’s budget architecture became progressively less legible to ordinary citizens. That outcome was not the result of a single bad actor or a single bad decision. It was the cumulative product of institutions finding lawful paths around a voter-imposed constraint, each path individually defensible, the aggregate effect something the voters who passed Proposition 13 did not authorize and could not easily reverse.

The hazard is not the millionaire income tax in isolation. It is not Quinn. It is not any single initiative invalidation or emergency clause.

It is the accumulation of technical compliance in service of functional constraint erosion. This critique applies regardless of which side benefits in a given cycle. The hazard is the precedent and the tool normalization, mechanisms that outlast any particular majority and remain available to whoever holds them next.

Washington is not unique in this dynamic. It may be ahead of the curve. The same interaction between initiative processes, judicial review, and legislative absorption is visible in other states at earlier stages. What makes Washington worth examining now is the velocity, the compression of decades of incremental drift into a few legislative sessions.

Constitutional systems do not usually fail through overt violation. They fail through incremental, lawful actions that alter the practical balance of power without amending the text that describes it.

That is the unmitigated risk.

Not the bill.

Not the case.

The sequence.

The Math Problem With “Not Helping Us Make Decisions”

What Statistically Relevant Engagement Actually Looks Like

You Don’t Need to Read the Bill

The Participation Double Standard

The Self-Selection Argument Does Not Save Them

The Broader Pattern

The Question That Deserves an Answer

Why Duplicates Happen

On Rapid Submissions

What the Test Is Measuring

What This Means for the Dataset

On Impersonation

So What Does All of This Mean?

What This Is About

Who Actually Showed Up

The Geography Test

The Community Test

The Name Collision Test

On CAPTCHA

On the Audit Trail

What This Means

Methodology

The Repricing of Expertise

What This Does to Venture Capital

Built for Acquisition, But It Is Not a Spreadsheet Decision

The Irony of Automating Your Own Moat

Overgrowth Risk

The Structural Shift

What “domain control validation” actually meant in the early days

The thing that made the real fix possible

DNS-01, and why scale broke it

DNS-ACCOUNT-01, which solved the CNAME collision and nothing else

DNS-PERSIST-01, which the short-validity world actually requires

What actually changed

Foundation

The Floor That Held for Ninety Years

The Procedural Record, 1999–2015

The Car Tabs: Three Votes, Zero Results

Property Taxes: The Pattern Extends

The Executive Lever

The Emergency That Wasn’t

Charter Schools: The Pattern Beyond Taxes

Sound Transit: The Gap Inside the Framework

Adopt and Amend

New Mechanisms Emerge, 2015–2023

The Capital Gains Pivot

The Acceleration, 2023–2026

The Pattern Beyond Tax Doctrine

Absorb Rather Than Fight

The Parents’ Bill: Adopted, Then Amended

Excise Expansion and the QSBS Gap

Raising the Cost of the Petition

The Millionaire Tax: Where It All Converges

What the Sequence Reveals

The Routing Map

The Sequence Assembled

The Counterarguments

Disdain or Design?

The Systemic Hazard

Related Reading