χ²(5) = 2.31, p = 0.80 on the joint party × sponsor interaction — by-party point-estimate dispersion in the subset regressions of robustness.md §3 is not statistically distinguishable from noise around the pooled β. The pooled +7.92 (OTHER baseline) is a sufficient summary; the apparent MDB null was a reduced-power artifact of subset regression, not a real party effect.

Confidence
green
Type
robustness
Design
Sample
estimulado-non-aggregate-match2 (31,186 rows, 641 sponsored, 8,431 candidates)
Specification
error ~ sponsored_by + opponent_sponsored + Σ_k (sponsored_by × I[party=k]) + log_sample_size + days_to_election + days² | candidate FE + pollster FE, cluster-robust SE at muni. Joint Wald: H0: all five party × sponsor interactions = 0.
Comparator
all polls (spec 2 baseline)
Cluster
muni
Weights
none
Script
source/analysis/an-009-party-interaction.py
Target
build/table/party_interaction.csv
Status
interpreted · 2026-06-02
Created
2026-06-02

Question

docs/briefs/robustness.md §3 reported substantial point-estimate dispersion in the spec-2 sponsor coefficient when the sample is split by candidate party: PSD (+7.16), UNIÃO (+9.10), OTHER (+8.29***) versus PL (+5.32, p=0.14), PP (+4.55, p=0.54), MDB (+0.02, p=0.99). Is that dispersion statistically distinguishable from noise around the pooled +7.98?

Results

By-party implied sponsor coefficient with joint Wald test

Table: Implied sponsor coefficient by party with joint Wald test

Party Implied β SE 95% CI n (sponsored)
PSD +9.40 3.11 [+3.30, +15.50] 159
PL +7.76 2.30 [+3.24, +12.27] 120
PP +3.87 5.28 [−6.48, +14.23] 67
MDB +6.50 3.52 [−0.39, +13.39] 60
UNIÃO +10.81 2.40 [+6.11, +15.52] 38
OTHER (baseline) +7.92 2.73 [+2.57, +13.27] 197

Joint Wald: χ²(5) = 2.31, p = 0.80.

(from build/table/party_interaction.csv)

Interpretation

Confidence rationale (green). The joint Wald test is clean and unambiguous, the interaction spec resolves the subset-regression power artifact directly, and the pooled coefficient survives as a sufficient summary. No interaction coefficient is individually close to significant, so the homogeneity verdict is robust.

Follow-ups

  1. Pollster × sponsor interaction (extension): same test structure with party_group → pollster_group (top-5 pollsters by sponsored-row count + OTHER). The hypothesis: pollster-specific sponsor-bias machines should generate detectable cross-firm heterogeneity if anything does. Suggested script: pollster_interaction.py.
  2. PP point estimate (puzzle, low priority): PP shows the smallest implied β (+3.87) and the smallest robustness.py subset coefficient (+4.55). The wide CI ([−6.48, +14.23]) means it's statistically uninformative, but a future paper run with the 2022 cycle data folded in would tighten PP's CI by ~30% and tell us whether the point estimate is real or coincidence. No standalone script — folded into the 2022 extension when that runs.
  3. Channel A × party interaction (blind spot): once the poll_methodology LLM extractor ships ([[project_poll_sponsor_bias_headline]]), re-run AN-009 with Channel-A controls and ask whether the already-tight pooled spec moves at all when methodology heterogeneity is netted out. That decomposition is the substantive next step for the longer paper.