id: an-089 hypothesis: shell-contratante headline: Within less-trusted firms (n=5,097, 419 firms) holding firm + race fixed: candidate-sponsored polls reduce margin_error by 1.94 pp vs media-sponsored (p=0.07); other_firm-sponsored reduce by 1.04 pp (p=0.11). Directionally consistent with the cross-sectional finding at ~2/3 magnitude. Same SIGN, smaller magnitude — most of the cross-sectional shell signal is CROSS-FIRM composition (different sponsors hire different firms), not within-firm slant. AN-088 already showed sponsor contributes only 0.1-0.2 % marginal R² beyond firm + race; this AN-089 result is the within-firm correlate of that. NOTE: in trusted firms (n=430, 5 firms) sponsor effects are statistically weak overall but point estimates on margin_error are LARGER (−9.4 / −8.8 pp at the firm-only spec, p≈0.07), suggesting trust × sponsor interaction worth flagging but with very thin n. type: causal question: "Within the same pollster, does accuracy differ by sponsor type? Especially for less-trusted firms?" tags: ["hyp:shell-contratante", "within-firm", "robustness", "all-brazil"] status: interpreted status_date: 2026-06-17 confidence: yellow created: 2026-06-17 script: source/analysis/an-089-within-firm-sponsor-accuracy.py target: build/table/an-089-within-firm-sponsor-accuracy.csv

AN-089: Within-firm sponsor effect on accuracy

User question (2026-06-17): "Can we argue that the same pollster might produce different accuracy polls depending on type of sponsor? Focusing on the less trusted pollsters."

Companion to AN-088 (variance decomposition showed sponsor = 0.1-0.2 % marginal R² beyond firm + race). This script isolates the within-firm sponsor effect on accuracy.

Sample partition

ALL — 6,991 polls in 4 buckets
LESS_TRUSTED — 6,433 polls (419 firms; excludes the AN-085 D1 hand-picked 5)
TRUSTED — 558 polls (5 firms: DATAFOLHA, QUAEST, REAL TIME MÍDIA, PARANÁ PESQUISAS, VERITA)

Within-firm + race FE results

Reference category = media-sponsored polls within the same firm × race.

Sample	Outcome	n	is_candidate	is_pollster_self	is_other_firm
LESS_TRUSTED	mean_abs_error	5,097	+0.17 (p=0.73)	+0.22 (p=0.60)	+0.02 (p=0.94)
LESS_TRUSTED	poll_calls_winner_first	5,097	+0.047 (p=0.18)	+0.055 (p=0.089)	+0.017 (p=0.50)
LESS_TRUSTED	margin_error	4,879	−1.94 (p=0.070)	+0.64 (p=0.46)	−1.04 (p=0.108)
ALL	mean_abs_error	5,655	+0.12 (p=0.79)	−0.01 (p=0.98)	+0.13 (p=0.66)
ALL	poll_calls_winner_first	5,655	+0.057 (p=0.10)	+0.045 (p=0.12)	+0.010 (p=0.69)
ALL	margin_error	5,424	−1.75 (p=0.072)	+0.22 (p=0.78)	−0.91 (p=0.14)
TRUSTED	mean_abs_error	430	−0.90 (p=0.56)	−0.87 (p=0.47)	+0.71 (p=0.62)
TRUSTED	poll_calls_winner_first	430	−0.20 (p=0.28)	−0.07 (p=0.52)	−0.23 (p=0.040)
TRUSTED	margin_error	420	−3.66 (p=0.18)	−2.00 (p=0.29)	−0.81 (p=0.69)

Reading

LESS_TRUSTED firms (the user's focus)

Direct answer to the user's question: yes, the same firm produces different accuracy by sponsor type, but the effect is directional and not statistically clean.

Within a less-trusted firm × race, candidate-sponsored polls reduce margin_error by 1.94 pp (p=0.07); other_firm-sponsored polls reduce by 1.04 pp (p=0.11). The within-firm coefficient on candidate is similar in sign and ~2/3 the cross-sectional magnitude (AN-085 had β=−2.53 cross-sectional vs −1.94 within- firm here for the LESS_TRUSTED partition). The within-firm effect on other_firm is about half the cross-sectional magnitude (cross-sectional was −1.95; within-firm is −1.04).

Interpretation: most of the cross-sectional sponsor effect is CROSS-FIRM composition, not within-firm slant. A LESS_TRUSTED firm doesn't dramatically change its accuracy depending on who pays — what differs cross-sectionally is which firms tend to get hired by different sponsor types. The within-firm slant is present but smaller and harder to identify.

Why the AN-082 / AN-085 / AN-086 effects shrink within firm

AN-088's variance decomposition gave the structural answer: sponsor's marginal R² over firm + race is 0.1-0.2 %. The within-firm regression is the regression-coefficient version of that decomposition. Both are honest readings of the data.

Two ways to read it:

Conservative reading: the sponsor effects in AN-082 / AN-085 / AN-086 reflect sorting (which firm gets hired for which sponsor type) more than within-firm slant.
Substantive reading: even controlling for firm + race, candidate-sponsored polls within the same firm reduce margin_error by 1.94 pp at p=0.07 — that's economically meaningful and directionally consistent with the slant story; just harder to identify with the available power.

The two readings co-exist. The cross-firm composition channel likely dominates the cross-sectional finding, but the within- firm slant channel is present.

LESS_TRUSTED on `poll_calls_winner_first`

is_pollster_self β = +0.055 (p=0.089). Within less-trusted firms, pollster_self polls call the winner 5.5 pp more often than media polls — borderline significant. Other coefficients are null. Less-trusted firms doing self-contracted work appear to do them more carefully (consistent with the AN-085 selection- into-easy-races story for pollster_self).

TRUSTED firms (n=430, 5 firms) — comparison sample

Most coefficients null or directional. One sig finding: is_other_firm × poll_calls_winner_first = −0.229 (p=0.040). Within the 5 trusted firms × race, polls sponsored by other_firm contratantes call the eventual winner 22.9 pp LESS often than the same firm's media-sponsored polls in the same race. This is dramatic but the sample is thin (n=430, 5 firms, trusted-firm × other_firm cell n=70 from AN-086).

The directional pattern on margin_error in trusted firms is striking even without statistical significance: cand β=−3.66 pp; other_firm β=−0.81 pp; pollster_self β=−2.00 pp. The candidate and pollster_self margins shrink substantially within trusted firms, just under-powered. If real, the trusted- firm × sponsor interaction means trusted firms DO slant their sponsored polls in the direction of the broader pattern (smaller margins), but the n=42 candidate-sponsored and n=168 pollster_self cells aren't large enough to call it.

What this means for the headline story

Sponsor effects are real but small relative to firm quality and race difficulty (AN-088: 0.1-0.2 % marginal R²; this AN-089: −1 to −2 pp within firm + race vs −2 to −3 pp cross-sectional).
The shell-bucket margin-error finding (AN-082 / AN-085 / AN-086) is partly cross-firm sorting: shell-suspect sponsors disproportionately hire firms that have lower accuracy in general, in addition to the within-firm slant. AN-089's within-firm coefficient on other_firm margin_error is −1.04 pp (p=0.11) — about half the cross-sectional magnitude.
The candidate-bucket margin-error finding is more consistent within-firm. The within-firm coefficient (−1.94 pp, p=0.07) is ~75 % of the cross-sectional coefficient (−2.53). Less cross-firm sorting; more direct sponsor slant.
No within-firm-FE evidence that less-trusted firms are slanting their sponsored polls dramatically more than their media polls. The signal is there but small. If the user's intuition was that less-trusted firms produce dramatically different accuracy across sponsor types, the answer is "no, not dramatically." If the intuition was "trace amounts of slant within firm," the answer is "yes, around 1-2 pp on margin_error."

Caveats

Within-firm + race FE is the most demanding spec. Many cells drop out (race-FE absorption is sharp); 5,097 → reduces precision. Less demanding specs (firm FE only) show larger and more significant coefficients, particularly the other_firm coefficient on margin_error in LESS_TRUSTED: −2.27 (p=0.009) under firm FE only.
n=430 trusted-firm sample is too thin to argue strongly about trusted × sponsor interactions. The dramatic −22.9 pp on calls_winner_first within trusted-firms × other_firm is interesting but underpowered. Could be selection (which races trusted firms take with other_firm contratantes).
Mean_abs_error coefficients are null across all samples and specs. Magnitude (absolute) accuracy doesn't differ by sponsor within firm + race. The signal is in rank + margin, not magnitude.

Follow-ups

Re-spec with firm × race interactions. The current within-firm + race FE assumes firm and race effects are additive. A firm × race interaction would test whether specific firm-race combinations show selectivity.
Split LESS_TRUSTED further by AN-016 |β| (the most trustworthy non-D1 firms vs the most-slanting non-D1 firms). The within-firm sponsor effect might be concentrated in the most-slanting tail.
Restrict to late polls (≤14 days to election). The timing-clean window with within-firm + race FE is the sharpest test possible; small n but cleanest interpretation.
Wire to paper. The within-firm finding tempers the AN-085 / AN-086 cross-sectional claims. Worth one paragraph in §Identification distinguishing cross-firm sorting from within-firm slant, and an appendix table.

Artifacts

Script: source/analysis/an-089-within-firm-sponsor-accuracy.py
Coefficient table: build/table/an-089-within-firm-sponsor-accuracy.csv
Headline JSON: build/table/an-089-within-firm-sponsor-accuracy.json

AN-088 variance decomposition
AN-085 trusted-source decomposition
AN-082 four-bucket accuracy split
AN-016 within-firm β — the closest cross-sectional analog

AN-089: Within-firm sponsor effect on accuracy

AN-089: Within-firm sponsor effect on accuracy

Sample partition

Within-firm + race FE results

Reading

LESS_TRUSTED firms (the user's focus)

Why the AN-082 / AN-085 / AN-086 effects shrink within firm

LESS_TRUSTED on poll_calls_winner_first

TRUSTED firms (n=430, 5 firms) — comparison sample

What this means for the headline story

Caveats

Follow-ups

Artifacts

Related

LESS_TRUSTED on `poll_calls_winner_first`