id: an-051 hypothesis: methodology-flexibility-a type: descriptive status: done status_date: 2026-06-02 confidence: green created: 2026-06-02 script: source/llm/pilot_questionario.py + source/analysis/an-051-questionnaire-rotation-by-sponsor.py target: build/table/an-051-questionnaire-rotation-by-sponsor.csv headline: Sponsored polls dramatically under-document candidate-name rotation in the questionnaire (5.4 % vs 26.1 % when rotation+random+rotated combined; McNemar p < 10⁻⁷). Sponsored polls file FEWER vote-intention scenarios on average (3.73 vs 4.49, Wilcoxon p = 0.002) — refutes the within-poll scenario-selection / cherry-pick hypothesis. [2026-06-02 update — see AN-052/AN-053:] The rotation contrast is a between-firm composition effect, not a within-firm sponsor-choice effect (AN-052 firm-FE LPM p = 0.37). The direct candidate-position priming test refutes the sponsor-exploitation reading (AN-053: sponsored polls list sponsor's candidate LATER, McNemar p = 0.001 in WRONG direction). The AN-051 finding is recontextualized as a firm-tier composition signal — sponsors choose low-discipline firms that don't document rotation — rather than a within-firm Channel-A lever. design: sample: 241 sponsored × independent pairs (of 244 in curated_pairs) with both questionnaires successfully extracted specification: within-pair McNemar tests on binary fields (name_rotation_documented, approval_question_present, rejection_question_present, nonresponse_instruction_present); paired Wilcoxon on n_vote_intention_scenarios; bias-carrier Mann-Whitney on differing-rotation subset comparator: independent poll within the matched pair notes: First quantitative output of the new questionario_pesquisa LLM extractor (PollQuestionario schema). Pilot was on 338 protocols (the curated-pair universe), $1-2 cost at gpt-4o-mini, 19 min wall.

AN-051: Does questionnaire-side name rotation and scenario filing differ between sponsored and matched independent polls?

Question

First quantitative output of the new poll_questionario LLM extractor. Sources two probes from the source-of-bias agenda that the registration narrative cannot reach: (1) within-poll scenario selection (cherry-pick hypothesis — does a sponsored poll file multiple estimulada scenarios and report the most favorable?), (2) name-order priming (do sponsored polls document randomization of candidate-name order across respondents, which neutralizes the documented 2-3 pp name-order effect?).

Design

Source: 244 curated sponsored × independent pairs in build/llm/curated_pairs/pairs_with_extractions.parquet. New PollQuestionario extraction ran on the 338 unique protocols across both sides (build/llm/questionario_pilot/extractions.parquet); 335 valid extractions, 3 image-only PDFs skipped. The 241 pairs with both sides successfully extracted form the analysis sample.

Schema fields tested:

Tests:

  1. Marginals per field, sponsored side vs independent side.
  2. Paired McNemar on the binary fields.
  3. Paired Wilcoxon signed-rank on n_vote_intention_scenarios.
  4. Bias-carrier MW on the differing-rotation subset.
  5. Between-pair MW asking whether sponsored polls that do document rotation have lower within-pair bias than those that don't.

Results

AN-051: name rotation, candidate order, and scenario count — sponsored vs independent

Headline marginals (n = 241 pairs):

Field Sponsored Independent Δ McNemar p
name_rotation_documented 5.4 % 20.3 % −14.9 pp < 10⁻⁵
Headline order ∈ {random, rotated} 2.5 % 18.3 % −15.8 pp
Combined randomization (any of the above) 5.4 % 26.1 % −20.7 pp ≈ 4 × 10⁻⁸
approval_question_present 26.1 % 32.8 % −6.6 pp 0.089
rejection_question_present 50.2 % 51.9 % −1.7 pp 0.78
nonresponse_instruction_present 1.2 % 1.7 % −0.4 pp 1.0
asks_party_first 0 % 0 % 0

The combined randomization contrast is the sharpest within-pair methodology asymmetry surfaced in any source-of-bias probe to date. Discordant cells: b = 12 (sp-only) vs c = 62 (ind-only) — sponsored polls are 5× less likely to document name rotation than the matched independent.

Scenario filing — refutes cherry-pick:

Statistic Sponsored Independent
Mean n_vote_intention_scenarios 3.73 4.49
Median 3 4
75th pct 5 6
Max 12 14

The cherry-pick / scenario-selection hypothesis is refuted on this sample: sponsored polls file fewer vote-intention scenarios on average, not more.

Bias-carrier test (the lever-mechanism question):

Subset n Mean contrast (sp_error − ind_error) MW p (vs other)
Pairs where sponsored documents rotation 13 +6.97 pp 0.54
Pairs where sponsored does NOT document rotation 228 +5.38 pp
Differing-rotation subset (sp-only) 12 +7.34 pp 0.33
Differing-rotation subset (ind-only) 48 +3.98 pp

The bias contrast does not decrease when the sponsored side documents rotation — direction is the opposite (more bias when rotation is documented). The within-pair carrier test is also null. Rotation status appears to be a within-pair signal without being a within-pair carrier of the slant.

Interpretation

2026-06-02 update. The two natural follow-ups (AN-052 and AN-053) have walked back the "first sharp Channel-A signal" framing below. AN-052 shows the contrast vanishes within firm (firm-FE LPM p = 0.37; 19 of 20 both-side firms have identical rotation rates). AN-053 shows sponsored polls list the sponsor's candidate later, not earlier (McNemar p = 0.001 in the wrong direction). The rotation gap is a firm-tier composition signal — sponsors choose low-discipline firms that don't document rotation — and not a within-firm sponsor lever. The cherry-pick refutation (sp fewer scenarios than ind, Wilcoxon p = 0.002) stands. Original interpretation preserved below for the audit trail.


This is the first source-of-bias probe to land a sharp Channel-A within-pair signal. Sponsored polls dramatically under-document candidate-name rotation in the questionnaire — by a factor of 4-5×. The lever is real (the name-order-priming literature documents a 2-3 pp effect of listing a candidate first), the direction is right, and the McNemar significance is extreme (p ≈ 4 × 10⁻⁸ on the combined randomization indicator).

But the carrier test is null, repeating the AN-042 selective-disclosure pattern: the documentation contrast is loud and the within-pair contrast doesn't track the slant. Two readings:

  1. Selective disclosure of methodology that doesn't carry bias. Sponsored polls may genuinely use fixed candidate order while independents randomize, but the resulting name-order priming doesn't dominate the slant — the +5-7 pp comes mostly from sample-shape levers (population frame, coverage class) that the AN-019 / AN-022 evidence pointed at.

  2. Underpowered carrier test with directional but noisy contrast. With only 60 differing-rotation pairs and 12 sp-only sponsored documenting, the bias-carrier test is statistically thin. The +3.98 pp (when independent rotates but sponsored doesn't) vs +7.34 pp (when sponsored rotates but independent doesn't) is directionally consistent with "no-rotation → more bias" but MW p = 0.33.

The cherry-pick / scenario-selection hypothesis is refuted on this sample. Sponsored polls file fewer scenarios than matched independents, not more (Wilcoxon p = 0.002 in the wrong direction for cherry-pick). This is consistent with two readings: sponsored polls have already locked in their preferred scenario at sample design, so they don't need to file alternatives; OR sponsored polls under-document multi-scenario filings the same way they under- document rotation. Either way, the cherry-pick lever is not the carrier.

The combined picture is now: of the six concrete-design candidates in the source-of-bias agenda, AN-019 (coverage class, +2 pp), AN-022 (methodology completeness, t-rejection), the +4.6 pp population-frame interaction, and now AN-051 (name rotation, sp-vs-ind 5.4 % vs 26.1 %) are all directionally consistent with Channel A but each individually under-carry the +7 pp headline. The size-mismatch problem persists; the agenda has identified a multi-lever Channel-A pattern rather than a single quantitative carrier.

Follow-ups

  1. Pollster fixed effects on the rotation contrast (puzzle, high priority). Independent polls that document rotation have higher mean bias than independents that don't (+1.95 vs −0.85 pp). This suggests rotation status correlates with firm characteristics — potentially firm-size: large pollsters (Datafolha, IBOPE, QUAEST) may use boilerplate that prints rotation instructions regardless of sponsor, and sponsored polls in this sample concentrate on smaller firms. A within-firm refit (restrict to the 31 AN-016 firms with ≥ 5 sponsored polls) would tell us whether the rotation gap is a sponsor- choice effect or a firm-size effect. Suggested script: source/analysis/an-NNN-rotation-within-firm.py.

  2. Universe-scale rotation extraction (extension). Confirm the 5.4 % vs 26.1 % gap at full universe. The questionnaire extractor is already wired; running on the ~14k mayoral poll universe is ~$50-100 at gpt-4o-mini. Would give a regression- grade test with statistical power for the carrier hypothesis.

  3. Candidate position of the sponsor's candidate (extension, the direct name-order test). The current AN tests whether rotation is documented; the sharper question is whether the sponsor's own candidate appears earlier in the printed roster than chance would predict. Requires joining the sponsor identity from the curated_pairs frame to the candidates_listed roster of the headline scenario. ~30 min of script work. Suggested script: source/analysis/an-NNN-candidate-position-by-sponsor.py.

  4. AN-051 finding into paper §5 (writeup task). The new "Size mismatch and ruled-out levers" paragraph in paper.tex currently lists deferral / mode / interviewer / nonresponse as ruled-out concrete-design levers. Add candidate-name rotation as the FIRST lever-confirmed-but-not-quantitatively-carrying finding from the questionnaire layer. Bumps the running multi- lever count.

  5. Promote PollQuestionario schema to the politica pipeline (infra). The schema and extractor live in pipelines/politica/source/llm/. If we want to run on the ~14k poll universe, add it to the production extraction orchestrator alongside the methodology / coverage / operations / sampling tasks.