Codebook

Decision rules and variable definitions for manual coding

This codebook governs Phase 2 coding. All coders should read it fully before beginning and refer to it when edge cases arise.

Author Expertise (Phase 2A)

Measures how well the paper’s topic aligns with each key author’s demonstrated expertise, based on their published record.

Code	Label	Criterion
`0`	Core	Paper domain exactly matches ≥ 1 key author’s primary domain
`1`	Adjacent	Related field or overlapping methods, but a different subarea
`2`	Distant	Outside both key authors’ documented publication history

Coding procedure:

Identify the two key authors — typically first and last/corresponding author.
Locate each author’s Google Scholar or institutional profile.
Record their primary domain(s) and five most-cited papers.
Classify the paper’s own primary domain.
Assign the most lenient match across the two authors (e.g., if one is Core, code 0).

Examples:

Classify the cited work’s primary field using the standardised list below. Assign the single best-fit label.

Label	Description
NLP / Language Models	Natural language processing, LLMs, text generation
Computer Vision	Image/video recognition, generation, segmentation
Robotics / Embodied AI	Physical systems, simulation, embodied agents
ML Theory / Optimization	Learning theory, convergence, gradient methods
Reinforcement Learning	RL algorithms, policy learning, reward modeling
Graphs / Networks	Graph neural networks, network analysis
Biomedical / Comp. Bio.	Genomics, clinical informatics, computational biology
Economics / Finance	Economic theory, empirical economics, finance
Political Science	Comparative politics, IR, political behavior
Sociology	Social structures, institutions, collective behavior
Psychology / Cog. Sci.	Cognition, behavior, psycholinguistics
Other	Does not fit above categories

What function does the citation serve in the paper?

Code	Label	Description
`B`	Background	General context or prior-work overview
`M`	Methods	A specific technique, algorithm, or tool adopted by the paper
`R`	Related Work	Direct comparison to a similar approach
`E`	Empirical	A dataset, benchmark, or empirical finding cited
`T`	Theoretical	A mathematical result, proof, or formal theorem

Code	Label	Criterion
`0`	Core	Citation domain matches the paper’s domain
`1`	Peripheral	Citation domain is outside the paper’s domain

Code	Label
`0`	Very Recent (0–1 yr before submission)
`1`	Recent (2–5 yr)
`2`	Moderate (6–10 yr)
`3`	Old (11+ yr)
`4`	Future — impossible date; strong hallucination signal

After completing the full dataset, randomly select 10% of coded papers (blind re-code).

Report Cohen’s κ for:

Target: κ ≥ 0.70 for both variables before proceeding to analysis.

No Google Scholar profile found: Search institutional faculty page and DBLP. If no publication record is locatable, code Expertise_Match = 2 (Distant) and note in the Notes field.
Citation spans multiple domains: Assign the primary domain and note secondary domain in Notes.
Co-authored paper with contradictory expertise: Use the most lenient code (closest match) and note both authors’ domains.
Citation appears verbatim elsewhere in literature: Code Is_Hallucinated = 0 (Real) — confirmation in CrossRef or another indexed source is sufficient.