Live corpus dashboard

Coverage, sessions, and the full corpus.

Engineering human rhythm in synthetic text · Vittoria Lanzo and Dico Angelo

50 papers 18 peer-reviewed 16 bridge papers 3 research sessions Regenerated 2026-06-21 20:12 UTC

Coverage

Coverage by axis

Axis	Papers	Bridge	Status
Q1 · Prompt-engineering ceiling	4	2	Gap Publishable on its own
Q2 · Learnable rhythm embedding	6	3	Strong Mechanism established for other style axes
Q3 · Burstiness across model generations	9	3	Strong Empirically confirmed in 2024-2026
Q4 · Burstiness LoRA / auxiliary steering	19	8	Strong Direct analogue: StyleVector contrastive
Q5 · Variance vs inter-token timing	9	2	Partial Variance OK, timing TTS-only
Q6 · Fingerprint vs idiosyncratic rhythm	8	6	Strong Stylometry + Bakkouche perception bridge
Q7 · Punctuation / paragraph proxies	1	0	Partial Targeted query needed
TTS · Raitio (Apple)	9	4	Confirmed Raitio + Suni + DiTTo-TTS
TTS · CTRL-P	1	0	Verify Likely = Raitio 2020 itself
TTS · Bakkouche perception	1	1	Located Bakkouche 2025
Bio · NN ↔ biological rhythm	4	1	Strong Caucheteux + eLife predictive coding
Arch · Vaswani extensions	1	1	Implicit Covered via Q2 / Q4 LM descendants

Q1 · Prompt-engineering ceiling

peer-reviewed Echoes in AI: Quantifying Lack of Plot Diversity in LLM Outputs
· · PMC · 2025
Quantifies diversity collapse in LLM outputs — supports cross-generation flattening / 'average prosody' in text.
preprint bridge Benchmark of Stylistic Variation in LLM-Generated Texts
· · arXiv · 2025
COMPETING WORK for Q1 — benchmarks stylistic variation across 16 frontier models + prompt settings. Does NOT frame as control ceiling vs distributional target — wedge survives but must differentiate.
preprint Evaluating the Diversity and Quality of LLM Generated Content
· · arXiv · 2025
Effective semantic diversity = diversity among quality-passing outputs. Method gift: gate burstiness measurement on quality so high-variance incoherence isn't rewarded.
preprint bridge LLMs Still Struggle to Imitate the Implicit Writing Styles of Everyday People
· · arXiv · 2025
Prompt-only personalized-style imitation FAILS — dual support for Q1 (prompt ceiling) and Q6 (fingerprint hard by instruction). Strongest single motivation for model-level approach.

Q2 · Learnable rhythm embedding

peer-reviewed bridge Towards Controllable Speech Synthesis in the Era of LLMs
Lee et al. · EMNLP 2025 main · 2025
EMNLP 2025 unified TTS+LLM control framework. Cites DiTTo-TTS for speech rate via latent length.
peer-reviewed bridge Style Vectors for Steering Generative Large Language Models
Konen, Jentzsch, Diallo et al. · EACL 2024 Findings · 2024
Foundational steering template for a 'burstiness vector'
peer-reviewed bridge Controllable neural TTS using intuitive prosodic features
Raitio, Rasipuram, Castellani · Interspeech 2020 · 2020
The Raitio paper Vittoria named
preprint From Weights to Activations: Is Steering the Next Frontier of LLMs?
Turner et al. · arXiv · 2026
2025/2026 survey-style — situates activation steering as the next frontier
preprint LLMBraces: Straightening Out LLM Predictions
· · arXiv · 2025
Task-specific control via relevant sub-updates
preprint Low-Rank Adaptation for Foundation Models — Survey
· · arXiv · 2025
Comprehensive LoRA survey

Q3 · Burstiness across model generations

peer-reviewed Trusting AI to detect AI?
· · Computers in Human Behavior · 2026
Detection rates drop with newer LLM generations
peer-reviewed Echoes in AI: Quantifying Lack of Plot Diversity in LLM Outputs
· · PMC · 2025
Quantifies diversity collapse in LLM outputs — supports cross-generation flattening / 'average prosody' in text.
peer-reviewed How is ChatGPT's Behavior Changing Over Time?
Chen, Zaharia, Zou · Harvard Data Science Review · 2024
Empirical longitudinal drift of LLM behavior
peer-reviewed Distinguishing ChatGPT-3.5 vs -4 vs human Japanese texts
· · PMC · 2023
Direct stylometric comparison across GPT versions
preprint bridge Can You Detect the Difference? Stylometric Comparison of Diffusion vs Autoregressive Text
Tarım & Onan · arXiv · 2025
COMPETING WORK — first systematic stylometric burstiness comparison. Cite, don't ignore.
preprint RedNote-Vibe: Temporal Dynamics of AI-Generated Text
· · arXiv · 2025
Dataset for temporal-dynamics detection
preprint bridge DivEye: Diversity Boosts AI-Generated Text Detection
· · arXiv · 2025
CUTTING-EDGE — surprisal-based features capturing how unpredictability FLUCTUATES across text. Most direct burstiness-adjacent 2025 work. Vittoria pole-position read.
preprint bridge Benchmark of Stylistic Variation in LLM-Generated Texts
· · arXiv · 2025
COMPETING WORK for Q1 — benchmarks stylistic variation across 16 frontier models + prompt settings. Does NOT frame as control ceiling vs distributional target — wedge survives but must differentiate.
preprint Evaluating the Diversity and Quality of LLM Generated Content
· · arXiv · 2025
Effective semantic diversity = diversity among quality-passing outputs. Method gift: gate burstiness measurement on quality so high-variance incoherence isn't rewarded.

Q4 · Burstiness LoRA / auxiliary steering

peer-reviewed bridge Style Vectors for Steering Generative Large Language Models
Konen, Jentzsch, Diallo et al. · EACL 2024 Findings · 2024
Foundational steering template for a 'burstiness vector'
peer-reviewed bridge FUDGE: Controlled Text Generation With Future Discriminators
Yang, Klein · NAACL 2021 · 2021
Foundation for boundary-FUDGE: future-attribute discriminator-guided decoding. We adapt it by using the computable burstiness metric as the future predictor applied only to the sentence-boundary decision.
peer-reviewed bridge Mirostat: A Neural Text Decoding Algorithm that Directly Controls Perplexity
Basu et al. · ICLR 2021 · 2021
MUST-DIFFERENTIATE (top scoop risk): controls a distributional output statistic (perplexity) in-generation via closed-form feedback. We differ: structural sequence-VARIANCE not token-mean; boundary-only not whole-vocab; future-discriminator not running set-point. Frame ours as the structural-variance, boundary-altitude generalization.
preprint CARD: Cluster-level Adaptation with Reward-guided Decoding
· · arXiv · 2026
Logit corrections on frozen LLM
preprint Styles + Persona-plug = Customized LLMs
· · arXiv · 2026
Plug-in persona over frozen LLM
preprint GLASS: GRPO-Trained LoRA for Acoustic Style Steering in Zero-Shot TTS
· · arXiv · 2026
GRPO-trained LoRA for a continuous style attribute - method analogue for the GRPO burstiness controller arm.
preprint Continuous Control of Editing Models via Adaptive-Origin Guidance
· · arXiv · 2026
Continuous control via guidance - relevant to the smooth burstiness dial.
preprint A Unified Study of LoRA Variants: Taxonomy, Review, Codebase
· · arXiv · 2026
LoRA-variant taxonomy - reference for the controller adapter choice.
preprint From Weights to Activations: Is Steering the Next Frontier of LLMs?
Turner et al. · arXiv · 2026
2025/2026 survey-style — situates activation steering as the next frontier
preprint bridge TADA! Tuning Audio Diffusion Models through Activation Steering
· · arXiv · 2026
Activation steering applied to audio diffusion
preprint Self-Supervised Honesty Steering via Anti-Parallel Representations
· · arXiv · 2026
Self-supervised steering method
preprint bridge Plug-and-Play LLM Fingerprinting via Text-to-Weight Generation
· · arXiv · 2026
GENERATES LoRA params as variable-length sequences — direct precedent for personalized burstiness-LoRA generation
preprint bridge AgentSteerTTS: Multi-Agent Closed-Loop TTS Steering
· · arXiv · 2026
Multi-agent closed-loop steering — adversarial disentanglement
preprint bridge Personalized Text Generation with Contrastive Activation Steering
Liu et al. · arXiv · 2025
User-specific style vectors via contrastive activation — closest analogue to a personal burstiness fingerprint
preprint Merge and Guide: Unifying Model Merging and Guided Decoding for Controllable Multi-Objective Generation
· · arXiv · 2025
Closed-form logit-combination decoding for semantic/reward objectives, not structural statistics: shows the closed-form-decoding family does not cover structural-variance.
preprint Low-Rank Adaptation for Foundation Models — Survey
· · arXiv · 2025
Comprehensive LoRA survey
preprint Continuous Language Model Interpolation for Dynamic Control
· · arXiv · 2024
Weight-space interpolation for fine-grained control
preprint bridge StyleRemix: Authorship Obfuscation via Distillation
Fisher et al. · arXiv · 2024
Reverse of burstiness-injection — same machinery
preprint Large Language Models can be Guided to Evade AI-Generated Text Detection (SICO)
Lu et al. · TMLR 2024 · 2024
Forces sentence-length variance for detector evasion via prompt/in-context substitution: black-box, post-hoc, no logit steering. Cite-and-dismiss as orthogonal humanizer mechanism.

Q5 · Variance vs inter-token timing

peer-reviewed AI-Generated Text Detection: A Comprehensive Review of Active Methods
· · ScienceDirect · 2026
2025/26 detection review — paraphrasing vulnerability
preprint The Statistical Signature of LLMs
· · arXiv · 2026
How probabilistic sampling reshapes statistical structure
preprint bridge Can You Detect the Difference? Stylometric Comparison of Diffusion vs Autoregressive Text
Tarım & Onan · arXiv · 2025
COMPETING WORK — first systematic stylometric burstiness comparison. Cite, don't ignore.
preprint Beyond Checkmate: Creative Choke Points in AI Text
· · arXiv · 2025
Segment-level burstiness analysis — where the rhythm signal lives
preprint Detecting LLM-Generated Short Answers
· · arXiv · 2025
Documents GPTZero operationalization
preprint A Training-free Method for LLM Text Attribution
· · arXiv · 2025
Uses GPTZero burstiness definition upstream
preprint bridge DivEye: Diversity Boosts AI-Generated Text Detection
· · arXiv · 2025
CUTTING-EDGE — surprisal-based features capturing how unpredictability FLUCTUATES across text. Most direct burstiness-adjacent 2025 work. Vittoria pole-position read.
preprint Stylometry Recognizes Human and LLM-Generated Texts
· · arXiv · 2025
Stylometric human-vs-LLM discrimination — burstiness sits inside a working stylometric feature set.
preprint Detecting AI-Generated Text: Factors Influencing Detectability
· · arXiv · 2024
Survey treating perplexity-burstiness as standard

Q6 · Fingerprint vs idiosyncratic rhythm

peer-reviewed bridge Prosodic cues strengthen human-AI voice boundaries
Bakkouche et al. · ScienceDirect · 2025
The named Bakkouche perception study — listeners discriminate human vs AI by prosody
peer-reviewed bridge Style Vectors for Steering Generative Large Language Models
Konen, Jentzsch, Diallo et al. · EACL 2024 Findings · 2024
Foundational steering template for a 'burstiness vector'
preprint bridge Plug-and-Play LLM Fingerprinting via Text-to-Weight Generation
· · arXiv · 2026
GENERATES LoRA params as variable-length sequences — direct precedent for personalized burstiness-LoRA generation
preprint bridge Personalized Text Generation with Contrastive Activation Steering
Liu et al. · arXiv · 2025
User-specific style vectors via contrastive activation — closest analogue to a personal burstiness fingerprint
preprint bridge LLMs Still Struggle to Imitate the Implicit Writing Styles of Everyday People
· · arXiv · 2025
Prompt-only personalized-style imitation FAILS — dual support for Q1 (prompt ceiling) and Q6 (fingerprint hard by instruction). Strongest single motivation for model-level approach.
preprint Stylometry Recognizes Human and LLM-Generated Texts
· · arXiv · 2025
Stylometric human-vs-LLM discrimination — burstiness sits inside a working stylometric feature set.
preprint bridge StyleRemix: Authorship Obfuscation via Distillation
Fisher et al. · arXiv · 2024
Reverse of burstiness-injection — same machinery
preprint Large Language Models can be Guided to Evade AI-Generated Text Detection (SICO)
Lu et al. · TMLR 2024 · 2024
Forces sentence-length variance for detector evasion via prompt/in-context substitution: black-box, post-hoc, no logit steering. Cite-and-dismiss as orthogonal humanizer mechanism.

Q7 · Punctuation / paragraph proxies

preprint A Statistical Journey into the Poetic World of Evgenij Onegin
· · arXiv · 2026
Symbolic time-series + Markov modeling of phonological rhythm. Method transfer for Q7: model punctuation/segmentation as a symbolic time series → timing-free burstiness proxy.

TTS · Raitio (Apple)

peer-reviewed bridge Towards Controllable Speech Synthesis in the Era of LLMs
Lee et al. · EMNLP 2025 main · 2025
EMNLP 2025 unified TTS+LLM control framework. Cites DiTTo-TTS for speech rate via latent length.
peer-reviewed Style and Prosody control for Zero-shot Speech Synthesis
Suni, Antti et al. · SSW 2025 · 2025
Derived-vector approach to predictably modify synthesized speech
peer-reviewed Enhanced Prosody Modeling and Character Voice Controlling for Audiobooks
· · ACM · 2025
Prosodic expressiveness for audiobook synthesis
peer-reviewed Emphasis control for parallel neural TTS / Hierarchical Prosody Modeling
Raitio et al. · arXiv / Interspeech · 2022
Hierarchical latent space for prosodic emphasis
peer-reviewed bridge Controllable neural TTS using intuitive prosodic features
Raitio, Rasipuram, Castellani · Interspeech 2020 · 2020
The Raitio paper Vittoria named
preprint GLASS: GRPO-Trained LoRA for Acoustic Style Steering in Zero-Shot TTS
· · arXiv · 2026
GRPO-trained LoRA for a continuous style attribute - method analogue for the GRPO burstiness controller arm.
preprint bridge TADA! Tuning Audio Diffusion Models through Activation Steering
· · arXiv · 2026
Activation steering applied to audio diffusion
preprint MAGIC-TTS: Fine-Grained Controllable Speech Synthesis
· · arXiv · 2026
Token-level duration conditioning
preprint bridge AgentSteerTTS: Multi-Agent Closed-Loop TTS Steering
· · arXiv · 2026
Multi-agent closed-loop steering — adversarial disentanglement

TTS · CTRL-P

peer-reviewed Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis
Mohan, Hu, Klimkov et al. · Interspeech 2021 · 2021
RESOLVED: the true 'CTRL-P' Vittoria referenced, distinct from Raitio 2020. Conditions on three acoustic correlates of prosody. PHASE 2 anchor, not core.

TTS · Bakkouche perception

peer-reviewed bridge Prosodic cues strengthen human-AI voice boundaries
Bakkouche et al. · ScienceDirect · 2025
The named Bakkouche perception study — listeners discriminate human vs AI by prosody

Bio · NN ↔ biological rhythm

peer-reviewed Language experience shapes predictive coding of rhythmic sound sequences
· · eLife · 2024
Brain rhythm-prediction is shaped by native-language experience
peer-reviewed Predictive Coding or Just Feature Discovery? An Alternative Account
· · PMC · 2024
Critical view — LM features predict brain but mechanism may not be predictive coding
peer-reviewed bridge Long-range and hierarchical language predictions in brains and algorithms
Caucheteux, Gramfort, King · Nature Human Behaviour · 2023
LM hierarchy matches brain hierarchy — strongest Vaswani↔biology bridge
preprint Toward a realistic model of speech processing in the brain with SSL
· · arXiv · 2022
SSL models match brain processing

Arch · Vaswani extensions

peer-reviewed bridge Long-range and hierarchical language predictions in brains and algorithms
Caucheteux, Gramfort, King · Nature Human Behaviour · 2023
LM hierarchy matches brain hierarchy — strongest Vaswani↔biology bridge

Research sessions

pass-1-2-baseline

Started 2026-06-21 18:00:56 · isolation: no-personal-carryover

Initial 8-query Firecrawl sweep of academic literature mapped to Vittoria's 7 questions + TTS + Bio anchors.

Queries (8)

Q2,Q4 · 10 results
controllable text generation LoRA style adapter frozen LLM stylistic steering arXiv
TTS-A · 10 results
Raitio Apple controllable prosody TTS conditioning CTRL-P prosodic speech synthesis
Q5,Q7 · 10 results
burstiness sentence length variance perplexity AI generated text detection GPTZero stylometry
Q6 · 10 results
stylometric fingerprint authorship attribution sentence rhythm English prose Mendenhall
Q3 · 10 results
AI generated text burstiness longitudinal GPT-3 GPT-4 stylistic evolution detection across model generations
Q1,Q7 · 10 results
prompt engineering ceiling stylistic distributional control LLM negative results long output drift punctuation paragraph rhythm
TTS-C · 9 results
Bakkouche prosody perception Interspeech 2025 TTS speech synthesis listener
Bio,Arch · 10 results
neural oscillations language predictive coding speech rhythm thought chunking transformer attention prosody biology Pickering Garrod

Findings (7)

Q4 novelty-bar The 'burstiness LoRA' steering mechanism already exists for OTHER style axes (sentiment, persona, authorship). Novelty bar = apply to rhythm + personal fingerprinting + perception validation, NOT invent the mechanism. cite · cite · cite
Q5 competing-work Tarım & Onan 2025 published the first systematic stylometric burstiness comparison (diffusion vs autoregressive). Must cite and position against, not ignore. cite
Q5 gap GPTZero's burstiness operationalization (Tian 2023) is a blog post, not peer-reviewed. This IS the formalization gap the Burstiness Engine identifies.
Q3 general Empirically confirmed: AI text style measurably shifts across model generations, detection rates drop with newer models. cite · cite
TTS-C perception Bakkouche 2025 perception study: listeners do NOT easily perceive humans and AI clones as the same person — prosody is the discriminator. Direct precedent for Vittoria's perception hypothesis.
Bio,Arch methodology Caucheteux et al. (Nature Human Behaviour 2023) show brain uses long-range hierarchical predictions matching LM architecture — the strongest single bridge between Vaswani and biology. cite
Q1 gap Prompt-engineering ceiling for distributional stylistic control is UNDER-PUBLISHED. The negative-result paper Vittoria proposes is itself a publishable contribution.

pass-3-cutting-edge

Started 2026-06-21 18:00:56 · isolation: no-personal-carryover

Last-12-months arXiv sweep for progression vs baseline. A/B against pass 1+2.

Queries (4)

Q2,Q4 [qdr:y] · 10 results
controllable text generation LoRA style adapter 2025 frozen LLM activation steering
Q3,Q5 [qdr:y] · 4 results
burstiness LLM text generation 2025 2026 stylometric variance peer-reviewed EMNLP ACL NAACL
TTS-A [qdr:y] · 10 results
prosody control TTS 2025 NaturalSpeech VALL-E neural speech synthesis variance adaptor latent
Q3,Q5 [qdr:y] · 4 results
AI generated text detection 2025 2026 burstiness perplexity DetectGPT Binoculars latest

Findings (5)

Q5,Q3 competing-work DivEye (arXiv:2509.18880, Sep 2025) captures how unpredictability FLUCTUATES across text via surprisal-based features. This is the most direct 2025 burstiness-adjacent detection paper. Must be in Vittoria's prior-art section. cite
TTS-A,Q2 methodology EMNLP 2025 main 'Towards Controllable Speech Synthesis in the Era of LLMs' (Lee et al.) is the unified TTS+LLM control framework. DiTTo-TTS controls speech rate via latent-length prediction — direct precedent for in-generation rhythm conditioning.
Q4,Q6 methodology Plug-and-Play LLM Fingerprinting (arXiv:2605.18474) generates LoRA params as variable-length sequences. This is the precedent for GENERATING personalized burstiness LoRAs from a user sample, not just training one. cite
Q2,Q4 methodology 'From Weights to Activations' (arXiv:2604.14090) positions activation steering as the 2025/26 frontier. The Burstiness Engine should explicitly choose between weight-space (LoRA) and activation-space (steering vectors) and justify. cite
Q4,TTS-A methodology AgentSteerTTS (arXiv:2605.17583) multi-agent closed-loop steering shows the field moving toward composed-controller systems. Burstiness Engine could be one controller in such a system.

pass-4-gap-closing

Started 2026-06-21 18:00:56 · isolation: no-personal-carryover

Targeted Firecrawl sweep closing Q1 prompt-ceiling, Q7 punctuation-proxy, and CTRL-P verification. Re-anchored to Vittoria's canonical scope (prompt-baseline → model-level; TTS Phase 2).

Queries (4)

Q1 [qdr:y] · 10 results
prompt-only stylistic control distributional ceiling negative results LLM sentence length variance kurtosis
Q7 [qdr:y]
punctuation paragraph rhythm proxy text generation LLM burstiness sentence segmentation
Q7 · 10 results
punctuation as proxy for sentence rhythm stylometry text segmentation author attribution
TTS-B · 10 results
Wagner Klimkov CTRL-P prosodic boundaries phrasing prominence ICASSP text-to-speech

Findings (5)

TTS-B resolution RESOLVED: CTRL-P is a distinct paper (arXiv:2106.08352, Ctrl-P: Temporal Control of Prosodic Variation, Interspeech 2021), NOT Raitio 2020. Closes the Pass-3 open action. Per Vittoria's scope it anchors Phase 2 (voice personalization), not the core paper. cite
Q1 competing-work Q1 is no longer empty: 'Benchmark of Stylistic Variation in LLM-Generated Texts' (2509.10179) measures variation across 16 frontier models + prompts. It does NOT frame this as a control ceiling vs a distributional target, so the Q1 wedge survives — but must now cite and differentiate. cite
Q1,Q6 methodology 'LLMs Still Struggle to Imitate Implicit Writing Styles' (2509.14543) shows prompt-only personalized-style imitation fails. This is the strongest single motivation for the model-level (LoRA/steering) approach over prompting. cite
Q1 methodology Measure burstiness only among quality-passing generations (effective semantic diversity, 2504.12522) so the Q1 experiment does not reward high-variance incoherence. cite
Q7 gap Q7 remains a genuine gap — no dedicated paper treats punctuation/paragraph as a burstiness proxy. Onegin time-series (2604.20221) gives a method: model segmentation as a symbolic time series for a timing-free proxy. Like Q1, a targeted standalone contribution. cite

Learnings (cross-session)

methodology: Run real Firecrawl academic searches; ResearchGravity's delegate_research is a stub that returns hollow 100% completes.
Why: Pass 1 chain-a360b2b3619b returned 'completed' in 1 second routed to insight_review + webhook_test — no papers retrieved.
How to apply: Always use Firecrawl --categories research as the actual fetcher; use ResearchGravity only as the INDEX layer.
methodology: Time-filter cutting-edge queries with --tbs qdr:y for last-12-months. Default arXiv search is undated and surfaces old work.
Why: Pass 3 only worked because of --tbs qdr:y; without it the same queries surface 2020-2023 papers.
How to apply: Every new-pass run should include --tbs qdr:y or qdr:m by default; explicitly opt out for historical sweeps.
methodology: Add as URL sources to NotebookLM, not arbitrary scraped PDFs. Wrong arXiv ID → wrong paper ingested silently.
Why: Pass 2 added arXiv:2208.07814 (jet physics) thinking it was Caucheteux — fixed by re-search.
How to apply: VERIFY arxiv ID matches the paper title BEFORE source_add. Search-then-add, never type-then-add.
methodology: Avoid PMC URLs in source_add — they hit reCAPTCHA and ingest the captcha page as 'source'.
Why: PMC10038805 was added and the source title showed 'Checking your browser - reCAPTCHA'.
How to apply: Prefer arXiv preprint over PMC final when both exist; or use SciHub/researcher-portal URLs.
novelty-bar: The Burstiness Engine novelty is the COMPOSITE (rhythm-target + personal fingerprinting + perception validation), not any single mechanism. Each ingredient has 2024-2026 precedent.
Why: Style Vectors (2024), StyleVector (2025), Bakkouche (2025), DivEye (2025), DiTTo-TTS (2025) collectively cover the building blocks.
How to apply: Frame paper as 'composing established mechanisms onto a new target with TTS-derived validation', not 'inventing burstiness steering'.
competing: Two competing papers are now live: Tarım & Onan 2025 (stylometric burstiness comparison) and DivEye Sep 2025 (surprisal-based diversity detection).
Why: Both published 2025, both treat burstiness/diversity as central. The Burstiness Engine ships into a crowded room.
How to apply: Cite both in the related-work section; differentiate on the in-generation CONTROL angle, not the measurement angle.
perception: Bakkouche 2025 + 'reduced F0 variation lowers naturalness' from ResearchGate 396811851 = direct evidence that suppressed prosodic variance harms perception. Strong analogy to suppressed burstiness in text.
Why: If TTS listeners notice suppressed variance, text readers likely do too — testable hypothesis with Bakkouche's methodology adapted.
How to apply: Use Bakkouche's experimental design as the template for Vittoria's text-perception study.
gap: Q1 (prompt-engineering ceiling for distributional stylistic control) is genuinely under-published. The systematic-negative-result paper does not exist yet.
Why: 8 Firecrawl queries surfaced zero peer-reviewed papers measuring prompt-only ceiling against distributional targets like sentence-length variance.
How to apply: Treat Q1 as a publishable contribution in itself OR as the motivation section of the main paper.
verification: Dario Amodei has no 'thesis on neural networks and biology'. His PhD was condensed-matter physics.
Why: Dico's notebook references this; no such work exists. Possible misattribution to Olah / Elhage / Anthropic interpretability thread. Dico's email cited a Princeton dataspace link (dsp013f462544k) — verify what it actually is.
How to apply: Ask Dico for the original source before paper-writing; the bio-bridge work can be carried by Caucheteux + eLife predictive coding without the Amodei citation.
scope: CANONICAL SCOPE (from Vittoria's email thread, May 21-28 2026): title = 'The Burstiness Engine: Modeling Human Rhythmic Patterns in Language Generation'. Spine = prompt-level control as baseline (show where it breaks) → model-level steering (auxiliary rhythm model / LoRA) as the answer. TTS is PHASE 2, not core. Emotion out of scope unless a dataset already exists.
Why: Vittoria explicitly flagged the original 'Engineering Synthetic Human Rhythm' title for ethics, said the TTS prosody angle 'makes the surface too big, we'd lose depth', and chose model-level as the real contribution with prompt-level as baseline.
How to apply: All synthesis and the paper outline follow this spine. Do not foreground TTS. The deliverable Vittoria is waiting on is shareable ResearchGravity results she can read independently before a sync call.

All papers

Title	Authors	Venue	Year	Peer?
AI-Generated Text Detection: A Comprehensive Review of Active Methods	·	ScienceDirect	2026	✓
Trusting AI to detect AI?	·	Computers in Human Behavior	2026	✓
Echoes in AI: Quantifying Lack of Plot Diversity in LLM Outputs	·	PMC	2025	✓
Enhanced Prosody Modeling and Character Voice Controlling for Audiobooks	·	ACM	2025	✓
Prosodic cues strengthen human-AI voice boundaries	Bakkouche et al.	ScienceDirect	2025	✓
Style and Prosody control for Zero-shot Speech Synthesis	Suni, Antti et al.	SSW 2025	2025	✓
Towards Controllable Speech Synthesis in the Era of LLMs	Lee et al.	EMNLP 2025 main	2025	✓
How is ChatGPT's Behavior Changing Over Time?	Chen, Zaharia, Zou	Harvard Data Science Review	2024	✓
Language experience shapes predictive coding of rhythmic sound sequences	·	eLife	2024	✓
Predictive Coding or Just Feature Discovery? An Alternative Account	·	PMC	2024	✓
Style Vectors for Steering Generative Large Language Models	Konen, Jentzsch, Diallo et al.	EACL 2024 Findings	2024	✓
Distinguishing ChatGPT-3.5 vs -4 vs human Japanese texts	·	PMC	2023	✓
Long-range and hierarchical language predictions in brains and algorithms	Caucheteux, Gramfort, King	Nature Human Behaviour	2023	✓
Emphasis control for parallel neural TTS / Hierarchical Prosody Modeling	Raitio et al.	arXiv / Interspeech	2022	✓
Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis	Mohan, Hu, Klimkov et al.	Interspeech 2021	2021	✓
FUDGE: Controlled Text Generation With Future Discriminators	Yang, Klein	NAACL 2021	2021	✓
Mirostat: A Neural Text Decoding Algorithm that Directly Controls Perplexity	Basu et al.	ICLR 2021	2021	✓
Controllable neural TTS using intuitive prosodic features	Raitio, Rasipuram, Castellani	Interspeech 2020	2020	✓
A Statistical Journey into the Poetic World of Evgenij Onegin	·	arXiv	2026	·
A Unified Study of LoRA Variants: Taxonomy, Review, Codebase	·	arXiv	2026	·
AgentSteerTTS: Multi-Agent Closed-Loop TTS Steering	·	arXiv	2026	·
CARD: Cluster-level Adaptation with Reward-guided Decoding	·	arXiv	2026	·
Continuous Control of Editing Models via Adaptive-Origin Guidance	·	arXiv	2026	·
From Weights to Activations: Is Steering the Next Frontier of LLMs?	Turner et al.	arXiv	2026	·
GLASS: GRPO-Trained LoRA for Acoustic Style Steering in Zero-Shot TTS	·	arXiv	2026	·
MAGIC-TTS: Fine-Grained Controllable Speech Synthesis	·	arXiv	2026	·
Plug-and-Play LLM Fingerprinting via Text-to-Weight Generation	·	arXiv	2026	·
Self-Supervised Honesty Steering via Anti-Parallel Representations	·	arXiv	2026	·
Styles + Persona-plug = Customized LLMs	·	arXiv	2026	·
TADA! Tuning Audio Diffusion Models through Activation Steering	·	arXiv	2026	·
The Statistical Signature of LLMs	·	arXiv	2026	·
A Training-free Method for LLM Text Attribution	·	arXiv	2025	·
Benchmark of Stylistic Variation in LLM-Generated Texts	·	arXiv	2025	·
Beyond Checkmate: Creative Choke Points in AI Text	·	arXiv	2025	·
Can You Detect the Difference? Stylometric Comparison of Diffusion vs Autoregressive Text	Tarım & Onan	arXiv	2025	·
Detecting LLM-Generated Short Answers	·	arXiv	2025	·
DivEye: Diversity Boosts AI-Generated Text Detection	·	arXiv	2025	·
Evaluating the Diversity and Quality of LLM Generated Content	·	arXiv	2025	·
LLMBraces: Straightening Out LLM Predictions	·	arXiv	2025	·
LLMs Still Struggle to Imitate the Implicit Writing Styles of Everyday People	·	arXiv	2025	·
Low-Rank Adaptation for Foundation Models — Survey	·	arXiv	2025	·
Merge and Guide: Unifying Model Merging and Guided Decoding for Controllable Multi-Objective Generation	·	arXiv	2025	·
Personalized Text Generation with Contrastive Activation Steering	Liu et al.	arXiv	2025	·
RedNote-Vibe: Temporal Dynamics of AI-Generated Text	·	arXiv	2025	·
Stylometry Recognizes Human and LLM-Generated Texts	·	arXiv	2025	·
Continuous Language Model Interpolation for Dynamic Control	·	arXiv	2024	·
Detecting AI-Generated Text: Factors Influencing Detectability	·	arXiv	2024	·
Large Language Models can be Guided to Evade AI-Generated Text Detection (SICO)	Lu et al.	TMLR 2024	2024	·
StyleRemix: Authorship Obfuscation via Distillation	Fisher et al.	arXiv	2024	·
Toward a realistic model of speech processing in the brain with SSL	·	arXiv	2022	·