Synthetic Participants Fail Where It Counts

Week 20May 2026

Week 22May 2026

ABOUT THIS ISSUE

How was this newsletter synthesized?

Methodology

This newsletter is generated by an AI pipeline (leveraging Anthropic Sonnet 4.5 & Haiku 4.5) that processes the metadata and abstracts of every new arXiv HCI paper from the past week—99 this issue. Each paper is scored on three dimensions: Practice (applicability for practitioners), Research (scientific contribution), and Strategy (industry implications), with scores from 1-5. Papers passing threshold are grouped into topic clusters, and each cluster is summarized to capture what that body of research is exploring.

Selection Criteria

The pipeline builds a curated selection that balances high scores with topic diversity—and deliberately includes at least one 'contrarian' paper that challenges prevailing assumptions. This selection is then analyzed to identify key findings (patterns across multiple papers) and surprises (results that contradict conventional wisdom). A narrative synthesis ties the week's research together under a unifying frame.

Key Themes Discovered

Field Report: ai-interaction

Trust, Calibration, and Behavioral Misalignment

This cluster examines how users form and maintain appropriate reliance on AI systems amid systematic behavioral misalignment. Core tensions emerge: synthetic AI participants fail to replicate human decision-making patterns; users overestimate AI efficiency gains and underestimate their own usage; LLM-generated preferences diverge from real user preferences; and AI assistance paradoxically erodes skill development. Research spans trust calibration frameworks, skill atrophy mechanisms, and design interventions that surface AI limitations. The work is primarily relevant for UX researchers, product teams, and system designers managing human-AI workflows.

1/8

The efficiency-gain illusion: People underestimate the rate of AI use and overestimate its benefits on simple tasks

2605.20149

Less Back-and-Forth: A Comparative Study of Structured Prompting

2605.21035

What Would GPT Click: Practical Effects of Human-AI Behavioral Misalignment and the Cost of Synthetic Participants in User Experience

Platform architecture determines whether recommendation algorithms can shape information quality on social media

PROTEA: Offline Evaluation and Iterative Refinement for Multi-Agent LLM Workflows

What Would GPT Click: Practical Effects of Human-AI Behavioral Misalignment and the Cost of Synthetic Participants in User Experience

Distorted Perspectives of LLM-Simulated Preferences: Can AI Mislead Design?

Multi-Week, In-Class Deployments of Telepresence Robots With Four Homebound K-12 Students: Benefits, Challenges, and Recommendations

Multi-site PPG

PROTEA

PaintCopilot

PLACES

Loom

HITL-D

Conflict Context Evaluation Framework

Reflecti-Mate

The efficiency-gain illusion: People underestimate the rate of AI use and overestimate its benefits on simple tasks

Proximal State Nudging: Reducing Skill Atrophy from AI Assistance

Less Back-and-Forth: A Comparative Study of Structured Prompting

The Quiet Path from Seemingly Minor Design Errors to Workplace AI Incidents

When Support Escalates Distress: Regulation and Escalation in LLM Responses to Venting and Advice-Seeking

The Impact of AI Usage and Informativeness on Skill Development in Logical Reasoning

AnyMo: Geometry-Aware Setup-Agnostic Modeling of Human Motion in the Wild

Exploring Trust Calibration in XAI - The Impact of Exposing Model Limitations to Lay Users

How was this newsletter synthesized?

Methodology

Selection Criteria

Key Themes Discovered

Field Report: ai-interaction

Trust, Calibration, and Behavioral Misalignment

Top Papers in this Theme

Proximal State Nudging: Reducing Skill Atrophy from AI Assistance

DARE-EEG: A Foundation Model for Mining Dual-Aligned Representation of EEG

The efficiency-gain illusion: People underestimate the rate of AI use and overestimate its benefits on simple tasks

Less Back-and-Forth: A Comparative Study of Structured Prompting

The Quiet Path from Seemingly Minor Design Errors to Workplace AI Incidents

Synthesized using AI

What Would GPT Click: Practical Effects of Human-AI Behavioral Misalignment and the Cost of Synthetic Participants in User Experience

Platform architecture determines whether recommendation algorithms can shape information quality on social media

PROTEA: Offline Evaluation and Iterative Refinement for Multi-Agent LLM Workflows

What Would GPT Click: Practical Effects of Human-AI Behavioral Misalignment and the Cost of Synthetic Participants in User Experience

Distorted Perspectives of LLM-Simulated Preferences: Can AI Mislead Design?

Multi-Week, In-Class Deployments of Telepresence Robots With Four Homebound K-12 Students: Benefits, Challenges, and Recommendations

Multi-site PPG

PROTEA

PaintCopilot

PLACES

Loom

HITL-D

Conflict Context Evaluation Framework

Reflecti-Mate

The efficiency-gain illusion: People underestimate the rate of AI use and overestimate its benefits on simple tasks

Proximal State Nudging: Reducing Skill Atrophy from AI Assistance

Less Back-and-Forth: A Comparative Study of Structured Prompting

The Quiet Path from Seemingly Minor Design Errors to Workplace AI Incidents

When Support Escalates Distress: Regulation and Escalation in LLM Responses to Venting and Advice-Seeking

The Impact of AI Usage and Informativeness on Skill Development in Logical Reasoning

AnyMo: Geometry-Aware Setup-Agnostic Modeling of Human Motion in the Wild

Exploring Trust Calibration in XAI - The Impact of Exposing Model Limitations to Lay Users

Efficiency gains hide competence losses

How was this newsletter synthesized?

Methodology

Selection Criteria

Key Themes Discovered

Field Report: ai-interaction

Trust, Calibration, and Behavioral Misalignment

Top Papers in this Theme

Proximal State Nudging: Reducing Skill Atrophy from AI Assistance

DARE-EEG: A Foundation Model for Mining Dual-Aligned Representation of EEG

The efficiency-gain illusion: People underestimate the rate of AI use and overestimate its benefits on simple tasks

Less Back-and-Forth: A Comparative Study of Structured Prompting

The Quiet Path from Seemingly Minor Design Errors to Workplace AI Incidents