ABOUT THIS ISSUE

How this newsletter was synthesized?

Methodology

This newsletter is generated by an AI pipeline (leveraging Anthropic Sonnet 4.5 & Haiku 4.5) that processes the metadata and abstracts of every new arXiv HCI paper from the past week—95 this issue. Each paper is scored on three dimensions: Practice (applicability for practitioners), Research (scientific contribution), and Strategy (industry implications), with scores from 1-5. Papers passing threshold are grouped into topic clusters, and each cluster is summarized to capture what that body of research is exploring.

Selection Criteria

The pipeline builds a curated selection that balances high scores with topic diversity—and deliberately includes at least one 'contrarian' paper that challenges prevailing assumptions. This selection is then analyzed to identify key findings (patterns across multiple papers) and surprises (results that contradict conventional wisdom). A narrative synthesis ties the week's research together under a unifying frame.

Key Themes Discovered

Field Report: ai-interaction

Trust, Calibration, and Human Agency

This cluster examines how humans interact with and depend on AI systems, focusing on trust calibration, user agency, and performance outcomes. Core tensions emerge: AI tools often underperform expectations (developer productivity slows despite forecasts), users overrely on overconfident outputs across languages, and literacy gaps determine independent capability post-support. Research spans trust formation in ChatGPT, multilingual miscalibration risks, and the "agency gap" where GenAI literacy predicts self-directed performance. Methodologically diverse—mixing RCTs, behavioral experiments, qualitative interviews, and human-centered evaluations—the work prioritizes understanding *when* and *why* humans calibrate trust incorrectly, rather than optimizing AI accuracy alone.

1/10