ABOUT THIS ISSUE

How this newsletter was synthesized?

Methodology

This newsletter is generated by an AI pipeline (leveraging Anthropic Sonnet 4.5 & Haiku 4.5) that processes the metadata and abstracts of every new arXiv HCI paper from the past week—115 this issue. Each paper is scored on three dimensions: Practice (applicability for practitioners), Research (scientific contribution), and Strategy (industry implications), with scores from 1-5. Papers passing threshold are grouped into topic clusters, and each cluster is summarized to capture what that body of research is exploring.

Selection Criteria

The pipeline builds a curated selection that balances high scores with topic diversity—and deliberately includes at least one 'contrarian' paper that challenges prevailing assumptions. This selection is then analyzed to identify key findings (patterns across multiple papers) and surprises (results that contradict conventional wisdom). A narrative synthesis ties the week's research together under a unifying frame.

Key Themes Discovered

Field Report: ai-interaction

Trust, Reliability, and Alignment

This cluster examines how humans calibrate trust in AI systems and whether AI outputs align with user intent and societal values. Core tensions emerge: LLMs systematically overgeneralize scientific findings; GUI agents require formal verification to prevent erroneous actions; AI-generated content risks manipulation through personification. Research spans evaluation frameworks (SPHERE), adversarial robustness, multimodal grounding, and domain-specific reliability (medical, financial, news). The work is primarily for systems designers and policymakers navigating deployment risks.

1/10