arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.25126 2026-04-29 cs.RO

HANDFUL: Sequential Grasp-Conditioned Dexterous Manipulation with Resource Awareness

Ethan Foong, Yunshuang Li, Hao Jiang, Gaurav S. Sukhatme, Daniel Seita

详情

英文摘要

Dexterous robot hands offer rich opportunities for multifunctional manipulation, where a robot must execute multiple skills in sequence while maintaining control over previously grasped objects. Most prior work in dexterous manipulation focuses on single-object, single-skill tasks. In contrast, our insight is that many sequential tasks require resource-aware grasps that conserve fingers for future actions. In this paper, we study sequential grasp-conditioned dexterous manipulation, where a robot first grasps an object and then performs a second, distinct manipulation subtask while preserving the initial grasp. We introduce HANDFUL, a learning framework that models finger usage as a limited resource and encourages exploration of resource-aware grasps through finger-level contact rewards. These grasps are subsequently selected for downstream tasks via curriculum-based policy learning. We further propose HANDFUL-Bench, a simulation benchmark that introduces sequential dexterous manipulation tasks across multiple secondsubtask objectives, including pushing, pulling, and pressing, under a shared grasp-conditioned setup. Extensive simulation results demonstrate that prioritizing resource-aware grasps improves second-subtask success and robustness compared to a baseline that greedily optimizes the initial grasp before attempting the second subtask. We additionally validate our approach on a real dexterous LEAP hand. Together, this work establishes resource-aware grasp planning as a key principle for multifunctional dexterous manipulation. Supplementary material is available on our website: https://handful-dex.github.io.

URL PDF HTML ☆

赞 0 踩 0

2604.25122 2026-04-29 cs.CV cs.AI

M$^3$-VQA: A Benchmark for Multimodal, Multi-Entity, Multi-Hop Visual Question Answering

Jiatong Ma, Longteng Guo, Yuchen Liu, Zijia Zhao, Dongze Hao, Xuanxu Lin, Jing Liu

2604.25119 2026-04-29 cs.LG cs.CY

Evaluation without Generation: Non-Generative Assessment of Harmful Model Specialization with Applications to CSAM

Vinith M. Suriyakumar, Ayush Sekhari, Lena Stempfle, Robertson Wang, Michael Simpson, Rebecca Portnoff, Marzyeh Ghassemi, Ashia C. Wilson

2604.25110 2026-04-29 cs.LG cs.AI

Knowledge Distillation Must Account for What It Loses

Wenshuo Wang

2604.25102 2026-04-29 cs.CV

One Perturbation, Two Failure Modes: Probing VLM Safety via Embedding-Guided Typographic Perturbations

Ravikumar Balakrishnan, Sanket Mendapara

2604.25096 2026-04-29 cs.CL cs.HC

The Dynamics of Delusion: Modeling Bidirectional False Belief Amplification in Human-Chatbot Dialogue

Ashish Mehta, Jared Moore, Jacy Reese Anthis, William Agnew, Eric Lin, Peggy Yin, Desmond C. Ong, Nick Haber, Carol Dweck

2604.25088 2026-04-29 cs.AI cs.CL

Cooperate to Compete: Strategic Coordination in Multi-Agent Conquest

Abigail O'Neill, Alan Zhu, Mihran Miroyan, Narges Norouzi, Joseph E. Gonzalez

2604.25083 2026-04-29 cs.AI cs.AR

Agentic Architect: An Agentic AI Framework for Architecture Design Exploration and Optimization

Alexander Blasberg, Vasilis Kypriotis, Dimitrios Skarlatos

详情

英文摘要

Rapid advances in Large Language Models (LLMs) create new opportunities by enabling efficient exploration of broad, complex design spaces. This is particularly valuable in computer architecture, where performance depends on microarchitectural designs and policies drawn from vast combinatorial spaces. We introduce Agentic Architect, an agentic AI framework for computer architecture design exploration and optimization that combines LLM-driven code evolution with cycle-accurate simulation. The human architect specifies the optimization target, seed design, scoring function, simulator interface, and benchmark split, while the LLM explores implementations within these constraints. Across cache replacement, data prefetching, and branch prediction, Agentic Architect matches or exceeds state-of-the-art designs. Our best evolved cache replacement design achieves a 1.062x geomean IPC speedup over LRU, 0.6% over Mockingjay (1.056x). Our evolved branch predictor achieves a 1.100x geomean IPC speedup over Bimodal, 1.5% over its Hashed Perceptron seed (1.085x). Finally, our evolved prefetcher achieves a 1.76x geomean IPC speedup over no prefetching, 17% over its VA/AMPM Lite seed (1.59x) and 21% over SMS (1.55x). Our analysis surfaces several findings about agentic AI-driven microarchitecture design. Across evolved designs, components often correspond to known techniques; the novelty lies in how they are coordinated. The architect's role is shifting, but the human remains central. Seed quality bounds what search can achieve: evolution can refine and extend an existing mechanism, but cannot compensate for a weak foundation. Likewise, objectives, constraints, and prompt guidance affect reliability and generalization. Overall, Agentic Architect is the first end-to-end open-source framework for agentic AI architecture exploration and optimization.

URL PDF HTML ☆

赞 0 踩 0

2604.25077 2026-04-29 cs.AI

Evaluating Risks in Weak-to-Strong Alignment: A Bias-Variance Perspective

Hamid Osooli, Kareema Batool, Rick Gentry, Tiasa Singha Roy, Ashwin Gupta, Anirudha Ramesh

2604.25076 2026-04-29 cs.LG

Zero Shot Coordination for Sparse Reward Tasks with Diverse Reward Shapings

Keenan Powell, Peihong Yu, Pratap Tokekar

2604.25073 2026-04-29 cs.LG

Feasible-First Exploration for Constrained ML Deployment Optimization in Crash-Prone Hierarchical Search Spaces

Christian Lysenstøen

Comments 22 pages, 5 figures, 10 tables. Code available at https://github.com/Chrislysen/Constrained-ML-Deployment

2604.25072 2026-04-29 cs.CV

Beyond Accuracy: Benchmarking Cross-Task Consistency in Unified Multimodal Models

Weixing Wang, Liudvikas Zekas, Anton Hackl, Constantin Alexander Auga, Parisa Shahabinejad, Jona Otholt, Antonio Rueda-Toicen, Gerard de Melo

2604.25065 2026-04-29 cs.CV

ShapeY: A Principled Framework for Measuring Shape Recognition Capacity via Nearest-Neighbor Matching

Jong Woo Nam, Amanda S. Rios, Bartlett W. Mel

2604.25057 2026-04-29 cs.LG cs.DL cs.HC cs.IR

CiteRadar: A Citation Intelligence Platform for Researcher Profiling and Geographic Visualization

Chenxu Niu, Yiming Sun

详情

英文摘要

Understanding the geographic reach and community structure of one's scholarly citations is increasingly valuable for career development, grant applications, and collaboration discovery -- yet accessible tools for answering these questions remain scarce. Existing bibliometric platforms either require costly institutional subscriptions or expose only aggregate citation counts without granular per-author metadata. We present CiteRadar, an open-source system that accepts a single Google Scholar user identifier and automatically produces a structured output folder containing: the author's complete publication list, all retrieved citing papers with enriched author metadata, two ranked author tables (by citation frequency and by h-index), a plain-text statistical summary, and a self-contained interactive HTML world map -- all from a single command-line invocation. CiteRadar integrates five heterogeneous data sources -- Google Scholar, OpenAlex, CrossRef, Semantic Scholar, and OpenStreetMap Nominatim -- through a carefully engineered five-stage pipeline. Key technical contributions include: (1) a Scholar meta-string parser resilient to Unicode non-breaking-space separators, a pervasive but undocumented quirk in Scholar's HTML that silently corrupts venue and year fields when unhandled; (2) a two-stage author disambiguation system using stop-word-filtered institution name similarity to guard against the well-known same-name entity-merging failure mode in bibliometric databases, demonstrated to eliminate h-index attribution errors of up to 9x the correct value; (3) an OpenAlex web-URL to API-URL conversion fix that raises the fraction of author records with city-level location data from 0% to ~60%; and (4) a logarithmically-scaled interactive Folium world map with per-city researcher popups, rendered as a fully self-contained HTML file.

URL PDF HTML ☆

赞 0 踩 0

2604.25053 2026-04-29 cs.CL cs.AI

Analyzing LLM Reasoning to Uncover Mental Health Stigma

Sreehari Sankar, Aliakbar Nafar, Mona Barman, Hannah K. Heitz, Ashwin Kumar, Pouria Tohidi, Dailun Li, Danish Hussain, Russell DuBois, Hamed Hasheminia, Farshad Majzoubi

2604.25040 2026-04-29 cs.AI cs.CL

Leverage Laws: A Per-Task Framework for Human-Agent Collaboration

Stan Loosmore

Comments 10 pages, 2 figures

2604.25039 2026-04-29 cs.CL cs.AI

Dual-Track CoT: Budget-Aware Stepwise Guidance for Small LMs

Sagnik Chatterjee, Atharva Patil, Sricharan Ramesh

2604.25028 2026-04-29 cs.LG cs.LO stat.ML

Null Measurability at the Symmetrization Interface in VC Learning

Dhruv Gupta

Comments 12 pages. Companion Lean 4 formalization: https://github.com/Zetetic-Dhruv/formal-learning-theory-kernel/tree/v3.3.0-paper

2604.25021 2026-04-29 cs.LG

Dynamic Regret for Online Regression in RKHS via Discounted VAW and Subspace Approximation

Dmitry B. Rokhlin, Georgiy A. Karapetyants

Comments 26 pages

2604.25012 2026-04-29 cs.LG

Why Search When You Can Transfer? Amortized Agentic Workflow Design from Structural Priors

Shiyi Du, Jiayuan Liu, Weihua Du, Yue Huang, Jiayi Li, Yingtao Luo, Xiangliang Zhang, Vincent Conitzer, Carl Kingsford

2604.25011 2026-04-29 cs.CL

Why Does Reinforcement Learning Generalize? A Feature-Level Mechanistic Study of Post-Training in Large Language Models

Dan Shi, Zhuowen Han, Simon Ostermann, Renren Jin, Josef van Genabith, Deyi Xiong

Comments ACL 2026 Main Conference

2604.25000 2026-04-29 cs.AI cs.SE

Toward a Science of Intent: Closure Gaps and Delegation Envelopes for Open-World AI Agents

Maximiliano Armesto, Christophe Kolb

Comments 15 pages, 1 figure, 5 tables

2604.24999 2026-04-29 cs.CV cs.AI

BifDet: A 3D Bifurcation Detection Dataset for Airway-Tree Modeling

Ali Keshavarzi, Quentin Bouniot, Benjamin M. Smith, Elsa Angelini

Comments This manuscript is currently in preparation for submission

2604.24997 2026-04-29 cs.CV

DouC: Dual-Branch CLIP for Training-Free Open-Vocabulary Segmentation

Mohamad Zamini, Diksha Shukla

2604.24996 2026-04-29 cs.AI

Sparse Personalized Text Generation with Multi-Trajectory Reasoning

Bo Ni, Haowei Fu, Qinwen Ge, Franck Dernoncourt, Samyadeep Basu, Nedim Lipka, Seunghyun Yoon, Yu Wang, Nesreen K. Ahmed, Subhojyoti Mukherjee, Puneet Mathur, Ryan A. Rossi, Tyler Derr

2604.24993 2026-04-29 cs.LG

Laplace-Bridged Randomized Smoothing for Fast Certified Robustness

Miao Lin, MD Saifur Rahman Mazumder, Feng Yu, Daniel Takabi, Rui Ning

2604.24987 2026-04-29 cs.AI

Assessing Y-Axis Influence: Bias in Multimodal Language Models on Chart-to-Table Translation

Seok Hwan Song, Azher Ahmed Efat, Wallapak Tavanapong

2604.24983 2026-04-29 cs.AI

Adaptive Prompt Embedding Optimization for LLM Jailbreaking

Miles Q. Li, Benjamin C. M. Fung, Boyang Li, Radin Hamidi Rad, Ebrahim Bagheri

2604.24978 2026-04-29 cs.CL cs.SE

Dont Stop Early: Scalable Enterprise Deep Research with Controlled Information Flow and Evidence-Aware Termination

Prafulla Kumar Choubey, Kung-Hsiang Huang, Pranav Narayanan Venkit, Jiaxin Zhang, Vaibhav Vats, Yu Li, Xiangyu Peng, Chien-Sheng Wu

Comments ACL Industry 2026

2604.24977 2026-04-29 cs.CL cs.HC

A Survey on LLM-based Conversational User Simulation

Bo Ni, Leyao Wang, Yu Wang, Branislav Kveton, Franck Dernoncourt, Yu Xia, Hongjie Chen, Reuben Leura, Samyadeep Basu, Subhojyoti Mukherjee, Puneet Mathur, Nesreen Ahmed, Junda Wu, Li Li, Huixin Zhang, Ruiyi Zhang, Tong Yu, Sungchul Kim, Jiuxiang Gu, Zhengzhong Tu, Alexa Siu, Zichao Wang, David Seunghyun Yoon, Nedim Lipka, Namyong Park, Zihao Lin, Trung Bui, Yue Zhao, Tyler Derr, Ryan A. Rossi

Comments Submitted in August 2025. MOD-81000 approved survey