arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2509.17766 2026-04-08 cs.CL cs.AI

A State-Update Prompting Strategy for Efficient and Robust Multi-turn Dialogue

Ziyi Liu

详情

英文摘要

Large Language Models (LLMs) struggle with information forgetting and inefficiency in long-horizon, multi-turn dialogues. To address this, we propose a training-free prompt engineering method, the State-Update Multi-turn Dialogue Strategy. It utilizes "State Reconstruction" and "History Remind" mechanisms to effectively manage dialogue history. Our strategy shows strong performance across multiple multi-hop QA datasets. For instance, on the HotpotQA dataset, it improves the core information filtering score by 32.6%, leading to a 14.1% increase in the downstream QA score, while also reducing inference time by 73.1% and token consumption by 59.4%. Ablation studies confirm the pivotal roles of both components. Our work offers an effective solution for optimizing LLMs in long-range interactions, providing new insights for developing more robust Agents.

URL PDF HTML ☆

赞 0 踩 0

2508.21435 2026-04-08 cs.CV cs.AI

MedShift: Implicit Conditional Transport for X-Ray Domain Adaptation

Francisco Caetano, Christiaan Viviers, Peter H. N. De With, Fons van der Sommen

Comments Accepted at the ICCV 2025 AIM Workshop

2508.08900 2026-04-08 cs.CV

DSER: Spectral Epipolar Representation for Efficient Light Field Depth Estimation

Noor Islam S. Mohammad, Md Muntaqim Meherab

Comments We have recently had author conflicts with this work; I heartily request to withdraw his paper as soon as possible

2507.05084 2026-04-08 cs.LG stat.ML

Distribution-dependent Generalization Bounds for Tuning Linear Regression Across Tasks

Maria-Florina Balcan, Saumya Goyal, Dravyansh Sharma

Comments 55 pages

2507.03617 2026-04-08 cs.CL

EMERGE: A Benchmark for Updating Knowledge Graphs with Emerging Textual Knowledge

Klim Zaporojets, Daniel Daza, Edoardo Barba, Ira Assent, Roberto Navigli, Paul Groth

2506.13040 2026-04-08 cs.CV

MAMMA: Markerless & Automatic Multi-Person Motion Action Capture

Hanz Cuevas-Velasquez, Anastasios Yiannakidis, Soyong Shin, Giorgio Becherini, Markus Höschle, Joachim Tesch, Taylor Obersat, Tsvetelina Alexiadis, Eni Halilaj, Michael J. Black

Comments Main paper and supplementary material

详情

Journal ref: CVPR2026

英文摘要

We present MAMMA, a markerless motion-capture pipeline that accurately recovers SMPL-X parameters from multi-view video of two-person interaction sequences. Traditional motion-capture systems rely on physical markers. Although they offer high accuracy, their requirements of specialized hardware, manual marker placement, and extensive post-processing make them costly and time-consuming. Recent learning-based methods attempt to overcome these limitations, but most are designed for single-person capture, rely on sparse keypoints, or struggle with occlusions and physical interactions. In this work, we introduce a method that predicts dense 2D contact-aware surface landmarks conditioned on segmentation masks, enabling person-specific correspondence estimation even under heavy occlusion. We employ a novel architecture that exploits learnable queries for each landmark. We demonstrate that our approach can handle complex person--person interaction and offers greater accuracy than existing methods. To train our network, we construct a large, synthetic multi-view dataset combining human motions from diverse sources, including extreme poses, hand motions, and close interactions. Our dataset yields high-variability synthetic sequences with rich body contact and occlusion, and includes SMPL-X ground-truth annotations with dense 2D landmarks. The result is a system capable of capturing human motion without the need for markers. Our approach offers competitive reconstruction quality compared to commercial marker-based motion-capture solutions, without the extensive manual cleanup. Finally, we address the absence of common benchmarks for dense-landmark prediction and markerless motion capture by introducing two evaluation settings built from real multi-view sequences. Our dataset is available in https://mamma.is.tue.mpg.de for research purposes.

URL PDF HTML ☆

赞 0 踩 0

2505.22765 2026-04-08 cs.CL cs.SD eess.AS

StressTest: Can YOUR Speech LM Handle the Stress?

Iddo Yosha, Gallil Maimon, Yossi Adi

Comments Accepted to ACL 2026

2504.05995 2026-04-08 cs.CL cs.AI

NativQA Framework: Enabling LLMs and VLMs with Native, Local, and Everyday Knowledge

Firoj Alam, Md Arid Hasan, Sahinur Rahman Laskar, Mucahid Kutlu, Kareem Darwish, Shammur Absar Chowdhury

Comments LLMs, Native, Multilingual, Language Diversity, Contextual Understanding, Minority Languages, Culturally Informed, Foundation Models, Large Language Models

2503.18297 2026-04-08 cs.CV

Image-to-Text for Medical Reports Using Adaptive Co-Attention and Triple-LSTM Module

Yishen Liu

Comments arXiv admin note: This submission has been withdrawn by arXiv administrators due to incorrect authorship. Author list truncated

2410.13469 2026-04-08 cs.LG

Interpreting Temporal Graph Neural Networks with Koopman Theory

Michele Guerra, Simone Scardapane, Filippo Maria Bianchi

2407.14971 2026-04-08 cs.CV cs.AI cs.CL cs.LG

Sim-CLIP: Unsupervised Siamese Adversarial Fine-Tuning for Robust and Semantically-Rich Vision-Language Models

Md Zarif Hossain, Ahmed Imteaj

Comments Accepted at IJCNN 2026

2312.02095 2026-04-08 cs.LG

Single-sample versus case-control sampling scheme for Positive Unlabeled data: the story of two scenarios

Jan Mielniczuk, Adam Wawrzeńczyk

2604.05831 2026-04-08 cs.RO

BiCoord: A Bimanual Manipulation Benchmark towards Long-Horizon Spatial-Temporal Coordination

Xingyu Peng, Chen Gao, Liankai Jin, Annan Li, Si Liu

Comments 8 pages

2604.05830 2026-04-08 cs.CL cs.AI

"OK Aura, Be Fair With Me": Demographics-Agnostic Training for Bias Mitigation in Wake-up Word Detection

Fernando López, Paula Delgado-Santos, Pablo Gómez, David Solans, Jordi Luque

Comments Accepted at Speech Language Models in Low-Resource Settings: Performance, Evaluation, and Bias Analysis (SPEAKABLE) - LREC2026 Workshops

2604.05829 2026-04-08 cs.LG stat.ML

Bivariate Causal Discovery Using Rate-Distortion MDL: An Information Dimension Approach

Tiago Brogueira, Mário A. T. Figueiredo

Comments 22 pages

2604.05826 2026-04-08 cs.AI cs.CY

Reciprocal Trust and Distrust in Artificial Intelligence Systems: The Hard Problem of Regulation

Martino Maggetti

2604.05819 2026-04-08 cs.CV cs.LG

Learn to Rank: Visual Attribution by Learning Importance Ranking

David Schinagl, Christian Fruhwirth-Reisinger, Alexander Prutsch, Samuel Schulter, Horst Possegger

2604.05794 2026-04-08 cs.CV cs.GR

EfficientMonoHair: Fast Strand-Level Reconstruction from Monocular Video via Multi-View Direction Fusion

Da Li, Dominik Engel, Deng Luo, Ivan Viola

Comments 10 pages, 6 figures, conference

2604.05788 2026-04-08 cs.CV

Sparse Gain Radio Map Reconstruction With Geometry Priors and Uncertainty-Guided Measurement Selection

Zhihan Zeng, Ning Wei, Muhammad Baqer Mollah, Kaihe Wang, Phee Lep Yeoh, Fei Xu, Yue Xiu, Zhongpei Zhang

2604.05781 2026-04-08 cs.CV

RHVI-FDD: A Hierarchical Decoupling Framework for Low-Light Image Enhancement

Junhao Yang, Bo Yang, Hongwei Ge, Yanchun Liang, Heow Pueh Lee, Chunguo Wu

Comments 8 pages, 8 figures

2604.05780 2026-04-08 cs.CV

Sparsity-Aware Voxel Attention and Foreground Modulation for 3D Semantic Scene Completion

Yu Xue, Longjun Gao, Yuanqi Su, HaoAng Lu, Xiaoning Zhang

Comments Accepted at CVPR 2026

2604.05779 2026-04-08 cs.CL cs.AI

What Models Know, How Well They Know It: Knowledge-Weighted Fine-Tuning for Learning When to Say "I Don't Know"

Joosung Lee, Hwiyeol Jo, Donghyeon Ko, Kyubyung Chae, Cheonbok Park, Jeonghoon Kim

Comments 8 pages

2604.05775 2026-04-08 cs.CL q-bio.GN

PhageBench: Can LLMs Understand Raw Bacteriophage Genomes?

Yusen Hou, Weicai Long, Haitao Hu, Houcheng Su, Junning Feng, Yanlin Zhang

2604.05773 2026-04-08 cs.CV

PDMP: Rethinking Balanced Multimodal Learning via Performance-Dominant Modality Prioritization

Shicai Wei, Chunbo Luo, Qiang Zhu, Yang Luo

2604.05761 2026-04-08 cs.CV

Improving Controllable Generation: Faster Training and Better Performance via $x_0$-Supervision

Amadou S. Sangare, Adrien Maglo, Mohamed Chaouch, Bertrand Luvison

2604.05757 2026-04-08 cs.CL

Identifying Influential N-grams in Confidence Calibration via Regression Analysis

Shintaro Ozaki, Wataru Hashimoto, Hidetaka Kamigaito, Katsuhiko Hayashi, Taro Watanabe

2604.05756 2026-04-08 cs.CL

Controlling Distributional Bias in Multi-Round LLM Generation via KL-Optimized Fine-Tuning

Yanbei Jiang, Amr Keleg, Ryandito Diandaru, Jey Han Lau, Lea Frermann, Biaoyan Fang, Fajri Koto

Comments Accepted at ACL Main Conference

2604.05749 2026-04-08 cs.RO cs.SY eess.SY

Hazard Management in Robot-Assisted Mammography Support

Ioannis Stefanakos, Roisin Bradley, Radu Calinescu, Beverley Townsend, Tianyuan Wang, Jihong Zhu

2604.05748 2026-04-08 cs.CV

SVC 2026: the Second Multimodal Deception Detection Challenge and the First Domain Generalized Remote Physiological Measurement Challenge

Dongliang Zhu, Zhiyi Niu, Bo Zhao, Jiajian Huang, Shuo Ye, Xun Lin, Hui Ma, Taorui Wang, Jiayu Zhang, Chunmei Zhu, Junzhe Cao, Yingjie Ma, Rencheng Song, Albert Clapés, Sergio Escalera, Dan Guo, Zitong Yu

Comments Accepted by the SVC workshop @ CVPR 2026

2604.05742 2026-04-08 cs.CV

ASSR-Net: Anisotropic Structure-Aware and Spectrally Recalibrated Network for Hyperspectral Image Fusion

Qiya Song, Hongzhi Zhou, Lishan Tan, Renwei Dian, Shutao Li