arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.11048 2026-03-12 cs.CV cs.AI cs.CL cs.MA cs.NE

COMIC: Agentic Sketch Comedy Generation

Susung Hong, Brian Curless, Ira Kemelmacher-Shlizerman, Steve Seitz

Comments Project page: https://susunghong.github.io/COMIC/

2603.11047 2026-03-12 cs.CV cs.AI cs.GR

LiTo: Surface Light Field Tokenization

Jen-Hao Rick Chang, Xiaoming Zhao, Dorian Chan, Oncel Tuzel

Comments ICLR 2026; Project page: https://apple.github.io/ml-lito/

2603.11044 2026-03-12 cs.CV

Agentar-Fin-OCR

Siyi Qian, Xiongfei Bai, Bingtao Fu, Yichen Lu, Gaoyang Zhang, Xudong Yang, Peng Zhang

2603.11039 2026-03-12 cs.CL cs.AI cs.DS

Instruction set for the representation of graphs

Ezequiel Lopez-Rubio, Mario Pascual-Gonzalez

2603.11027 2026-03-12 cs.CL

Beyond the Illusion of Consensus: From Surface Heuristics to Knowledge-Grounded Evaluation in LLM-as-a-Judge

Mingyang Song, Mao Zheng, Chenning Xu

2603.11021 2026-03-12 cs.LG

Leech Lattice Vector Quantization for Efficient LLM Compression

Tycho F. A. van der Ouderaa, Mart van Baalen, Paul Whatmough, Markus Nagel

2603.11000 2026-03-12 cs.LG q-bio.NC

Cross-Species Transfer Learning for Electrophysiology-to-Transcriptomics Mapping in Cortical GABAergic Interneurons

Theo Schwider, Ramin Ramezani

2603.10995 2026-03-12 cs.LG

Factorized Neural Implicit DMD for Parametric Dynamics

Siyuan Chen, Zhecheng Wang, Yixin Chen, Yue Chang, Peter Yichen Chen, Eitan Grinspun, Jonathan Panuelos

2603.10990 2026-03-12 cs.CV

Too Vivid to Be Real? Benchmarking and Calibrating Generative Color Fidelity

Zhengyao Fang, Zexi Jia, Yijia Zhong, Pengcheng Luo, Jinchao Zhang, Guangming Lu, Jun Yu, Wenjie Pei

Comments accepted by CVPR2026

2603.10987 2026-03-12 cs.LG

MCMC Informed Neural Emulators for Uncertainty Quantification in Dynamical Systems

Heikki Haario, Zhi-Song Liu, Martin Simon, Hendrik Weichel

2603.10985 2026-03-12 cs.LG

The Discrete Charm of the MLP: Binary Routing of Continuous Signals in Transformer Feed-Forward Layers

Peter Balogh

2603.10983 2026-03-12 cs.LG physics.space-ph

Federated Learning-driven Beam Management in LEO 6G Non-Terrestrial Networks

Maria Lamprini Bartsioka, Ioannis A. Bartsiokas, Athanasios D. Panagopoulos, Dimitra I. Kaklamani, Iakovos S. Venieris

Comments 2 pages with 2 figures and 1 table. Accepted in 2026 International Applied Computational Electromagnetics Society (ACES) Symposium

2603.10980 2026-03-12 cs.RO

PPGuide: Steering Diffusion Policies with Performance Predictive Guidance

Zixing Wang, Devesh K. Jha, Ahmed H. Qureshi, Diego Romeres

Comments Accepted by ICRA'26

2603.10979 2026-03-12 cs.RO

Learning Adaptive Force Control for Contact-Rich Sample Scraping with Heterogeneous Materials

Cenk Cetin, Shreyas Pouli, Gabriella Pizzuto

Comments 8 pages, 6 figures, 4 tables; Submitted to IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2026

详情

英文摘要

The increasing demand for accelerated scientific discovery, driven by global challenges, highlights the need for advanced AI-driven robotics. Deploying robotic chemists in human-centric labs is key for the next horizon of autonomous discovery, as complex tasks still demand the dexterity of human scientists. Robotic manipulation in this context is uniquely challenged by handling diverse chemicals (granular, powdery, or viscous liquids), under varying lab conditions. For example, humans use spatulas for scraping materials from vial walls. Automating this process is challenging because it goes beyond simple robotic insertion tasks and traditional lab automation, requiring the execution of fine-granular movements within a constrained environment (the sample vial). Our work proposes an adaptive control framework to address this, relying on a low-level Cartesian impedance controller for stable and compliant physical interaction and a high-level reinforcement learning agent that learns to dynamically adjust interaction forces at the end-effector. The agent is guided by perception feedback, which provides the material's location. We first created a task-representative simulation environment with a Franka Research 3 robot, a scraping tool, and a sample vial containing heterogeneous materials. To facilitate the learning of an adaptive policy and model diverse characteristics, the sample is modelled as a collection of spheres, where each sphere is assigned a unique dislodgement force threshold, which is procedurally generated using Perlin noise. We train an agent to autonomously learn and adapt the optimal contact wrench for a sample scraping task in simulation and then successfully transfer this policy to a real robotic setup. Our method was evaluated across five different material setups, outperforming a fixed-wrench baseline by an average of 10.9%.

URL PDF HTML ☆

赞 0 踩 0

2603.10978 2026-03-12 cs.CV cs.AI

GroundCount: Grounding Vision-Language Models with Object Detection for Mitigating Counting Hallucinations

Boyuan Chen, Minghao Shao, Siddharth Garg, Ramesh Karri, Muhammad Shafique

2603.10977 2026-03-12 cs.LG

FRIEND: Federated Learning for Joint Optimization of multi-RIS Configuration and Eavesdropper Intelligent Detection in B5G Networks

Maria Lamprini A. Bartsioka, Ioannis A. Bartsiokas, Anastasios K. Papazafeiropoulos, Maria A. Seimeni, Dimitra I. Kaklamani, Iakovos S. Venieris

Comments 8 pages with 5 figures and 2 tables. Accepted in 29th Conference on Innovation in Clouds, Internet and Networks (ICIN 2026)

2603.10975 2026-03-12 cs.CV

VCR: Variance-Driven Channel Recalibration for Robust Low-Light Enhancement

Zhixin Cheng, Fangwen Zhang, Xiaotian Yin, Baoqun Yin, Haodian Wang

2603.10965 2026-03-12 cs.CV

Contrastive learning-based video quality assessment-jointed video vision transformer for video recognition

Jian Sun, Mohammad H. Mahoor

Comments 9 figures, 10 tables,

2603.10950 2026-03-12 cs.LG stat.ML

When should we trust the annotation? Selective prediction for molecular structure retrieval from mass spectra

Mira Jürgens, Gaetan De Waele, Morteza Rakhshaninejad, Willem Waegeman

2603.10938 2026-03-12 cs.LG cs.AI

Safe RLHF Beyond Expectation: Stochastic Dominance for Universal Spectral Risk Control

Yaswanth Chittepu, Ativ Joshi, Rajarshi Bhattacharjee, Scott Niekum

2603.10937 2026-03-12 cs.LG stat.AP

Quantifying Membership Disclosure Risk for Tabular Synthetic Data Using Kernel Density Estimators

Rajdeep Pathak, Sayantee Jana

2603.10933 2026-03-12 cs.CV

Bridging the Skill Gap in Clinical CBCT Interpretation with CBCTRepD

Qinxin Wu, Fucheng Niu, Hengchuan Zhu, Yifan Sun, Ye Shen, Xu Li, Han Wu, Leqi Liu, Zhiwen Pan, Zuozhu Liu, Fudong Zhu, Bin Feng

2603.10928 2026-03-12 cs.CV

Novel Architecture of RPA In Oral Cancer Lesion Detection

Revana Magdy, Joy Naoum, Ali Hamdi

2603.10921 2026-03-12 cs.SD

Training-Free Multi-Step Inference for Target Speaker Extraction

Zhenghai You, Ying Shi, Lantian Li, Dong Wang

2603.10916 2026-03-12 cs.LG

NCAA Bracket Prediction Using Machine Learning and Combinatorial Fusion Analysis

Yuanhong Wu, Isaiah Smith, Tushar Marwah, Michael Schroeter, Mohamed Rahouti, D. Frank Hsu

Comments 8 pages, 4 figures, Published in Proceedings of the 2024 IEEE Cyber Science and Technology Congress (CyberSciTech)

2603.09974 2026-03-12 cs.LG physics.ao-ph

Task Aware Modulation Using Representation Learning for Upsaling of Terrestrial Carbon Fluxes

Aleksei Rozanov, Arvind Renganathan, Vipin Kumar

Comments Accepted to the KGML Bridge at AAAI 2026 (non-archival)

2603.09480 2026-03-12 cs.CV

Prune Redundancy, Preserve Essence: Vision Token Compression in VLMs via Synergistic Importance-Diversity

Zhengyao Fang, Pengyuan Lyu, Chengquan Zhang, Guangming Lu, Jun Yu, Wenjie Pei

Comments accepted by ICLR2026

2603.08935 2026-03-12 cs.CV cs.AI cs.CL cs.DL cs.IR

PathoScribe: Transforming Pathology Data into a Living Library with a Unified LLM-Driven Framework for Semantic Retrieval and Clinical Integration

Abdul Rehman Akbar, Samuel Wales-McGrath, Alejadro Levya, Lina Gokhale, Rajendra Singh, Wei Chen, Anil Parwani, Muhammad Khalid Khan Niazi

2603.07789 2026-03-12 cs.CV

SGI: Structured 2D Gaussians for Efficient and Compact Large Image Representation

Zixuan Pan, Kaiyuan Tang, Jun Xia, Yifan Qin, Lin Gu, Chaoli Wang, Jianxu Chen, Yiyu Shi

Comments Accepted by CVPR 2026

2603.03281 2026-03-12 cs.CV cs.LG

CFG-Ctrl: Control-Based Classifier-Free Diffusion Guidance

Hanyang Wang, Yiyang Liu, Jiawei Chi, Fangfu Liu, Ran Xue, Yueqi Duan

Comments Accepted by CVPR 2026; Project Page: https://hanyang-21.github.io/CFG-Ctrl