arXiv:2104.08928v4 Announce Type: replace-cross
Abstract: Unstructured text provides decision-makers with a rich data source in many domains, ranging from product reviews in retail to nursing notes in healthcare. To leverage this information, words are typically translated into
arXiv:2305.14985v3 Announce Type: replace-cross
Abstract: The field of vision-and-language (VL) understanding has made unprecedented progress with end-to-end large pre-trained VL models (VLMs). However, they still fall short in zero-shot reasoning tasks that require multi-step i
arXiv:2309.15769v3 Announce Type: replace-cross
Abstract: Recent advances in deep learning have highlighted the phenomenon of benign overfitting in overparameterized statistical models, sparking significant interest in understanding its foundations. Owing to its simplicity and p
2026-06-19
· Dennis Shen, Dogyoon Song, Peng Ding, Jasjeet S. Sekhon
· 打开 ↗
arXiv:2402.14035v4 Announce Type: replace
Abstract: Knowledge distillation from foundation models to compact domain models is challenging due to substantial gaps in capacity, architecture, and modality. For example, in our experiments, distilling from a 76M-parameter language mo
arXiv:2412.18980v2 Announce Type: replace
Abstract: Uncertainty-aware deep learning (DL) models recently gained attention in fault diagnosis as a way to promote the reliable detection of faults when out-of-distribution (OOD) data arise from unseen faults (epistemic uncertainty)
2026-06-19
· Reza Jalayer, Masoud Jalayer, Andrea Mor, Carlotta Orsenigo, Carlo Vercellis
· 打开 ↗
arXiv:2501.17015v2 Announce Type: replace
Abstract: Simulation plays a crucial role in assessing autonomous driving systems, where the generation of realistic multi-agent behaviors is a key aspect. In multi-agent simulation, the primary challenges include behavioral multimodalit
2026-06-19
· Longzhong Lin, Xuewu Lin, Kechun Xu, Haojian Lu, Lichao Huang, Rong Xiong, Yue Wang
· 打开 ↗
arXiv:2501.18322v2 Announce Type: replace
Abstract: Transformers, which are state-of-the-art in most machine learning tasks, represent the data as sequences of vectors called tokens. This representation is then exploited by the attention function, which learns dependencies betwe
2026-06-19
· Val\'erie Castin, Pierre Ablin, Jos\'e Antonio Carrillo, Gabriel Peyr\'e
· 打开 ↗
arXiv:2502.06866v3 Announce Type: replace
Abstract: The drastic changes in the global economy, geopolitical conditions, and disruptions such as the COVID-19 pandemic have impacted the cost of living and quality of life. It is essential to comprehend the long-term implications of
arXiv:2502.19193v2 Announce Type: replace-cross
Abstract: Social media platforms frequently impose restrictive policies to moderate user content, prompting the emergence of creative evasion language strategies. This paper presents a multi-agent framework based on Large Language
arXiv:2503.02636v5 Announce Type: replace-cross
Abstract: Resting-state EEG provides a non-invasive view of spontaneous brain activity, but extracting meaningful patterns is often limited by scarce high-quality data and reliance on manually engineered features. Generative advers
arXiv:2503.17386v2 Announce Type: replace-cross
Abstract: Crashworthiness is a key performance measure in the design of safety-critical vehicle panel components such as B-pillars. Finite element (FE) simulations are widely used to evaluate crash responses but remain computationa
2026-06-19
· Haoran Li, Yingxue Zhao, Haosu Zhou, Tobias Pfaff, Nan Li
· 打开 ↗
arXiv:2504.02885v2 Announce Type: replace
Abstract: Automated medical report generation (MRG) is increasingly used to reduce the burden of manual reporting and for decision support. Large vision-language models (LVLMs) hold great promise for automated MRG due to their fine-grain
2026-06-19
· Hao Wang, Shuchang Ye, Jinghao Lin, Usman Naseem, Jinman Kim
· 打开 ↗
arXiv:2504.11171v5 Announce Type: replace-cross
Abstract: We present TerraMind, the first any-to-any generative, multimodal foundation model for Earth observation (EO). Unlike other multimodal models, TerraMind is pretrained on dual-scale representations combining both token-lev
2026-06-19
· Johannes Jakubik, Felix Yang, Benedikt Blumenstiel, Erik Scheurer, Rocco Sedona, Stefano Maurogiovanni, Jente Bosmans, Nikolaos Dionelis, Valerio Marsocci, Niklas Kopp, Rahul Ramachandran, Paolo Fraccaro, Thomas Brunschwiler, Gabriele Cavallaro, Juan Bernabe-Moreno, Nicolas Long\'ep\'e
· 打开 ↗
arXiv:2505.18201v2 Announce Type: replace-cross
Abstract: Controlling flapping-wing drones requires controllers that handle time-varying, nonlinear, underactuated dynamics from incomplete, noisy sensor data. Recent advances in artificial intelligence (AI), particularly reinforce
2026-06-19
· Romain Poletti, Lorenzo Schena, Lilla Koloszar, Joris Degroote, Miguel Alfonso Mendez
· 打开 ↗
arXiv:2505.18726v3 Announce Type: replace-cross
Abstract: Can we determine someone's geographic location solely from the sounds they hear? Are acoustic signals enough to localize within a country, state, or even city? In this work, we tackle the challenge of global-scale audio g
2026-06-19
· Mustafa Chasmai, Wuao Liu, Subhransu Maji, Grant Van Horn
· 打开 ↗
arXiv:2505.22829v2 Announce Type: replace
Abstract: This paper bridges distribution shift and AI safety through a comprehensive analysis of their conceptual and methodological synergies. While prior discussions often focus on narrow cases or informal analogies, we establish two
2026-06-19
· Chenruo Liu, Kenan Tang, Yao Qin, Qi Lei
· 打开 ↗
arXiv:2506.01678v2 Announce Type: replace-cross
Abstract: Scanning tunnelling microscopy (STM) is a powerful technique for imaging surfaces with atomic resolution, providing insight into physical and chemical processes at the level of single atoms and molecules. A regular task o
2026-06-19
· Nikola L. Kolev, Max Trouton, Filippo Federici Canova, Geoff Thornton, David Z. Gao, Neil J. Curson, Taylor J. Z. Stock
· 打开 ↗
arXiv:2506.14990v3 Announce Type: replace
Abstract: Benchmarks play a central role in reinforcement learning (RL) research, yet their computational constraints often shape what is studied. Despite the motivation of lifelong learning, most continual RL papers consider only 3-10 s
2026-06-19
· Tristan Tomilin, Luka van den Boogaard, Samuel Garcin, Constantin Ruhdorfer, Bram Grooten, Fabrice Kusters, Yali Du, Andreas Bulling, Mykola Pechenizkiy, Meng Fang
· 打开 ↗
arXiv:2507.00875v3 Announce Type: replace
Abstract: Translating Hong Kong Court Judgments from English to Traditional Chinese is mandated by Articles 8-9 of the Basic Law, yet remains constrained by a shortage of parallel resources and rigorous demands on legal terminology, cita
arXiv:2507.19137v3 Announce Type: replace-cross
Abstract: Prior research indicates that users prefer assistive technologies whose personalities align with their own. This has sparked interest in automatic personality perception (APP), which aims to predict an individual's percei
2026-06-19
· Alice Zhang, Skanda Muralidhar, Daniel Gatica-Perez, Mathew Magimai-Doss
· 打开 ↗
arXiv:2507.19712v3 Announce Type: replace-cross
Abstract: In this paper, we explore mission assignment and task offloading in an Open Radio Access Network (Open RAN)-based intelligent transportation system (ITS), where autonomous vehicles leverage mobile edge computing for effic
arXiv:2508.04266v4 Announce Type: replace
Abstract: Existing benchmarks in e-commerce primarily focus on basic user intents, such as finding or purchasing products. However, real-world users often pursue more complex goals, such as applying vouchers, managing budgets, and findin
arXiv:2508.05762v2 Announce Type: replace-cross
Abstract: Universal machine learning force fields (UMLFFs) promise to revolutionize materials science by enabling rapid atomistic simulations across the periodic table. However, their evaluation has been limited to computational be
2026-06-19
· Sajid Mannan, Vaibhav Bihani, Carmelo Gonzales, Kin Long Kelvin Lee, Nitya Nand Gosvami, Sayan Ranu, Santiago Miret, N M Anoop Krishnan
· 打开 ↗
arXiv:2509.15822v3 Announce Type: replace-cross
Abstract: Predictions from statistical physics postulate that recovery of the communities in the Stochastic Block Model (SBM) with a fixed number $K$ of communities is possible in polynomial time above, and only above, the Kesten-S
2026-06-19
· Alexandra Carpentier, Christophe Giraud, Nicolas Verzelen
· 打开 ↗
arXiv:2509.15927v5 Announce Type: replace
Abstract: Auto-bidding is a critical tool for advertisers to improve advertising performance. Recent progress has demonstrated that AI-Generated Bidding (AIGB), which learns a conditional generative planner from offline data, achieves su
arXiv:2509.19658v2 Announce Type: replace-cross
Abstract: In-context imitation learning (ICIL) enables robots to learn tasks from prompts consisting of just a handful of demonstrations. By eliminating the need for parameter updates at deployment time, this paradigm supports few-
2026-06-19
· Youngju Yoo, Jiaheng Hu, Yifeng Zhu, Bo Liu, Qiang Liu, Roberto Mart\'in-Mart\'in, Peter Stone
· 打开 ↗
arXiv:2509.23806v2 Announce Type: replace-cross
Abstract: Concolic testing for neural networks alternates concrete execution with constraint solving to search for inputs that flip model decisions. We present a concolic tester for Transformer classifiers that uses SHAP estimates
arXiv:2509.25148v2 Announce Type: replace
Abstract: Post-training alignment of large language models often combines supervised fine-tuning (SFT) on expert demonstrations with reinforcement learning (RL) from preference or verifiable feedback. SFT provides a useful behavioral anc
arXiv:2510.01565v4 Announce Type: replace
Abstract: Diffusion Transformer (DiT) models excel at generating high-quality images through iterative denoising steps, but serving them under strict Service Level Objectives (SLOs) is challenging due to their high computational cost, pa
2026-06-19
· Runyu Lu, Shiqi He, Wenxuan Tan, Shenggui Li, Ruofan Wu, Jeff J. Ma, Ang Chen, Mosharaf Chowdhury
· 打开 ↗
arXiv:2510.08807v2 Announce Type: replace-cross
Abstract: From loco-motion to dextrous manipulation, humanoid robots have made remarkable strides in demonstrating complex full-body capabilities. However, the majority of current robot learning datasets and benchmarks mainly focus
arXiv:2510.18383v3 Announce Type: replace
Abstract: Distilling the tool-use capabilities of large language models (LLMs) into small language models (SLMs) is essential for their practical application. The predominant approach, supervised fine-tuning (SFT), suffers from poor out-
arXiv:2510.18784v3 Announce Type: replace
Abstract: Despite significant work on low-bit quantization-aware training (QAT), there is still an accuracy gap between such techniques and native training. To address this, we introduce CAGE (Curvature-Aware Gradient Estimation), a new
2026-06-19
· Soroush Tabesh, Mher Safaryan, Andrei Panferov, Alexandra Volkova, Dan Alistarh
· 打开 ↗
arXiv:2510.19893v2 Announce Type: replace
Abstract: Medical AI systems demonstrated impressive diagnostic performance, yet they routinely show uneven accuracy across demographic groups, disadvantaging underrepresented populations. Although multimodal reasoning foundation models
2026-06-19
· Shiqi Dai, Wei Dai, Jiaee Cheong, Paul Pu Liang
· 打开 ↗
arXiv:2510.21978v2 Announce Type: replace
Abstract: Reinforcement learning with verifiable rewards (RLVR) has delivered impressive gains in mathematical and multimodal reasoning and has become a standard post-training paradigm for contemporary language and vision-language models
arXiv:2510.27568v2 Announce Type: replace-cross
Abstract: Solving mathematical reasoning problems requires not only accurate access to relevant knowledge but also careful, multi-step thinking. However, current retrieval-augmented models often rely on a single perspective, follow
2026-06-19
· Ali Asgarov, Umid Suleymanov, Aadyant Khatri
· 打开 ↗
arXiv:2511.08378v4 Announce Type: replace-cross
Abstract: Session-based recommendation (SBR) aims to predict anonymous users' next interaction based on their interaction sessions. In the practical recommendation scenario, low-exposure items constitute the majority of interaction
arXiv:2512.03818v2 Announce Type: replace
Abstract: Due to their architecture and vast pre-training data, large language models (LLMs) demonstrate strong text classification performance. However, LLM output - here, the category assigned to a text - depends heavily on the wording
arXiv:2512.17473v3 Announce Type: replace-cross
Abstract: We present an algorithm based on the alternating direction method of multipliers (ADMM) for solving nonlinear matrix decompositions (NMD). Given an input matrix $X \in \mathbb{R}^{m \times n}$ and a factorization rank $r
arXiv:2512.20014v3 Announce Type: replace-cross
Abstract: While Vision-Language-Action (VLA) models generalize well to generic instructions, they struggle with personalized commands such as "bring my cup," where the robot must act on one specific instance among visually similar
arXiv:2601.03040v2 Announce Type: replace-cross
Abstract: A fundamental requirement for full autonomy is the ability to sustain accurate navigation in the absence of external data, such as GNSS signals or visual information. In these challenging environments, the platform must r
arXiv:2601.14430v2 Announce Type: replace-cross
Abstract: Controlling generative models is computationally expensive. This is because optimal alignment with a reward function--whether via inference-time steering or fine-tuning--requires estimating the value function. This task d
2026-06-19
· Peter Potaptchik, Adhi Saravanan, Abbas Mammadov, Alvaro Prat, Michael S. Albergo, Yee Whye Teh
· 打开 ↗
arXiv:2601.16233v2 Announce Type: replace-cross
Abstract: HIV is a retrovirus that attacks the human immune system and can lead to death without proper treatment. In collaboration with the WHO and the University of Witwatersrand, we study how to improve the efficiency of HIV tes
2026-06-19
· Akseli Kangaslahti, Davin Choo, Lingkai Kong, Milind Tambe, Alastair van Heerden, Cheryl Johnson
· 打开 ↗
arXiv:2601.22107v2 Announce Type: replace
Abstract: We introduce \textit{Prior-Informed Flow Matching (PIFM)}, a conditional flow model for graph reconstruction. Reconstructing graphs from partial observations remains a key challenge; classical embedding methods often lack globa
2026-06-19
· Harvey Chen, Nicolas Zilberstein, Santiago Segarra
· 打开 ↗
arXiv:2601.22300v3 Announce Type: replace-cross
Abstract: We propose a deep photonic neuromorphic network (PNN) architecture based on phase-change material (PCM) synapses and local optical feedback for online, unsupervised Hebbian learning. The proposed architecture combines opt
2026-06-19
· Xi Li, Disha Biswas, Peng Zhou, Wesley H. Brigner, Anna Capuano, Joseph S. Friedman, Qing Gu
· 打开 ↗
arXiv:2602.00510v2 Announce Type: replace-cross
Abstract: Most LLM code-synthesis benchmarks rely on unit tests as the reward oracle, but PCB schematic design has none: correctness is defined by structured physical constraints over real IC packages and pin-level assignments, per
arXiv:2602.01425v2 Announce Type: replace-cross
Abstract: Linear probes are a promising approach for monitoring AI systems for deceptive behaviour. Previous work has shown that a linear classifier trained on a contrastive instruction pair and a simple dataset can achieve good pe
arXiv:2602.04306v2 Announce Type: replace
Abstract: As large language models (LLMs) are increasingly deployed in real-world applications, ensuring their fair responses across demographics has become crucial. Despite many efforts, an ongoing challenge is hidden bias: LLMs appear
2026-06-19
· Kahee Lim, Soyeon Kim, Steven Euijong Whang
· 打开 ↗
arXiv:2602.05533v3 Announce Type: replace
Abstract: We study conditional generation in diffusion models under hard constraints, where generated samples must satisfy prescribed events with probability one. Such constraints arise naturally in safety-critical applications and in ra
arXiv:2602.07628v2 Announce Type: replace-cross
Abstract: While the shift toward unified foundation models has revolutionized many deep learning domains, sleep medicine remains largely restricted to task-specific models that focus on localized micro-structure features. These app
2026-06-19
· Keondo Park, Younghoon Na, Yourim Choi, Hyunwoo Ryu, Hyun-Woo Shin, Hyung-Sin Kim
· 打开 ↗
arXiv:2602.09689v2 Announce Type: replace
Abstract: Fine-tuning large pre-trained models on a target distribution often improves in-distribution (ID) accuracy, but at the cost of out-of-distribution (OOD) robustness as representations specialize to the fine-tuning data. Weight-s