2025-07-03 |
MultiGen: Using Multimodal Generation in Simulation to Learn Multimodal Policies in Real |
Renhao Wang et.al. |
2507.02864v1 |
null |
2025-07-03 |
Point3R: Streaming 3D Reconstruction with Explicit Spatial Pointer Memory |
Yuqi Wu et.al. |
2507.02863v1 |
null |
2025-07-03 |
LiteReality: Graphics-Ready 3D Scene Reconstruction from RGB-D Scans |
Zhening Huang et.al. |
2507.02861v1 |
null |
2025-07-03 |
RefTok: Reference-Based Tokenization for Video Generation |
Xiang Fan et.al. |
2507.02862v1 |
null |
2025-07-03 |
Bootstrapping Grounded Chain-of-Thought in Multimodal LLMs for Data-Efficient Model Adaptation |
Jiaer Xia et.al. |
2507.02859v1 |
null |
2025-07-03 |
Answer Matching Outperforms Multiple Choice for Language Model Evaluation |
Nikhil Chandak et.al. |
2507.02856v1 |
null |
2025-07-03 |
AnyI2V: Animating Any Conditional Image with Motion Control |
Ziye Li et.al. |
2507.02857v1 |
null |
2025-07-03 |
Proof of a magnificent conjecture |
M. Kool et.al. |
2507.02852v1 |
null |
2025-07-03 |
Three-qubit W state tomography via full and marginal state reconstructions on ibm_osaka |
H. Talath et.al. |
2507.02849v1 |
null |
2025-07-03 |
MvHo-IB: Multi-View Higher-Order Information Bottleneck for Brain Disorder Diagnosis |
Kunyu Zhang et.al. |
2507.02847v1 |
null |
2025-07-03 |
Legal Requirements Translation from Law |
Anmol Singhal et.al. |
2507.02846v1 |
null |
2025-07-03 |
Enhancement of the effects due to the Schrödinger-Newton equation |
Davide Giordano Ario Altamura et.al. |
2507.02845v1 |
null |
2025-07-03 |
Visual Contextual Attack: Jailbreaking MLLMs with Image-Driven Context Injection |
Ziqi Miao et.al. |
2507.02844v1 |
null |
2025-07-03 |
On the Structure of Replicable Hypothesis Testers |
Anders Aamand et.al. |
2507.02842v1 |
null |
2025-07-03 |
StepHint: Multi-level Stepwise Hints Enhance Reinforcement Learning to Reason |
Kaiyi Zhang et.al. |
2507.02841v1 |
null |
2025-07-03 |
Neutrino mixing parameters and masses from $Δ(96)\rtimes H_{CP}$ in the tri-direct CP approach |
Li-Na Yan et.al. |
2507.02840v1 |
null |
2025-07-03 |
Early time hydrodynamic attractor in a nearly-unitary Fermi gas |
Michal P. Heller et.al. |
2507.02838v1 |
null |
2025-07-03 |
Free boundary regularity for a tumor growth model with obstacle |
Giulia Bevilacqua et.al. |
2507.02837v1 |
null |
2025-07-03 |
Revealing a transitional epoch of large-scale cosmic anisotropy in the quasar distribution |
Amit Mondal et.al. |
2507.02835v1 |
null |
2025-07-03 |
Generalizing Verifiable Instruction Following |
Valentina Pyatkin et.al. |
2507.02833v1 |
null |
2025-07-03 |
LCQNN: Linear Combination of Quantum Neural Networks |
Hongshun Yao et.al. |
2507.02832v1 |
null |
2025-07-03 |
Trace Formulas for Deformed W-Algebras |
Fabrizio Nieri et.al. |
2507.02831v1 |
null |
2025-07-03 |
Enhancing Noisy Quantum Sensing by GHZ State Partitioning |
Allen Zang et.al. |
2507.02829v1 |
null |
2025-07-03 |
Designs from magic-augmented Clifford circuits |
Yuzhen Zhang et.al. |
2507.02828v1 |
null |
2025-07-03 |
USAD: An Unsupervised Data Augmentation Spatio-Temporal Attention Diffusion Network |
Ying Yu et.al. |
2507.02827v1 |
null |
2025-07-03 |
Confidence-driven Gradient Modulation for Multimodal Human Activity Recognition: A Dynamic Contrastive Dual-Path Learning Approach |
Panpan Ji et.al. |
2507.02826v1 |
null |
2025-07-03 |
Establishing Best Practices for Building Rigorous Agentic Benchmarks |
Yuxuan Zhu et.al. |
2507.02825v1 |
null |
2025-07-03 |
Osculating Geometry and Higher-Order Distance Loci |
Sandra Di Rocco et.al. |
2507.02823v1 |
null |
2025-07-03 |
Genetic Features for Drug Responses in Cancer -- Investigating an Ensemble-Feature-Selection Approach |
Johannes Schlüter et.al. |
2507.02818v1 |
null |
2025-07-03 |
ML-based muon identification using a FNAL-NICADD scintillator chamber for the MID subsystem of ALICE 3 |
Jesus Eduardo Muñoz Mendez et.al. |
2507.02817v1 |
null |