Skip to content

Text and Image Generation

Text and Image Generation

Publish Date Title Authors PDF Code
2025-07-03 MultiGen: Using Multimodal Generation in Simulation to Learn Multimodal Policies in Real Renhao Wang et.al. 2507.02864v1 null
2025-07-03 Point3R: Streaming 3D Reconstruction with Explicit Spatial Pointer Memory Yuqi Wu et.al. 2507.02863v1 null
2025-07-03 LiteReality: Graphics-Ready 3D Scene Reconstruction from RGB-D Scans Zhening Huang et.al. 2507.02861v1 null
2025-07-03 RefTok: Reference-Based Tokenization for Video Generation Xiang Fan et.al. 2507.02862v1 null
2025-07-03 Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Caching Xin Zhou et.al. 2507.02860v1 null
2025-07-03 Bootstrapping Grounded Chain-of-Thought in Multimodal LLMs for Data-Efficient Model Adaptation Jiaer Xia et.al. 2507.02859v1 null
2025-07-03 Requirements Elicitation Follow-Up Question Generation Yuchen Shen et.al. 2507.02858v1 null
2025-07-03 Answer Matching Outperforms Multiple Choice for Language Model Evaluation Nikhil Chandak et.al. 2507.02856v1 null
2025-07-03 AnyI2V: Animating Any Conditional Image with Motion Control Ziye Li et.al. 2507.02857v1 null
2025-07-03 Subtyping in DHOL -- Extended preprint Colin Rothgang et.al. 2507.02855v1 null
2025-07-03 Diffeomorphic approximation of piecewise affine homeomorphisms Daniel Campbell et.al. 2507.02854v1 null
2025-07-03 Imprints of information scrambling on eigenstates of a quantum chaotic system Bikram Pain et.al. 2507.02853v1 null
2025-07-03 Proof of a magnificent conjecture M. Kool et.al. 2507.02852v1 null
2025-07-03 MOTIF: Modular Thinking via Reinforcement Fine-tuning in LLMs Purbesh Mitra et.al. 2507.02851v1 null
2025-07-03 LLM Hypnosis: Exploiting User Feedback for Unauthorized Knowledge Injection to All Users Almog Hilel et.al. 2507.02850v1 null
2025-07-03 Three-qubit W state tomography via full and marginal state reconstructions on ibm_osaka H. Talath et.al. 2507.02849v1 null
2025-07-03 Quantum jet Hopf algebroids by cotwist Xiao Han et.al. 2507.02848v1 null
2025-07-03 MvHo-IB: Multi-View Higher-Order Information Bottleneck for Brain Disorder Diagnosis Kunyu Zhang et.al. 2507.02847v1 null
2025-07-03 Legal Requirements Translation from Law Anmol Singhal et.al. 2507.02846v1 null
2025-07-03 Enhancement of the effects due to the Schrödinger-Newton equation Davide Giordano Ario Altamura et.al. 2507.02845v1 null
2025-07-03 Visual Contextual Attack: Jailbreaking MLLMs with Image-Driven Context Injection Ziqi Miao et.al. 2507.02844v1 null
2025-07-03 LLM-Driven Treatment Effect Estimation Under Inference Time Text Confounding Yuchen Ma et.al. 2507.02843v1 null
2025-07-03 On the Structure of Replicable Hypothesis Testers Anders Aamand et.al. 2507.02842v1 null
2025-07-03 StepHint: Multi-level Stepwise Hints Enhance Reinforcement Learning to Reason Kaiyi Zhang et.al. 2507.02841v1 null
2025-07-03 Neutrino mixing parameters and masses from $Δ(96)\rtimes H_{CP}$ in the tri-direct CP approach Li-Na Yan et.al. 2507.02840v1 null
2025-07-03 Stiefel optimization is NP-hard Zehua Lai et.al. 2507.02839v1 null
2025-07-03 Early time hydrodynamic attractor in a nearly-unitary Fermi gas Michal P. Heller et.al. 2507.02838v1 null
2025-07-03 Free boundary regularity for a tumor growth model with obstacle Giulia Bevilacqua et.al. 2507.02837v1 null
2025-07-03 On the Boundary Harnack Principle for operators with different lower order terms Daniela De Silva et.al. 2507.02836v1 null
2025-07-03 Revealing a transitional epoch of large-scale cosmic anisotropy in the quasar distribution Amit Mondal et.al. 2507.02835v1 null