2025-05-01 |
Controllable Weather Synthesis and Removal with Video Diffusion Models |
Chih-Hao Lin et.al. |
2505.00704v1 |
null |
2025-05-01 |
T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT |
Dongzhi Jiang et.al. |
2505.00703v1 |
null |
2025-05-01 |
RayZer: A Self-supervised Large View Synthesis Model |
Hanwen Jiang et.al. |
2505.00702v1 |
null |
2025-05-01 |
A log-depth in-place quantum Fourier transform that rarely needs ancillas |
Gregory D. Kahanamoku-Meyer et.al. |
2505.00701v1 |
null |
2025-05-01 |
Robotic Visual Instruction |
Yanbang Li et.al. |
2505.00693v1 |
null |
2025-05-01 |
Towards Autonomous Micromobility through Scalable Urban Simulation |
Wayne Wu et.al. |
2505.00690v1 |
null |
2025-05-01 |
GuideSR: Rethinking Guidance for One-Step High-Fidelity Diffusion-Based Super-Resolution |
Aditya Arora et.al. |
2505.00687v1 |
null |
2025-05-01 |
On the Importance of Gaussianizing Representations |
Daniel Eftekhari et.al. |
2505.00685v1 |
null |
2025-05-01 |
Visual Test-time Scaling for GUI Agent Grounding |
Tiange Luo et.al. |
2505.00684v1 |
null |
2025-05-01 |
Comma 2-comonad I: Eilenberg-Moore 2-category of colax coalgebras |
Igor Baković et.al. |
2505.00682v1 |
null |
2025-05-01 |
MINERVA: Evaluating Complex Video Reasoning |
Arsha Nagrani et.al. |
2505.00681v1 |
null |
2025-05-01 |
Deep Reinforcement Learning for Urban Air Quality Management: Multi-Objective Optimization of Pollution Mitigation Booth Placement in Metropolitan Environments |
Kirtan Rajesh et.al. |
2505.00668v1 |
null |
2025-05-01 |
Open-Source LLM-Driven Federated Transformer for Predictive IoV Management |
Yazan Otoum et.al. |
2505.00651v1 |
null |
2025-05-01 |
Deep Learning Assisted Outer Volume Removal for Highly-Accelerated Real-Time Dynamic MRI |
Merve Gülle et.al. |
2505.00643v1 |
null |
2025-05-01 |
Detecting Modeling Bias with Continuous Time Flow Models on Weak Lensing Maps |
Kangning Diao et.al. |
2505.00632v1 |
null |
2025-05-01 |
Bayes-Optimal Fair Classification with Multiple Sensitive Features |
Yi Yang et.al. |
2505.00631v1 |
null |
2025-05-01 |
Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and Outlook |
Muyi Bao et.al. |
2505.00630v1 |
null |
2025-05-01 |
Brain Foundation Models with Hypergraph Dynamic Adapter for Brain Disease Analysis |
Zhongying Deng et.al. |
2505.00627v1 |
null |
2025-05-01 |
Scaling limit of a weakly asymmetric simple exclusion process in the framework of regularity structures |
Ruojun Huang et.al. |
2505.00621v1 |
null |
2025-05-01 |
Diverse Semantics-Guided Feature Alignment and Decoupling for Visible-Infrared Person Re-Identification |
Neng Dong et.al. |
2505.00619v1 |
null |
2025-05-01 |
Pixel3DMM: Versatile Screen-Space Priors for Single-Image 3D Face Reconstruction |
Simon Giebenhain et.al. |
2505.00615v1 |
null |
2025-05-01 |
Generating multi-wavelength phase screens for atmospheric wave optics simulations using fast Fourier transforms |
Milo W. Hyde IV et.al. |
2505.00613v1 |
null |
2025-05-01 |
Combining LLMs with Logic-Based Framework to Explain MCTS |
Ziyan An et.al. |
2505.00610v1 |
null |
2025-05-01 |
Wavefront errors in two-wavelength adaptive optics systems |
Milo W. Hyde IV et.al. |
2505.00609v1 |
null |
2025-05-01 |
Nonparametric Estimation of Matching Efficiency and Elasticity in a Marriage Agency Platform: 2014--2025 |
Suguru Otani et.al. |
2505.00607v1 |
null |
2025-05-01 |
Dietary Intake Estimation via Continuous 3D Reconstruction of Food |
Wallace Lee et.al. |
2505.00606v1 |
null |
2025-05-01 |
Frustration, dynamics and catalysis |
R. Gonzalo Parra et.al. |
2505.00600v1 |
null |
2025-05-01 |
Visual Trajectory Prediction of Vessels for Inland Navigation |
Alexander Puzicha et.al. |
2505.00599v1 |
null |
2025-05-01 |
Fast and Low-Cost Genomic Foundation Models via Outlier Removal |
Haozheng Luo et.al. |
2505.00598v1 |
link |
2025-05-01 |
Uncertainty-Aware Multi-Expert Knowledge Distillation for Imbalanced Disease Grading |
Shuo Tong et.al. |
2505.00592v1 |
null |