Skip to content

Image Caption

Image Caption

Publish Date Title Authors PDF Code
2025-05-01 Controllable Weather Synthesis and Removal with Video Diffusion Models Chih-Hao Lin et.al. 2505.00704v1 null
2025-05-01 T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT Dongzhi Jiang et.al. 2505.00703v1 null
2025-05-01 RayZer: A Self-supervised Large View Synthesis Model Hanwen Jiang et.al. 2505.00702v1 null
2025-05-01 GuideSR: Rethinking Guidance for One-Step High-Fidelity Diffusion-Based Super-Resolution Aditya Arora et.al. 2505.00687v1 null
2025-05-01 Visual Test-time Scaling for GUI Agent Grounding Tiange Luo et.al. 2505.00684v1 null
2025-05-01 TumorTwin: A python framework for patient-specific digital twins in oncology Michael Kapteyn et.al. 2505.00670v1 null
2025-05-01 Spreading Depolarization Detection in Electrocorticogram Spectrogram Imaging by Deep Learning: Is It Just About Delta Band? Jeanne Boyer-Chammard et.al. 2505.00666v1 null
2025-05-01 Why the hyperbolic polaritons are hyperbolic? Xiaoyu Xiong et.al. 2505.00655v1 null
2025-05-01 Deep Learning Assisted Outer Volume Removal for Highly-Accelerated Real-Time Dynamic MRI Merve Gülle et.al. 2505.00643v1 null
2025-05-01 Brain Foundation Models with Hypergraph Dynamic Adapter for Brain Disease Analysis Zhongying Deng et.al. 2505.00627v1 null
2025-05-01 Diverse Semantics-Guided Feature Alignment and Decoupling for Visible-Infrared Person Re-Identification Neng Dong et.al. 2505.00619v1 null
2025-05-01 Pixel3DMM: Versatile Screen-Space Priors for Single-Image 3D Face Reconstruction Simon Giebenhain et.al. 2505.00615v1 null
2025-05-01 A Novel Feature-Aware Chaotic Image Encryption Scheme For Data Security and Privacy in IoT and Edge Networks Muhammad Shahbaz Khan et.al. 2505.00593v1 null
2025-05-01 Uncertainty-Aware Multi-Expert Knowledge Distillation for Imbalanced Disease Grading Shuo Tong et.al. 2505.00592v1 null
2025-05-01 Synthesizing and Identifying Noise Levels in Autonomous Vehicle Camera Radar Datasets Mathis Morales et.al. 2505.00584v1 null
2025-05-01 AI-Driven High-Resolution Cell Segmentation and Quantitative Analysis Shuang Zhang et.al. 2505.00578v1 null
2025-05-01 Multimodal Masked Autoencoder Pre-training for 3D MRI-Based Brain Tumor Analysis with Missing Modalities Lucas Robinet et.al. 2505.00568v1 null
2025-05-01 X-ray illicit object detection using hybrid CNN-transformer neural network architectures Jorgen Cani et.al. 2505.00564v1 null
2025-05-01 The Jackknife method as a new approach to validate strong lens mass models Shun Nishida et.al. 2505.00553v1 null
2025-05-01 A Methodological and Structural Review of Parkinsons Disease Detection Across Diverse Data Modalities Abu Saleh Musa Miah et.al. 2505.00525v1 null
2025-05-01 Inconsistency-based Active Learning for LiDAR Object Detection Esteban Rivera et.al. 2505.00511v1 null
2025-05-01 Towards Scalable Human-aligned Benchmark for Text-guided Image Editing Suho Ryu et.al. 2505.00502v1 link
2025-05-01 Implicit Neural-Representation Learning for Elastic Deformable-Object Manipulations Minseok Song et.al. 2505.00500v1 null
2025-05-01 JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers Kwon Byung-Ki et.al. 2505.00482v1 null
2025-05-01 Orbit-blocking words in free groups Lucy Koch-Hyde et.al. 2505.00477v1 null
2025-05-01 CORSTITCH - A free, open source software for stitching and georeferencing underwater coral reef videos Julian Christopher L. Maya et.al. 2505.00462v1 null
2025-05-01 Toward Automated Regulatory Decision-Making: Trustworthy Medical Device Risk Classification with Multimodal Transformers and Self-Training Yu Han et.al. 2505.00422v1 null
2025-05-01 Self-supervised surface-related multiple suppression with multidimensional convolution Shijun Cheng et.al. 2505.00419v1 null
2025-05-01 ScaleTrack: Scaling and back-tracking Automated GUI Agents Jing Huang et.al. 2505.00416v1 null
2025-05-01 Multi-dimensional optical imaging on a chip Liheng Bian et.al. 2505.00408v1 null