Skip to content

Multi modal

Multi-modal

Publish Date Title Authors PDF Code
2025-05-01 OmicsCL: Unsupervised Contrastive Learning for Cancer Subtype Discovery and Survival Stratification Atahan Karagoz et.al. 2505.00650v1 null
2025-05-01 Brain Foundation Models with Hypergraph Dynamic Adapter for Brain Disease Analysis Zhongying Deng et.al. 2505.00627v1 null
2025-05-01 Diverse Semantics-Guided Feature Alignment and Decoupling for Visible-Infrared Person Re-Identification Neng Dong et.al. 2505.00619v1 null
2025-05-01 ParkDiffusion: Heterogeneous Multi-Agent Multi-Modal Trajectory Prediction for Automated Parking using Diffusion Models Jiarong Wei et.al. 2505.00586v1 null
2025-05-01 Multimodal Masked Autoencoder Pre-training for 3D MRI-Based Brain Tumor Analysis with Missing Modalities Lucas Robinet et.al. 2505.00568v1 null
2025-05-01 Superintuitionistic predicate logics of linear frames: undecidability with two individual variables Mikhail Rybakov et.al. 2505.00531v1 null
2025-05-01 A Methodological and Structural Review of Parkinsons Disease Detection Across Diverse Data Modalities Abu Saleh Musa Miah et.al. 2505.00525v1 null
2025-05-01 Recursive inseparability of classical theories of a binary predicate and non-classical logics of a unary predicate Mikhail Rybakov et.al. 2505.00524v1 null
2025-05-01 JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers Kwon Byung-Ki et.al. 2505.00482v1 null
2025-05-01 Toward Automated Regulatory Decision-Making: Trustworthy Medical Device Risk Classification with Multimodal Transformers and Self-Training Yu Han et.al. 2505.00422v1 null
2025-05-01 Prospects for Ultralow-Mass Nuclear Magnetic Resonance using Spin Defects in Hexagonal Boron Nitride Declan M. Daly et.al. 2505.00383v1 null
2025-05-01 The Invisible Threat: Evaluating the Vulnerability of Cross-Spectral Face Recognition to Presentation Attacks Anjith George et.al. 2505.00380v1 null
2025-05-01 Automated segmenta-on of pediatric neuroblastoma on multi-modal MRI: Results of the SPPIN challenge at MICCAI 2023 M. A. D. Buser et.al. 2505.00369v1 null
2025-05-01 Edge Large AI Models: Revolutionizing 6G Networks Zixin Wang et.al. 2505.00321v1 null
2025-04-30 Audo-Sight: Enabling Ambient Interaction For Blind And Visually Impaired Individuals Bhanuja Ainary et.al. 2505.00153v1 null
2025-04-30 Emergent oscillations and chaos in non-compliant microfluidic networks Yanxuan Shao et.al. 2505.00068v1 null
2025-04-30 Early Exit and Multi Stage Knowledge Distillation in VLMs for Video Summarization Anas Anwarul Haq Khan et.al. 2504.21831v1 null
2025-04-30 An Underwater, Fault-Tolerant, Laser-Aided Robotic Multi-Modal Dense SLAM System for Continuous Underwater In-Situ Observation Yaming Ou et.al. 2504.21826v1 null
2025-04-30 Why Compress What You Can Generate? When GPT-4o Generation Ushers in Image Compression Fields Yixin Gao et.al. 2504.21814v1 null
2025-04-30 Solving Copyright Infringement on Short Video Platforms: Novel Datasets and an Audio Restoration Deep Learning Pipeline Minwoo Oh et.al. 2504.21772v1 null
2025-04-30 Task-Agnostic Semantic Communications Relying on Information Bottleneck and Federated Meta-Learning Hao Wei et.al. 2504.21723v2 null
2025-04-30 VividListener: Expressive and Controllable Listener Dynamics Modeling for Multi-Modal Responsive Interaction Shiying Li et.al. 2504.21718v1 null
2025-04-30 REHEARSE-3D: A Multi-modal Emulated Rain Dataset for 3D Point Cloud De-raining Abu Mohammed Raisuddin et.al. 2504.21699v1 null
2025-04-30 BERSting at the Screams: A Benchmark for Distanced, Emotional and Shouted Speech Recognition Paige Tuttösí et.al. 2505.00059v1 null
2025-04-30 Cascade Detector Analysis and Application to Biomedical Microscopy Thomas L. Athey et.al. 2504.21598v1 null
2025-04-30 Iterative Trajectory Exploration for Multimodal Agents Pengxiang Li et.al. 2504.21561v1 null
2025-04-30 TinyMA-IEI-PPO: Exploration Incentive-Driven Multi-Agent DRL with Self-Adaptive Pruning for Vehicular Embodied AI Agent Twins Migration Zhuoqi Zeng et.al. 2505.00055v1 null
2025-04-30 Consistency-aware Fake Videos Detection on Short Video Platforms Junxi Wang et.al. 2504.21495v1 null
2025-04-30 A Comprehensive Survey of Electrical Stimulation Haptic Feedback in Human-Computer Interaction Simin Yang et.al. 2504.21477v1 null
2025-04-30 GarmentDiffusion: 3D Garment Sewing Pattern Generation with Multimodal Diffusion Transformers Xinyu Li et.al. 2504.21476v1 null