2025-07-03 |
Subtyping in DHOL -- Extended preprint |
Colin Rothgang et.al. |
2507.02855v1 |
null |
2025-07-03 |
Legal Requirements Translation from Law |
Anmol Singhal et.al. |
2507.02846v1 |
null |
2025-07-03 |
Visual Contextual Attack: Jailbreaking MLLMs with Image-Driven Context Injection |
Ziqi Miao et.al. |
2507.02844v1 |
null |
2025-07-03 |
Confidence-driven Gradient Modulation for Multimodal Human Activity Recognition: A Dynamic Contrastive Dual-Path Learning Approach |
Panpan Ji et.al. |
2507.02826v1 |
null |
2025-07-03 |
LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion |
Fangfu Liu et.al. |
2507.02813v1 |
null |
2025-07-03 |
No time to train! Training-Free Reference-Based Instance Segmentation |
Miguel Espinosa et.al. |
2507.02798v1 |
null |
2025-07-03 |
From Long Videos to Engaging Clips: A Human-Inspired Video Editing Framework with Multimodal Narrative Understanding |
Xiangfeng Wang et.al. |
2507.02790v1 |
null |
2025-07-03 |
From Pixels to Damage Severity: Estimating Earthquake Impacts Using Semantic Segmentation of Social Media Images |
Danrong Zhang et.al. |
2507.02781v1 |
null |
2025-07-03 |
A Proof-Theoretic View of Basic Intuitionistic Conditional Logic (Extended Version) |
Tiziano Dalmonte et.al. |
2507.02767v1 |
null |
2025-07-03 |
DexVLG: Dexterous Vision-Language-Grasp Model at Scale |
Jiawei He et.al. |
2507.02747v1 |
null |
2025-07-03 |
Prompt learning with bounding box constraints for medical image segmentation |
Mélanie Gaillochet et.al. |
2507.02743v1 |
null |
2025-07-03 |
SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alignment |
Qi Xu et.al. |
2507.02705v1 |
null |
2025-07-03 |
Integrating path-planning and control for robotic unicycles |
Máté B. Vizi et.al. |
2507.02700v1 |
null |
2025-07-03 |
APT: Adaptive Personalized Training for Diffusion Models with Limited Data |
JungWoo Chae et.al. |
2507.02687v1 |
null |
2025-07-03 |
MEGANet-W: A Wavelet-Driven Edge-Guided Attention Framework for Weak Boundary Polyp Detection |
Zhe Yee Tan et.al. |
2507.02668v1 |
null |
2025-07-03 |
AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detection via Multimodal Large Language Models |
Ziyin Zhou et.al. |
2507.02664v1 |
null |
2025-07-03 |
VRAgent-R1: Boosting Video Recommendation with MLLM-based Agents via Reinforcement Learning |
Siran Chen et.al. |
2507.02626v1 |
null |
2025-07-03 |
ArtGS:3D Gaussian Splatting for Interactive Visual-Physical Modeling and Manipulation of Articulated Objects |
Qiaojun Yu et.al. |
2507.02600v1 |
null |
2025-07-03 |
Structure-aware Semantic Discrepancy and Consistency for 3D Medical Image Self-supervised Learning |
Tan Pan et.al. |
2507.02581v1 |
null |
2025-07-03 |
Parametric shape models for vessels learned from segmentations via differentiable voxelization |
Alina F. Dima et.al. |
2507.02576v1 |
null |
2025-07-03 |
Reconstructing Close Human Interaction with Appearance and Proxemics Reasoning |
Buzhen Huang et.al. |
2507.02565v1 |
null |
2025-07-03 |
Multi-Utterance Speech Separation and Association Trained on Short Segments |
Yuzhu Wang et.al. |
2507.02562v1 |
null |
2025-07-03 |
Clarifying Before Reasoning: A Coq Prover with Structural Context |
Yanzhen Lu et.al. |
2507.02541v1 |
null |
2025-07-03 |
Open-Source System for Multilingual Translation and Cloned Speech Synthesis |
Mateo Cámara et.al. |
2507.02530v1 |
null |
2025-07-03 |
MedFormer: Hierarchical Medical Vision Transformer with Content-Aware Dual Sparse Selection Attention |
Zunhui Xia et.al. |
2507.02488v1 |
null |
2025-07-03 |
On the width and profiles of cosmic filaments |
Qi-Rui Yang et.al. |
2507.02476v1 |
null |
2025-07-03 |
Optimisation of amplification and gas mixture for directional Dark Matter searches with the CYGNO/INITIUM project |
Giorgio Dho et.al. |
2507.02474v1 |
null |
2025-07-03 |
Resolving CAP Through Automata-Theoretic Economic Design: A Unified Mathematical Framework for Real-Time Partition-Tolerant Systems |
Craig S Wright et.al. |
2507.02464v1 |
null |
2025-07-03 |
Weakly-supervised Contrastive Learning with Quantity Prompts for Moving Infrared Small Target Detection |
Weiwei Duan et.al. |
2507.02454v1 |
null |
2025-07-03 |
Network structural change point detection and reconstruction for balanced neuronal networks |
Kai Chen et.al. |
2507.02450v1 |
null |