Image Matching

Publish Date	Title	Authors	PDF	Code
2025-07-03	Point3R: Streaming 3D Reconstruction with Explicit Spatial Pointer Memory	Yuqi Wu et.al.	2507.02863v1	null
2025-07-03	LiteReality: Graphics-Ready 3D Scene Reconstruction from RGB-D Scans	Zhening Huang et.al.	2507.02861v1	null
2025-07-03	Bootstrapping Grounded Chain-of-Thought in Multimodal LLMs for Data-Efficient Model Adaptation	Jiaer Xia et.al.	2507.02859v1	null
2025-07-03	Answer Matching Outperforms Multiple Choice for Language Model Evaluation	Nikhil Chandak et.al.	2507.02856v1	null
2025-07-03	AnyI2V: Animating Any Conditional Image with Motion Control	Ziye Li et.al.	2507.02857v1	null
2025-07-03	MvHo-IB: Multi-View Higher-Order Information Bottleneck for Brain Disorder Diagnosis	Kunyu Zhang et.al.	2507.02847v1	null
2025-07-03	Visual Contextual Attack: Jailbreaking MLLMs with Image-Driven Context Injection	Ziqi Miao et.al.	2507.02844v1	null
2025-07-03	LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion	Fangfu Liu et.al.	2507.02813v1	null
2025-07-03	Tailoring the Electronic Properties of Monoclinic (InxAl1-x)2O3 Alloys via Substitutional Donors and Acceptors	Mohamed Abdelilah Fadla et.al.	2507.02805v1	null
2025-07-03	Multimodal Mathematical Reasoning with Diverse Solving Perspective	Wenhao Shi et.al.	2507.02804v1	null
2025-07-03	No time to train! Training-Free Reference-Based Instance Segmentation	Miguel Espinosa et.al.	2507.02798v1	null
2025-07-03	RichControl: Structure- and Appearance-Rich Training-Free Spatial Control for Text-to-Image Generation	Liheng Zhang et.al.	2507.02792v1	null
2025-07-03	Metric dimension reduction modulus for logarithmic distortion	Dylan J. Altschuler et.al.	2507.02785v1	null
2025-07-03	From Pixels to Damage Severity: Estimating Earthquake Impacts Using Semantic Segmentation of Social Media Images	Danrong Zhang et.al.	2507.02781v1	null
2025-07-03	Discovery and Preliminary Characterization of a Third Interstellar Object: 3I/ATLAS	Darryl Z. Seligman et.al.	2507.02757v1	null
2025-07-03	Generation of Intense Deep-Ultraviolet Pulses at 200 nm	X. Xie et.al.	2507.02756v1	null
2025-07-03	Linear Attention with Global Context: A Multipole Attention Mechanism for Vision and Physics	Alex Colagrande et.al.	2507.02748v1	null
2025-07-03	DexVLG: Dexterous Vision-Language-Grasp Model at Scale	Jiawei He et.al.	2507.02747v1	null
2025-07-03	Prompt learning with bounding box constraints for medical image segmentation	Mélanie Gaillochet et.al.	2507.02743v1	null
2025-07-03	Hierarchical Multi-Label Contrastive Learning for Protein-Protein Interaction Prediction Across Organisms	Shiyi Liu et.al.	2507.02724v1	null
2025-07-03	FairHuman: Boosting Hand and Face Quality in Human Image Generation with Minimum Potential Delay Fairness in Diffusion Models	Yuxuan Wang et.al.	2507.02714v1	null
2025-07-03	UniMC: Taming Diffusion Transformer for Unified Keypoint-Guided Multi-Class Image Generation	Qin Guo et.al.	2507.02713v1	null
2025-07-03	A note on maximal plane subgraphs of the complete twisted graph containing perfect matchings	Elsa Omaña-Pulido et.al.	2507.02711v1	null
2025-07-03	SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alignment	Qi Xu et.al.	2507.02705v1	null
2025-07-03	APT: Adaptive Personalized Training for Diffusion Models with Limited Data	JungWoo Chae et.al.	2507.02687v1	null
2025-07-03	Learning few-step posterior samplers by unfolding and distillation of diffusion models	Charlesquin Kemajou Mbakam et.al.	2507.02686v1	null
2025-07-03	Real-time Image-based Lighting of Glints	Tom Kneiphof et.al.	2507.02674v1	null
2025-07-03	Embedding-Based Federated Data Sharing via Differentially Private Conditional VAEs	Francesco Di Salvo et.al.	2507.02671v1	null
2025-07-03	MEGANet-W: A Wavelet-Driven Edge-Guided Attention Framework for Weak Boundary Polyp Detection	Zhe Yee Tan et.al.	2507.02668v1	null
2025-07-03	AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detection via Multimodal Large Language Models	Ziyin Zhou et.al.	2507.02664v1	null