论文 | 项目ä¸æ–‡ç®€ä»‹
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-07 | Multi-style Neural Radiance Field with AdaIN | Yu-Wen Pao et.al. | 2406.04960v1 | link |
2024-06-07 | DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D Data | Qihao Liu et.al. | 2406.04322v2 | link |
2024-06-06 | How Far Can We Compress Instant-NGP-Based NeRF? | Yihang Chen et.al. | 2406.04101v1 | link |
2024-06-03 | Self-Calibrating 4D Novel View Synthesis from Monocular Videos Using Gaussian Splatting | Fang Li et.al. | 2406.01042v1 | link |
2024-05-30 | $\textit{S}^3$Gaussian: Self-Supervised Street Gaussians for Autonomous Driving | Nan Huang et.al. | 2405.20323v1 | link |
2024-05-30 | View-Consistent Hierarchical 3D SegmentationUsing Ultrametric Feature Fields | Haodi He et.al. | 2405.19678v1 | link |
2024-06-02 | NeRF On-the-go: Exploiting Uncertainty for Distractor-free NeRFs in the Wild | Weining Ren et.al. | 2405.18715v2 | link |
2024-05-24 | Neural Elevation Models for Terrain Mapping and Path Planning | Adam Dai et.al. | 2405.15227v1 | link |
2024-05-23 | Camera Relocalization in Shadow-free Neural Radiance Fields | Shiyao Xu et.al. | 2405.14824v1 | link |
2024-06-10 | From NeRFs to Gaussian Splats, and Back | Siming He et.al. | 2405.09717v2 | link |
2024-06-05 | Synergistic Integration of Coordinate Network and Tensorial Feature for Improving Neural Radiance Fields from Sparse Inputs | Mingyu Kim et.al. | 2405.07857v3 | link |
2024-05-11 | TD-NeRF: Novel Truncated Depth Prior for Joint Camera Pose and Neural Radiance Field Optimization | Zhen Tan et.al. | 2405.07027v1 | link |
2024-05-10 | OneTo3D: One Image to Re-editable Dynamic 3D Model and Video Generation | Jinwei Lin et.al. | 2405.06547v1 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-10 | Multicam-SLAM: Non-overlapping Multi-camera SLAM for Indirect Visual Localization and Navigation | Shenghao Li et.al. | 2406.06374v1 | link |
2024-06-06 | GLACE: Global Local Accelerated Coordinate Encoding | Fangjinhua Wang et.al. | 2406.04340v1 | link |
2024-06-02 | Visual place recognition for aerial imagery: A survey | Ivan Moskalenko et.al. | 2406.00885v1 | link |
2024-05-20 | UAV-VisLoc: A Large-scale Dataset for UAV Visual Localization | Wenjia Xu et.al. | 2405.11936v1 | link |
2024-05-13 | OverlapMamba: Novel Shift State Space Model for LiDAR-based Place Recognition | Qiuchi Xiang et.al. | 2405.07966v1 | link |
2024-05-13 | JointLoc: A Real-time Visual Localization Framework for Planetary UAVs Based on Joint Relative and Absolute Pose Estimation | Xubo Luo et.al. | 2405.07429v1 | link |
2024-05-12 | BoQ: A Place is Worth a Bag of Learnable Queries | Amar Ali-bey et.al. | 2405.07364v1 | link |
2024-04-16 | SPVLoc: Semantic Panoramic Viewport Matching for 6D Camera Localization in Unseen Environments | Niklas Gard et.al. | 2404.10527v1 | link |
2024-04-20 | CREST: Cross-modal Resonance through Evidential Deep Learning for Enhanced Zero-Shot Learning | Haojian Huang et.al. | 2404.09640v3 | link |
2024-04-23 | 2DLIW-SLAM:2D LiDAR-Inertial-Wheel Odometry with Real-Time Loop Closure | Bin Zhang et.al. | 2404.07644v5 | link |
2024-04-02 | TSCM: A Teacher-Student Model for Vision Place Recognition Using Cross-Metric Knowledge Distillation | Yehui Shen et.al. | 2404.01587v1 | link |
2024-03-28 | JIST: Joint Image and Sequence Training for Sequential Visual Place Recognition | Gabriele Berton et.al. | 2403.19787v1 | link |
2024-03-26 | Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge | Dongjin Kim et.al. | 2403.17420v1 | link |
2024-03-20 | Leveraging Neural Radiance Field in Descriptor Synthesis for Keypoints Scene Coordinate Regression | Huy-Hoang Bui et.al. | 2403.10297v2 | link |
2024-03-11 | LHMap-loc: Cross-Modal Monocular Localization Using LiDAR Point Cloud Heat Map | Xinrui Wu et.al. | 2403.05002v2 | link |
2024-04-01 | CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition | Feng Lu et.al. | 2402.19231v2 | link |
2024-02-28 | Representing 3D sparse map points and lines for camera relocalization | Bach-Thuan Bui et.al. | 2402.18011v1 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-05-21 | OmniGlue: Generalizable Feature Matching with Foundation Model Guidance | Hanwen Jiang et.al. | 2405.12979v1 | link |
2024-05-14 | Shape-aware synthesis of pathological lung CT scans using CycleGAN for enhanced semi-supervised lung segmentation | Rezkellah Noureddine Khiati et.al. | 2405.08556v1 | link |
2024-06-10 | MinBackProp -- Backpropagating through Minimal Solvers | Diana Sungatullina et.al. | 2404.17993v2 | link |
2024-05-23 | A Semantic Segmentation-guided Approach for Ground-to-Aerial Image Matching | Francesco Pro et.al. | 2404.11302v2 | link |
2024-04-13 | DeDoDe v2: Analyzing and Improving the DeDoDe Keypoint Detector | Johan Edstedt et.al. | 2404.08928v1 | link |
2024-03-23 | MatchSeg: Towards Better Segmentation via Reference Image Matching | Ruiqiang Xiao et.al. | 2403.15901v1 | link |
2024-02-21 | Visual Style Prompting with Swapping Self-Attention | Jaeseok Jeong et.al. | 2402.12974v2 | link |
2024-03-20 | Learning to Produce Semi-dense Correspondences for Visual Localization | Khang Truong Giang et.al. | 2402.08359v2 | link |
2024-01-18 | Question-Answer Cross Language Image Matching for Weakly Supervised Semantic Segmentation | Songhe Deng et.al. | 2401.09883v1 | link |
2024-01-26 | RomniStereo: Recurrent Omnidirectional Stereo Matching | Hualie Jiang et.al. | 2401.04345v2 | link |
2023-12-22 | Harnessing Diffusion Models for Visual Perception with Meta Prompts | Qiang Wan et.al. | 2312.14733v1 | link |
2024-04-02 | Steerers: A framework for rotation equivariant keypoint descriptors | Georg Bökman et.al. | 2312.02152v2 | link |
2023-11-29 | LGFCTR: Local and Global Feature Convolutional Transformer for Image Matching | Wenhao Zhong et.al. | 2311.17571v1 | link |
2023-11-08 | Zero-shot Translation of Attention Patterns in VQA Models to Natural Language | Leonard Salewski et.al. | 2311.05043v1 | link |
2024-03-11 | Segment Anything Model is a Good Teacher for Local Feature Learning | Jingqian Wu et.al. | 2309.16992v2 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-06-03 | Scale-Free Image Keypoints Using Differentiable Persistent Homology | Giovanni Barbarani et.al. | 2406.01315v1 | link |
2024-06-01 | Benchmarking Fish Dataset and Evaluation Metric in Keypoint Detection -- Towards Precise Fish Morphological Assessment in Aquaculture Breeding | Weizhen Liu et.al. | 2405.12476v2 | link |
2024-03-28 | Towards Long Term SLAM on Thermal Imagery | Colin Keil et.al. | 2403.19885v1 | link |
2024-03-28 | Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation | Xiao Lin et.al. | 2403.19527v1 | link |
2024-03-18 | FE-DeTr: Keypoint Detection and Tracking in Low-quality Image Frames with Events | Xiangyuan Wang et.al. | 2403.11662v1 | link |
2024-01-29 | Reconstructing Close Human Interactions from Multiple Views | Qing Shuai et.al. | 2401.16173v1 | link |
2024-01-17 | To deform or not: treatment-aware longitudinal registration for breast DCE-MRI during neoadjuvant chemotherapy via unsupervised keypoints detection | Luyi Han et.al. | 2401.09336v1 | link |
2024-01-08 | Flowmind2Digital: The First Comprehensive Flowmind Recognition and Conversion Approach | Huanyu Liu et.al. | 2401.03742v1 | link |
2024-04-30 | An Effective Image Copy-Move Forgery Detection Using Entropy Information | Li Jiang et.al. | 2312.11793v2 | link |
2023-12-11 | VoxelKP: A Voxel-based Network Architecture for Human Keypoint Estimation in LiDAR Data | Jian Shi et.al. | 2312.08871v1 | link |
2023-12-11 | Keypoint-based Stereophotoclinometry for Characterizing and Navigating Small Bodies: A Factor Graph Approach | Travis Driver et.al. | 2312.06865v1 | link |
2024-03-27 | Back to 3D: Few-Shot 3D Keypoint Detection with Back-Projected 2D Features | Thomas Wimmer et.al. | 2311.18113v2 | link |
2024-04-02 | Diffusion 3D Features (Diff3F): Decorating Untextured Shapes with Distilled Semantic Features | Niladri Shekhar Dutt et.al. | 2311.17024v2 | link |
2024-04-26 | Enhancing Visual Grounding and Generalization: A Multi-Task Cycle Training Approach for Vision-Language Models | Xiaoyu Yang et.al. | 2311.12327v2 | link |
2023-11-20 | CurriculumLoc: Enhancing Cross-Domain Geolocalization through Multi-Stage Refinement | Boni Hu et.al. | 2311.11604v1 | link |
2023-11-11 | CVTHead: One-shot Controllable Head Avatar with Vertex-feature Transformer | Haoyu Ma et.al. | 2311.06443v1 | link |
2023-11-06 | TAMPAR: Visual Tampering Detection for Parcel Logistics in Postal Supply Chains | Alexander Naumann et.al. | 2311.03124v1 | link |
2023-10-12 | UniPose: Detecting Any Keypoints | Jie Yang et.al. | 2310.08530v1 | link |