论文 | 项目中文简介 |
Updated on 2024.05.20
NeRF
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-05-15 | From NeRFs to Gaussian Splats, and Back | Siming He et.al. | 2405.09717v1 | link |
2024-05-13 | Synergistic Integration of Coordinate Network and Tensorial Feature for Improving Neural Radiance Fields from Sparse Inputs | Mingyu Kim et.al. | 2405.07857v1 | link |
2024-05-10 | OneTo3D: One Image to Re-editable Dynamic 3D Model and Video Generation | Jinwei Lin et.al. | 2405.06547v1 | link |
2024-05-07 | Tactile-Augmented Radiance Fields | Yiming Dou et.al. | 2405.04534v1 | link |
2024-05-03 | Rip-NeRF: Anti-aliasing Radiance Fields with Ripmap-Encoded Platonic Solids | Junchen Liu et.al. | 2405.02386v1 | link |
2024-04-30 | MicroDreamer: Zero-shot 3D Generation in $\sim$20 Seconds by Score-based Iterative Reconstruction | Luxi Chen et.al. | 2404.19525v1 | link |
2024-04-26 | Geometry-aware Reconstruction and Fusion-refined Rendering for Generalizable Neural Radiance Fields | Tianqi Liu et.al. | 2404.17528v1 | link |
2024-04-26 | ArtNeRF: A Stylized Neural Field for 3D-Aware Cartoonized Face Synthesis | Zichen Tang et.al. | 2404.13711v2 | link |
2024-04-20 | EC-SLAM: Real-time Dense Neural RGB-D SLAM System with Effectively Constrained Global Bundle Adjustment | Guanghao Li et.al. | 2404.13346v1 | link |
Visual Localization
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-05-13 | OverlapMamba: Novel Shift State Space Model for LiDAR-based Place Recognition | Qiuchi Xiang et.al. | 2405.07966v1 | link |
2024-05-13 | JointLoc: A Real-time Visual Localization Framework for Planetary UAVs Based on Joint Relative and Absolute Pose Estimation | Xubo Luo et.al. | 2405.07429v1 | link |
2024-05-12 | BoQ: A Place is Worth a Bag of Learnable Queries | Amar Ali-bey et.al. | 2405.07364v1 | link |
2024-04-16 | SPVLoc: Semantic Panoramic Viewport Matching for 6D Camera Localization in Unseen Environments | Niklas Gard et.al. | 2404.10527v1 | link |
2024-04-20 | CREST: Cross-modal Resonance through Evidential Deep Learning for Enhanced Zero-Shot Learning | Haojian Huang et.al. | 2404.09640v3 | link |
2024-04-23 | 2DLIW-SLAM:2D LiDAR-Inertial-Wheel Odometry with Real-Time Loop Closure | Bin Zhang et.al. | 2404.07644v5 | link |
2024-04-02 | TSCM: A Teacher-Student Model for Vision Place Recognition Using Cross-Metric Knowledge Distillation | Yehui Shen et.al. | 2404.01587v1 | link |
2024-03-28 | JIST: Joint Image and Sequence Training for Sequential Visual Place Recognition | Gabriele Berton et.al. | 2403.19787v1 | link |
2024-03-26 | Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge | Dongjin Kim et.al. | 2403.17420v1 | link |
2024-03-20 | Leveraging Neural Radiance Field in Descriptor Synthesis for Keypoints Scene Coordinate Regression | Huy-Hoang Bui et.al. | 2403.10297v2 | link |
2024-03-11 | LHMap-loc: Cross-Modal Monocular Localization Using LiDAR Point Cloud Heat Map | Xinrui Wu et.al. | 2403.05002v2 | link |
2024-04-01 | CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition | Feng Lu et.al. | 2402.19231v2 | link |
2024-02-28 | Representing 3D sparse map points and lines for camera relocalization | Bach-Thuan Bui et.al. | 2402.18011v1 | link |
2024-02-29 | Active propulsion noise shaping for multi-rotor aircraft localization | Gabriele Serussi et.al. | 2402.17289v2 | link |
2024-03-21 | NocPlace: Nocturnal Visual Place Recognition via Generative and Inherited Knowledge Transfer | Bingxi Liu et.al. | 2402.17159v2 | link |
2024-03-18 | Deep Homography Estimation for Visual Place Recognition | Feng Lu et.al. | 2402.16086v2 | link |
2024-02-25 | VOLoc: Visual Place Recognition by Querying Compressed Lidar Map | Xudong Cai et.al. | 2402.15961v1 | link |
2024-04-03 | Towards Seamless Adaptation of Pre-trained Models for Visual Place Recognition | Feng Lu et.al. | 2402.14505v3 | link |
2024-02-15 | Self-Supervised Learning of Visual Robot Localization Using LED State Prediction as a Pretext Task | Mirko Nava et.al. | 2402.09886v1 | link |
2024-03-20 | Learning to Produce Semi-dense Correspondences for Visual Localization | Khang Truong Giang et.al. | 2402.08359v2 | link |
2024-03-03 | Night-Rider: Nocturnal Vision-aided Localization in Streetlight Maps Using Invariant Extended Kalman Filtering | Tianxiao Gao et.al. | 2402.00330v2 | link |
Image Matching
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-05-14 | Shape-aware synthesis of pathological lung CT scans using CycleGAN for enhanced semi-supervised lung segmentation | Rezkellah Noureddine Khiati et.al. | 2405.08556v1 | link |
2024-04-27 | MinBackProp – Backpropagating through Minimal Solvers | Diana Sungatullina et.al. | 2404.17993v1 | link |
2024-04-17 | A Semantic Segmentation-guided Approach for Ground-to-Aerial Image Matching | Francesco Pro et.al. | 2404.11302v1 | link |
2024-04-13 | DeDoDe v2: Analyzing and Improving the DeDoDe Keypoint Detector | Johan Edstedt et.al. | 2404.08928v1 | link |
2024-03-23 | MatchSeg: Towards Better Segmentation via Reference Image Matching | Ruiqiang Xiao et.al. | 2403.15901v1 | link |
2024-02-21 | Visual Style Prompting with Swapping Self-Attention | Jaeseok Jeong et.al. | 2402.12974v2 | link |
2024-03-20 | Learning to Produce Semi-dense Correspondences for Visual Localization | Khang Truong Giang et.al. | 2402.08359v2 | link |
2024-01-18 | Question-Answer Cross Language Image Matching for Weakly Supervised Semantic Segmentation | Songhe Deng et.al. | 2401.09883v1 | link |
2024-01-26 | RomniStereo: Recurrent Omnidirectional Stereo Matching | Hualie Jiang et.al. | 2401.04345v2 | link |
2023-12-22 | Harnessing Diffusion Models for Visual Perception with Meta Prompts | Qiang Wan et.al. | 2312.14733v1 | link |
2024-04-02 | Steerers: A framework for rotation equivariant keypoint descriptors | Georg Bökman et.al. | 2312.02152v2 | link |
2023-11-29 | LGFCTR: Local and Global Feature Convolutional Transformer for Image Matching | Wenhao Zhong et.al. | 2311.17571v1 | link |
2023-11-08 | Zero-shot Translation of Attention Patterns in VQA Models to Natural Language | Leonard Salewski et.al. | 2311.05043v1 | link |
2024-03-11 | Segment Anything Model is a Good Teacher for Local Feature Learning | Jingqian Wu et.al. | 2309.16992v2 | link |
2023-09-11 | Towards Content-based Pixel Retrieval in Revisited Oxford and Paris | Guoyuan An et.al. | 2309.05438v1 | link |
Keypoint Detection
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-03-28 | Towards Long Term SLAM on Thermal Imagery | Colin Keil et.al. | 2403.19885v1 | link |
2024-03-28 | Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation | Xiao Lin et.al. | 2403.19527v1 | link |
2024-03-18 | FE-DeTr: Keypoint Detection and Tracking in Low-quality Image Frames with Events | Xiangyuan Wang et.al. | 2403.11662v1 | link |
2024-01-29 | Reconstructing Close Human Interactions from Multiple Views | Qing Shuai et.al. | 2401.16173v1 | link |
2024-01-17 | To deform or not: treatment-aware longitudinal registration for breast DCE-MRI during neoadjuvant chemotherapy via unsupervised keypoints detection | Luyi Han et.al. | 2401.09336v1 | link |
2024-01-08 | Flowmind2Digital: The First Comprehensive Flowmind Recognition and Conversion Approach | Huanyu Liu et.al. | 2401.03742v1 | link |
2024-04-30 | An Effective Image Copy-Move Forgery Detection Using Entropy Information | Li Jiang et.al. | 2312.11793v2 | link |
2023-12-11 | VoxelKP: A Voxel-based Network Architecture for Human Keypoint Estimation in LiDAR Data | Jian Shi et.al. | 2312.08871v1 | link |
2023-12-11 | Keypoint-based Stereophotoclinometry for Characterizing and Navigating Small Bodies: A Factor Graph Approach | Travis Driver et.al. | 2312.06865v1 | link |
2024-03-27 | Back to 3D: Few-Shot 3D Keypoint Detection with Back-Projected 2D Features | Thomas Wimmer et.al. | 2311.18113v2 | link |
2024-04-02 | Diffusion 3D Features (Diff3F): Decorating Untextured Shapes with Distilled Semantic Features | Niladri Shekhar Dutt et.al. | 2311.17024v2 | link |
2024-04-26 | Enhancing Visual Grounding and Generalization: A Multi-Task Cycle Training Approach for Vision-Language Models | Xiaoyu Yang et.al. | 2311.12327v2 | link |
2023-11-20 | CurriculumLoc: Enhancing Cross-Domain Geolocalization through Multi-Stage Refinement | Boni Hu et.al. | 2311.11604v1 | link |
2023-11-11 | CVTHead: One-shot Controllable Head Avatar with Vertex-feature Transformer | Haoyu Ma et.al. | 2311.06443v1 | link |
2023-11-06 | TAMPAR: Visual Tampering Detection for Parcel Logistics in Postal Supply Chains | Alexander Naumann et.al. | 2311.03124v1 | link |
2023-10-12 | UniPose: Detecting Any Keypoints | Jie Yang et.al. | 2310.08530v1 | link |
2023-10-10 | l-dyno: framework to learn consistent visual features using robot’s motion | Kartikeya Singh et.al. | 2310.06249v1 | link |
2023-10-04 | Self-supervised Learning of Contextualized Local Visual Embeddings | Thalles Santos Silva et.al. | 2310.00527v3 | link |