S2LD: Sparse-to-Local-Dense Matching for Geometry-Guided Correspondence Estimation

Homepage | Paper

This is a reimplemented end-to-end version of S2LD correspondence estimation described in the original paper.

Sparse-to-Local-Dense Matching for Geometry-Guided Correspondence Estimation Shenghao Li, Qunfei Zhao^*, Zeyang Xia
IEEE Transaction on Image Processing, 2023

Technical Architecture

S2LD proposes a novel sparse-to-local-dense matching framework for geometry-guided correspondence estimation. The architecture consists of three key components:

1. Attention-based Feature Extractor

Utilizes attention mechanisms to extract features with global receptive fields
Enables feature descriptors to capture global contextual information
Enhances matching robustness and accuracy across different viewing conditions

2. Multi-level Matching Process

Sparse Matching Stage: First establishes sparse correspondences across the entire image
Local Dense Matching Stage: Performs dense matching in local regions around sparse keypoints
Progressively refines correspondences at multiple levels to reduce reprojection errors
Maintains sub-pixel level consistency while reducing computational complexity

3. 3D Noise-Aware Regularizer

Designed through differentiable triangulation
Provides additional 3D geometric guidance during training
Handles supervision noise from camera pose and depth map errors
Improves model generalization capability

Key Innovations

Asymmetric Sparse-to-Local-Dense Matching Strategy

Instead of performing dense matching across entire images (computationally expensive) or only sparse matching (less accurate), S2LD adopts an asymmetric approach:

Efficiently detects sparse feature points with global context
Densifies matches locally around geometrically promising regions
Achieves the best trade-off between accuracy and efficiency

Global Receptive Field with Attention

Feature descriptors leverage attention mechanisms to achieve global receptive fields
Captures long-range dependencies and contextual information
Significantly improves matching robustness in challenging scenarios (occlusions, texture-poor regions)

Geometry-Guided Correspondence Estimation

Explicitly uses geometric information from sparse features to guide dense matching
Reduces ambiguity in correspondence search space
Ensures geometric consistency throughout the matching pipeline

3D-Aware Training with Noise Handling

Novel 3D noise-aware regularizer handles imperfect supervision signals
Differentiable triangulation provides 3D geometric constraints during training
Improves robustness to camera pose and depth estimation errors

Benchmark Results

S2LD demonstrates state-of-the-art performance on multiple geometric estimation benchmarks:

MegaDepth-1500 Dataset

Method	Type	AUC@5°	AUC@10°	AUC@20°	MMA@5E-4
Sup.+SG.	Sparse	36.78%	54.68%	71.02%	99.21%
NCNet	Dense	25.89%	41.79%	57.94%	82.62%
DRCNet	Dense	27.70%	43.04%	56.78%	83.50%
LoFTR-DS	Dense	48.41%	65.04%	78.28%	95.43%
S2LD (Ours)	Sparse-to-Dense	49.73%	65.69%	78.84%	96.16%

Key Achievements:

+12.95% improvement over best sparse method (Sup.+SG.) at AUC@5°
+1.32% improvement over best dense method (LoFTR-DS) at AUC@5°
Superior pose estimation accuracy with better computational efficiency

Performance Highlights

Accuracy: Achieves sub-pixel matching accuracy with high geometric consistency
Efficiency: Faster than dense matching methods while more accurate than sparse methods
Robustness: Handles challenging scenarios including occlusions, texture-poor regions, and large viewpoint changes
Generalization: Strong performance across indoor and outdoor scenes

Installation

# For full pytorch-lightning trainer features (recommended)
conda env create -f environment.yaml
conda activate s2ld

We provide the download link to

the megadepth-1500-testset.
The pretrained models of end2end S2LD.

Run Demos

Match image pairs

python demo_match.py --weight ./weights/s2ld-e2e-inference.pt

Training

Dataset Setup

Generally, MegaDepth is needed for training, the original dataset, the offline generated dataset indices. The dataset indices store scenes, image pairs, and other metadata within each dataset used for training/validation/testing. The relative poses between images used for training are directly cached in the indexing files.

Download the dataset indices

You can download the required dataset indices from the following link. After downloading, unzip the required files.

unzip downloaded-file.zip
# extract dataset indices
tar xf train-data/megadepth_indices.tar
# extract testing data (optional)
tar xf testdata/megadepth_test_1500.tar

Build the dataset symlinks

# megadepth
# -- # train and test dataset (train and test share the same dataset)
ln -s /path/to/megadepth/Undistorted_SfM /path/to/S2LD/data/megadepth/train
ln -s /path/to/megadepth/Undistorted_SfM /path/to/S2LD/data/megadepth/test
# -- # dataset indices
ln -s /path/to/megadepth_indices/* /path/to/S2LD/data/megadepth/index

Training on MegaDepth

scripts/train/train_outdoor_ds_e2e.sh

NOTE: It uses 2 gpus only, with image sizes of 640x640. This is the reproduction of an end-to-end sparse-to-local-dense correspondence estimation, which may not be aligned to the results presented in the paper.

Test on MegaDepth

scripts/test/test_outdoor_ds_e2e.sh

For visualizing the results, please refer to notebooks/visualize_dump_results.ipynb.

Citation

If you find this code useful for your research, please use the following BibTeX entry.

@ARTICLE{li2023s2ld,
  author={Li, Shenghao and Zhao, Qunfei and Xia, Zeyang},
  journal={IEEE Transactions on Image Processing}, 
  title={Sparse-to-Local-Dense Matching for Geometry-Guided Correspondence Estimation}, 
  year={2023},
  volume={32},
  number={},
  pages={3536-3551},
  doi={10.1109/TIP.2023.3287500}}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
assets		assets
configs		configs
datasets		datasets
lightning		lightning
losses		losses
models		models
notebooks		notebooks
optimizers		optimizers
scripts		scripts
utils		utils
.gitignore		.gitignore
README.md		README.md
demo_match.py		demo_match.py
environment.yaml		environment.yaml
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

S2LD: Sparse-to-Local-Dense Matching for Geometry-Guided Correspondence Estimation

Homepage | Paper

Technical Architecture

1. Attention-based Feature Extractor

2. Multi-level Matching Process

3. 3D Noise-Aware Regularizer

Key Innovations

Asymmetric Sparse-to-Local-Dense Matching Strategy

Global Receptive Field with Attention

Geometry-Guided Correspondence Estimation

3D-Aware Training with Noise Handling

Benchmark Results

MegaDepth-1500 Dataset

Performance Highlights

Installation

Run Demos

Match image pairs

Training

Dataset Setup

Training on MegaDepth

Test on MegaDepth

Citation

About

Uh oh!

Releases

Packages

Languages

chenghao-li-1029/S2LD

Folders and files

Latest commit

History

Repository files navigation

S2LD: Sparse-to-Local-Dense Matching for Geometry-Guided Correspondence Estimation

Homepage | Paper

Technical Architecture

1. Attention-based Feature Extractor

2. Multi-level Matching Process

3. 3D Noise-Aware Regularizer

Key Innovations

Asymmetric Sparse-to-Local-Dense Matching Strategy

Global Receptive Field with Attention

Geometry-Guided Correspondence Estimation

3D-Aware Training with Noise Handling

Benchmark Results

MegaDepth-1500 Dataset

Performance Highlights

Installation

Run Demos

Match image pairs

Training

Dataset Setup

Training on MegaDepth

Test on MegaDepth

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages