FastMap: Revisiting Structure from Motion through First-Order Optimization

A fast and simple structure from motion pipeline written in Pytorch with optional custom CUDA kernel acceleration, focusing on images densely covering a scene for dense 3D reconstruction applications such as NeRFs.

[Paper] [Project Page]

Installation

Currently we only support Linux.

Install PyTorch following the instructions in the official website.
Install other Python packages with the following commands

pip install trimesh "pyglet<2" pyyaml dacite loguru prettytable psutil

pip install git+https://github.com/jiahaoli95/pyrender.git

Install COLMAP following the instructions in the official website. If you already have the matching databases (see below) and want to run FastMap starting from that, this step is optional.
(Optional but highly recommended for speed) Compile the custom CUDA kernels:

python setup.py build_ext --inplace

Usage

Basics

The structure from motion pipeline consists of two parts: feature matching and pose estimation. We use the image matching routines in COLMAP to obtain a database that contains matching results, and feed it to FastMap to estimate the camera poses and triangulate a sparse point cloud. Given a directory of images (possibly with nested directory structures), the easiest way to run the pipeline using the default configuration is (assuming you have a monitor connected):

# keypoint detection and feature extraction
colmap feature_extractor --database_path /path/to/your/database.db --image_path /path/to/your/image/directory
# matching
colmap exhaustive_matcher --database_path /path/to/your/database.db
# pose estimation with FastMap (if you do not need the colored point cloud, you may omit the --image_dir option for some potential speedup)
python run.py --database /path/to/your/database.db --image_dir /path/to/your/image/directory --output_dir /your/output/directory

An interactive visualization will appear after FastMap finishes, and the results will be stored in the provided output directory in the same format as COLMAP. Please refer to the official COLMAP tutorial and command-line interface guide for various options that can be passed to the feature extractor and matcher (e.g., accelerating with GPUs).

Configuration

There are many hyper-parameters and options that allow you to control the behavior of FastMap. They are specified as a set of dataclasses (with default values) in fastmap/config.py. To change the config, you can pass the path of a YAML file to run.py

python run.py --config /path/to/your/config.yaml --database /path/to/your/database.db --image_dir /path/to/your/image/directory --output_dir /your/output/directory

The YAML file only needs to specify the parameters you want to change; for example (see fastmap/config.py for all available options):

distortion: 
  num_levels: 5
epipolar_adjustment:
  num_irls_steps: 4
  num_prune_steps: 2
sparse_reconstruction:
  reproj_err_thr: 10.0

Device

By default, FastMap will run on the first available GPU. If you want to run it on another device, use the --device option

python run.py --device cuda:2 --database /path/to/your/database.db --image_dir /path/to/your/image/directory --output_dir /your/output/directory

So far we only support running on a single GPU.

Headless Mode

If you are running on a server without a monitor connected, you need to pass the --headless flag

python run.py --headless --database /path/to/your/database.db --image_dir /path/to/your/image/directory --output_dir /your/output/directory

Camera Assumptions

Default Camera Model: By default, we assume a SIMPLE_RADIAL camera model with the principal point at the center, and the two unknown parameters being the focal length and radial distortion. Our method groups images believed to share the same camera parameters according to the following:

Images from different subdirectories are considered to have different intrinsics.
Within a subdirectory, images with the same size and whose EXIF specifies the same focal length are considered to be from the same camera.
For images within the same directory that lack EXIF data, those of the same size are considered to be from the same camera.

Pinhole Camera: If you are sure that the distortion is extremely small and a pinhole camera model is appropriate, you can use the --pinhole flag to tell FastMap not to estimate the distortion parameter, which can save some time. In practice, however, even distortions that are imperceptible to the human eye can have a negative effect on the estimation of focal length and pose.

Calibrated Camera: If the cameras are calibrated, and the focal length and principal point information are stored in the database, you can pass the --calibrated flag to use the known intrinsics. While the final focal length might still be different after optimization, the principal point will remain fixed. Note that if you use this flag, FastMap assumes it is a pinhole camera model, so the images should be undistorted properly before feature extraction and matching.

Visualization

We include a simple visualizer for inspecting the results in the COLMAP format (which FastMap adopts). COLMAP supports outputting multiple models, so the results are stored in numbered subdirectories, such as sparse/0, sparse/1, and so on. FastMap always outputs only one model and abandons the images that fail to be registered, but to be consistent, we still use this naming convention, and so everything is stored in the sparse/0 subdirectory. To interactively visualize a model stored in sparse/0 (including the camera poses and point cloud), use the following command:

python -m fastmap.vis /your/output/directory/sparse/0

You can pass options to the script to control the viewer behavior. For example, if you want a different point size for the point cloud, you can use

python -m fastmap.vis /your/output/directory/sparse/0 --viewer_options point_size=5

Please see fastmap/vis.py for a complete set of supported options.

Debugging

If FastMap fails on a database but other methods such as COLMAP succeed, you can use the --gt_model option to provide a good model to run.py. Some information potentially useful for debugging (such as comparison of intrinsics, errors of relative rotation and translation, etc) will be printed.

Data and Reproduction

Images, pre-computed databases and ground truths to reproduce our benchmarks are hosted here. Download a subset to start playing with FastMap.

Limitations

This method works best on datasets with high quality images intended for dense 3D reconstruction (e.g. as a proprocessing step before NeRF). It trade robustness for simplicity and speed, so is not particularly careful in countering the negative effect of outlier matches. In cases like sparse scene coverage, low quality matching, degenerate motions (e.g. colinear translation), it is less robust than COLMAP and GLOMAP, and is prone to catastrophic failures.

BibTeX

If you use this tool for your research please cite the following paper

@article{fastmap2025,
    author        = {Jiahao Li and Haochen Wang and Muhammad Zubair Irshad and Igor Vasiljevic and Matthew R. Walter and Vitor Campagnolo Guizilini and Greg Shakhnarovich},
    title         = {FastMap: Revisiting Structure from Motion through First-Order Optimization},
    journal       = {https://arxiv.org/abs/2505.04612},
    year          = {2025},
}

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
assets		assets
cuda		cuda
fastmap		fastmap
.clangd		.clangd
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
eval.py		eval.py
run.py		run.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

FastMap: Revisiting Structure from Motion through First-Order Optimization

Installation

Usage

Basics

Configuration

Device

Headless Mode

Camera Assumptions

Visualization

Debugging

Data and Reproduction

Limitations

BibTeX

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

pals-ttic/fastmap

Folders and files

Latest commit

History

Repository files navigation

FastMap: Revisiting Structure from Motion through First-Order Optimization

Installation

Usage

Basics

Configuration

Device

Headless Mode

Camera Assumptions

Visualization

Debugging

Data and Reproduction

Limitations

BibTeX

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages