D2NT: A High-Performing Depth-to-Normal Translator

This repo is the official implementation of the paper:

"D2NT: A High-Performing Depth-to-Normal Translator"

[arXiv] [homepage] [video]

Introduction

D2NT is a high-performance Python library for converting depth maps directly to surface normal maps. Unlike prevalent methods or computational libraries (e.g., Open3D, Kornia) that typically require projecting depth images to 3D point clouds and then estimating normals through local plane fitting, D2NT explicitly constructs the mathematical relationship between depth maps and normal maps, enabling end-to-end normal estimation. On 640×480 images, D2NT achieves a computational speed 28× faster than Open3D and 1.8× faster than Kornia (even though it's implemented with cuda), significantly outperforming traditional point-cloud-based approaches in both speed and accuracy.

D2NT provides three algorithm versions with increasing accuracy: a fast basic version, an optimized version with Discontinuity-Aware Gradient (DAG) filter, and a refined version with DAG filter and MRF-based Normal Refinement. The library is designed for efficiency, accuracy, and ease of use in computer vision and robotics applications.

Installation

Install from PyPI (Recommended)

pip install d2nt

Install from Source

# Clone the repository
git clone https://github.com/fengyi233/depth-to-normal-translator.git
cd depth-to-normal-translator

# Install in development mode
pip install -e .

# Or install normally
pip install .

Prerequisites

Python >= 3.7
numpy >= 1.20.0
opencv-python >= 4.0.0
matplotlib >= 3.5.0 (optional, for visualization)

Dataset Preparation

Public real-world datasets generally obtain surface normals by local plane fitting, which makes the surface normal ground truth unreliable. Therefore, we use the synthesis 3F2N dataset provided in this paper to evaluate estimation performance.

The 3F2N dataset can be downloaded from:
GoogleDrive
The dataset is organized as follows:

3F2N
 |-- Easy
 |  |-- android
 |  |  |-- depth
 |  |  |-- normal
 |  |  |-- params.txt
 |  |  |-- pose.txt
 |  |-- cube
 |  |-- ...
 |  |-- torusknot
 |-- Medium
 |  |-- ...
 |-- Hard
 |  |-- ...

Usage

Python Package Usage

After installation, you can use the depth2normal() function directly:

import numpy as np
from d2nt import depth2normal

# Prepare depth map (example)
depth = np.random.rand(480, 640) * 10.0

cam_intrinsic = np.array([
    [525.0, 0, 320.0],  # fx=525.0, u0=320.0
    [0, 525.0, 240.0],  # fy=525.0, v0=240.0
    [0, 0, 1]
])

# Convert depth to normal
normal = depth2normal(depth, cam_intrinsic, version='d2nt_v3')

print(f"Normal map shape: {normal.shape}")  # (480, 640, 3)

Algorithm Versions

d2nt_basic: Basic version without any optimization method
d2nt_v2: With Discontinuity-Aware Gradient (DAG) filter
d2nt_v3: With DAG filter and MRF-based Normal Refinement (MNR) module (recommended)

Performance

We benchmarked D2NT against popular depth-to-normal conversion libraries on 640×480 images:

Method	FPS	Speedup vs d2nt_basic
d2nt_basic	65.5	1.0× (baseline)
Kornia	36.9	0.56× (1.8× slower)
Open3D	2.3	0.04× (28× slower)

Note: Performance was measured on a standard CPU. Results may vary depending on hardware configuration. Run python test_speed.py to benchmark on your system.

Python Demo

Run the demo script in the root directory to see visualization results and error maps:

python demo.py

This will display:

Ground truth normal map
Estimated normal map
Error map (in degrees) with mean angular error

The demo uses test data from demo_data/ directory. The results will be saved in demo_results/ directory.

You can change the VERSION parameter in demo.py to select different D2NT versions:

d2nt_basic: Basic version without any optimization method
d2nt_v2: With Discontinuity-Aware Gradient (DAG) filter
d2nt_v3: With DAG filter and MRF-based Normal Refinement (MNR) module (recommended)

Coordinate System and Normal Visualization

Coordinate System

The normal vectors returned by depth2normal() are defined in the camera coordinate system:

X-axis (nx): Points to the right in the image
Y-axis (ny): Points downward in the image
Z-axis (nz): Points forward into the image plane

The normal vectors are normalized unit vectors with values in the range [-1, 1] for each component.

Normal Visualization

This package provides visualization functions that convert normal vectors to RGB images for display:

from d2nt import get_normal_vis, get_normal_vis_reference
import matplotlib.pyplot as plt

# Visualize normal map
normal_vis = get_normal_vis(normal, valid_mask=mask)
plt.imshow(normal_vis)

# Generate normal visualization reference
plt.imsave("normal_vis_reference.png", get_normal_vis_reference())

Visualization Formula: The visualization uses the formula:

normal_img = (1 - normal) / 2

This formula maps normal vectors from the range [-1, 1] to [0, 1] for RGB display. This convention is chosen to match the 3F2N dataset ground truth encoding, where normal maps are stored with this specific encoding scheme.

Note on Different Conventions: Other projects (e.g., DSINE) may use the alternative formula (normal + 1) / 2 for visualization, which is also valid. However, when comparing results or using different datasets, it is important to be aware of which convention is being used, as the color mapping will be inverted. The get_normal_vis() function in this package uses (1 - normal) / 2 to maintain consistency with the 3F2N dataset format.

Cite

If you find our work useful in your research, please consider citing our paper:

@inproceedings{feng2023d2nt,
	author      = {{Yi Feng, Bohuan Xue, Ming Liu, Qijun Chen, and Rui Fan}},
	title       = {{D2NT: A High-Performing Depth-to-Normal Translator}},
	booktitle   = {{IEEE International Conference on Robotics and Automation (ICRA)}},
	year        = {{2023}}
}

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
assets		assets
d2nt		d2nt
demo_data		demo_data
demo_results		demo_results
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
demo.py		demo.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
test_speed.py		test_speed.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

D2NT: A High-Performing Depth-to-Normal Translator

Introduction

Installation

Install from PyPI (Recommended)

Install from Source

Prerequisites

Dataset Preparation

Usage

Python Package Usage

Algorithm Versions

Performance

Python Demo

Coordinate System and Normal Visualization

Coordinate System

Normal Visualization

Cite

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

D2NT: A High-Performing Depth-to-Normal Translator

Introduction

Installation

Install from PyPI (Recommended)

Install from Source

Prerequisites

Dataset Preparation

Usage

Python Package Usage

Algorithm Versions

Performance

Python Demo

Coordinate System and Normal Visualization

Coordinate System

Normal Visualization

Cite

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages