VQVDB: AI Compression for OpenVDB Grids

Download the latest version

Overview

VQVDB is a deep learning–powered compressor for volumetric data stored in OpenVDB. It uses Vector Quantized Variational Autoencoders (VQ-VAE) to learn a compact latent space and achieves up to 32× compression of float voxel grids, nearly lossless at the visual level.

VQVDB is designed for GPU-accelerated decoding via CUDA but also support CPU encoding / decoding, enabling real-time decompression of large volumes, with native support for integration into Houdini.

📂 File Format

Each .vqvdb file stores:

Section	Description
Header	Magic, version, codebook size, shape info
Codebook	256×128 float matrix
Index Tensors	[B × 4 × 4 × 4] uint8 values
Origins	Per-leaf grid coordinates

🧠 How it Works

Training Pipeline

I'm very bad at programming and couldn't compile pyopenvdb. so I had to extract vdb data to .npy files.

Leaf Extraction
Extract all non-empty 8×8×8 voxel blocks as well as the coords of each leaf origins (for reconstruction) from a dataset of VDB volumes.
VQ-VAE Model
A PyTorch-based encoder compresses each leaf into a latent vector of size D in this case 128 dimmenions.
The quantizer maps this vector to the closest of K learned codebook entries, the codebook as 256, which makes it fit in a uint8_t.
Loss Function
Optimized using the following objective:
$$\mathcal{L} = \|x - \hat{x}\|^2 + \beta \cdot \| \text{sg}[z_e(x)] - e \|^2$$
with Exponential Moving Average (EMA) updates to the embedding table.

Runtime Decompression (C++/CUDA)

Load .vqvdb file, which contains:
- Codebook (float32)
- Per-leaf index tensors (4×4×4×uint8_t = 64 bytes)
- Leaf origins (openvdb::Coord)
GPU Decoding
- Launch CUDA kernel to decode latent codes into dense voxel blocks
- Allocate leaf nodes in a new openvdb::FloatGrid
- Write the reconstructed 8×8×8 voxel blocks
Streaming-Friendly
- Decompression supports lazy loading in batches
- Low VRAM footprint when streaming large scenes

📈 Future Work

Hierarchical VQ-VAE (multi-res compression)
Residual VAE
Transformer-based latent upsampling
Real-time decompression via Vulkan compute
VDB segmentation / semantic-aware encoding

📜 Citation

If you use VQVDB in academic work, please cite the project:

@article{Crema2025,
author = "Enzo Crema",
title = "{VQVDB : VDB Compression using VQ-VAEs}",
year = "2025",
month = "7",
url = "https://figshare.com/articles/dataset/VQVDB_VDB_Compression_using_VQ-VAEs/29469083",
doi = "10.6084/m9.figshare.29469083.v1"
}

Name		Name	Last commit message	Last commit date
Latest commit History 105 Commits
houdini_package		houdini_package
python		python
src		src
.clang-format		.clang-format
.gitignore		.gitignore
BUILDING.md		BUILDING.md
CMakeLists.txt		CMakeLists.txt
Jenkinsfile		Jenkinsfile
LICENSE		LICENSE
README.md		README.md
build.bat		build.bat
build.sh		build.sh
notebook_scalar.ipynb		notebook_scalar.ipynb
notebook_vec3f.ipynb		notebook_vec3f.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

VQVDB: AI Compression for OpenVDB Grids

Download the latest version

Overview

📂 File Format

🧠 How it Works

Training Pipeline

Runtime Decompression (C++/CUDA)

📈 Future Work

📜 Citation

About

Uh oh!

Releases 5

Contributors 3

Languages

License

ZephirFXEC/VQVDB

Folders and files

Latest commit

History

Repository files navigation

VQVDB: AI Compression for OpenVDB Grids

Download the latest version

Overview

📂 File Format

🧠 How it Works

Training Pipeline

Runtime Decompression (C++/CUDA)

📈 Future Work

📜 Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 5

Contributors 3

Languages