Learning about voxelization, point clouds, 3D meshes and implicit surfaces
October 23, 2020 • Pratulya Bubna
Unlike in 2D where we have images (array of pixels — a grid) as a well-defined choice of representation, there is no such consensus in case of 3D data. Different 3D data representations have varying geometric structure and properties. In this post, we’ll cover some of the commonly chosen 3D representations and discuss their properties.
Needless to say, different data representations entail different datasets, different deep learning architectures and sometimes different tasks as well.
Voxels in 3D are analagous to pixels in 2D. Just as pixels are basic elements on a regular 2D grid, voxels are volumetric elements that make up volume in 3D space.
Imagine a voxel as a cube that represents a single data point on a regularly-spaced 3D grid. Many such voxels would approximate a continuous 3D surface.
Voxels don't have their positions encoded (inferred from relative positions) and can contain multiple scalar values like density, color, opacity etc.
discrete
units (similar to pixels) and hence serve as an approximation to continuous surfaces. Thus it is bound to suffer from artifacts.Voxelization is the process of converting a geometric object from its continuous geometric representation into a set of voxels that approximate it.
A point cloud is an unordered set of points in a space that approximates the geometry of 3D objects.
A 3D Mesh, or polygonal mesh, approximates surfaces via a set of 2D polygons in 3D space. A mesh structure consists of faces, a set of vertices (coordinates) in 3D space, and edges — a connectivity list that describes how the vertices are connected with each other.
A mesh provides an efficient, non-uniform representation of a shape: a small number of polygons (coarser
) can cover large, simple surfaces; and, many higher resolution polygons (finer
) can faithfully represent intricate, detailed geometry.
Most commonly, triangular meshes are used as they are non-planar, memory-efficient and can be rendered fast
Implicit representations, eg. level sets, represent the surface as a continuous function. They are more expressive and are able to capture more geometrical information of 3D shapes.
Level sets
, for instance, are equipped with mathematical formulations that permit the inclusion of geometric quantities such as surface orientation, smoothness and volume.
Park et al. in Signed Distance Function
(SDF).
An SDF is a continuous function that, for a given spatial point, outputs the point’s distance to the closest surface, whose sign encodes whether the point is inside (negative) or outside (positive) of the watertight surface.