LOGen: Toward Lidar Object Generation by Point Diffusion

A step toward generating realistic lidar object point clouds

Ellington Kirby, Mickael Chen, Renaud Marlet, Nermin Samet

Novel LiDAR objects generated by LOGen using the conditioning information of real objects from the nuScenes validation set. Top row shows a bicycle (car on bottom row) generated via interpolations of conditioning information. Relative to the sensor, rot is the object's rotation while d is the distance. Color is according to LiDAR intensity.

Abstract

A common strategy to improve lidar segmentation results on rare semantic classes consists of pasting objects from one lidar scene into another. While this augments the quantity of instances seen at training time and varies their context, the instances fundamentally remain the same. In this work, we explore how to enhance instance diversity using a lidar object generator. We introduce a novel diffusion-based method to produce lidar point clouds of dataset objects, including reflectance, and with an extensive control of the generation via conditioning information. Our experiments on nuScenes show the quality of our object generations measured with new 3D metrics developed to suit lidar objects.

barrier

bike

bus

car

construction vehicle

motorcycle

pedestrian

traffic cone

trailer

truck

Real (white) and generated (green) objects by LOGen for all ten instance classes of nuScenes.

The objects generated by LOGen are indistinguishable from real objects in certain classes

SPVCNN is trained with 3 (left) or 4 (right) channels (coordinates + intensity) on nuScenes (train set) with real objects. It is tested on nuScenes (val set) where each object is replaced with a generated object of the same class, box size and sensor viewing angle, and with the same number of points.

LOGen can generate intensities for lidar point cloud objects

Intensities of real (left) and generated (right) objects by LOGen for all ten instance classes of nuScenes.

LOGen is the first diffusion method that leverages transformer architecture for lidar point cloud generation

LOGen conditions the generation on the following box information: the box center (x, y, z), the box length, width and height (l, w, h) and ϕ the angle between the object heading ψ and the ray from the object bounding box center to the sensor.

LOGen can generate new objects from novel views by interpolating the viewing angle of the condition

reference ground-truth

recreation, rotation 0

novel view, rotation 1/5

novel view, rotation 2/5

novel view, rotation 3/5

novel view, rotation 4/5

Novel objects produced by LOGen. Recreations are generated using the conditioning information of a real object, the rest of the objects are created from novel views by interpolating the viewing angle ϕ of the condition.

BibTeX

@inproceedings{logen, author = {Ellington Kirby and Mickael Chen and Renaud Marlet and Nermin Samet}, title = {LOGen: Toward Lidar Object Generation by Point Diffusion}, booktitle = {arXiv}, year = {2024}, }

This page was built using the Academic Project Page Template.