Self-supervised adversarial masking for 3D point cloud representation learning

Michał Szachniewicz, Wojciech Kozłowski, Michał Stypułkowski, Maciej Zięba

PointCAM architecture

PointCAM (Point Clouds with Adversarial Masking) is an adversarial method for learning a masking function for point clouds. The model utilizes a self-distillation framework with an online tokenizer for 3D point clouds (similar to iBOT). Compared to previous techniques that optimize patch-level and object-level objectives, we postulate applying an auxiliary network that learns how to select masks instead of choosing them randomly.

Installation

pip install -e . && pip install -e .[3d]

If you have trouble installing pytorch3d, see INSTALL.md.

Usage

Download pretrained model

git lfs pull --include="checkpoints/pointcam.pt" --exclude=""

Extracting features

import torch

from pointcam.models import PointCAM
from pointcam.utils.crop import PatchFinder

model = PointCAM.from_exported_checkpoint('path/to/checkpoint.pt').eval().cuda()
patch_finder = PatchFinder(64, 32)

xyz = torch.rand(1, 1024, 3)
xyz = patch_finder(xyz).to("cuda")

with torch.no_grad():
    features = model(xyz) # [1, 768]

Exploring generated masks

To explore generated masks on ModelNet40 dataset, run the notebook notebooks/visualize_masks.py. You should see a GUI that allows to iterate over the dataset:

Rendering masks in Mitsuba

In order to use the RENDER button in the GUI, you need to have Mitsuba service running in the background. Clicking RENDER will send a request to the service and save rendered image in the images directory.

Data

Set the DATA_PATH environment variable to the directory where you want to store the data. Inside this directory you should have the following structure:

$DATA_PATH/
├── ShapeNetFPS
├── ScanObjectNN
├── ModelNet40FPS
└── ModelNet40Fewshot

ShapeNet

Download ShapeNetCore v2 from the official website and preprocess it with:

python scripts/prepare_points_fps.py --dir ShapeNet --out-dir ShapeNetFPS --ext obj --n-points 8192 --n-process 16

ScanObjectNN

Download ScanObjectNN from the official website, extract to ScanObjectNN.

ModelNet40

Download ModelNet40 from the official website and preprocess it with:

python scripts/prepare_points_fps.py --dir ModelNet40 --out-dir ModelNet40FPS --ext off --n-points 1024 --n-process 16

ModelNet40 Few-shot

Use splits defined in PointBERT, extract to ModelNet40Fewshot.

Experiments

Experiments configs are located in pointcam/configs directory.

Results

Experiment	Dataset	Accuracy [%]
Linear SVM	ModelNet40	91.52 ± 0.26
Finetuning	ModelNet40	92.6 ± 0.2
Finetuning	ScanObjNN.OBJ_BG	90.36 ± 0.25
Finetuning	ScanObjNN.OBJ_ONLY	88.35 ± 0.53
Finetuning	ScanObjNN.PB_T50_RS	84.49 ± 0.38

Pretraining

To pretrain the model on ShapeNet, run:

Adversarial masking

python scripts/train.py -m pointcam

Random masking

python scripts/train.py -m pcbot

Downstream tasks

Linear evaluation with SVM on ModelNet40

python scripts/eval_svm.py path/to/checkpoint.pt

Finetuning on ModelNet40 and ScanObjectNN

python scripts/eval_mlp.py path/to/checkpoint.pt -c {mn40, scanobjbg, scanobjonly, scanobjhardest}

Few-shot classification on ModelNet40

python scripts/eval_fewshot.py path/to/checkpoint.pt --way {5, 10} --shot {10, 20}

Part segmentation on ShapeNetPart

Based mostly on PointBERT implementation. Needs to be integrated with the rest of the codebase.

Citing PointCAM

@misc{szachniewicz2023selfsupervised,
      title={Self-supervised adversarial masking for 3D point cloud representation learning}, 
      author={Michał Szachniewicz and Wojciech Kozłowski and Michał Stypułkowski and Maciej Zięba},
      year={2023},
      eprint={2307.05325},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.github/workflows		.github/workflows
checkpoints		checkpoints
images		images
notebooks		notebooks
pointcam		pointcam
scripts		scripts
.gitattributes		.gitattributes
.gitignore		.gitignore
.lfsconfig		.lfsconfig
README.md		README.md
pyproject.toml		pyproject.toml
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Self-supervised adversarial masking for 3D point cloud representation learning

PointCAM architecture

Installation

Usage

Download pretrained model

Extracting features

Exploring generated masks

Rendering masks in Mitsuba

Data

ShapeNet

ScanObjectNN

ModelNet40

ModelNet40 Few-shot

Experiments

Results

Pretraining

Downstream tasks

Citing PointCAM

About

Releases

Packages

Languages

szacho/pointcam

Folders and files

Latest commit

History

Repository files navigation

Self-supervised adversarial masking for 3D point cloud representation learning

PointCAM architecture

Installation

Usage

Download pretrained model

Extracting features

Exploring generated masks

Rendering masks in Mitsuba

Data

ShapeNet

ScanObjectNN

ModelNet40

ModelNet40 Few-shot

Experiments

Results

Pretraining

Downstream tasks

Citing PointCAM

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages