* *Pretraining Large Vision and Multimodal Models
* Benefits of Synthetically Pre-trained Depth-Prediction Networks for Indoor/Outdoor Image Classification
* Masked Autoencoder for Self-Supervised Pre-training on Lidar Point Clouds
* RarePlanes Soar Higher: Self-Supervised Pretraining for Resource Constrained and Synthetic Datasets
* Scalable and Accurate Self-supervised Multimodal Representation Learning without Aligned Video and Text Data
Index for "p"