masked-autoencoder

Here are 74 public repositories matching this topic...

OpenGVLab / InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Updated Dec 11, 2024
Python

[ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"

Updated Jan 23, 2024
Python

MCG-NJU / VideoMAE

Star

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

pytorch transformer action-recognition video-understanding mae video-analysis video-representation-learning self-supervised-learning masked-autoencoder vision-transformer video-transformer neurips-2022

Updated Dec 8, 2023
Python

EdisonLeeeee / Awesome-Masked-Autoencoders

Star

A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).

mae self-supervised-learning masked-autoencoder

Updated Jul 10, 2024

Lupin1998 / Awesome-MIM

Star

[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)

computer-vision deep-learning awesome-list gpt representation-learning bert mae generative-models self-supervised-learning pre-training masked-autoencoder vision-transformer masked-image-modeling awesome-mim masked-modeling

Updated Oct 7, 2024
Python

implus / UM-MAE

Star

Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"

coco mae ade20k self-supervised-learning masked-autoencoder imagenet-classification pyramid-vision-transformer swin-transformer masked-image-modeling hierarchical-vision-transformer

Updated Dec 3, 2022
Jupyter Notebook

uncbiag / SimpleClick

Star

SimpleClick: Interactive Image Segmentation with Simple Vision Transformers (ICCV 2023)

pytorch interactive-segmentation masked-autoencoder vision-transformers

Updated May 3, 2024
Python

implus / mae_segmentation

Star

reproduction of semantic segmentation using masked autoencoder (mae)

vit semantic-segmentation mae self-supervised-learning masked-autoencoder vision-transformer

Updated Feb 3, 2022
Python

xyzforever / BEVT

Star

PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529

deep-learning pytorch bert action-recognition video-understanding video-representation-learning self-supervised-learning masked-autoencoder foundation-models

Updated Jul 19, 2022
Python

ruiwang2021 / mvd

Star

[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)

action-recognition video-understanding video-representation-learning self-supervised-learning masked-autoencoder vision-transformer cvpr2023

Updated May 21, 2023
Python

TonyLianLong / CrossMAE

Star

Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders

computer-vision deep-learning mae self-supervised-learning masked-autoencoder

Updated Dec 14, 2024
Python

habla-liaa / encodecmae

Star

Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'

audio deep-learning representation-learning masked-autoencoder encodec

Updated Jul 24, 2024
Python

zubair-irshad / NeRF-MAE

Star

[ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields