papers
[maskformer 2021] Per-Pixel Classification is Not All You Need for Semantic Segmentation:Facebook,
[mask2former 2022] Masked-attention Mask Transformer for Universal Image Segmentation:
[mask2former+1] Mask2Former for Video Instance Segmentation:追加了一个在video上面的report