Paper | Code | mIoU | ModelName | ReleaseDate |
---|---|---|---|---|
FoodSAM: Any Food Segmentation | ✓ Link | 46.4 | FoodSAM | 2023-08-11 |
Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers | ✓ Link | 45.1 | SeTR-MLA (ViT-16/B) | 2020-12-31 |
A Large-Scale Benchmark for Food Image Segmentation | ✓ Link | 43.9 | SeTR-Naive (ReLeM-ViT-16/B) | 2021-05-12 |
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows | ✓ Link | 41.6 | Swin-Transformer (Swin-Small) | 2021-03-25 |
Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers | ✓ Link | 41.3 | SeTR-Naive (ViT-16/B) | 2020-12-31 |
A Large-Scale Benchmark for Food Image Segmentation | ✓ Link | 36.8 | CCNet (ReLeM-ResNet-50) | 2021-05-12 |
CCNet: Criss-Cross Attention for Semantic Segmentation | ✓ Link | 35.5 | CCNet (ResNet-50) | 2018-11-28 |