PolarMask: Single Shot Instance Segmentation with Polar Representation
动机
- instance segmentation
 - anchor-free
 - single-shot
 - modified on FCOS
 
论点
- two-stage methods 
- FCIS, Mask R-CNN
 - bounding box detection then semantic segmentation within each box
 
 - single-shot method
- formulate the task as instance center classification and dense distance regression in a polar coordinate
 - FCOS can be regarded as a special case that the contours has only 4 directions
 
 this paper
- two parallel task:
- instance center classification
 - dense distance regression
 
 - Polar IoU Loss can largely ease the optimization and considerably improve the accuary
 - Polar Centerness improves the original idea of “Centreness” in FCOS, leading to further performance boost
 

- two parallel task:
 
- two-stage methods 
 方法
- architecture 
- back & fpn are the same as FCOS
 - model the instance mask as one center and n rays
- conclude that mass-center is more advantageous than box center
 - the angle interval is pre-fixed, thus only the length of the rays is to be regressed
 - positive samples:falls into 1.5xstrides of the area around the gt mass-center,that is 9-16 pixels around gt grid
 - distance regression
- 如果一条射线上存在多个交点,取最长的
 - 如果一条射线上没有交点,取最小值$\epsilon=10^{-6}$
 
 
 
 - potential issuse of the mask regression branch
- dense regression task with such as 36 rays, may cause imbalance between regression loss and classification loss
 - n rays are relevant and should be trained as a whole rather than a set of independent values—->iou loss
 
 - inference
- multiply center-ness with classification to obtain final confidence scores, conf thresh=0.05
 - take top-1k predictions per fpn level
 - use the smallest bounding boxes to run NMS, nms thresh=0.5
 
 - polar centerness
- to suppress low quality detected centers
 - $polar\ centerness=\sqrt{\frac{min(\{d_1,d_2, …, d_n\})}{max(\{d_1,d_2, …, d_n\})}}$
 - $d_{min}$和$d_{max}$越接近,说明中心点质量越好
 - Experiments show that Polar Centerness improves accuracy especially under stricter localization metrics, such as $AP_{75}$
 
 - polar IoU loss
- polar IoU:$IoU=lim_{N\to\inf}\frac{\sum_{i=1}^N\frac{1}{2} d_{min}^2 \Delta \theta}{\sum_{i=1}^N\frac{1}{2} d_{max}^2 \Delta \theta}$
 - empirically observe that 去掉平方项效果更好:$polar\ IoU=\frac{\sum_{i=1}^n d_{min}}{\sum_{i=1}^n d_{max}}$
 - polar iou loss:bce of polar IoU,$-log(\frac{\sum_{i=1}^n d_{min}}{\sum_{i=1}^n d_{max}})$
 - advantage
- differentiable, enable bp
 - regards the regression targets as a whole
 - keep balance with classification loss
 
 
 
- architecture