Show pageOld revisionsBacklinksBack to top This page is read only. You can view the source, but not change it. Ask your administrator if you think this is wrong. ====== Mask RCNN ====== Extends Faster-RCNN * Additional branch for predicting segmentation masks on each Region of Interest (ROI) * Mask branch is a small FCN Problem: Faster-RCNN is not designed for pixel-to-pixel alignment between input and output. * RoIPool operation for attending to instances performs coarse spatial quantization for feature extraction Solution: Quantization-free layer that preserves spatial location (RoIAlign) ===== Architecture ===== * Convolutional backbone used for feature extraction * Alternative 1: ResNet-50-C4 (features of final conv layer of 4-th stage (C4)) * Alternative 2: Feature Pyramid Network (FPN): Extracts features from different scales * Network head for bounding-box recognition (classiciation and regression) * Extend Faster R-CNN box heads from ResNet/FPN paper with mask prediction branch. data_mining/neural_network/cnn/mask_rcnn.txt Last modified: 2019/10/26 10:09by phreazer