Ordinal Multi-task Part Segmentation with Recurrent Prior Generation
Yifan Zhao, Jia Li*, Yu Zhang,
State Key Laboratory of Virtual Reality Technology and Systems, Beihang University
Yafei Song, Yonghong Tian
School of Electronics Engineering and Computer Science, Peking University
Current Status: Early Acess
Semantic object part segmentation is a fundamental task in object understanding and geometric analysis. The clear understanding of part relationships can be of great use to the segmentation process. In this work, we propose a novel Ordinal Multitask Part Segmentation (OMPS) approach which explicitly models the part ordinal relationship to guide the segmentation process in a recurrent manner. Quantitative and qualitative experiments are conducted first to explore the mutual impacts among object parts and then an ordinal part inference algorithm is formulated via experimental observations. Specifically, our framework is mainly composed of two modules, the forward module to segment multiple parts as individual subtasks with prior knowledge, and the recurrent module to generate appropriate part priors with the ordinal inference algorithm. These two modules work iteratively to optimize the segmentation performance and the network parameters. Experimental results show that our approach outperforms the state-of-the-art models on human and vehicle part parsing benchmarks. Comprehensive evaluations are conducted to demonstrate the effectiveness of our approach in object part segmentation.
Approach
Pipeline of proposed OMPS framework. In the forward module, we adopt a weight-sharing network PartNet to extract image-level features and a task-relevant shallow network PriorNet to encode part-level mask priors. After combining image-level features and prior knowledge, a multitask decoder is constructed to parse each part individually. The recurrent module generates and updates appropriate part priors with an ordinal subtask inference algorithm. These two modules are conducted iteratively to get the optimal results. The arrows between different subtasks indicate the ordinal relationships among subtasks (i.e. the results of the former subtasks serve as prior to the latter).
Benchmark
Segmentation Results on PASCAL-Car and PASCAL-Aeroplane dataset. Our model shows superior performances, especially in parsing parts with small size and conflicts between classes. DeepLabv3 is the most efficient model with finer results while FCN and DeepLab-LFOV miss many details in this part parsing task
Citation
-
Yifan Zhao, Jia Li*, Yu Zhang, Yafei Song, and Yonghong Tian. Ordinal Multi-task Part Segmentation with Recurrent Prior Generation.