Abstract
We present a novel fusion scheme between multiple intermediate convolutional features within convolutional neurual network (CNN) for dense correspondence estimation. In contrast to existing CNN-based descriptors that utilize a single convolutional activation, our approach jointly uses multiple intermediate features of CNN through the attention weight that balances the contribution of each features. We formulate the overall network as two sub-networks, correspondence network and attention network. The correspondence network is designed to provide multiple intermediate matching costs while the attention network is to learn the optimal weight between them. These two networks are learned in a joint manner to boost the correspondence estimation performance. Experiments demonstrate that our proposed method outperforms the state-of-the-art methods on various correspondence estimation tasks including depth estimation, optical flow, and semantic correspondence.
Original language | English |
---|---|
Title of host publication | 2017 IEEE International Conference on Image Processing, ICIP 2017 - Proceedings |
Publisher | IEEE Computer Society |
Pages | 1007-1011 |
Number of pages | 5 |
ISBN (Electronic) | 9781509021758 |
DOIs | |
Publication status | Published - 2018 Feb 20 |
Event | 24th IEEE International Conference on Image Processing, ICIP 2017 - Beijing, China Duration: 2017 Sept 17 → 2017 Sept 20 |
Publication series
Name | Proceedings - International Conference on Image Processing, ICIP |
---|---|
Volume | 2017-September |
ISSN (Print) | 1522-4880 |
Other
Other | 24th IEEE International Conference on Image Processing, ICIP 2017 |
---|---|
Country/Territory | China |
City | Beijing |
Period | 17/9/17 → 17/9/20 |
Bibliographical note
Funding Information:This work was supported by Institute for Information and communications Technology Promotion(IITP) grant funded by the Korea government(MSIP)(No.2016-0-00197)
Publisher Copyright:
© 2017 IEEE.
All Science Journal Classification (ASJC) codes
- Software
- Computer Vision and Pattern Recognition
- Signal Processing