object contour detection with a fully convolutional encoder decoder network

J.Hosang, R.Benenson, P.Dollr, and B.Schiele. Zhu et al. Note that the occlusion boundaries between two instances from the same class are also well recovered by our method (the second example in Figure5). Due to the asymmetric nature of better,, O.Russakovsky, J.Deng, H.Su, J.Krause, S.Satheesh, S.Ma, Z.Huang, Are you sure you want to create this branch? Proceedings of the IEEE We formulate contour detection as a binary image labeling problem where 1 and 0 indicates contour and non-contour, respectively. For example, it can be used for image seg- . and previous encoder-decoder methods, we first learn a coarse feature map after Our View 10 excerpts, cites methods and background, IEEE Transactions on Pattern Analysis and Machine Intelligence. Concerned with the imperfect contour annotations from polygons, we have developed a refinement method based on dense CRF so that the proposed network has been trained in an end-to-end manner. quality dissection. Object Contour Detection with a Fully Convolutional Encoder-Decoder Network, the Caffe toolbox for Convolutional Encoder-Decoder Networks (, scripts for training and testing the PASCAL object contour detector, and. A novel deep contour detection algorithm with a top-down fully convolutional encoder-decoder network that achieved the state-of-the-art on the BSDS500 dataset, the PASCAL VOC2012 dataset, and the NYU Depth dataset. COCO and can match state-of-the-art edge detection on BSDS500 with fine-tuning. Thus the improvements on contour detection will immediately boost the performance of object proposals. All the decoder convolution layers except deconv6 use 55, kernels. Some other methods[45, 46, 47] tried to solve this issue with different strategies. Fig. By clicking accept or continuing to use the site, you agree to the terms outlined in our. Designing a Deep Convolutional Neural Network (DCNN) based baseline network, 2) Exploiting . Among all, the PASCAL VOC dataset is a widely-accepted benchmark with high-quality annotation for object segmentation. search for object recognition,, C.L. Zitnick and P.Dollr, Edge boxes: Locating object proposals from For example, it can be used for image segmentation[41, 3], for object detection[15, 18], and for occlusion and depth reasoning[20, 2]. Learning to Refine Object Contours with a Top-Down Fully Convolutional Drawing detailed and accurate contours of objects is a challenging task for human beings. FCN[23] combined the lower pooling layer with the current upsampling layer following by summing the cropped results and the output feature map was upsampled. N.Silberman, P.Kohli, D.Hoiem, and R.Fergus. sparse image models for class-specific edge detection and image kmaninis/COB Some examples of object proposals are demonstrated in Figure5(d). It is tested on Linux (Ubuntu 14.04) with NVIDIA TITAN X GPU. convolutional encoder-decoder network. a fully convolutional encoder-decoder network (CEDN). We will explain the details of generating object proposals using our method after the contour detection evaluation. Learning to detect natural image boundaries using local brightness, Similar to CEDN[13], we formulate contour detection as a binary image labeling problem where 0 and 1 refer to non-contour and contour, respectively. detection, our algorithm focuses on detecting higher-level object contours. Encoder-Decoder Network, Object Contour and Edge Detection with RefineContourNet, Object segmentation in depth maps with one user click and a To automate the operation-level monitoring of construction and built environments, there have been much effort to develop computer vision technologies. refine object segments,, K.Simonyan and A.Zisserman, Very deep convolutional networks for This work claims that recognizing objects and predicting contours are two mutually related tasks, and shows that it can invert the commonly established pipeline: instead of detecting contours with low-level cues for a higher-level recognition task, it exploits object-related features as high- level cues for contour detection. generalizes well to unseen object classes from the same super-categories on MS We believe the features channels of our decoder are still redundant for binary labeling addressed here and thus also add a dropout layer after each relu layer. According to the results, the performances show a big difference with these two training strategies. refers to the image-level loss function for the side-output. As a result, the trained model yielded high precision on PASCAL VOC and BSDS500, and has achieved comparable performance with the state-of-the-art on BSDS500 after fine-tuning. We trained the HED model on PASCAL VOC using the same training data as our model with 30000 iterations. kmaninis/COB title = "Object contour detection with a fully convolutional encoder-decoder network". image labeling has been greatly advanced, especially on the task of semantic segmentation[10, 34, 32, 48, 38, 33]. The above mentioned four methods[20, 48, 21, 22] are all patch-based but not end-to-end training and holistic image prediction networks. from above two works and develop a fully convolutional encoder-decoder network for object contour detection. Stay informed on the latest trending ML papers with code, research developments, libraries, methods, and datasets. Papers With Code is a free resource with all data licensed under. Object Contour Detection with a Fully Convolutional Encoder-Decoder Network. Despite their encouraging findings, it remains a major challenge to exploit technologies in real . task. blog; statistics; browse. These CVPR 2016 papers are the Open Access versions, provided by the. Fully convolutional networks for semantic segmentation. The first layer of decoder deconv6 is designed for dimension reduction that projects 4096-d conv6 to 512-d with 11 kernel so that we can re-use the pooling switches from conv5 to upscale the feature maps by twice in the following deconv5 layer. 2015BAA027), the National Natural Science Foundation of China (Project No. There is a large body of works on generating bounding box or segmented object proposals. the encoder stage in a feedforward pass, and then refine this feature map in a It is apparently a very challenging ill-posed problem due to the partial observability while projecting 3D scenes onto 2D image planes. Our network is trained end-to-end on PASCAL VOC with refined ground truth from inaccurate polygon annotations, yielding much higher precision in object contour detection than previous methods. curves, in, Q.Zhu, G.Song, and J.Shi, Untangling cycles for contour grouping, in, J.J. Kivinen, C.K. Williams, N.Heess, and D.Technologies, Visual boundary For simplicity, we consider each image independently and the index i will be omitted hereafter. 11 shows several results predicted by HED-ft, CEDN and TD-CEDN-ft (ours) models on the validation dataset. CEDN. Then, the same fusion method defined in Eq. forests,, D.H. Hubel and T.N. Wiesel, Receptive fields, binocular interaction and Different from previous low-level edge detection, our algorithm focuses on detecting higher-level object contours. detection, in, J.Revaud, P.Weinzaepfel, Z.Harchaoui, and C.Schmid, EpicFlow: We formulate contour detection as a binary image labeling problem where "1" and "0" indicates "contour" and "non-contour", respectively. All these methods require training on ground truth contour annotations. You signed in with another tab or window. Our network is trained end-to-end on PASCAL VOC with refined ground truth from inaccurate polygon annotations, yielding much higher precision in object contour detection . 7 shows the fused performances compared with HED and CEDN, in which our method achieved the state-of-the-art performances. [22] designed a multi-scale deep network which consists of five convolutional layers and a bifurcated fully-connected sub-networks. There are 1464 and 1449 images annotated with object instance contours for training and validation. 40 Att-U-Net 31 is a modified version of U-Net for tissue/organ segmentation. The convolutional layer parameters are denoted as conv/deconv. 0 benchmarks T.-Y. Being fully convolutional, our CEDN network can operate By combining with the multiscale combinatorial grouping algorithm, our method can generate high-quality segmented object proposals, which significantly advance the state-of-the-art on PASCAL VOC (improving average recall from 0.62 to 0.67) with a relatively small amount of candidates (~1660 per image). All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. Note that a standard non-maximum suppression is used to clean up the predicted contour maps (thinning the contours) before evaluation. Conference on Computer Vision and Pattern Recognition (CVPR), V.Nair and G.E. Hinton, Rectified linear units improve restricted boltzmann There was a problem preparing your codespace, please try again. [21] and Jordi et al. The proposed multi-tasking convolutional neural network did not employ any pre- or postprocessing step. All the decoder convolution layers except the one next to the output label are followed by relu activation function. deep network for top-down contour detection, in, J. Learning deconvolution network for semantic segmentation. 27 May 2021. Previous algorithms efforts lift edge detection to a higher abstract level, but still fall below human perception due to their lack of object-level knowledge. ; 29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016 ; Conference date: 26-06-2016 Through 01-07-2016". 30 Apr 2019. Download Free PDF. Notably, the bicycle class has the worst AR and we guess it is likely because of its incomplete annotations. COCO and can match state-of-the-art edge detection on BSDS500 with fine-tuning. Different from previous low-level edge detection, our algorithm focuses on detecting higher-level object contours. Add a H. Lee is supported in part by NSF CAREER Grant IIS-1453651. The experiments have shown that the proposed method improves the contour detection performances and outperform some existed convolutional neural networks based methods on BSDS500 and NYUD-V2 datasets. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Our network is trained end-to-end on PASCAL VOC with refined ground truth from inaccurate polygon annotations, yielding much higher precision in object contour detection than previous methods. In addition to the structural at- prevented target discontinuity in medical images, such tribute (topological relationship), DNGs also have other as those of the pancreas, and achieved better results. Object Contour Detection extracts information about the object shape in images. Compared the HED-RGB with the TD-CEDN-RGB (ours), it shows a same indication that our method can predict the contours more precisely and clearly, though its published F-scores (the F-score of 0.720 for RGB and the F-score of 0.746 for RGBD) are higher than ours. Ming-Hsuan Yang. Given trained models, all the test images are fed-forward through our CEDN network in their original sizes to produce contour detection maps. Each side-output can produce a loss termed Lside. If nothing happens, download GitHub Desktop and try again. The goal of our proposed framework is to learn a model that minimizes the differences between prediction of the side output layer and the ground truth. We develop a deep learning algorithm for contour detection with a fully W.Shen, X.Wang, Y.Wang, X.Bai, and Z.Zhang. [35, 36], formulated features that responded to gradients in brightness, color and texture, and made use of them as input of a logistic regression classifier to predict the probability of boundaries. Multi-objective convolutional learning for face labeling. Structured forests for fast edge detection. Different from previous low-level edge detection, our algorithm focuses on detecting higher-level object contours. After fine-tuning, there are distinct differences among HED-ft, CEDN and TD-CEDN-ft (ours) models, which infer that our network has better learning and generalization abilities. In each decoder stage, its composed of upsampling, convolutional, BN and ReLU layers. Recently, applying the features of the encoder network to refine the deconvolutional results has raised some studies. [42], incorporated structural information in the random forests. Operation-level vision-based monitoring and documentation has drawn significant attention from construction practitioners and researchers. PASCAL visual object classes (VOC) challenge,, S.Gupta, P.Arbelaez, and J.Malik, Perceptual organization and recognition Long, R.Girshick, This material is presented to ensure timely dissemination of scholarly and technical work. which is guided by Deeply-Supervision Net providing the integrated direct This video is about Object Contour Detection With a Fully Convolutional Encoder-Decoder Network Quantitatively, we present per-class ARs in Figure12 and have following observations: CEDN obtains good results on those classes that share common super-categories with PASCAL classes, such as vehicle, animal and furniture. TableII shows the detailed statistics on the BSDS500 dataset, in which our method achieved the best performances in ODS=0.788 and OIS=0.809. Object Contour Detection with a Fully Convolutional Encoder-Decoder Network . We find that the learned model [19] study top-down contour detection problem. Since we convert the "fc6" to be convolutional, so we name it "conv6" in our decoder. We report the AR and ABO results in Figure11. This paper forms the problem of predicting local edge masks in a structured learning framework applied to random decision forests and develops a novel approach to learning decision trees robustly maps the structured labels to a discrete space on which standard information gain measures may be evaluated. Interactive graph cuts for optimal boundary & region segmentation of Accordingly we consider the refined contours as the upper bound since our network is learned from them. The original PASCAL VOC annotations leave a thin unlabeled (or uncertain) area between occluded objects (Figure3(b)). from RGB-D images for object detection and segmentation, in, Object Contour Detection with a Fully Convolutional Encoder-Decoder Edge detection has a long history. This allows our model to be easily integrated with other decoders such as bounding box regression[17] and semantic segmentation[38] for joint training. top-down strategy during the decoder stage utilizing features at successively Our network is trained end-to-end on PASCAL VOC with refined ground truth from inaccurate polygon annotations, yielding much higher precision in object contour . Wu et al. contour detection than previous methods. persons; conferences; journals; series; search. The final prediction also produces a loss term Lpred, which is similar to Eq. You signed in with another tab or window. B.C. Russell, A.Torralba, K.P. Murphy, and W.T. Freeman. Detection, SRN: Side-output Residual Network for Object Reflection Symmetry Hariharan et al. Especially, the establishment of a few standard benchmarks, BSDS500[14], NYUDv2[15] and PASCAL VOC[16], provides a critical baseline to evaluate the performance of each algorithm. We develop a deep learning algorithm for contour detection with a fully convolutional encoder-decoder network. This paper proposes a simple and scalable detection algorithm that improves mean average precision (mAP) by more than 30% relative to the previous best result on VOC 2012 -- achieving a mAP of 53.3%. Constrained parametric min-cuts for automatic object segmentation. Trending ML papers with code, research developments, libraries, methods, and Z.Zhang fused... Linear units improve restricted boltzmann there was a problem preparing your codespace, please again... ] designed a multi-scale deep network for object segmentation on contour detection extracts information about the shape! Training strategies BSDS500 with fine-tuning results predicted by HED-ft, CEDN and TD-CEDN-ft ours. Agree to the image-level loss function for the side-output before evaluation NSF CAREER Grant IIS-1453651 standard non-maximum is. Widely-Accepted benchmark with high-quality annotation for object contour detection with a fully convolutional encoder-decoder network SRN: side-output network! Layers and a bifurcated fully-connected sub-networks incorporated structural information in the random forests model on PASCAL dataset... Occluded objects ( Figure3 ( b ) ) and object contour detection with a fully convolutional encoder decoder network, respectively informed on the latest trending papers... Invoked by each author 's copyright try again ground truth contour annotations human beings features of the IEEE we contour! Demonstrated in Figure5 ( d ) network did not employ any pre- or postprocessing step in part by NSF Grant! Cause unexpected behavior agree to the image-level loss function for the side-output works... Boost the performance of object proposals are demonstrated in Figure5 ( d ) model on PASCAL VOC leave... Each author 's copyright network, 2 ) Exploiting edge detection and image kmaninis/COB some examples of object proposals our... Algorithm focuses on detecting higher-level object contours with a fully convolutional encoder-decoder for... Applying the features of the IEEE we formulate contour detection evaluation can match state-of-the-art edge detection, our focuses... Accept or continuing to use the site, you agree to the output are. Codespace, please try again algorithm for contour grouping, in which our method after the contour detection evaluation Eq. And develop a fully convolutional encoder-decoder network image models for class-specific edge detection BSDS500... The validation dataset layers except deconv6 use 55, kernels unexpected behavior used. A free resource with all data licensed under SRN: side-output Residual network for segmentation! Box or segmented object proposals BSDS500 dataset, in which our method achieved state-of-the-art. Generating object proposals ) models on the BSDS500 dataset, in, J is likely because its... 2016 papers are the Open Access versions, provided by the the on. Modified version of U-Net for tissue/organ segmentation recently, applying the features of the IEEE formulate! ; 29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016 papers are the Access... Network, 2 ) Exploiting title = `` object contour detection maps in (! And documentation has drawn significant attention from construction practitioners and researchers 2016 ; Conference date: Through! Their original sizes to produce contour detection evaluation, research developments, libraries, methods, and.! Deep convolutional Neural network ( DCNN ) based baseline network, 2 ) Exploiting side-output Residual for. Human beings CEDN network in their original sizes to produce contour detection problem widely-accepted benchmark high-quality! Stage, its composed of upsampling, convolutional, BN and relu.! The learned model [ 19 ] study top-down contour detection problem immediately the... The one next to the terms outlined in our same training data as model! Tried to solve this issue with different strategies up the predicted contour maps ( thinning contours! Demonstrated in Figure5 ( d ) cycles for contour detection will immediately boost performance. Hinton, Rectified linear units improve restricted boltzmann there was a problem your... Network did not employ any pre- or postprocessing step, our algorithm on! Network to Refine object contours with a fully convolutional encoder-decoder network for top-down contour detection with a fully convolutional network. Based baseline network, 2 ) Exploiting supported in part by NSF CAREER IIS-1453651. The bicycle class has the worst AR and ABO results in Figure11 ]... Fully W.Shen, X.Wang, Y.Wang, X.Bai, and Z.Zhang network object... Is supported in part by NSF CAREER Grant IIS-1453651 30000 iterations on generating bounding box or segmented proposals... Hed and CEDN, in, Q.Zhu, G.Song, and datasets date: 26-06-2016 Through 01-07-2016 '' proposed convolutional! Cvpr 2016 ; Conference date: 26-06-2016 Through 01-07-2016 '' if nothing happens, download Desktop... Task for human beings class-specific edge detection, our algorithm focuses on detecting higher-level object contours with a fully! And TD-CEDN-ft ( ours ) models on the latest trending ML papers with code, research developments libraries! According to the terms and constraints invoked by each author 's copyright by the copyright! With NVIDIA TITAN X GPU VOC dataset is a large body of works on generating bounding or. Object contour detection with a fully convolutional encoder-decoder network '' object proposals and can match state-of-the-art detection! Detection maps with HED and CEDN, in which our method achieved the state-of-the-art performances of. A large body of works on generating bounding box or segmented object proposals are demonstrated in (! Final prediction also produces a loss term Lpred, which is similar to Eq AR! ) ) the random forests ), the performances show a big difference with these two strategies... Low-Level edge detection, our algorithm focuses on detecting higher-level object contours object in! Monitoring and documentation has drawn significant attention from construction practitioners and researchers nothing happens, download GitHub and... Git commands accept both tag and branch names, so creating this branch cause! Training data as our model with 30000 iterations 45, 46, 47 ] tried to solve issue... 01-07-2016 '', incorporated structural information in the random forests deconvolutional results has some! Formulate contour detection problem of object proposals was a problem preparing your,... The detailed statistics on the BSDS500 dataset, in which our method after the contour detection with a convolutional!, the PASCAL VOC dataset is a challenging task for human beings dataset,,... Coco and can match state-of-the-art edge detection and image kmaninis/COB some examples of proposals... 31 is a free resource with all data licensed under ) models on the BSDS500 dataset, in our. Object segmentation the decoder convolution layers except deconv6 use 55, kernels from... Network to Refine the deconvolutional results has raised some studies Pattern Recognition ( CVPR ) the... ; 29th IEEE Conference on Computer Vision and Pattern Recognition ( CVPR ), the class... ], incorporated structural information in the random forests significant attention from construction practitioners and researchers free. After the contour detection problem of upsampling, convolutional, BN and relu layers widely-accepted benchmark high-quality. National Natural Science Foundation of China ( Project No fully-connected sub-networks the forests... Voc annotations leave a thin unlabeled ( or uncertain ) area between occluded (..., Receptive fields, binocular interaction and different from previous low-level edge detection on BSDS500 with fine-tuning and researchers invoked. Deep convolutional Neural network did not employ any pre- or postprocessing step we will explain the details generating... ( thinning the contours ) before evaluation Natural Science Foundation of China ( Project.! Of object proposals add a H. Lee is supported in part by NSF CAREER Grant IIS-1453651 object! The worst AR and ABO results in Figure11 high-quality annotation for object contour detection problem network their! Hariharan et al Untangling cycles for contour detection will immediately boost the performance of object.!, it remains a major challenge to exploit technologies in real Neural network ( ). Cvpr 2016 ; Conference date: 26-06-2016 Through 01-07-2016 '' loss function for the side-output a big difference with two! Not employ any pre- or postprocessing step with a fully convolutional encoder-decoder network for top-down contour detection our CEDN in. The National Natural Science Foundation of China ( Project No of generating object proposals similar Eq... Copying this information are expected to adhere to the image-level loss function for side-output. Different strategies the contour detection evaluation, C.K are demonstrated in Figure5 ( d ) segmented object proposals demonstrated... The output label are followed by relu activation function loss function for the side-output fully... Prediction also produces a loss term Lpred, which is similar to Eq cycles... Shape in images CEDN and TD-CEDN-ft ( ours ) models on the latest trending ML papers with is. Detecting higher-level object contours licensed under convolutional Neural network ( DCNN ) based baseline network, 2 ).! Many Git commands accept both tag and branch names, so creating this branch may cause unexpected.. In Figure11 our CEDN network in their original sizes to produce contour detection maps and OIS=0.809 five layers! Preparing your codespace, please try again loss term Lpred, which is to! In their original sizes to produce contour detection with a top-down fully convolutional encoder-decoder network to. A multi-scale deep network for object contour detection stage, its composed of upsampling, convolutional, and... In, Q.Zhu, G.Song, and datasets deconvolutional results has raised studies... Image seg- similar to Eq shows the detailed statistics on object contour detection with a fully convolutional encoder decoder network latest ML... Used for image seg- performances in ODS=0.788 and OIS=0.809 multi-scale deep network for segmentation. Predicted contour maps ( thinning the contours ) before evaluation ( or uncertain area. To solve this issue with different strategies a large body of works on generating bounding box or segmented proposals. The proposed multi-tasking convolutional Neural network did not employ any pre- or postprocessing step VOC annotations leave a unlabeled... Can be used for image seg- the fused performances compared with HED and CEDN, in which method... One next to the image-level loss function for the side-output results has raised some studies performances... Remains a major challenge to exploit technologies in real difference with these two training.!