HOW TO DEAL WITH MULTI-SOURCE DATA FOR TREE DETECTION BASED ON DEEP LEARNING Lionel Pibrea,e , Marc Chaumonta,b , Gérard Subsola,c , Dino Iencod and Mustapha Derrase a

LIRMM, Université de Montpellier, b Université de Nîmes, c CNRS, d IRSTEA, e Berger-Levrault 2017/11/14

Context

What is our goal?

} Detect and localize trees from aerial images Why?

} Manage trees in cities How?

} With Deep Learning } With Multi-source data

LeCun, Yann, Yoshua Bengio, and Geoffrey Hinton. "Deep learning." Nature 521, no. 7553, pp. 436-444, 2015.

1

Context

What is the difficulty?

} It is complex to merge several information sources } Trees are often regrouped and occluded Some solutions exist[1]

} But not with multi-source data

[1]

Yang, Lin, Xiaqing Wu, Emil Praun, and Xiaoxu Ma. "Tree detection from aerial imagery." In Proceedings of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, pp. 131-137. ACM, 2009.

2

Method

Figure: AlexNet network.

Two methods are tested:

} The Early Fusion ◦ Each sensor source is treated as a channel ◦ Give it through a classical CNN

} The Late Fusion[2] ◦ A subnet for each sensor source [2] J. Wagner, V. Fischer, M. Herman and S. Behnke, "Multispectral pedestrian detection using deep fusion convolutional neural networks", in European Symp. on Artificial Neural Networks (ESANN), Bruges, Belgium, 2016.

3

Early Fusion

Early Fusion diagram.

4

Late Fusion

Late Fusion diagram.

5

Experimental Settings - Database } Database: Vaihingen } Type of images: Red, Green and Near-Infrared (RGNIR) and Digital Surface Model (DSM). We also generated Normalized Difference Vegetation Index (NDVI) images (grayscale) from the RGNIR images. NIR − R NIR + R } Training: 6,000 "tree" thumbnails and 40,000 "other" thumbnails. The thumbnail size is 64 × 64 pixels. NDV I 

(1)

} Testing: 20 images of variable size (from 125 × 150 pixels up to 550 × 725 pixels) and that contain about hundred trees. 6

Experimental Settings - Evaluation

 tree If      labe l      not tree If 

are a ( de te ction∩ground truth ) are a ( de te ction∪ground truth )

> 0.5 (2)

are a ( de te ction∩ground truth ) are a ( de te ction∪ground truth )

Example when the label will be "not tree".

≤ 0.5

Example when the label will be "tree". 7

Experimental Settings - Evaluation

} In green: True Positives } In yellow: False Positives } In blue: False Negatives 8

Experimental Settings - Evaluation

Recall 

TruePositive s TruePositive s + False N e gative s

TruePositive s TruePositive s + FalsePositive s 2Re call ∗ Pre cision F − Measure max  Re call + Pre cision

Precision 

(3) (4) (5)

} TruePositives: Yeah! we really found a tree } FalseN e gatives: Oups, we missed this one } FalsePositives: Oh really? Did you really think THAT was a tree? 9

Results using one source

Results using one source. Source F-Measuremax Recall Precision

RGNIR 60.45% 57.89% 63.44%

DSM 62.47% 57.62% 68.56%

NDVI 63.97% 62.34% 67.04%

} The DSM allows to obtain the best precision } NDVI gives better results than RGNIR and the best F-Measuremax 10

Early Fusion and Late Fusion Results using multi-source data and the Early Fusion architecture. Early Fusion F-Measuremax Recall Precision

RGNIR+DSM 67.12% 65.40% 69.54%

NDVI+DSM 75.30% 68.37% 84.11%

Results using multi-source data and the Late Fusion architecture. Late Fusion F-Measuremax Recall Precision

RGNIR+DSM 62.14% 62.54% 62.65%

NDVI+DSM 72.57% 70.99% 74.83%

11

Discussion Early Fusion and Late Fusion } From one source to multi-source, we increase the f-measuremax by 11%

} No matter the architecture used, NDVI+DSM gives the best results

} The Early Fusion allows us to obtain the best performances } We have an important increase of the precision when we use the Early Fusion ◦ 74% up to 84% with NDVI+DSM ◦ 62% up to 69% with RGNIR+DSM

} The recall does not increase with the Early Fusion } We decrease the number of False Positives with the Early Fusion architecture 12

Complementarity between sources

Results of the correlation between each source. Sources Correlation Distribution

RGNIR/DSM 47.86% 26.47% 25.66%

NDVI/DSM 48.96% 28.75% 22.27%

} 50% of the trees are found in both sources } The remaining 50% is distributed in the two sources and thus shows us the utility of combining several sources

13

Conclusions

} The Early Fusion gives better performances than the Late Fusion

} NDVI allows us to obtain the best performances } This highlights the importance of the data that are used to learn a model with a CNN (RGNIR is not enough)

} We show the effectiveness of CNNs in merging different information with a performance gain exceeding 10%

14

THE END

how to deal with multi-source data for tree detection based on ... - lirmm

Detect and localize trees from aerial images. Why? Manage trees in cities. How? With Deep Learning. With Multi-source data. LeCun, Yann, Yoshua Bengio, and ... "Tree detection from aerial imagery." In Proceedings of the th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, pp.

560KB Sizes 0 Downloads 173 Views

Recommend Documents

HOW TO DEAL WITH MULTI-SOURCE DATA FOR TREE ... - Lirmm
HOW TO DEAL WITH MULTI-SOURCE DATA FOR TREE DETECTION BASED ON DEEP. LEARNING. Lionel Pibrea,e, Marc Chaumonta,b, ... preprocessing on the input data of a CNN. Index Terms— Deep Learning, Localization, Multi- ..... perform a 5-fold cross validation

Model Based Approach for Outlier Detection with Imperfect Data Labels
much progress has been done in support vector data description for outlier detection, most of the .... However, subject to sampling errors or device .... [9] UCI Machine Learning Repository [Online]. http://archive.ics.uci.edu/ml/datasets.html.

Model Based Approach for Outlier Detection with Imperfect Data Labels
outlier detection, our proposed method explicitly handles data with imperfect ... Outlier detection has attracted increasing attention in machine learning, data mining ... detection has been found in wide ranging applications from fraud detection ...

Rumor Detection on Twitter with Tree-structured ...
2Victoria University of Wellington, New Zealand ... rooted from a source post rather than the parse tree ... be seen that when a post denies the false rumor,.

Get Deal With Tree Lopping In Canberra With Perfection.pdf ...
There was a problem loading more pages. Retrying... Get Deal With Tree Lopping In Canberra With Perfection.pdf. Get Deal With Tree Lopping In Canberra With ...

Saliency Detection based on Extended Boundary Prior with Foci of ...
Page 1 of 5. SALIENCY DETECTION BASED ON EXTENDED BOUNDARY. PRIOR WITH FOCI OF ATTENTION. Yijun Li1. , Keren Fu1. , Lei Zhou1. , Yu Qiao1. , Jie Yang1∗. , and Bai Li2. 1. Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong Uni

Saliency Detection based on Extended Boundary Prior with Foci of ...
K) and its mean position and mean color in. LAB color space are denoted as pi and ci respectively (both. normalized to the range [0,1]). 2.1. Graph Construction.

Face Detection Algorithm based on Skin Detection ...
systems that analyze the information contained in face images ... characteristics such as skin color, whose analysis turns out to ..... software version 7.11.0.584.

E-Books How to Deal With Haters
Sep 26, 2014 - Internet The Two Traps When Dealing With Them Understanding Constructive Versus Destructive. Criticism Behavioral Traits of Subversive Haters Social Programming Parrots A Battle of WIllpower. Should You Cut Ties? The Types of Malicious

How To Deal With Debt Recovery in Melbourne.pdf
Page 1 of 8. o. "0. :z. us 10EE81. Eighth Semester B.E. Degree Examination, June/July 2017. Electrical Design Estimation and Costing. Time: 3 hrs. Max. Marks: 100. ote: 1.Answer FIVE full questions, selecting. at least TWO questions from each part. 2

How to Deal Effectively with Trademark Infringement?.pdf ...
How to Deal Effectively with Trademark Infringement?.pdf. How to Deal Effectively with Trademark Infringement?.pdf. Open. Extract. Open with. Sign In.

Enhancing Memory-Based Particle Filter with Detection-Based ...
Nov 11, 2012 - The enhance- ment is the addition of a detection-based memory acquisition mechanism. The memory-based particle filter, called M-PF, is a ...

A New Data Representation Based on Training Data Characteristics to ...
Sep 18, 2016 - sentence is processed as one sequence. The first and the second techniques are evaluated with MLP, .... rent words with the previous one to represent the influence. Thus, each current input is represented by ...... (better disting

Bilattice-based Logical Reasoning for Human Detection.
College Park, MD [email protected]. Abstract. The capacity to robustly detect humans in video is a crit- ical component of automated visual surveillance systems.

C205 A Fuzzy Neural Tree Based on Likelihood.pdf
Whoops! There was a problem loading this page. Retrying... Whoops! There was a problem loading this page. Retrying... Page 3 of 59. 17. wlucb rbd3 ihe ...