Transfer learning for hierarchical Conditional Random Fields for the classification of urban aerial and satellite images
Aerial and satellite images are an important source of information for the geo-information systems. Semantic information from the remotely sensed data can be derived by the classification. One problem related to the machine learning techniques used in this context is the necessity to provide a sufficient amount of training data for each new classification task or new data set. The acquisition of the data can be tedious and time consuming manual task. Therefore it is desirable to transfer classifier to the new data without any or with just few new training samples.
The goal of the project is to develop a new methodology based on transfer learning technique called domain adaptation. Domain adaptation techniques try to reduce the amount of training data required for a successful classification by adapting the classifier trained on samples from so called source domain to a new data set called target domain, where the features may have different distributions but assumed to be related.
Our approach for domain adaptation is based on logistic regression considering multiclass problems. Logistic regression is a simple and fast discriminative probabilistic classifier, which should require fewer training samples than a generative approach. The adaptation performs iteratively by removing labelled source samples from the current training data set and simultaneous addition of semi-labelled target samples to the current training data set. The semi-labels are determined automatically applying the current state of the classifier. After each iteration the classifier is retrained using the updated training data set. In this way the classifier and the decision boundaries can be slowly adapted to the distribution of the target domain data.
Currently, individual weights in the domain adaptation process are used, to achieve more stability and overcome problems of abrupt changes in the decision boundaries. These weights modulate the impact on the basis of a sample’s distance from the decision boundary. Additional stability is achieved by using the current state of the classifier for regularisation. An example for classification results applying the domain adaptation is shown in Figure 1(c).
Fig. 1: Results of classification of the target data: (a) classifier was trained and applied to the target data without domain adaptation (best possible performance using logistic regression); (b) classifier was trained on source domain and applied to target domain without domain adaptation; (c) results after domain adaptation procedure.
Paul, A.; Rottensteiner, F.; Heipke, C. (2015): Transfer Learning based on Logistic Regression. In: The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Volume XL-3/W3, pp. 145-152, 2015 | file |
Paul, A.; Rottensteiner, F.; Heipke, C. (2016): Iterative re-weighted instance transfer for domain adaptation. In: ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences III-3, pp. 339-346
Vogt, K.; Paul, A.; Ostermann, J.; Rottensteiner, F.; Heipke, C. (2018): Unsupervised source selection for domain adaptation. In: Photogrammetric Engineering and Remote Sensing 84(5): 249-261, 2018. more