290
L. Bruzzone et al. / Information Fusion 3 (2002) 289–297
detail, the proposed system is based on two different
unsupervised retraining classification algorithms: a para-
metric ML classifier and a non-parametric RBF neural-
network classifier. Both techniques allow the existing
on several factors (e.g., differences in the atmospheric
and light conditions at the image-acquisition dates,
sensor non-linearities, different levels of soil moisture,
etc.) that alter the spectral signatures of land-cover
classes in different images and consequently the distri-
butions of such classes in the feature space.
‘
‘knowledge’’ of the classifiers (i.e., the parameters of the
classifiers obtained by supervised learning on a first im-
age, for which a training set is assumed available) to be
updated in an unsupervised way, on the basis of the
distribution of the new image to be categorised. The
combination of the above-mentioned classification al-
gorithms is used as a tool for increasing the accuracy
and the reliability of the classification maps obtained by
each single classifier. Classical approaches to classifier
combination are adopted. As compared to previous
works [6], the main novelty of this paper consists in the
original retraining technique proposed for the RBF
classifier and in the multiple-classifier architecture used
in the context of partially unsupervised classification.
The paper is organized into seven sections. In Section
It is worth noting that the proposed approach is
based on a separate analysis of the two images X
. Consequently, it does not require that the images are
accurately co-registrated.
1
and
X
2
3. Description of the architecture of the proposed
classification system
The proposed classification system is based on a
multiple-classifier architecture. The choice of this ar-
chitecture mainly depends on the intrinsic complexity of
the unsupervised retraining procedures, which may re-
sult in less reliable and less accurate classifiers than the
corresponding supervised ones, especially for complex
data sets. In this context, the use of a multiple-classifier
approach allows one to integrate the complementary
information provided by an ensemble of different clas-
sifiers, thus involving a more robust and reliable classi-
fication system.
2
the considered problem is formulated. The architec-
ture of the proposed system is described in Section 3.
The unsupervised retraining classifiers are described in
Section 4. Section 5 presents the strategies adopted for
the combination of the ensemble of unsupervised re-
training classifiers considered. Experimental results are
given in Section 6. Finally, in Section 7, discussion is
provided and conclusions are drawn.
The classifiers composing the ensemble are developed
within the framework of the Bayes decision theory.
Consequently, the decision rule adopted to classify a
1
j
generic pixel x of the image X can be expressed as [10]:
1
2
. Formulation of the problem
1
j
1
j
x 2 x if x ¼ arg maxfP ðx =x Þg
ð1Þ
k
k
1
i
1
1
1
2
1
B
2
1
2
2
2
B
xi2X
Let X ¼ fx ; x ; . . . ; x g and X ¼ fx ; x ; . . . ; x g
1
2
1
j
denote two multispectral images composed of B pixels
and acquired in the area under analysis at the time t and
, respectively. Let x be the 1 Â d feature vector asso-
where P ðx =x Þ is the estimate of the posterior proba-
1
i
1
j
1
bility of the class x
(1), the classification of the image X
mation of the posterior probabilities P
classes x
2 X. These estimates involve the computation
of a parameter vector # , which represents the ‘‘knowl-
i
at t , given the pixel x . According to
1
i
j
t
2
1
requires the esti-
ðx =X Þ for all
ciated with the jth pixel of the image X
dimensionality of the input space). Let X
variate random variable that represents the pixel values
i.e., the feature vector values) in X . Let us assume that
i
(where d is the
1
i
1
i
be a multi-
i
1
(
edge’’ of the classifier concerning the distributions of the
classes in the feature space (i.e., the status of the clas-
i
the same set X ¼ fx ; x ; . . . ; x g of C land-cover
1
2
C
classes characterizes the considered geographical area at
both t and t . This means that in our system only the
sifier at t ). The number and nature of the vector com-
1
1
2
ponents will be different depending on the specific
classifier used. In our system, we propose to consider
two different unsupervised retraining approaches: the
former is a parametric approach, which is based on the
ML classifier; the latter consists of a non-parametric
spatial and spectral distributions of such land-covers
classes are supposed to vary (i.e., the set of land-cover
classes that characterize the considered site is fixed over
time). This assumption is quite realistic in several real
applications of classification of remote-sensing data [7–
technique, which is based on RBF neural networks.
p
9
]. Finally, let us assume that a reliable training set Y is
1
Both techniques allow the parameter vectors # (corre-
sponding to the parametric approach) and # (corre-
sponding to the non-parametric approach), which are
1
n
1
available at t , whereas a training set is not available at
1
t
2
. This prevents the generation of the t
as the training of the classifier on the image X
performed. At the same time, it is not possible to apply
the classifier trained on the image X to the image X
2
land-cover map,
2
cannot be
1
obtained by supervised learning on the first image X , to
be updated in an unsupervised way.
1
2
In the proposed multiple-classifier approach, N dif-
ferent classifiers are trained at the time t by using the
information contained in the available training set Y . In
because, in general, the estimates of the statistical pa-
rameters of the classes at t do not provide accurate
1
1
1
approximations for the same terms at t . This depends
particular, a classical parametric ML classifier [10] and
2