Psychological Validity
of Fuzzy Representations

(abridged version)

Maurice Clerc 1995

mcft10@calva.net

Introduction

In a memory system for Artificial Intelligence, or memoware, one way to represent information is to create some stereotypes comprising a set of descriptors on which other objects of memory, or memobjects, are described asfuzzy sets, i.e., practically, as weighted combinations of these descriptors:this is the fuzzy representation principle, which generalizes the multi-dimensional psychological spaces[1-4,7]

It is then possible to mathematically define some similarities, which are symmetrical, and have a kind of partial transitivity. Since memoware built with these principles has to be accepted by users as an extension of their ownmemory, it is important to verify that these properties are psychologically valid.

This paper presents the approach and some mathematical tools. A study was conducted with human subjects classifying pairs of objects (outlines of birds)on the basis of their similarity.

.

Method

Stimuli for the experiment were six outlines of birds. The outlines could be arranged in a sequence such that the image metamorphosed incrementally from a bird of prey to a wader. This sequence was chosen for two reasons :

- on the one hand each intermediate outline is calculated according to the two extremes, by linear variation of only one numerical parameter.

- on the other hand, this simplicity is not obvious, especially when all

the sequence in order is not visible (see figure 1).

Figure 1. Stimuli. Each point of the first outline has a counterpart on the last one. For intermediate outlines, the equivalent point is calculated by linear interpolation.

Preliminary tests showed that identical pairs were always found unambiguously.So they are finally not included into the material available for the subjects,which is then made of 30 pairs on 3x3 cm "cards" , and a A4 sheet of paper for arranging them

Without speaking here in detail of tested population, and of protocol, let us simply say that at each session instructions are given to put each card more or less "high" on the sheet, depending of the similarity of the two outlines on the card. Figure 2 shows an example of test sheet which has been judged satisfactory by a subject.

Each subject has to do five sessions, with at least some days between each one and the next, and with five different sets of 15 cards, in order to eliminate some artifacts (e.g. some pairs are rotated of 180 degrees).

Figure 2.-Filled test-sheet (scale 1/3). Subject were told to arrange each card accordingly to the similarity of the two outlines, the more similar they are the more the card should be close to the top, and vice versa.

Results and Models

For each session first results are measures of card positions, from the bottom of the sheet, knowing there is never a pair just at the bottom. It should be noticed that, according to verbalizations, this last point is probably due to a partly semantic component (e.g. "anyway, they are both birds"). They are then normalized on [0,1], simply by dividing them by the height of the sheet paper.Let be Ri,j the average similarity obtained for each card(birdi, birdj)

Symmetry

Before to model, the question is "Have we here the symmetry property ?", i.e.have we, more or less, ? The figure 3 shows a quite good correlation. In fact the probability fort he hypothesis of symmetry to be true is 0.905, which is acceptable, but not perfect, and it could be interesting to do more tests to examine this point.

Figure 3 Symmetry check. Each pair of outlines has been placed several times, sometimes with the card (i,j), and sometimes with the card(j,i). So it is possible to compute two separated averages, one for each order. The most important differences are for "intermediary" pairs, for which subjects have difficulty deciding the best position. On the whole, the correlation is however good.

Fuzzy Representations and Models

We are studying a symmetrical function F : ,whose estimated values are coming from measures, for one hand, and from the constraint that the identity is valued to 1, for the other hand. This values can be retrieved by a similarity function, in the mathematical meaning, applied to fuzzy representations of the studied objects. Figure 4 gives such are presentation for two descriptors.

Figure 4 Two descriptors Fuzzy Representation . Here, the sum ofthe values for each object is exactly 1, but it is not the general case.

Several kinds of similarity have been tested ("ensemblistic", "distance","angular"). The angular model is the only one which takes into account the typicality, in the sense that, for example, outlines 5 and 6 are indeed more similar than 3 and 4.

More precisely, a formula like gives (with lambda=1.29 and mu=1.04) a excellent rank correlation (0.96),and a small maximum relative difference (0.17).

Quasi-transitivity

Such a mathematical model implies a quasi-transitivity which could be roughly said:

"If the similarity between i and j is high, and also between jand k, then there must be some similarity between i and k"

and this can be defined by a strict archimedian t-norm T .where ri,j is the similarity between the object i and the object j.

More precisely, one must have

with, for example, for the automorphism j

It seems complicated, but this is only a generalization of the very simple case ,obtained for nu=1.

The exhaustive check is a bit boring, but it can be shown that the data have indeed an underlying structure, which constrains the similarities between the objects of a triplet. This can be visualized by a surface ,drawn for nu=1.374, so that, for each triplet ,the point is"above" this surface (figure 5). In passing, the subjects were completely unaware of this kind of transitivity. And some of them contested it, even when it could be found in their own test sheets.

Figure 5 Constraint surface for quasi-transitivity. For each triplet of objects, (bird outlines), the three similarities two by two,can be seen as the coordinates of a point. With the collected data all points are "above" the surface.

Conclusion

Metric properties of psychological similarities are studied from a long time ago, and it is well known, for example, that the attempts of formalizations of a kind of transitivity by triangle inequalities have been not very convincing [5-6]I show here that a more general formulation (coming from the model HFR:Hierarchical Fuzzy Representation) give results which are compatible with the data. In return, the pertinence of this model is a little reinforced

References

[1] X. Chanet, Décompositions floues, ressemblances, catégorisations, 1992, France Télécom: Annecy, France.

[2] M. Clerc, F. Guérin, et al. Représentations flouesdans un mémoriel. in JIOSC (Journées Internationales d'Orsay sur les Sciences Cognitives). 1994. Orsay, France: CNRS.

[3] W. Duch, G.H.F. Diercksen, Feature Space Mapping as a Universal adaptive System, Computer Physics Communications (1994)

[4] K. Tanabe, J. Ohya, et al., Similarity retrieval method using multi-dimensional psychological space, Systems and Computers in Japan 24 (1993)98-109.

[5] A. Tversky, I. Gati, Similarity, separability, and the triangle inequality,Psychological Review 89 (1982) 123-154.

[6] A. Tversky, D.H. Krantz, The dimensional representation and the metric structure of similarity data, Journal of Mathematical Psychology 7 (1970)572-596.

[7] W. Wagner, A fuzzy model of concept representation in memory, Fuzzy Sets& Systems 6 (1981) 11-26.