IML346 Blog

March 29, 2010

Chapter 8

Filed under: Uncategorized — Natural Light @ 8:14 pm

8 The third dimension

THE THIRD DIMENSION: FROM ‘READER’ TO ‘USER’

So far we have restricted ourselves to still rather than moving images, and to two dimensional forms of visual communication rather than three-dimensional ones such as sculpture, product design, architecture or stage set design. In this chapter we will explore to which degree the descriptive framework we have developed in this book can be applied also to the three-dimensional visual and the moving image. Our three-dimensional examples will mainly be drawn from the fields of sculpture and children’s toys. Toys are of particular interest as they occupy a space somewhere in between sculptures, which are primarily symbolic objects, objects for contemplation and veneration, and ‘designed objects’, which are primarily objects for use, even though they may also convey symbolic messages. In addition we will consider everyday objects such as cups and motor cars, and architecture. The chapter is intended as a first exploration. We are not able, within the space of this book, to present a systematic account of all aspects of three-dimensional visual communication, as this would require too many additional concepts (some of these are discussed in van Leeuwen, 2003). Instead, we will concentrate on the concepts we have described in the preceding chapters, to show what role they play in three-dimensional visual communication. In this way we will at least be able to indicate in which ways three-dimensional visual communication is similar to and different from two-dimensional communication, and to outline the theoretical issues which follow from this.
Starting with the issue of visual representation, we found that the key categories we introduced in chapters 2 and 3 can be applied also to three-dimensional visuals, and do indeed seem sufficient to describe a wide range of such objects. Many sculptures, for example, have what we called in chapter 2 a ‘narrative’ structure. Take Epstein’s Jacob and the Angel (1940), shown in figure 8.1: the arms of Jacob and the Angel form powerful vectors, relating the two participants in a complex and interesting way. The Angel’s action is transactional. It has a Goal, as he holds Jacob in a firm grip. But Jacob’s action, though foregrounded, is non-transactional – his arm hangs limply, and he does not hold or grab anything.


Kenneth Armitage’s People in the Wind (1952), shown in figure 8.2, also has strong vectors, formed by the way the figures are bent forwards as they struggle against the wind. But here (as in sculptures of discus throwers, ballerinas and other active subjects) the action is ‘non-transactional’. The vectors do not point at or lead to another participant, a Goal. The figures, it seems, ’strain forwards’, but they do not ’strain towards something’.

‘Reactions’ are also common, although in sculpture the eyes do not usually form as strong a focus of attraction as they do in two-dimensional images, because they lack the strong tonal contrast between the whites of the eyes and the pupils, which, in pictures as in nature, makes eyes so salient. It is as if the added naturalism of the third dimension must be counteracted by greater abstraction in other means of expression, such as colour, to prevent sculpture from crossing the line between art and the uncannily real make-believe of the waxworks show (or of certain contemporary forms of art). Nevertheless, in a sculpture like Rodin’s The Kiss (1880), the man and the woman not only hold each other with their arms, they hold each other with their gaze as well, in a ‘transactional reaction’, and it is not difficult to find examples of ‘non-transactional reactions’ also. Jacob (figure 8.1) looks up, in a non-transactional reaction, while the Angel looks at Jacob, in a transactional reaction. Again, the Angel acts on Jacob, but Jacob does not act on the Angel. Arnheim’s description of Michelangelo’s Moses provides another example. ‘The deflection of the lawgiver’s head and the fierce concentration of his glance introduce an oblique vector that moves outwards like the beam of a lighthouse. But no goal object is included’ (Arnheim, 1982: 46).

Even the design of objects and buildings can be vectorial and hence ‘narrative’, as shown in figure 8.3. The vector formed by the  tailfins of  1950s cars, for instance, represented (in an abstract way) the idea of dynamic motion, as if it was not enough that cars are in fact dynamic moving objects, regardless of whether they have  tailfins or not. The fact that cars do not have tailfins now points to the ideological dimension of sculptural representation: the meanings represented were those of the ‘jet age’. But here an important complication occurs. In the case of objects, the ideational relations we have discussed in chapters 2 and 3 can be realized in two ways: they can be realized by the designer, as forms to be ‘read’ by a viewer, as when a cup has a ‘dynamic’, vectorial handle (and also, of course, in the case of the pictorial or decorative designs printed or painted on, or moulded or carved in, the cup); or they can be realized by the user of the object, as when the cup is held or drunk from, in a ‘transactional action’ with its user – the vectorial handle is then a ‘non-transactional action’ from the point of view of the design of the cup, and a potential for transactional action, a means, from the point of view of its use.

Reactions can even occur in objects. The toy telephone shown in figure 8.4 not only includes the dog’s tongue, as an oblique vector signifying a non-transactional speaking; it also has eyes. The inclusion of eyes is in fact quite common in toys for young children, particularly in toys with the themes of time (clocks), communication (toy telephones) and transport (toy locomotives and cars), as if to encourage the child to form an emotive, personalized bond with these three key technologies as early as possible.

In contrast to two-dimensional visuals, sculptures rarely include a Setting. Their setting is the environment in which they are displayed, a gallery, a niche in a church, or a public square. It is not a represented Setting. Of course, sculptures can include a Setting, as in the works of Edward Kienholz and George Segal, for instance, or Asian sculpture gardens, such as the Tiger Balm Gardens in Singapore. But in contemporary Western sculpture the inclusion of a Setting is relatively rare. Decontextualization, it seems, has to counteract the added naturalism of the third dimension. Sculptures do, however, often have another participant, the pedestal on which they stand. Such pedestals can be (mere) framing devices, creating a degree of separation between the sculpture and its environment, and so enhancing its status as a representation, an object for contemplation, set apart from its environment. But they can also and at the same time form part of the representation, as in Canova’s Paolina Borghese (1805), which has Paolina resting on a couch which forms also a sarcophagus-like support for her reclining body. The absence of such framing can have a strong effect, as in the lifesize bronze of a middle-aged, corpulent man in a raincoat and hat placed as if mingling with the shoppers in the middle of the footpath of a busy shopping street in Amsterdam.

Turning now to ‘conceptual’ rather than ‘narrative’ structures of representation, these, too, can be found in sculpture. Miró’s Woman (1970), reproduced in figure 8.5, is what, in chapter 3, we called an ‘analytical’ representation. The sculpture does not just play with the forms of found objects, making eyes of the headlights and a mouth of the windscreen of a car; it is also an ‘analysis’ of ‘Woman’. ‘Woman’, in all its generality, is the ‘Carrier’, and the parts, the ‘Possessive Attributes’, are, in Miró’s conception: a head which is also the empty shell of a car; an upper body which is also a tray on which two aggressively pointed breasts are presented to the viewer; and a lower body which is a barrel-shaped container with a vagina-like slit and two handles to hold her by. When we visited an
exhibition of Miró’s sculptures we noted that the misogynistic quality of Miró’s ‘analysis’ was not lost on the viewers. There was a guest book in which the visitors of the gallery could write down their impressions. Many had used the opportunity to draw a quick caricature of a Miro woman with a contemptuous comment such as ‘WOMAN????’ These readings contrasted sharply with the artspeak in the catalogue, which described the formal qualities of the sculptures only, and did not dwell on Miro’s way of representing the female gender.

Giacometti’s Hour of the Traces (1930), shown in figure 8.6, is an analysis of the genderless human being, of the human condition in general. The parts: a kind of antenna, with an abstract eye, forms an active sensory tentacle and protrudes from the sculpture at an angle, constituting an oblique vector; a rigid rusty frame, the body; and, within the open frame, a plaster heart, suspended on a thin string, and moving slightly to and fro. Thus two actions are embedded in the analysis, both non-transactional: the action of the sensory apparatus, and the movement of the heart – the human being as a skeletal frame that is alive and surveys its environment.

We might add that analytical sculpture is used not only in art but also in science – for instance, to show the construction of a molecule; or as a teaching aid, for instance in anatomy, in which case the parts can often be detached from the whole. The kinetic design of sculptures and other objects, the way they can move or be made to move, taken apart and put back together again, and so on, is a subject to which we cannot do justice in this chapter, as it would again demand the introduction of a new set of concepts (but see van Leeuwen and Caldas-Coulthard, 2004).

The third dimension creates an additional option in representation, a relation between the representational structure and the position of the viewer. Seen from the side Epstein’s Jacob and the Angel (figure 8.1) has a narrative structure (‘transactional action’). It is in the first place about what Jacob and the Angel do. But if we look at the Angel from behind (figure 8.7), we are faced with an ‘analysis’ of the Angel, and a very striking one: the three principal ‘Possessive Attributes’ Epstein emphasizes are the Angel’s long hair, his wings – and his balls.

Not all sculptures use this possibility. One can imagine a continuum running from reliefs, which perhaps differ from two-dimensional images only in terms of modality, to
fully ‘multifaceted’ sculptures such as Jacob and the Angel. In between there are sculptures which, though free-standing, are clearly not designed to be seen from behind and leave the back ‘unworked’, perhaps because they were meant to be placed against a wall or in a niche. And even when a sculpture is a fully multifaceted representation, like Jacob and the Angel, its placement in a particular environment can block access to alternative viewing positions, and hence to alternative readings. This may be because the work is placed with its back against the wall, or because a barrier prevents the viewer from access to other than more or less frontal viewing positions. But it may also be done in subtler ways. When we first analysed Jacob and the Angel, it was placed in the centre of the octagonal entrance hall of the Tate Gallery, in such a way that the viewer first saw it from the side, with the Angel on the left. In other words, its position favoured the narrative reading, the drama of the sculpture, rather than Epstein’s striking ‘analysis’ of the Angel.

But the viewer did have access to the other sides, as the sculpture was placed in the centre of the hall. Playmobil figures such as those shown in figure 8.8 are also analytical structures. They show the significant attributes, the significant characteristics, of (for example) an ‘ethnic family’. The family has-five members – a father, a mother and three children. Each member of the family has black hair and dark skin. Note the difference from the neutrally labelled ‘family set’: the composition of the family is the same, and all the members of the family have pink skin, but they differ in the colour of their hair and therefore have individual characteristics as well as social characteristics, whereas (lesson number one) the members of the ‘ethnic family’ are ‘others’ who ‘all look the same’. The children of the two families are dressed identically, but the parents are not. Lesson number two: second-generation immigrants are already ‘more like us’. The Playmobil company brochure says that these toys ‘form an aid to the training of your minds’ and will ‘acquaint children with what they will meet in the big real world’ – but not in an entirely neutral fashion. As we have said in chapter 3, any analytical structure is only one of the many ways in which a given ‘Carrier’ can be analysed.

We have to remember, of course, that Playmobil figures enter into representation in two ways, like our earlier example of cups with vectorial handles (figure 8.4). On the one hand, they are like sculptures, pre-designed representations, to be ‘read’ by the child; on the other hand, they have movable limbs and detachable parts and they can hold objects in their hands. Children can therefore use them to create a variety of representational structures, narrative ’scenes’, and they can even subvert the pre-designed representations, for instance by giving an ‘ethnic’ child red hair. They can also create their own classificatory or analytical arrangements, for instance by making a display of different kinds of Playmobil children on their toy shelf or by creating a new analysis of the ‘family’, with only a mother, perhaps, or with five children of different ‘ethnic’ origins (the arrangement of Our Society and Others, figure 3.29, could be reconstructed with Playmobil figures!). In the same way a cup can be used, not only in the ‘transactions’ of holding or drinking, but also: to create an analytical structure, as when the cup is arranged on a sideboard together with the other parts of the set to which it belongs, or on the kitchen shelf, to become a ‘Possessive Attribute’ of the ‘Carrier’ ‘dishes’; or to create a classificational structure, as when a number of different cups are arranged symmetrically in a shop window or in a design exhibition.

The Vitrinen (display cases) of the German avant-garde artist Joseph Beuys are an intriguing example of the classificational sculpture. They are the kind of sculpture which is usually referred to as an ‘installation’ – glass display cases containing a variety of objects, some of them altered by Beuys. V/ti-/ne 2 (1960-70), for instance, contains a film can (itself containing a film which features a performance by Beuys), a pair of boxing gloves, a sausage, a cassette player (with a tape of music performed by Beuys), a Beethoven score with a small blackboard eraser on it, two wineglasses (one coated with a white substance, the other looking as though something has been burnt in it), and a zinc box. Clearly a work like this raises the question of what these objects have in common (what their ’superordinate’ is, in our terms), and this is exactly the question Beuys’ interpreters have asked. According to Theewen (1993: 139), Verwandschaft (‘relatedness’) is the key to understanding Beuys’ Vitrinen: ‘By bringing related objects together an association between them is created’, and in the case of the Vitrine we have just described this association is that all the objects ‘have played a role in performances of Beuys, and [that they] all contain something’ (1993: 29).

The third kind of conceptual structure discussed in chapter 3 was the symbolic relation. We saw how in pictures an overall colour – a blue haze, or a golden glow – could realize what we called a ’suggestive symbolic’ process, endowing the depicted scene with an overall significance, ’saying’, as it were, ‘this scene is cold and desolate’ (in the case of the blue haze) or ‘these objects are very valuable’ (in the case of the golden glow). Colour can play this role also in the case of three-dimensional objects – think of the colours of cars, for instance, the difference between a black, a bright-red and a white Mercedes, say.

But in addition to colour there are other factors, such as the material from which an object is made, the way the surface of a sculpture or other object is ‘worked’, or the overall shape of the objects, in so far as these are not determined by considerations of naturalistic representation, or by the functions served by an object. Giacometti’s Man Pointing (1947) is much less ‘analytical’ than the sculptures we have so far discussed. It places no emphasis on the distinctness of the parts of the human body. Even the facial features are hardly stated – small indentations to indicate a mouth and eyes. The sculpture does of course have a clear vector, as the man is making an expansive oratorical gesture. The most striking characteristic of this sculpture, however, is its rough, black, craggy surface. It is difficult to put into words exactly what is suggested by this surface, but whatever transcoding we attempt it will have to express somehow that this figure is ‘weatherbeaten’, affected by exposure to – but here we can fill in a number of things – the elements suffering, ageing and so on. The sculpture is abstract enough to allow all these readings and more.

The same applies, again, to other kinds of three-dimensional objects. Cups can be smooth, made of delicate china, suggesting, perhaps, an overall quality of elegance and refinement; or they can be sturdy and solid and made of brick-red terracotta, suggesting, perhaps, an overall quality of down-to-earth simplicity. Cars can be elongated and streamlined, suggesting power and speed, or, as in the case of the currently fashionable ‘retro’ look, rounded and egglike, suggesting a safe cocoon, a kind of womb.

The second kind of synbolic relation we discussed was the ’symbolic attributive’ process, where one represented participant has no other function than to endow another with symbolic significance. This occurs, for instance, in some of Miró’s sculptures, where birds and eggs attribute symbolic qualities to the figures depicted (usually ‘women’). But it occurs also in toys for young children, where telephones can have wheels (an early lesson about the concept of communication as ‘transport of information’ and ‘bridging the distance’ rather than ’sharing of information’), and ‘interactive learning centres’ for children aged 21/2 to 5 (‘6 built-in functions which teach the alphabet, numbers, shapes, colours, sound effects and nursery rhymes’, and all this for £29.50) have a steering wheel and dashboard, as a symbol of the power and control afforded by knowledge. Or in adult toys: emblems and other decorations on cars, for instance. Or in architecture, where sculptures and murals can become symbolic attributes for buildings.

On the whole, then, we feel that the account of visual representation we have presented in chapters 2 and 3 can be applied to three-dimensional visual communication. Yet there are some significant differences. First, three-dimensional objects can be placed on a continuum which runs from objects that allow only one reading (by offering the reader only one aspect, usually the front) to objects which allow more than one reading, depending on the position of the viewer relative to the object.

Second, three-dimensional objects can be placed on a continuum which runs from objects which have been designed only to be looked at, only to be ‘read’, to objects which enter into representational relations in three ways: (1) the relations encoded in the design of the object itself, to be ‘read’ only by the viewer; (2) interactive relations between the object and its user (e.g. holding the cup, or drinking from it); and (3) conceptual relations created by the user (e.g. creating a classificational syntagm with a number of different cups).

Third, even when an object does have a potential for multifaceted representation and/or for being ‘used’ as well as ‘read’, external conditions can inhibit this potential, block the viewer’s access to alternative reading positions, or to interactive engagement with the representational potential of the object.

INTERACTIVE VIEWING

We will now turn to the interactive relations we discussed in chapter 4, trying to explore, again, how applicable they are to three-dimensional visual communication. In that chapter we distinguished between ‘demand’ pictures from which represented participants address the viewer directly with their gaze and ‘want something from the viewer’, and ‘offer’ pictures which position the viewer as an observer only, and offer the represented participants as ‘information’ to be taken in by the viewer.

Clearly, this distinction can be applied also to sculpture – but, again, with a difference. Henry Moore’s Recumbent Figure (1938), shown in figure 8.9, addresses the viewer
powerfully. Although the eyes are little more than indentations in the surface of the stone, the whole attitude of the figure suggests a concentrated look. But, as viewers, we can take up a position from which that look w/// directly address us (as did the photographer, in the case of figure 8.9), so that the picture forms a ‘demand’; or a position from which the figure looks past us, at something else, or at nothing in particular, in any case, at something not included in our view, and in that case the look will become a ‘non-transactional reaction’. In the two-dimensional medium we cannot, as viewers, decide whether or not we will allow ourselves to be directly addressed by a represented participant; the decision has been made for us. In the three-dimensional medium we can -that is, if the placement of the sculpture allows us to do so. In the Tate Gallery, Moore’s sculpture could have been placed in such a way that the figure’s gaze would fix the viewer immediately upon entering the room. But this was not done when we viewed the sculpture there, and as a result the figure became just one of a number of Moore’s works, presented as part of a class ificational syntagm, and favouring tie ‘offer’ rather than the ‘demand’.

The same would be true of the toy telephone (figure 8.4). The gaze of this telephone can only become a ‘demand’ by virtue of an active decision on the part of its user. Some toys, of course, lend themselves more to this than others. Playmobil characters have small black dots for eyes. They are biased more towards the ‘offer for information’ than towards the interactive ‘demand’. And the eyes of many ‘boys’ ‘ dolls (Batmen, Crash-dummies, Megazords, etc.) are often obscured by helmets, masks or dark glasses. The eyes of ‘girls’ dolls (and of many cuddly animal toys), on the other hand, tend to be large and highly detailed. While boys are steered towards a more manipulative relation to their dolls, for girls the look, the interactive dimension, is made to matter more. And the same is true for very young children: even their bedclothes, pillowcases, cups, plates may have eyes, and are thus personalized, animated, capable of entering into a ‘direct address’ relation with the child. As with many other things, some of this may well live on, unconsciously, in the adult relation with objects.

The same reasoning can be applied to the other interactive dimensions we discussed in chapter 4. In principle the viewer can decide whether to see the object from close up or from a distance, frontally (hence with ‘involvement’) or from an oblique angle (hence with ‘detachment’); from above (hence from a position of power over the object) or from below (hence from a position in which the object has power over the viewer). We say ‘in principle’, because here, too, the viewer’s choice may be restricted by external factors, by barriers that prevent viewers from coming up close or seeing the object from a different angle. And large objects can make the high-angle viewpoint and the close distance impossible. What towers over us has, by design, power over us, and is, by design, socially distant: the vertical dimension is the dimension of power and reverential distance, the dimension of ‘highly placed’ people, places and things. In this connection it is also significant that sculptures, as works of ‘high’ art, cannot usually be approached from the most intimate distance, the distance that makes touching possible: as soon as the gallery visitor comes too close, a guard will become alert.
When sculptures are taken out of their original context and moved into another, their interactive meanings may change significantly. They may be, literally, taken down from the pedestal – in a church perhaps – where they were to be looked at from below, with reverence, to be moved into a gallery, where they are positioned at a level of equality, and viewed from a more ‘familiar’ distance: Michelangelo’s David, removed to the rotunda of a museum, no longer calls to the citizens of Florence and is unaware of their calling on him, and now ‘can be explored by the viewer, but makes no advances to him’ (Arnheim, 1982: 50; also Hodge and l<ress, 1988: 201-3).

MODALITY IN THREE DIMENSIONS

In chapter 5 we described visual modality as resulting from the degree to which certain means of pictorial expression (colour, representational detail, depth, tonal shades, etc.) are used. Each of these dimensions can be seen as a scale, running from the absence of any rendition of detail to maximal representation of detail, or from the absence of any rendition of depth to maximally deep perspective. And on each of these scales there is a point that represents the way the given pictorial dimension is used in what could be called standard naturalism. To the degree that the use of a dimension is reduced, it becomes, at least in one respect, more abstract, ‘less than real’. To the degree that it is amplified, it becomes ‘more than real’, and we associated this with a ’sensory coding orientation’, an emphasis on sensory pleasure (or displeasure, as in the case of ‘more than real’ horror images), and an attempt to come as close as possible to a representation that involves all the senses.

Some of these play much the same role in three-dimensional visuals. Clearly, sculptures and toys can represent what they represent in naturalistic detail or more abstractly. And when the shape of everyday design objects no longer betrays their function, when, for instance, refrigerators, washing machines and kitchen storage cabinets all become sleek featureless white boxes, there is also a strong sense of abstraction. The same can be argued for buildings. Also, like pictures, three-dimensional representations can include several levels of modality.

The heads of Henry Moore’s King and Queen (1952-3), for instance, shown in figure 8.10, are abstract symbols, while their hands are rendered in naturalistic detail. This expresses the contradictory nature of the powerful. Their minds may have lost touch with the detail of everyday concrete reality, but look at their hands – they are after all still human, and their work, their doing, is still the work of humans. Machin and Suleiman (2004) have pointed out that in American computer war games the weaponry is represented in realistic detail, while the settings have lower modality, forming a generic desert that could be anywhere. This foregrounds American technological supremacy and backgrounds the specifics of specific conflicts. In a Lebanese computer war game produced by Hezbollah, the landscape is reconstructed from photographs of the sites of actual conflicts and represented in more detail. Here the specifics of historical and geographical accuracy matter.

The representation of detail in toys is particularly interesting. Barthes’ still highly readable essay about the semantics of toys (1973: 53ff.) is now perhaps overtaken by semiotic events. He describes French toys as highly detailed, highly naturalistic miniature versions of adult objects – and notes how unsatisfactory they are, therefore, both from the point of view of pleasure, of the sensory dimension, and from the point of view of their interactive potential, as objects to play with: ‘the child can only identify himself as owner, as user, never as creator; he does not invent the world, he uses it: there are, prepared for him, action without adventure, without wonder, without joy’ (1973: 54). Toys of this kind can be seen in museums such as London’s Museum of Childhood. One’s first impression on seeing Victorian toys is that children are addressed as miniature adults, their subjectivity ’scaled down’, but not ‘reduced’, from that of the adult world. This is represented through a large number of semiotic modes: the materials (glass, clothes of various kinds, metals, etc. – all of them rarely, if at all, used in contemporary toys) as much as the miniature naturalism of the represented objects. Today’s toys vary in their detail. Toys for young children are abstract. Shapes and textures are brought down to their essentials. Locomotives have featureless wheels, one featureless chimney, two yellow circles for windows. They are, from the point of view of detail of representation, like the simplest line drawings. As the child gets older, detail increases. The wheels of the locomotives get spokes and driveshafts. The texture of the machine’s body acquires detail. Headlights with miniature Fresnel lenses are added. But on the whole the contemporary toy remains simple, essentialized, as for instance (again) in the popular Playmobil figures, where the eyes are two dots, the mouth a curved line, the hair an almost featureless helmet, with a few indentations suggesting texture and the length signifying gender. Girls’ dolls, on the other hand (and other girls’ toys: realistic washing machines, ‘beauty shops’, vacuum cleaners, dolls’ cots), imitate the adult world, or at least that of the glamour girl, the housewife and the mother, much more so than boys’ toys, which depict a make-believe world of sciencefiction vehicles and weapons, or a world of dinosaurs and other monsters. The latter are often ‘more than real’, with highly textured, glistening scales, irregular teeth, and menacing eyes, set behind wrinkled lids. They are designed to create the kind of sensory, visceral reaction also sought, for instance, in horror films. Cuddly animal toys also tend to be ‘more than real’, with exaggeratedly soft furs and large moist eyes, this time to enhance the sensory pleasures of holding and touching. Researching this chapter, we spent many hours in toy shops as well as in toy museums, and could not help being struck by the contrast between the ‘bourgeois’ naturalistic toys Barthes described and today’s make believe world of brightly coloured plastic and creatures and objects from fantasy stories.

The role of colour in the modality of three-dimensional visual representation also resembles that of the two-dimensional visuals. Some of Mir6’s sculptures of ‘women’, for instance, are painted in bright, primary colours – yellow, blue and red. As a result they are schematic and analytical from the point of view of representational detail, a simplified, abstract view of ‘woman’, but ‘more than real’, ’sensory’ from the point of view of colour. Mir6’s women are not just machine-like assemblages of parts, they are also pleasurably colourful (‘woman’ as a machine for pleasure). Many toys, especially toys for young children, have exactly the same kind of modality configuration: abstract and schematized, ‘conceptual’, from the point of view of colour; ‘un-naturalistic’ primary colours, colours for the sake of pleasure rather than naturalistic representation. This ’sensory’ aspect is then further enhanced by the way these toys appeal to all the senses, and include touch and sound, as stressed in this description of the ‘Chicco animal train’: ‘a locomotive featuring eight different animal sounds and four different train sounds. Sounds are activated by pressing appropriately shaped buttons.’ Again, for older children colour can become more naturalistic, as in Thomas the Tank Engine, which replaces bright reds, yellows, blues and greens with more mute steel blue, grey and black, and just a touch of red. And, while the colour in all these examples is unmodulated, the colour of girls’ dolls becomes more varied, with blushes on the cheeks, shadows under the eyes, a shine on the lips. And the same applies to the slimy greens and greys and pinks on dinosaurs and monsters.

The materials used in three-dimensional representation, similarly, can be motivated naturalistically, as when toy cars are made of metal, or cuddly animals of soft, furry materials (though, on the other hand, even toy pigs can be furry!). But they can also be ‘less than real’, abstracting from the variety and specificity of the range of materials available, as in the case of plastic. To quote Barthes again.

Many toys are now moulded from complicated mixtures; the plastic material of which they are made has an appearance at once gross and hygienic, it destroys all the pleasure, the sweetness, the humanity of touch …. Henceforth, toys are chemical in substance and colour; their very material introduces one to coenaesthetics of use, not pleasure. (1973: 55)

Again, materials can have ‘more than real’ modality, when their choice is motivated not by an attempt to make the object look like what it represents, but by an attempt to create pleasure or displeasure. And this can apply also to everyday objects, and to buildings, if we replace naturalism by an attempt to reveal the material from which the object is actually made and divergence from naturalism by attempts to conceal these in one direction (say, plain sheets of a synthetic material over bricks) or another (say, timber cladding over bricks or concrete).

In other respects, however, three-dimensional modality differs from two-dimensional modality. There is no need to represent depth: the object already has depth, by virtue of its three-dimensionality. And there is no need to represent the play of light and shade: it already occurs, naturally. It is to compensate for these intrinsic naturalistic qualities that most Western sculpture (1) is decontextualized, lacking a Setting; (2) refrains from using colour as a means of representation, except in the sense of overall, symbolic colour; and (3) increasingly tends towards highly reduced forms of representation. In this way ‘high art’, which seeks to go beyond the mere replication of reality by representing an ideal of beauty or an abstract truth, distinguishes itself from everyday sculpture – from the dummies in shop windows, from girls’ dolls, from miniature dinosaurs, and so on. These need not themselves be naturalistic in every dimension in any case, and may use reduced naturalism either for didactic purposes, as in toys for young children, or to create class distinctions in taste, as with the more stylized dummies in expensive shops.

COMPOSITION IN THREE DIMENSIONS

Many sculptures and other three-dimensional objects do not clearly polarize between left and right, top and bottom, centre and margin, but when they do, the values of Given and New, Ideal and Real and Centre and Margin apply, we think, in the same way as they do in two-dimensional visual communication. Yet the third dimension does introduce additional factors.

In figure 8.1 we saw Epstein’s Jacob and the Angel as it would have been seen by visitors entering the Tate Millbank Gallery when the work was positioned in the centre of the gallery’s octagonal entrance hall. Like God in figures 6.5 and 6.9, the Angel against whose force Jacob is so helpless is Given, and Jacob’s helplessness is New, the focus of the drama. But moving to the other side, the viewer can reverse this, and make Jacob Given and the Angel New, provided access to the other side is not blocked. The same is true for other multifaceted sculptures with two or more ‘polarized’ participants, such as Rodin’s The Kiss, where, depending on your point of view, either the man can be Given and the woman New, or the woman Given and the man New. This is why photographs cannot do justice to multifaceted sculptures – a photo can always give only one angle, and hence one reading. The same is not true, however, for Ideal and Real and Centre and Margin. These relations cannot be inverted by changing the angle from which the work is viewed. In other words, the horizontal dimension allows interactivity; the vertical dimension and centrality do not.

Ideal and Real and Centre and Margin are often the most significant compositional dimensions in three-dimensional visual composition. Architecture provides perhaps the clearest example. Left and right are not usually polarized. Horizontally there is symmetry, but vertically there is not. The vertical dimension is used to polarize, to produce difference, with the Ideal, the element(s) that give(s) the building its more general and ‘ideal’ significance on top – the tower, for instance, with its significant emblems, the cross, or the clock, the gable stone on Amsterdam canal houses, the frieze high up on the Greek temple. Below, on the other hand, is the space of the Real -the forecourts where we meet, the doors through which we enter. More generally, the façade of a building, its vertical dimension, is the building we ‘read’; the horizontal dimension, the floor plan, is the building we ‘use’: the compositional spectacle in the upright dimension is essentially visual. It restricts the user to observation from a distance …. In the horizontal plane, the corresponding dynamics involves the user directly and is therefore largely social. The level plan is the arena of human action. (Arnheim, 1982: 213)

Figure 8.11, shows an example of centrality in an architectural façade: the canopy with the Madonna, in the centre of the façade of the Church of Santa Maria della Spina in Pisa.
Multifaceted objects add further dimensions to three-dimensional composition, and allow (at least in principle) front and back, and the left and right side (and, in the case of open structures, an interior centre and the exterior) to be used for the production of difference. Of the latter we have already seen an example in figure 8.6-the plaster heart in the centre of the rigid frame of Giacometti’s Hour of the Traces. But not all multifaceted objects use these dimensions. As in nature, where trees or mountains do not have a front or back (other than one which stems from our positions towards them, e.g. which side of the mountain we live on), objects can be the same, and have the same meaning, whichever side we view them from. Sculptures of the ‘Three Graces’, for instance, are usually composed in the round, offering essentially the same view from whichever side one approaches them. And the same pattern can be observed in some children’s toys, for instance a kind of ball with symmetrically distributed pictures illustrating nursery rhymes (although ‘the one that ends up on top determines which song is heard’). Or you find it in buildings, such as the Queen Victoria Building in Sydney, which, in its latest reincarnation as a shopping centre, has, on every side, a central entrance and shop windows; or perfectly round buildings such as the Temple of Vesta in Rome, or the Pantheon. In other cases, however, front and back especially differ in meaning. It is tempting here to take the human body as a metaphor, with the front as the public side, the side where we articulate how we want to be read, and the back as the non-social side, the private side which is not meant to be viewed and often has no meaning except perhaps for those with whom we are most intimate. This is why, paradoxically, it can also be the most revealing, as in the case of Epstein’s Angel (figure 8.7). This idea is worked out in more detail in van Leeuwen (2003).

In chapter 3 we argued that the structures of diagrams (the top-down path of the taxonomy, the left-right path of the flowchart, the digital network) are modelled on forms of social organization. Perhaps it can be argued along the same lines that composition, both two- and three-dimensional, is ultimately modelled either on the ‘non-social’ roundness of the natural forms such as trees and mountains, or on the polarized human body, with the head as the Ideal, the feet as the Real, the heart as the Centre, movement and action as the more interactive and dynamic horizontal dimension, and the front and the back as, respectively, the social and public and the non-social and private side. As Lakoff and Johnson said, Spatial orientations arise from the fact that we have bodies of the sort we have and that they function as they do in our physical environment. Orientational metaphors have a basis in our physical and cultural experience. Though the polar opposites, up-down, in-out, etc., are physical in nature, the orientational metaphors based on them can vary from culture to culture. (1980: 14, our italics)

THE MOVING IMAGE

The representational, interactive and compositional patterns we have discussed in this book also apply to the moving image, as shown by a number of our examples in this book. Yet the addition of movement does of course lead to differences, and it is these differences we will discuss in the final section of this chapter.

Starting with the narrative processes we discussed in chapter 2, here the principal difference is that the role of the vector is taken over by movement. Instead of, for instance, a vector formed by an outstretched arm, as in figure 2.15, the process will be realized by the action of raising the arm and pointing the hand. Usually these actions are figurative, recognizable as driving, walking, jumping, pointing and so on. But they may also be abstract – as, for instance, in Walt Disney’s animation film Fantasia (1941) – or in technical films, where arrows may be animated, unfolding in front of our eyes.

But there is a complication. In moving images the relation between Actors and Goals may be represented in a single shot, showing both Actor and Goal; or in two subsequent shots, the first showing the Actor, the second the Goal (or vice versa), as demonstrated in figure 8.12. In both cases we see a soldier (Actor) and civilians (Goal). But in the one case they are spatially connected, shown together in the same shot; in the other they are disconnected, shown in separate shots. As every film and television director knows, the two shots in such a disconnected syntagm (usually referred to as a pair of ‘reverse angle shots’) have to be ‘matched’ carefully, to restore the connection. They have to be taken from the same side of the imaginary line running between the participants and from approximately the same horizontal angle, to make it appear that the participants are facing each other and looking at each other. The tonality of the two shots has to match as well, for instance by ensuring that they are shot under the same lighting conditions, and through colour grading, removing any discrepancies between the colour rendition of the shots.

Should we see such a ‘disconnected’ narrative process as one unit of meaning or two? Is it the equivalent of a sentence like ‘The soldier shoots the villagers’, or of a formulation that expresses the soldier’s agency less directly – for instance, ‘The soldier fires. The villagers are shot’? But such an attempt at translating moving images into words cannot fully capture the difference. Filmic ‘disconnection’ has no parallel in language. It does have semiotic potential, however. It can, for instance, show people as ‘isolated’ individuals, even while they are interacting with others, and it can radically disconnect Actors from the Goals of their actions, and from the effect of their actions on these Goals, just as happens, for instance, in long-distance telephone calls or the firing of long-range missiles.

The disconnection between Actors and Goals is an aspect of ‘film language’ that only developed twenty-five years or so after the invention of the medium, and it has been the subject of much discussion in film theory, not least because it allows ‘faking’. No history of the medium omits the experiments of ‘Constructivist’ film-makers in the Soviet Union of the early 1920s. In one of these experiments, film maker Lev Kuleshov cut together shots of two actors meeting and greeting each other. Each actor was filmed separately, in a different location. They had therefore never actually played out the scene together. Once the two shots were spliced together, however, the two appeared to meet in one and the same place, an effect which l<uleshov called ‘creative geography’. In 1930s Tarzan films, encounters with wild animals were often faked in this way, by intercutting stock shots of wild animals with shots of actors acting out the appropriate reactions and actions in a studio set. The famous French film critic André Bazin (1967), on the other hand, favoured the ‘connected’ method. To see a real event happening in real time was for him the quintessential film experience and the quintessential power of the medium. He praised the seal hunt in the documentary Nanook of the North (1921), a scene taken as one long, unedited shot of a seal being harpooned through a hole in the ice.

In contrast to the still image, the moving image can realize events that have neither an Actor nor a Goal. Shots of shimmering light on softly rippling water create a sense of pure process, pure movement, in which it is hardly possible to disentangle process and participants, and in which participants, if they can be discerned at all, are ‘caught up’ in the process in a way that is neither ‘active’ nor ‘passive’. The still image equivalent of such a shot would be a kind of abstract pattern, lacking the dynamic sense of ‘action’ or ‘event’.

The choice between ‘connection’ and ‘disconnection’ also exists in the case of reactions. Films can show Reacters and Phenomena either in one and the same shot, or in two subsequent shots. This pattern, known as ‘the point-of-view shot’, ties together three shots, with the Reacter reappearing in the third shot, so that the Phenomenon is wedged in between two shots of the Reacter. Here too the shots have to carefully ‘matched’. If, for instance, the Reacter looks down, the Phenomenon has to be shot from above, and if the Reacter looks at a moving Phenomenon, the angle of his or her head and the direction of his or her gaze should have changed in the third shot, to match the distance travelled by the Phenomenon during the second shot. Disconnected reactions have a particularly strong ’subjective’, ‘first-person’ feel, as the viewer is looking at the Phenomenon ‘through the eyes of the Reacter’. A variant shows the Reacter and the Phenomenon in the same image, ‘over the shoulder’ of the Reacter, hence also from the Reacter’s point of view. But here we do not look at the Phenomenon ‘through the Reacter’s eyes’, and the effect is less emotionally involving, as we see the Reacter from behind and therefore do not see his or her reactions to the Phenomenon. It is the angle used in contemporary computer war games, to make players identify with the ’special-ops’ soldier characters they play (figure 8.13).

Finally, while still images have developed dialogue balloons to realize verbal processes, in moving images dialogue is not represented visually, through writing, but directly, through speech. The synchronization between the speech and the Speaker’s lip movements replaces the vector that connects Speaker and Speech. Without such synchronization, moving images cannot signify that the speech we hear is actually spoken by the Speaker we see. Once the link between the Speaker and the dialogue has been established, the dialogue may become ‘off screen’, continuing, for instance, while viewers watch the reaction of a listener.

Turning now to the interactive dimension, in chapter 4 we have already seen how camera positions can create symbolic relations between viewers and what is depicted in an image. Moving images are no different in this respect, with one proviso: in moving images the relationship becomes dynamic. It can change in front of our eyes. The camera can zoom in to a closer shot, or zoom out to a wider shot; it can crane up to a high angle or crane down to a low angle; and so on. And even when the camera is not moving, the participants themselves can move, walk away from or towards the camera, or walk up or down a flight of stairs with the camera tilting up or down to follow them, thus changing the angle from which viewers see the participants. In other words, the moving image can represent social relations as dynamic, flexible and changeable. Distance and angle can be dynamicized, and this in two ways: subject-initiated, with the represented participants initiating the change, or camera-initiated, with the image-maker initiating the change (the contrast of course also applies to synthetic images where a camera is not involved). In the first case, the visual text takes a ‘neutral’ stance, a stance of ‘recording’ what is taking place (even though the events may of course be staged). In the second case, the image-maker more overtly positions viewers towards what is being represented.

In most films distance and angle change constantly. In other words, what in the case of still images has never moved in the mainstream, cubism, the use of multiple perspectives, has become so commonplace in movies that it is now hardly noticed. The only difference is that films show the different perspectives one after the other, rather than at the same time. Figure 8.14 shows how such changes of distance and angle can be used to signify both the relations between ‘characters’, between the people we see on the screen, and the ongoing, constantly shifting relations between these characters and the viewers. It is the opening scene from Howard Hawks’ The Big Sleep (1947). ‘Private Eye’ Marlowe (Humphrey Bogart) has been called to the house of General Sternwood (Charles Waldron) to help him deal with a case of blackmail involving his youngest daughter, Carmen (Martha Vickers). As Marlowe waits in the hall to be shown in by the butler, Carmen provocatively confronts him. The interview with Sternwood then follows.

In news and current affairs television, distance and angle create a symbolic relation between the people on the screen and the viewer. Anchorpersons are shown frontally, from slightly below eye level, and in a wider shot than most other participants in the programme. This enhances their authority. They are literally ‘higher up’ than the viewers and shown from a respectful distance – initial shots may even show them from a very long distance, sitting behind large, gleaming desks at the far side of an empty, palatial hail.

The distinction between ‘offer’ and ‘demand’ (see chapter 4) also applies to moving images, and it too can be dynamicized: represented participants can turn towards the camera and look at the lens (and hence at the viewer), or can avert their gaze. But the camera cannot initiate this; it must be initiated by the participant, whether on their own initiative or as a result of following instructions from a director.

‘Offers’ are still the rule in naturalistic drama, in the theatre as much as in film and television. Bertolt Brecht famously sought to reintroduce the ‘demand’ stance in the theatre, especially by means of interpolated songs, and film-makers like Jean-Luc Godard have followed him in this. In these contexts ‘demands’ were thought to create an ‘alienation effect’, to break with conventions meant to naturalize the fictional world of stage and screen, and so to make the audiences more aware that they were watching a fiction and invite them to reflect on its content. In many other contexts – for example, television news the ‘demand’ is the accepted convention, although not everyone is given the right to address the viewer directly. Anchorpersons and on-camera reporters may look at the camera, but interviewees may not; in chat shows hosts may look at the camera, but guests may not, and so on. In other words, the ‘demand’ is a privilege which media professionals have reserved for themselves.

The concept of modality (see chapter 5) is also fully applicable to moving images, but a further facto movement, needs to be added to the list of means of expression that can cue modality. Like visual detail, background, depth, light and shade, colour, etc., movement can be represented with different degrees of realism or abstraction and hence play a role in modality judgements. Representations of walking, for instance, can range from simple animations in which stick figures raise and lower their legs without any articulation of the joints or any movement of the rest of the body, to highly detailed animations showing the rippling of every muscle involved.

Most films invite us to use the naturalistic criterion, although this is perhaps changing as the use of synthetic images and animation increases. In many animated cartoons, the background has higher (naturalistic) modality than the foreground, a reversal of what normally happens in ‘live action’ films. There is, of course, a technical reason for this. Backgrounds do not have to be animated and can therefore be painted in detail without breaking the budget. Again, in computer games different characters and actions may be animated more or less intricately. In a Delta Force game one of us played, the movement of enemies falling down as they were killed was decidedly unconvincing and unnatural. Again, there may be pragmatic reasons. Detailed animation costs time and money, and may slow down the action. But that does not negate the semiotic effect of reducing the naturalistic impact of killing.

Finally, the elements of composition discussed in chapter 6 (information value, salience and framing) apply to the composition of the shots in a film or television programme just as much as they apply to still images and other visual compositions, with, again, the proviso that the moving image can make composition dynamic. Something that starts out as Given can move into the New position in front of our eyes. Something that has low salience can become highly salient in the middle of a shot – for instance, by moving or being moved into the light, or by a change of focus of the camera. In chapter 6 we showed how, in figure 6.1, the left edge of the door of the shed frames the two characters in the shot, causing them to inhabit different spaces and so emphasizing the lack of communication between them. But in a moving image characters can move into each other’s space and undo the framing between them. And all of these ways of dynamicizing composition can be subject-initiated or camera-initiated.

This brief discussion does not exhaust the ‘language of film and television’. It has concentrated on the spatial patterns of individual shots and on two specific time-ordered patterns, the ‘reverse angle’ and ‘point of view’. But film is also, and perhaps above all, a temporal mode, structured by intricate semantic and rhythmic patterns of editing (see van Leeuwen, 2004, for a social semiotic approach), and it is also characteristically multimodal, involving not just the visual, but also speech, sound and music. These aspects of the medium fall outside the scope of this book. But we do hope we have shown that the ideas presented in this book can usefully be applied to the spatial aspects of moving images or, more precisely, since movement is a temporal phenomenon, to an area where the spatial and the temporal interact and overlap.

March 22, 2010

Chapter 7

Filed under: Uncategorized — Natural Light @ 5:39 am

7) Materiality and meaning

MATERIAL PRODUCTION AS A SEMIOTIC RESOURCE

The semiotic resources we have discussed in this book abstract away from the materiality of the signifier. They can be applied, we have claimed (and tried to demonstrate through our examples) to the production and understanding of visuals which, materially, are quite different from each other: photographs, movies, websites, drawings, paintings, and so on. One of the major features – explicitly and implicitly – of the development of our ideas since we wrote the first edition of this book has been to pay more attention to the semiotic role of the material production of the sign. In music, the performance of a composition contributes a great deal to its meaning, and in many cases it is difficult, if not impossible, to separate composition and performance. In visual communication, similarly, the material production of a design is not just the execution of something already complete, but a vital part of meaning-making. Here we will focus on that aspect of semiosis in some more detail by looking at the materials used in what, in the first edition of this book, we called ‘inscription’, and have since come to call ‘production’ (Kress and van Leeuwen, 2001).

When, some time in 1988, we first presented our initial ideas on the visual to the Sydney Semiotics Salon, one of our friends said, ‘But what about brushstrokes? How can you describe brushstrokes as semiotic units?’ Our fumbled response was to say, ‘You have to start somewhere. We’ll get to brushstrokes later, when we are further on with our work.’ But the question stayed with us, and regularly kept coming up in our discussions. It was a question that responded to our view – then an odd one – that in a message all aspects matter and mean, and at the same time showed a profound scepticism about that assumption. However, it is and remains an important question. Nearly twenty years later, our answer would in principle be the same, though maybe a little less fumbled and somewhat more thought through. In our ‘grammar’ of visual design, we wanted to move away from a totalizing view of semiotic resources, a view in which semiotic resources are homogeneous systems in which there may be differences in the ’size’ of units, but in which all the units are of the same kind, all ‘belong’ to ‘the same system’, so that all texts are, in the end, built up from a single kind of ‘minimal unit’, be it the brushstroke, the ‘iconic figure’ (Eco, 1976a), the ‘coloureme’ (Saint-Martin, 1987), or the phoneme and morpheme – in their respective ‘tactic’ arrangements. By contrast, we wanted to maintain that a given form of semiosis for instance, ‘painting’ – involves a range of signifying resources. Some of these are like the signifying systems we have discussed in this book, resources which can be used, not just in painting, but also in photography, or in drawing, to mention just some examples. Any given type of production medium can, at least in principle, realize most of the choices from the ideational, interpersonal and textual networks we have presented in this book, though there are, in practice, historically and culturally specific restrictions on the combination of choices from these signifying systems; for example, restrictions on what can be painted and how. But other semiotic resources are more specifically tied to specific forms of material production and can be realized, for instance, only in the medium of paint or only in the medium of the photograph.

In the realm of art this is a relatively uncontentious point of view. Materiality matters: oil- and water-based paints offer different affordances, and hence different potentials for making meaning. The manner of production also matters, as we discussed with the examples of Robert Ryman in chapter 5. In the realm of linguistics it has been less obvious. If we ask the seemingly simple question ‘What is a text?’ or ‘Is a written text the same object or a different one when it is written with a pencil or with pen and ink or is wordprocessed?’, the answer of most linguists would be, ‘No question. It is the same text.’ The material, graphic expression of the text would not be seen as a relevant issue. If we asked a non-linguist the same question, the answer might be different – the teacher who responds negatively to an essay presented on scrappy bits of paper, badly handwritten (perhaps badly spelled), but responds favourably to a ‘well-presented’, typed version of the same text, uses a quite different criterion. So does the marketing executive when presenting a proposal to a client. Their notion of what a text is differs from that of the linguist. Like us, they would see ‘presentation’ as a significant part of the making of the text, increasingly often equal to, or even more important than, other aspects. For them, as for the painter or the viewer of a painting, the medium of inscription changes the text.

It is our impression that this aspect of text is rapidly gaining in importance, perhaps aided by new technologies of writing. The boundaries between the criteria prevailing in ‘art’ and those prevailing in everyday writing are no longer as sharply drawn as they once were. We do not want to engage in an argument with linguistics here, and the linguistic theory from which we draw much inspiration is in any case semiotically oriented. But we do want to say that the linguistic notion of text is an artefact of linguistic theory; as, indeed, is our notion of text – whether written and linguistic or painted and visual, or both. The question about the significance of brushstrokes comes, we think, out of a view in which everything representational is seen as belonging to the same unified, homogeneous representational system (language, or painting). The boundaries around what is ‘in’ and what is ‘out’ used to be strictly patrolled: in the linguistic training of one of the authors, phonetics was not part of linguistics, and everyone knew what was extra-, para- or simply non-linguistic. The material aspects of handwriting and typography were not even touched upon.

In our approach the material expression of signs, and therefore of the text, is always significant; it is what constitutes ’signifier material’ at one level, and it is therefore a crucial semiotic feature. So is the process of sign- (and therefore text-) production. Texts are material objects which result from a variety of representational and production practices that make use of a variety of signifier resources organized as signifying systems (we have called these ‘modes’), and a variety of ‘media’, of ’signifier materials’ – the surfaces of production (paper, rock, plastic, textile, wood, etc.), the substances of production (ink, gold, paint, light, etc.) and the tools of production (chisel, pen, brush, pencils, stylus, etc.)

Every culture has systems of meanings coded in these materials and means of production. Here, as in all areas of semiosis, signs in their materiality are fully motivated, though as always the motivations are those of a particular culture in a particular period, and those of the maker of the sign; they are not global, nor are they a-historical. Precious metals are precious because of their scarcity, and perhaps because of their malleability. But scarcity is not a globally uniform characteristic, and the preciousness of one metal need not be equally marked in another culture. It was one of the particular calamities of the cultures of Central and South America that they had attached different semiotic values to the material signifier goldfrom those of the invading Spaniards.

We regard material production as particularly significant because often it is in its processes that unsemioticized materiality is drawn into semiosis. At times production is therefore somewhat less subject to the various forms of semiotic policing than are other regions of the semiotic landscape, and thus leaves more room for individual possibilities of expression than those regions which have better-known cultural histories, are more foregrounded and have better-understood conventions. To explore material production is therefore also to explore the boundaries between the semiotic and the non-semiotic, and between individual expression and social semiosis.

PRODUCTION SYSTEMS AND TECHNOLOGY

Like all cultural technologies, forms of production are entirely related to the overall state of a society’s technologies. Indeed, dependence on technology may be one of the strongest features of graphically realized semiotics; it distinguishes them from semiotic modes in which signs are articulated by the body without any technological aids (as, for instance, in speech, singing, ‘non-verbal communication’, dance). Modes like music straddle the two categories; yet the boundaries between them are in any case always fuzzy: one can draw or write with one’s finger in sand, using only the body and a natural surface. But generally the surfaces, substances and tools of the visual semiotic are made available by technologies, as much in the case of pencil and paper as in the case of the modern word processor. Technology enters fundamentally into the semiotic process: through the kinds of means which it facilitates or favours, and through the differential access to the means of production and reception which it provides.

We distinguish three major classes of production technologies: (1) production in the narrower sense – that is, technologies of the hand, technologies in which representations are, in all their aspects, articulated by the human hand, aided by hand-held tools such as chisels, brushes, pencils, etc.; (2) recording technologies – that is, technologies of the eye (and ear), technologies which allow more or less automated analogical representation of what they represent, for instance, audiotape, photography and film; and (3) synthesizing technologies which allow the production of digitally synthesized representations. While remaining tied to the eye (and ear), these reintroduce the human hand via a technological ‘interface’, at present still in the shape of a tool (keyboard, mouse), though in future perhaps increasingly through direct articulation by the body (e.g. through issuing spoken commands to the computer, or through other gestures).

The boundaries between these categories are not clear-cut; and are always subject to further transformative semiotic work. A photograph can be hand-coloured once it has been printed, for instance, or digitally altered, and many artists experiment with precisely these mixed production systems. It should also be noted that the possibility of ‘mechanical reproduction’, to use Benjamin’s term, is not uniquely tied to any of the three categories. Printing can be done from a hand-carved master, photographically, or with a modern laser printer. But we think the categories are useful, particularly as they can be tied to major periods in the history of production and to the epistemologies that went with them.

While production technologies – technologies of the hand – have continued to play a role, the development of recording technologies has dominated the visual semiotic from the moment Renaissance artists began to use the camera obscura as an aid in painting, and particularly during the last two centuries or so, when a variety of recording technologies were developed, beginning with photography. They, in their turn, are now beginning to be superseded by synthesizing technologies. Quite different ontological orientations go with these different technologies. Walter Benjamin (1973) commented on the transition between manual production and recording, stressing reproduceability rather than the modes of representation themselves, and linking it to the dissolution of traditional forms of social organization in ‘mass society’ and to the disappearance of the ‘aura’ of the work of art.

Today the transition from recording to synthesizing technologies is the more pressing issue. The ‘crisis of representation’ which has characterized theoretical debate over the last two decades or so may be an indication of this. ‘Recording’ leads, we believe, to ontologies of referentiality, a view of representation being founded on direct, referential relations between the representations and the world. In an earlier publication we developed this idea in more detail (van Leeuwen and Kress, 1992). Synthesizing technologies undermine or even abolish such notions of referentiality, whereas as recently as in the 1970s, ‘Electronic News Gathering’ was ubiquitous enough to have developed an acronym, ‘ENG’, a deceptively naive metaphor reminiscent of other unproblematic gatherings – wild mushrooms, apples, the children. That metaphor is now entirely untenable; not only because news never was simply ‘out there’ to be gathered, but even more so because the technology now exists literally to produce it – a development anticipated by the critical media theory of the 1970s. ‘Reference’ has given way to ’signification’, the production, out of existing semiotic resources, of new semiotic means, new signs, new texts, new images, new visions, new worlds. This does not mean that representation has ceased. Rather, the formerly naturalized relation, the identity of representation and reference, has broken down, irreparably for the time being. A new relation is becoming established instead, between representation and signification. If present social and technological developments continue, this relation will, in its turn, first become naturalized and dominant, and then come into crisis. In the year that we revised this book for its second edition, the ‘production’ of photographs of abuses of prisoners by British troops in Iraq was one (notorious) case in point. (Leaving the crucial matter of veracity aside, it is interesting to note that the much earlier ‘production’ of the same ‘news event’ by means of writing produced no outcry of any kind.)

These technological and semiotic developments perhaps help us understand contemporary theoretical developments. With ‘recording’ goes an older semiotics of ‘representation’ and of naturalistic modality, which itself gave rise to particular ontologies of truth, of fact (hence the interminable debates of prior decades around ‘bias’). To deconstruct representation as recording, representations (‘texts’) themselves had to be ‘deconstructed’, which was done by emphasizing ‘production’ and therefore displaying the ‘constructedness’ of representations, in texts, images, etc. This can be seen as the period of ‘critique’ with the adjective ‘critical’ used as a descriptor of many practices. Inherent in this there was already a theorizing of one new stage of semiotic practice, namely synthesis, through new practices of construction and production – making new representations out of (constructed) representations – in the visual arts maybe some of the practices of ‘Brit Art’ or of the American artist Jeff Koons, and in music the currently ubiquitous practices of scratching, mixing.

From such a history and perspective, which both heralds and legitimizes the present stage of synthesis, we could project a further development, which will be to deconstruct current practices of production by showing that ‘underneath’ production there is an already-existing, already-produced ‘programme’, a system which defines the limits of production. This system, of course, is still a system of representation, a representation of the social/cultural system. We could also point to the present tendencies in semiotic theories (in cultural studies, in education, in literacy theory) to collapse reading into writing, or vice versa. The dissolution of that distinction (‘reading is writing’) was initiated in theory in the 1960s by Roland Barthes, though it is now enacted in electronic technologies which combine the acts of reading and (re)writing someone else’s text, whether in changes to an attachment to the email or, differently, in playing computer games. In other words, when the analogically based mode of ‘recording’ was dominant, the tendency of critical theory was to deconstruct representation-as-reference, and to emphasize the ‘constructedness’ of the sign, or the text.

As the synthetically based mode of production is becoming the dominant technology, critical theory will have to turn to deconstructing representation-as-programme, representation-as-design; that is, deconstructing the combinatorial possibilities and laying bare their cultural/social sources. It is for this reason that we concentrate on representational resources in this book, rather than (only) on texts. However, given the deconstruction of formerly stable frames – whether semiotic or social, cultural and economic – for the time being there exists newly the need for conceiving of social semiotic practice in terms of rhetoric and ‘design’, where the term ‘rhetoric’ focuses on the social relations which obtain in the process of communication, and the term ‘design’ focuses on the arrangement of the available semiotic resources in the making of the representation as a message.

Before we leave this subject, we should note two other aspects of the relation between production and technology. Our classification of production media was based on the way representations are produced, whether by hand, by more or less automated recording or by electronic synthesis. But production media also favour modes of reception, and here the surface plays a particularly important role. Some surfaces (walls, cinema screens) favour public reception, for instance, and others (pages, and paper generally, the computer screen) favour individual reception. Also – and more difficult to describe – there is the effect of the physicality, the tangibility of the surface, the difference between the forms carved in the hard rock and the fleeting flickers of light on the glass screen (we return to this in the next section). What matters is the site as much as the kind of surface on which the text is received. Now, unlike in previous periods, the surface of reception is no longer necessarily at all the same as the one on which the text was/is produced. Transcodings/transpositions of a wide variety may take place. An image may be produced in one medium – as a painting, say – and be received in a different medium – as a photograph, for instance. Or it may be produced in a recording medium, as a photograph, for instance, and received in a synthesis medium, retrieved from the image bank of a computer.

Finally, technology has also developed different distribution media, and it is here that the issue of (mass) reproduceability belongs, together with that of communication at (long) distance. The latter, although of crucial social importance, bears less directly on our subject. Whether images are distributed via electrical wires, optical fibres or the airwaves is irrelevant, semiotically, at the level of representation – though not at the level of dissemination. The fact that the internet is crammed full with images is in large part a matter of available technology; and it has profound semiotic consequences. At another level what matters most is the production medium in which images are produced, and the distribution medium in which they are received, if the latter is different from the former, be it because of transcoding (e.g. the photographic reproduction of paintings) or because of recoding at the other end of the telecommunication channel. Or, to be more precise, we would say that the mode of transmission is relevant only in relation to the potentials which it offers for reception as (re-) production.

BRUSHSTROKES

If one looks at Rothko’s Seagram Murals from a distance of 4-5m, the boundaries between the large blocks of colour seem sharp and clear-cut. The closer one moves to the paintings, however, the more uncertain and fuzzy the boundaries become, the more they overlap and run into each other. Yet a postcard of one of the paintings, taken from much more than 5m distance, shows nothing of this. An aspect of meaning is lost, because of the distance from which the photograph was taken, and because of the transcoding, from painting to photograph, from one production medium into another. This brings us back to the starting point of this chapter, to the argument about brushstrokes and about the status of ‘text’.

Painting allows the viewer a choice between different ways of relating to the text, even though this choice may be restricted in practice, as when a line on the floor in front of a painting prohibits the gallery visitor from coming too close to the painting. I may wish to view the painting as ‘a representation’, concentrating on what the painting ‘is about’, or view it in terms of its various techniques (‘the effective use of colour’), or effects (‘depression’). In each case Twill stand at the requisite distance. I may wish to engage with its materiality and with the way in which the hand of the artist ‘inscribed’ the canvas – the matter of the application of the paint; the brushstrokes – and in that case I would need to move very close up. Photography allows me choice of distance of viewing, but without any of these effects in meaning. This ‘recording’ medium is representational and can only be representational. It abstracts away from the imprint of the hand that made it, even when it reproduces art.

Certain discourses about art put much emphasis on material production, particularly in relation to a handful of great painters: the fine brushstrokes of Rembrandt, the violent brushstrokes of Van Gogh, the treatment of volume by the Pointillists. This encourages a focus on the ‘graphology’ of the painting as a symptom, a trace of the individual temperament of the artist. That is, it takes a semiotic approach to the matter of brushstrokes. Most of the descriptions of paintings in art galleries and in catalogues, however, focus on representation rather than on inscription. The preference of art historians for black-andwhite reproductions or even for drawings of paintings also points to an overriding concern with representation.

The question of transcoding is closely related to this. What does it signify when I purchase a print of Monet’s Poppiesto hang on my living-room wall? That I want to show my appreciation and admiration of Monet? That I have been to a gallery where the painting was exhibited? That I like the theme of the painting? That I am familiar with the intellectual histories of which Impressionism is a part? The print will allow me to signify all of these, but twill not allow me to signify my interest in the material production of the work, simply because it does not enable me to focus on that; it does not even make it available as a question.

To take another example, in an original painting by Mondrian the lines are, in close-up, not straight, but overpainted and the colour of the various rectangles is modulated rather than plain and flat. Postcards and other reproductions of the same painting make the lines appear straight, remove the overpainting and present flat, unmodulated colour. It is these reproductions which produce the Mondrian of innumerable high-school art lessons, and so reinforce and reproduce a particular (incorrect) version of Mondrian and a particular (ideological) version of abstract painting. It is through these ideologized readings that such paintings have their effect on, and in, other practices.

As a final example, consider Kandinsky’s Cossacks (plate 5). Representation is strongly reduced in this painting: one can ’see’ Cossacks on horseback and drawn and flashing sabres, but figuration is not foregrounded; or, we might say that what is represented ideationally is ‘violent action’. Colour differentiation, on the other hand, is amplified. And colour is used, not in a representational/referential function (shades of white to represent the ‘real’ shades of the white of uniforms, for instance), but to allow the substance of the production medium and the traces of the act of production themselves to signify. Colour is, in this painting, the key semiotic resource: jagged blocks of white, flashes of red, curves of yellow and blue. Action, energy, movement and violence are represented through the way the production medium has been handled. And this takes us back to the issue of individuality with which we ended the first section of this chapter, and perhaps also to the distinction between ‘art’ and ‘design’. The handling of paint is close to the individuality of handwriting, and it was precisely this mark of individuality which came to be lauded as the distinguishing characteristic of art and as its mark of difference from ‘recording’. On the other hand, this celebration of individuality was also somewhat of a last- ditch stand for art, in its losing battle with photography, and the very same art which came to stand for individual expression had its principal social effects in the form of photographic reproductions – that is, without the marks of individuality – while the originals became the priceless relics of a past ethos. In most other domains such marks of individuality became proscribed. Handwriting, for instance, has now become unacceptable in all but the most private forms of writing, despite the increased emphasis on ‘presentation’ which we noted earlier: this new valuation of ‘presentation’ is in no way a return to the kind of individual expressiveness that hovers on the border between the individual and the social, the ostensibly unsemioticized and the semioticized; it is thoroughly semiotic and social.

Figure 7.1

The individuality of the brushstroke not only became a symbol of individual expression, of ‘the essence of things seen through an individual temperament’, but it also came to be drawn into the domain of the semiotic, the domain of culture. It ‘made school’. It was reproduced, faked, developed, imitated, and so entered into the world of semiosis as a transformative element, in a process which then transformed the brushstroke itself, as witnessed by Roy Lichtenstein’s pop art parody (figure 7.1). Thus the brushstroke becomes a paradigm case of how inscription (the ‘how’ of painting) is allowed to play a key role in some domains – for instance, the domain of the great Modernist art of the immediate past – but required to play a humbler role in others, where representation (the ‘what’ of painting) dominates, and where product matters more than process and practice. The division exists within painting also. A painter such as Robert Ryman, whose work we discussed in chapter 5, focuses on the ‘how’, on the inscriptional practices of painting, whereas a painter such as Gainsborough focused on the ‘what’, on the analysis of social arrangements, the recording of social states of affairs; in short, on the referential. (Though maybe the better way of describing it would be to say that for Ryman the ‘how’ has become the ‘what’. In that view the question that painters ask is, ‘What is painting for?’ and that question has different answers in different periods and in different cultures.)

Such shifts of emphasis are themselves ’signs’. Interest in the materiality of representation and representational practices reflects wider social and cultural concerns with questions of substance and materiality in a world in which the concrete becomes abstract, the material immaterial, the substantial insubstantial and reality ‘virtual’. Today we have, side by side, a hankering for the individual, the subjective, the affective, the non-semiotic and non-social ‘punctum’ of the photograph or the ‘grain’ of the voice (Barthes, 1977, 1984), and at the same time (and in large part as a result) the increasing semioticization of all these phenomena and more. As we have noted already, it is as representations rather than as material productions that modern art has informed and shaped practices in other domains, that paintings such as those of Mondrian came to be transformed into blueprints for designed objects, buildings, cities, and that paintings such as those of Kandinsky had their effect on the layout of European newspapers such as the Bildzeitung and The Sun, which are further translations/reductions of other translations/reductions, but no less potent for that.

Any systematizing semiotics of materiality (of the brushstroke, or of handwriting) would follow the trend of semioticizing the not yet semioticized. Reluctance to do so would follow the opposing trend set by Barthes to ‘protect’ the non-semiotic, to protect the ‘unspoilt nature’ after which the tourist hankers. In this chapter, we (like him) are also at least a little reluctant to follow the path of relentlessly making everything semiotically accounted. In reality, we think that the choice is not one between turning the unsemiotic into the semiotic – if I as an interpreter take meaning from some phenomenon, then it is semiotic by virtue of that action. The choice rather is one between assisting in laying bare, in making overtly visible meanings which are as yet not visible and made systematic; and being clear about the consequences of that process. Nevertheless, as material production is semioticized, it becomes more important to be able to talk about it. Simply asserting the value of the non-semiotic at the very moment when everywhere around us it is being semioticized, marginalized or repressed seems at least equally problematic.

THE MEANING OF MATERIALITY

From the 1920s onwards, there has been a ‘functionalist’ current in Modernism, a trend to ‘let materials speak for themselves’, which is only now beginning to change. This had various roots, but it culminated in now happily clichéd styles of ‘plainness’: whether of steel or timber furniture, or of ‘brutalist’ architecture, with its love of unadorned concrete. Behind this trend were notions of ‘authenticity’, themselves explicit or implicit critiques of the ‘distortions’ of representation, the ‘falsities’ introduced by ‘decorative’ art and its ideologies. This trend had its origins in art, where it could even become the subject matter of artworks, as in Ryman’s paintings. In some periods or genres of art, artists have no choice of materials: all paintings are painted on cave walls with ochres, or on canvas with oil paints, all photographs are printed on paper. In other periods or genres, the material becomes a fully exploitable and exploited resource. Modern sculpture is perhaps the best example. The smoothly turned wood of Brancusi’s Head(1919-23) or the veined redness of the alabaster of Epstein’s Jacob and the Angel (1940) (see figure 8.1) become part of the meaning.

In some cases it is precisely the opposition between the materiality of the material and the mimetic quality of the product which becomes the issue. In Rodin’s The Kiss (1880), the figures are perfectly worked when seen from the front. The material resistances of marble have been entirely and successfully overcome. The material has become ‘invisible’, just as the materiality of the canvas is invisible in most paintings. If we change our viewing position by walking around the sculpture, however, we are permitted to see that this first impression is ‘produced’, and therefore ideological. The contrast poses the question of the sculptor’s work, his semiotic action. It forces us to reflect on the borderlines between the seemingly unsemioticized materiality of the representation and the semioticized, fully cultural work of the sculptor, and on the dialectic between the expression of individuality and the social semiotic framework in which it takes place. The two are connected: the ‘handwriting’ effect, so clearly visible in the less ‘polished’ parts of Rodin’s sculpture, becomes increasingly possible as we move from the highly policed, the highly conventionalized, the fully semiotic, towards the less semioticized, therefore less policed and less conventionalized. The uncertain lines of a Mondrian painting signal individuality, affect and art as clearly as the certain lines of a blueprint signal conventionality, reference and design. A painting by Ben Nicholson (1945) makes the same point in a different way. It consists, quite simply, of two circles: a perfect, compass-aided circle on the left, as the Given; and a hand-drawn circle on the right, as the New. Drawing by hand, with all its subtle marks of individuality, may once have been unproblematic, as there was no other way of drawing, but has now become problematic, an issue for concern.

Architects who develop blueprints for buildings also work with ‘unsemioticized’ materials. Their intentions are usually less semiotic or, to put it another way, it is harder for them to put the semiotic in the foreground. Other considerations may weigh more heavily: functional considerations (the fact that they are designing an office building, for instance), financial considerations, the wishes of a client. Art, on the other hand, has, since the 1960s, attempted critiques of mass society and in particular of mass production methods, which led it towards the very practices it sought to critique. When the American artist Jeff IKoons commissions sculptures from factories, from artisans or from other artists, and then signs them with his name, he works with attenuated, abstract ’surfaces’: the whole global domain of cultural production becomes the material of the work. Choice serves as the production tool, and industrialized cultural production as the material surface. Our focus on physical materiality should therefore be taken as a metaphor. In the highly semioticized world we live in, Jeff Koons’ work may be both more relevant and more usual than that of Giacometti or Moore, of Rothko or Ryman. The notion of artists producing objects ‘with their own hands’ dates back to the pre-industrial period, and had already come into crisis in the period of industrialization. It may be that in the post-industrial world it has lost much of its relevance.

The relative freedom of the artist, perhaps greater in the visual than in the verbal, remains, for the time being, in force, albeit on the margins. It lies in the possibility of foregrounding – either of the materiality of the means of production or of the object produced, or of the semioticization of this material in (referential or significatory) acts of representation. Wallpaper draws attention away from the materiality of a wall. A room without a hint of decoration on concrete or pine walls draws attention away from the facts of the wall itself and on to the materiality of the material.

What, then, is the meaning of material? Our assumption remains that signs are motivated. It is no accident that the statues erected to commemorate heroic figures are made of durable materials, or that tombstones are still carved: the durability of the materials makes them usable signifiers for the meanings of permanent feelings we intend to produce. Nor is it an accident that certain flowers or stones become signifiers for love: their rarity may make them precious, or else their colour, shape or perfume may make them suitable signifiers. Bone-china teacups do not produce the same meanings as tin mugs. Australian money is printed on plastic, a shock still to the returning expatriate, as if it is too boldly a signifier of what should not be signified.

To summarize, material production comprises the interrelated semiotic resources of surface, substance and tools of production. Each has its own semiotic effects, and in their interaction they produce complex effects of meaning. Production exists on many planes; that is, there are serial relations between surfaces. As with Barthes’ notion of the sign, signs at one level become available as signifiers at a higher level. And there are serial relations of translation between production media also, as in the case of the relation between paintings and their photographic reproductions.

COLOUR AS A SEMIOTIC MODE

Our focus so far has been on materiality of the sign – on surfaces, substances and tools. We now want to turn to the question of materiality as a means for representation more centrally. Of course we have touched on this many times, indirectly or more directly, but our question now is, ‘How does materiality actually enter into and shape the resources for representation, the modes?’ We referred to this when we pointed out that our ‘grammar’ of the visual could not be, simply, a transposition of the terms of a grammar of the linguistic mode, because it had to pay due attention to the material differences of the resource for representation – not the material of sound as in speech, but the material of graphic stuff as in images, not the ordering logic of time as in speech, but that of the space of surfaces of images.

One theoretical issue to be ‘got out of the way’, so to speak, is that of ‘abstraction’. If transpositions of linguistic terminology to other modes have been quite commonplace in the past, it has been because ‘grammars’ of language paid attention to an abstract entity, ‘language’, and did not see it or its elements in terms of their materiality. That is one reason why phonetics (as a discipline) had been excluded from linguistics (as a discipline). It had been too much concerned with the physicality of its domain. It is also one reason why for much of the twentieth century much of mainstream linguistics was concerned with ‘language’ as such, as an abstraction, rather than with the distinctiveness of the grammatical organization of speech and writing. At that level of abstraction it seemed possible to move from one medium to another: materiality was not seen, and hence did not figure and did not need to be accounted for.

In our approach the twin factors of materiality and of culture (always set in the social organizations in which cultures exist) are the means to an explanation of the resources for representation. Materiality enters into semiosis through tangible physical facts: speech happens as sound, and sound happens in temporal sequence. However, what culture does with these ‘facts of nature’ is then another matter. In any temporal sequence, something is first and therefore something else is second and something else last; cultures cannot get around that fact. But what meanings may be attached to that ordering is quite another matter, a matter for makers of signs in their cultures. ‘Being first’ may have any number of meanings: ‘that which is most important to me the speaker’, or ‘that which is my starting point, from where I can proceed’, or ‘the entity which is responsible for the action which is represented’, and so on. Using ’speech sounds’ means using the possibilities of changes in pressure in the air, as well as the potential of changes in the frequency of vibration of the vocal cords and changes in the volume and frequency of oscillation of a column of air in the ‘vocal tract’ to fashion a semiotic resource – speech sounds and intonation. Again, the same considerations apply: some cultures use the potentials of pitch for syntactic/semantic means, to make questions; others use it to produce differences in lexis – in the so-called tone languages, such as Cantonese, Mandarin, Igbo, etc. difference in pitch produces difference in lexis.

What material a culture chooses to fashion into a resource for making representations, into a mode, is a matter of the contingencies of that culture and of its history; though it is also a matter – a fundamental point neglected in traditional linguistics – of the bodilyness of humans as makers and as receivers/remakers of signs. A mode is a means for making representations, through elements (sounds, syllables, morphemes, words, clauses) and the possibilities of their arrangement as texts/messages. Colour is such a material, and here we will explore the question of colour as mode in social semiotic terms, letting colour stand in for all other instances of the relations of material, culture and mode as semiotic resource.

We begin with a bit of history, to demonstrate the point we made above. In the Middle Ages, pigments had value in themselves. Ultramarine, as the name indicates, had to be imported from across the sea and was expensive, not only for this reason, but also because it was made from lapis lazuli. Hence it was used for high-value motifs, such as the mantle of the Virgin Mary. Such pigments were not mixed, but used in unmixed form, or at most only mixed with white. The material identities of specific pigments had to remain visible, and it was these material qualities which motivated their use, their meaning. As a result ‘colour’ was a collection of distinct material substances, rather than a ’system’ – ‘lexis’ rather than ‘grammar’.

Around 1600, in Dutch painting, the technology changed. New techniques allowed each particle to be coated in a film of oil, which insulated it against chemical reaction with other pigments and made more extensive mixing possible. As a result the status and price of specific paints went down and colour became to some extent disengaged from its materiality. Colour was no longer used and thought of as a collection, an extensive catalogue of distinctly different individual pigments, each with their own affordances, and hence semiotic potentialities, but as a system with five elementary colours (black, white, yellow, red and blue) from which all other colours could be mixed. But this system was not a semiotic system. It was a (practical) physics of colour, just as phonetics is a physics of speech. Nevertheless it involved considerable abstraction, and this made it possible to apply the system to different media, just as the system of language is applied to the medium of speech as well as to the medium of writing. Newton drew a comparison between the elements (tones) and rules of combination (harmony) of music and the elements (colours) and rules of combination (‘colour harmony’) of colour. Colours would be ‘consonant’ or ‘dissonant’ on the basis of the same intervals between seven ordered ‘elements’ (green, blue, indigo, violet, red, orange and yellow) as music (the seven different tones in the octave). In his time it inspired Castel to built an ‘ocular harpsichord’, a type of experiment which has cropped up again and again since – for instance, in the interest in synaesthesia and audition colorée of early twentieth-century psychologists and, more recently, in the work of abstract film and computer animation artists. The point is, ‘colour’ was no longer a collection of material substances, pigments; it became ‘colour’. As Kandinsky would later say, ‘Colour is only loosely attached to objects …. It has a grammar of its own, akin to the grammar of music.’

What, then, of meaning? Of course, colour has always been used as a semiotic resource. In the Middle Ages there were many theoretical and practical debates about colour symbolism. Should monks wear black (penitence, humility) or white (glory, joy)? But there was no unified system. Green could mean ‘justice’ as well as ‘hope’, red ‘charity’ as well as ‘life’ and so on. Learned tomes such as F.P. Morato’s On the Meaning of Colours (1528) argued with and against each other about the symbolic meanings of colour, and in their arguments the meanings of colour were always motivated. Green could be the colour of unity, for instance, because it was used as a background in representations of the Trinity. Red could mean ‘life’ because it is the colour of blood. Some modern artists have tried to revive this kind of symbolism. For Malevich black denoted a worldly view of economy; red, the revolution; and white, action. With these elements, he thought, more complex ideas could be constructed. But as in the Middle Ages, contemporary ‘colour codes’ have limited domains of application, and specific colours can have very different meanings in different contexts. The work of Malevich, Mondrian, Kandinsky and others was an attempt to explore the possibility of a broader, more widely applicable ‘language of colour’. But they did not manage to bring such a language into being. As Gage has said, their experiments ‘offered the prospect of universality, [but became] thoroughly hermetic’ (1999: 248).

So colour has, on the one hand, developed into a ‘mode’, a systematically organized resource. But on the other hand, this system is a physical, rather than a semiotic system, a kind of ‘phonetics’, although the basic elements of the system, the ‘primary’ and ’secondary’ colours, have played a key role in visual semiotic practices and in accounts of the meaning of colour. Semiotically, a single ’system’ has not developed. ‘What people do’ with colour varies enormously, and social groups which share common purposes around uses of colour are often relatively small and specialized – they do not constitute a large group, as is the case with speech, or with the systems of visual communication we have discussed in this book. But if we stay with the notion that ‘what people do’ shapes the tools, and bear in mind that very different things are done by different groups, we might be able to make some sense of how colour becomes a usable resource for making meaning. If we relate the meanings of colour both to their materiality andto what people do with that, we might be able to ask the crucial questions: Is colour a mode of representation in its own right? Does it offer the full affordances of mode?

So the task is to discover the regularities of the resource of colour as they exist for specific groups; to understand them well enough to be able to describe what the principles for the use of the resource in signs are (that is, to understand how a specific group’s interest in colour shapes their signs of colour). From that we might begin to understand general principles of the semiosis of colour and of semiosis generally, and these in turn might provide a principled understanding of all uses of colour in all sociocultural domains.

THE COMMUNICATIVE FUNCTIONS OF COLOUR

In Halliday’s metafunctional semiotic theory, a communicational system simultaneously fulfils three functions: the ideational function, the function of constructing representations of the world; the interpersonal function, the function of enacting (or helping to enact) interactions characterized by specific social purposes and specific social relations; and the textual function, the function of marshalling communicative acts into larger wholes, into the communicative events or texts that realize specific social practices. We can ask the questions that we have asked of images generally of colour specifically. Can it create specific relations between ‘participants’; that is, between represented people, places, things and ideas? Can it represent social relations and help enact social interaction? And can it realize textual meanings – for instance, in a system of reference or in creating cohesion in an text?

In the preceding chapters we have, we hope reasonably plausibly, applied this model to a number of resources of visual communication (composition, gaze, angle and size of frame, and so on), thereby (re)constituting these resources as part of the ‘grammatical system’ of images, in Halliday’s terms. We did not, however, in the first version of this book, deal with colour in this way, because we found it difficult to assign colour plausibly to just one – and only one – of Halliday’s three metafunctions. It is true that there is a dominant discourse of
colour in which colour is primarily related to affect, and Halliday and others (e.g. Poynton, 1985; Martin, 1992) see affect as an aspect of the interpersonal metafunction. But the communicative function of colour is not restricted to affect alone. We think that colour is used metafunctionally, and that it is therefore a mode in its own right.

Starting with the ideational function, colour clearly can be used to denote people, places and things as well as classes of people, places and things, and more general ideas. The colours of flags, for instance, denote states, and corporations increasingly use specific colours or colour schemes to denote their unique identities. Car manufacturers, for instance, ensure that the dark blue of a BMW is quite distinct from the dark blue of a VW or a Ford, and they legally protect ‘their’ colours, so that others will not be able to use them. Even universities use colour to signal their identities. The Open University, for example, stipulates:

Two colours … for formal applications such as high-quality stationery and degree certificates – blue (reference PMS 300) for the shield and lettering, and yellow (PMS 123) for the circular inset. Single colour stationery should be in blue (PMS 300) if possible.
(quoted in Goodman and Graddol, 1996: 119)

On maps, colours can serve to identify water, arable land, deserts, mountains, and so on. In uniforms, colour can signal rank. In the safety code designed by US colour consultant Faber Birren (Lacy, 1996:75), ‘green’ identifies first-aid equipment, while ‘red’ identifies hoses and valves (which play a role, of course, in fire protection). On the London Underground, ‘green’ identifies the District Line and ‘red’ the Central Line, and on both Underground maps and in Underground stations many people look for those colours first, and speak of the ‘green line’ and the ‘red line’.

Colour is also used to convey ‘interpersonal’ meaning: it allows us to realize ‘colour acts’ (as language permits ’speech acts’). It can be and is used to do things to or for each other: to impress or intimidate through ‘power-dressing’, to warn against obstructions and other hazards by painting them orange, or to subdue people – apparently the Naval Correctional Centre in Seattle found that ‘pink, properly applied, relaxes hostile and aggressive individuals within 15 minutes’ (Lacy, 1996: 89). According to the Guardian’s ‘Office Hours’ supplement (3 September 2001: 5):

‘Colours are very powerful and can reduce or raise stress levels,’ believes Lilian Verner-Bonds, author of Colour Healing. Bright reds are energising and are good for offices in the banking or entertainment fields. Green is useful if there’s discord or disharmony as it is soothing. Blue is rated as the best colour for promoting calm and pastel orange is good for gently encouraging activity.

Elsewhere in the same article we learn that adding colour to documents can increase the reader’s attention span by more than eighty per cent and that ‘an invoice that has the amount of money in colour is thirty per cent more likely to be paid on time than a monocolour one’. In all these cases colour represents, projects, enables or constructs social relations – it is interpersonal. It is not just the case that colour ‘expresses’ or ‘means’ things such as ‘calm’ or ‘energy’; rather, people actually use colour to try to energize or calm down other people. Putting it more generally, colour is used to act on others, to send managerial messages to workers, or parental messages to children, as we have shown in an analysis of a child’s room (Kress and van Leeuwen, 2001). It is used by people to present themselves and the values they stand for, to say in the context of specific social situations, ‘I am calm’ or ‘I am energetic’, and to project ‘calm’ or ‘energy’ as positive values. We will address this in more detail below, when we analyse the use of colour in home decoration.

Colour also functions, maybe even most obviously, at the textual level. In many buildings, the differing colours of doors and other features – the colour schemes of floors – distinguish different departments from each other on the one hand, while creating unity and coherence within these departments on the other. Colour can be used to create coherence in texts. Textbooks make wide use of this, whether in ‘reading schemes’ or in mathematics texts to indicate ‘levels’ of difficulty, or in science textbooks to provide topical unity. In Pasos, a Spanish language textbook (Martin and Ellis, 2001), the chapter headings and page numbers of each chapter have a distinct colour, all section headings (‘Vocabulario en casa’, ‘Gramática’, etc.) are red throughout the book, and all ‘activities’ (e.g. ‘Make phrases with es or está’) have a purple heading and number. In an issue of the German edition of Cosmopolitan (November 2001), film reviews have orange headlines and other uses of orange in the typography, as the background of textboxes, etc. The art reviews use green in a similar way, book reviews use red, and so on. In some cases this is cued by a salient colour in the key illustration of the first page of the relevant review section; for instance, Cate Blanchett’s orange hair in a still from the film Bandits in the film review section.

‘Colour-coordination’, rather than the repetition of a single colour, can be used to promote textual cohesion. In this case the various colours of a page, or a larger section of a text (or of an outfit, or a room), may have roughly the same degree of brightness and/or saturation. In computer software such as PowerPoint,this feature is already built in, a kind of analogue of the spell-checker, showing just the development in the direction of a broader use of grammar. Choosing the initial background automatically selects a range of colours as a colour scheme. For instance, if the initial colour is a pastel, then the other colours will also be pastels. It is possible to override this by selecting another colour from a Munsell atlas type display, but this takes more effort and skill.

There are two points to make. First, colour fulfils the three metafunctions simultaneously. The colours on a map retain their ideational and their interpersonal value, their appealing brightness, or stuffy dullness; on maps colours are coordinated to enhance textual cohesion. And contemporary scientific visualizations are thought of as primarily ideational: veins and arteries will be represented using different colours to indicate the amount of oxygen – or the level of its depletion. Second, we are not arguing that colour always has and always will fulfil all three of these functions equally. Colour does what people do with it, in making a sign and in remaking the sign in its reception. We are not ‘discovering’ universal and suprahistorical facts about colour here. We are trying to document what kind of communicative work colour is made to do in today’s increasingly global semiotic practices, and how. The examples provide an indication that some of these uses of colour have fairly specific, limited domains, where they clearly relate to the specific interests of sign-makers (e.g. map-making, subduing prisoners) while others may have wider distribution (e.g. the use of colour-coding in magazines as a means of cohesion).

Finally, the central question: If we are right, if colour fulfils all three metafunctions, is it a semiotic mode in its own right, along with speech, image, writing, music? Maybe. But is there not also a difference? Language, image and music have been conceived of (and have in various ‘purist’ practices often operated) as relatively independent semiotic modes. Although a novel is a material object, and a page a visual artefact, its communicative work is done primarily through writing. In an art gallery images usually come with no words the small descriptive sign on the wall nearby is not a part of the image but a part of the environment of display in the gallery or museum. In the concert hall everything is concentrated on the music, while expression through semiotic modes such as dress, bodily performance, etc., is held back, certainly by comparison to contemporary popular music shows. There is a choice for the audience to focus on the music ‘as such’ or on the whole as ‘performance’, the ‘concert’.

Is this the case with colour? Painters have tried to make paintings that use only colour and nothing else (‘field painting’, Rothko, etc.), but this does not appear to have led to a whole new artform. Then again, maybe colour is a characteristic mode for the age of multimodality. It can combine freely with many other modes, with architecture, typography, product design, document design.

Let us step back for a moment. As one of us writes this – it is a day in mid-August, just before lunchtime – I am sitting looking out through the open French windows, on a tranquil French countryside. I see low hills, trees, forest in the background, some Charolais cattle in the pasture beyond the fence. There are a very few fluffy clouds, though above and beyond the forested hill there is a denser bank of cloud, just appearing, making me think of a late afternoon thunderstorm. I am describing this scene by selecting out specific elements, naming them, putting them here on the screen of my laptop as words in a particular order. I have avoided using any colour words.

Now let me try this again. I am looking out through the open French windows at a world of colour. Overwhelmingly, greens of the most varied hues dominate, though there are greys of various kinds and browns, dark purples, blues, off-white. Everything that I see I see as colour. And if I represent it here on my screen again as words, it is because I have translated the world as I see it into the mode that my culture has made most readily available to me. Usually I don’t even regard that action as translation, but as representation: that is how my culture has taught me to understand it. The elements of this translation code have to be inscribed -first on to this screen in ways I do not understand, later on to a printed page -though until about eight years ago or so I would have transcribed them on to the paper-page directly, using a pen.

I might, however, have learned (or might still learn) to paint. I would be able to represent this scene – it would of course not be a recording (even a photograph would not be that) – in a mode which is closer to the manner of my perception. Colour would be represented by colour, whereas at the moment colour is represented by words (in syntactic order). The colours would of course be organized, as blocks, splashes, lines, dots: the greys and browns would appear as thinner, differently vertical elements – as the trunks of the trees I see, or as the flecks of grey in the bank of cloud, the greens of various hues and brightness as leaves and blades of grass, and the purples as the dots of various sizes of plums and ripening elderberries. Colour would appear entirely by itself- on an inscriptional surface, of course, no less, but also (are we right in saying?) no more so than the words I used above need the inscriptional surface of page or, if spoken, the inscriptional surface of air and, if heard, the receptor organs of the ear.

In my banal account of this framed segment of the landscape I used words as my descriptional resource, having become so used to it that it also served as my means of analysis of the countryside. The mode gave me the terms with which to analyse that which I saw, and it gave me the means for its description. The mode of colour – if we see it as a mode – would give me different terms (not of course as here in my transcription as words), if I were able to paint what I see having mixed my own colours on my palette, using now a quite different set of analytical and descriptive ‘terms’.

None of this is new; and the Impressionists were just one ’school’ of painters who worked with ideas such as these, even if more subtly thought and expressed, and focused on the materiality of light rather than of colour. However, what would be new for us now is to see colour for what it is and what it does. Does colour here exist on its own? Well, yes, of course – at least as much as do words spoken or written. Once posed and seen in this context the answer becomes somewhat oddly self-evident. Can colour be or become a mode only in a multimodal environment? Well, yes, in the same way-no more no less, even if differently – as every other mode. And the experiments of Mondrian, of Rothko or Nicholson, as of others, would now be seen not so much as experiments in turning colour into mode, but as experiments in abstracting away from the (attempted) realisms of blocks (as tree trunks), slashes (as blades of grass), lines (as edges of all kinds) and dots (berries or plums), turning realism of the ideational kind into its abstraction.

A DISTINCTIVE FEATURE APPROACH TO THE SEMIOTICS OF COLOUR

In Kress and van Leeuwen (2001), we argued that signifiers – and colours are signifiers, not signs – carry a set of affordances from which sign-makers and interpreters select according to their communicative needs and interests in a given context. In some cases their choice will be highly regulated by explicit or implicit rules, or by the authority of experts and role models. In other cases – for instance, in the production and interpretation of art – it will be relatively free. In our brief analysis of the use of colour in home decoration below, we show how in most situations these two poles, constraint and creativity, are both in evidence and mixed in complex ways.

Like Kandinsky, we distinguish two types of affordance in colour, two sources for making meaning with colour. First there is association, or provenance – the question of’ where the colour comes from’, ‘where we have seen it before’. The associations taken up in many of the communicative uses of colour, such as in advertising or the entertainment media, will usually be with substances, objects, etc. that carry significant symbolic value in the given sociocultural context. While the affordances of a colour may be wide in theory, in practice they are not when the context of production and interpretation is taken into account, as we will try to do in the analysis below.

The second type of affordance is that of the ‘distinctive features’ of colour. Here we want to show some aspects of the affordances of the materiality of colour, and hence make a connection, not with the ‘grammar’, but with phonetics and phonology. In Jakobson and Halle’s distinctive feature phonology (1956), the features named real material phenomena – such as the point of articulation of a consonant, or the aperture of the mouth in the making of a vowel – and in description they were deployed as operating in opposition. So one consonant could be distinguished from another by an opposition, such as i+voicedi as against /-voiced/, an opposition which would distinguish fbi from ipf, or Id! from it!, in English. We focus less on opposition than on the quality, the characteristics of the features, and talk about values on a range of scales, such as the scale that runs from light to dark, the scale that runs from saturated to desaturated, and so on. Unlike Jakobson and Halle, we see these features not just as serving to distinguish different sounds or colours from each other, but above all as meaning potentials; that is, as their potential to become signifiers. Any specific instance of colour can be defined as a combination of specific values on each of these scales – and hence also as a complex and composite meaning potential, as we will try to show below.

Value

The scale of value is the grey scale, the scale from maximally light (white) to maximally dark (black). In the lives of all human beings light and dark are fundamental experiences, and there is no culture which has not built an edifice of symbolic meanings and value systems upon this fundamental experience – even though different cultures may have done so in different ways. Painters who emphasize value – for instance, Rembrandt – are often able to exploit this meaning potential in complex and profound ways.

Saturation

This is the scale from the most intensely saturated or ‘pure’ manifestations of a colour to its softest, most ‘pale’ or ‘pastel’, or dull and dark manifestations, and, ultimately, to complete desaturation, to black and white. Its key affordance lies in its ability to express emotive ‘temperatures’, kinds of affect. It is the scale that runs from maximum intensity of feeling to maximally subdued, maximally toned-down, indeed neutralized feeling. In context this allows many different more precise and strongly value-laden meanings. High saturation may be positive, exuberant, adventurous, but also vulgar or garish. Low saturation may be subtle and tender, but also cold and repressed, or brooding and moody.

Purity

This is the scale that runs from maximum ‘purity’ to maximum ‘hybridity’, and it has been at the heart of colour theory as it developed over the last few centuries. Many different systems of primary and mixed colours have been proposed – some physical, some psychological and some a mixture of both – and this search for primaries or basics has not resulted in a generally accepted system, but ‘has proved to be remarkably inconsequential and
freighted with the heavy burden of ideology’ (Gage, 1999: 107). Some writers have seen the issue as closely related to the question of colour names. Colours with commonly used single names, such as brown and green, would be considered pure. The names of other colours, like cyan, are mainly used by specialists, and non-specialists would refer to them by means of a composite name, for instance, blue-green. Such colours would then be perceived as mixed.

Terms like ‘purity’ and ‘hybridity’ already suggest something of the meaning potential of this aspect of colour. The ‘pure’ bright reds, blues and yellows of the ‘Mondrian’ colour scheme have become key signifiers of the ideologies of modernity, while a colour scheme of pale, anaemic cyans and mauves has become a key signifier of the ideologies of postmodernism, in which the idea of hybridity is positively valued. This is by no means the only way in which the affordances of this scale have been taken up, but it is a culturally salient one, and hence one which is currently quite widely understood.

Modulation

This is the scale that runs from fully modulated colour (for example, from a blue that is richly textured with different tints and shades, as in paintings by Cezanne) to flat colour, (as in comic strips, or paintings by Matisse). It was already recognized as a feature of colour in Goethe’s Farbenlehre (Theory of Colour) (1970 [18101). The affordances of modulation are various and, again, strongly value-laden. Flat colour may be perceived as simple and bold in a positive sense, or as overly basic and simplified. Modulated colour, similarly, may be perceived as subtle and doing justice to the rich texture of real colour, or as overly fussy and detailed. And, as we have discussed in chapter 5, modulation is also closely related to the issue of modality. Flat colour is generic colour, it expresses colour as an essential quality of things (‘grass is green’), while modulated colour is specific colour (‘the colour of grass depends on the time of day and the weather’), it attempts to show the colour of people, places and things as it is actually seen, under specific lighting conditions. Hence the truth of flat colour is an abstract truth, and the truth of modulated colour a naturalistic, perceptual truth.

Differentiation

Differentiation is the scale that runs from monochrome to the use of a maximally varied palette, and its very diversity or exuberance is one of its key semiotic affordances, as is the restraint involved in its opposite, lack of differentiation. In our analysis of an article from a home decoration magazine below, a couple ‘uses nearly the whole spectrum in their house’ and comment that ‘it’s great that there are so many bright shades in the house. It’s a shame people aren’t more adventurous. It’s when you start being timid that things go wrong’ (House Beautiful, September 1998: 21). So here high differentiation means ‘adventurousness’ and low differentiation ‘timidity’, but it is clear that in another context restraint might have a more positive value.

Hue

This is the scale from blue to red. In a distinctive feature theory of colour it becomes only one of the factors constituting the complex and composite meanings of colour, and maybe not even the most important one. Nevertheless, although ‘the’ meaning of red-in-general, of the abstract signifier ‘red’, cannot be established, the red end of the scale remains associated with warmth, energy, salience, foregrounding, and the blue end with cold, calm, distance, backgrounding. The cold-warm continuum has many correspondences and uses. Itten (1970) lists transparent/opaque, sedative/stimulant, rare/dense, airy/earthy, far/near, light/heavy and wet/dry. In an actual red, meanwhile, its warmth combines with other features. An actual red may, for instance, be very warm, medium dark, highly saturated, pure and modulated, and its affordances for sign-makers and sign interpreters flow from all these factors. In the next section we will see how such sets of affordances are actually taken up in a specific context, and what context-specific interests and values are at work in this process.

HOME DECORATION: COLOUR, CHARACTER AND FASHION

What colours are used in home decoration and why? The answer depends on the sociocultural context. There have been many different traditions, including, for instance, regional differences, such as the bright blues and greens of the doors and windows of farmhouses in Staphorst, a village in the Netherlands where traditional dress is still worn. But today a new approach has developed, in which the expertise of the colour consultant plays a key role. According to Lacy (1996: 29), the entrance hall of a home signals the identity of its owner or owners:

A yellow entrance hall usually indicates a person who has ideas and a wide field of interests. A home belonging to an academic would probably contain a distinctive shade of yellow as this colour is associated with the intellect, ideas and a searching mind …. A green entrance hall – say, a warm apple green – indicates a home in which children, family and pets are held in high importance …. A blue entrance hall indicates a place in which people have strong opinions – there could be a tendency to appear aloof as they can be absorbed too much in their own world.

In expert discourse of this kind the colours of a home above all express character, express the identity, the personal characteristics, and the values and interests of the home owner or owners, while the colours of workplaces (and prisons, schools, etc.) are more often discussed in terms of their effects on workers (prisoners, students, etc).

Most people will encounter this discourse in magazines and television makeover programmes where it is mediated by journalists, although the expertise of colour consultants and interior decorators is often explicitly drawn on. In magazines aiming at different sectors of the market, different types of home owners or celebrities may be introduced (for instance, the owner of a London art gallery versus an actor in a popular soap opera). Compare the following two quotes:
Her latest habitat (she moves as regularly and happily as a nomad) is surprisingly spare and elegant, as you might expect from someone with a sense of the aesthetic in her genes. After all, Jane’s great aunt was Nancy Lancaster, of Colefax and Fowler fame, while her brother, Henry Wyndham, is chairman of Sotheby’s.
(Ideal Home and Lifestyle, September 1998: 60)

Guessing what Hamish and Vanessa Dows do for a living isn’t too difficult – a pair of feet on the house numberplate is a dead giveaway for a couple who are both chiropodists, but it’s also an indication of the fun they’ve had decorating their home.
(House Beautiful, September 1998: 20)

In such articles colour choice is presented as an original and unique expression of the character and values of the home owners – as fully personal, rather than mediated by social codes. The two fun-loving chiropodists above, for instance, use nearly the whole spectrum in their house, from mustard yellow and leaf green in the sitting room, to brick red and blue in the dining room. Their bedroom is a soft buttery yellow combined with orange, there’s lemon and lime in the breakfast room and cornflower and Wedgwood blues on the stairs. ‘It’s great that there are so many bright shades in the house’, says Hamish. ‘It’s a shame people aren’t more adventurous. It’s when you start being timid that things go wrong.’
(House Beautiful, September 1998:21)

This shows the reader how colour semiosis works, but at the same time avoids the suggestion that such models can be slavishly followed, and suggests that colour semiosis should naturally flow from people’s unique character and values. It is here that the affordances of colour are taken up. High colour differentiation and high saturation become signifiers of ‘adventurousness’, with differentiation standing for the absence of monotony and routine, and saturation for an intensity of feeling, for ‘living to the full’ and not being ‘timid’.

A look at the actual colours in the illustrations of the article (e.g. plate 6) shows that the distinctive features are selectively used in the discourse. There are, on those bright walls, painted gold leaves and sunflowers which, Hamish and Vanessa say, ‘give such a lovely Victorian feel’. Indeed, the photos show a very cluttered interior, with many retro objects, including fringed lampshades and statuettes of servile black servants. But, even without the quote and without these objects, the provenance of the leaves and sunflowers would be clear. While the colours may be highly saturated, they are also relatively dark and relatively impure, certainly by reference to Modernist bright and light interiors (and Mondrian-type pure colours), and this aspect of the colours, their provenance as ‘historic’ colours, is not explicitly discussed in the article.

Such ‘historic’ colours were very much in fashion in the 1990s: ‘The specialist paint firm Farrow & Ball whose colours were used to recreate eighteenth- and nineteenthcentury England in television adaptations of Pride and Prejudice and Middlemarch, reports that its sales have consistently risen by 40% each year over the past ten years’ (Guardian Weekend Magazine, 19 January 2002: 67). It may be that Hamish and Vanessa’s interior is not just an original expression of their character, but also follows fashion, and also takes its cues from the media. It may be that Hamish and Vanessa not only use the affordances of the distinctive features of colour to express their unique interests and values, but also base their choice of colour on ‘provenance’, and thereby also express the values of the place and the time where these colours come from. It may be that Hamish and Vanessa, through the way they decorate their home, symbolically identify with the values of that era, and with the nostalgia for a ‘lost’ Englishness which had been so salient throughout the 1990s. In this article, this is expressed in a covert way in which colour plays an absolutely crucial role. It may be that what seems so uniquely their own is socially constructed in and through the media, in discourses realized through colour.

COLOUR SCHEMES

In this last section we will discuss one final example in order to focus briefly on the question of the colour scheme. The example is a pamphlet produced to describe and explain the corporate identity change of a major publishing house in the U K (plate 7).

A number of modes are involved – colour, typeface, icons of several kinds. The pamphlet briefly describes the function of each. In the case of colour, a caption states that ‘The colour palette provides a harmonious selection of 16 colours, all carefully chosen to complement the corporate colour Palgrave silver, and they should be used wherever possible.’ So the deliberateness and intent is clear. Rather than the traditional layout of the colour chart, here the corporate colour is central, to indicate its status and role, and the subsidiary colours cluster around it in a regular display. This clustering is organized – in part – on the principle of gradations in hue, though given the colours chosen this cannot be achieved entirely in the manner of the traditional chart; there are gaps. Coherence has been deliberately aimed for: all the hues have to be able to collocate with the corporate colour, in its support. There is, consequently, already a strong sense of structure – both in the explicit hierarchy of colours and in the delimiting of the range of permissible ‘units’. But one colour changes the overall effect of the scheme, the bright-yellow top left. Its introduction in effect makes all the colours in its palette different in their meaning-potentials. It puts this colour palette into the domain of the sharp, the bright, the upbeat – in the words of the pamphlet, ‘active pursuit of ideas’, ‘rapid change’, ‘a world of challenges to be met’, ‘a new company and a global force in publishing’, etc.

Today, colours increasingly are colours in a ‘colour scheme’, colours in systems of colour which can be defined on the basis of specific uses of the distinctive features we have discussed. We have come across several such schemes already: the ‘historic’ colour scheme, based on differentiation, relatively high saturation and dark value; the modernist ‘Mondrian’ colour scheme, based on purity and high saturation; the postmodern colour scheme, based on hybridity and pastel values. All these colour schemes have distinct historical placements. But they live on beyond their historical period as recognized semiotic resources which can continue to be used and combined (for instance, the bright-yellow accent in the overall postmodern scheme of Palgrave) to realize distinctly different ideological positions.

February 7, 2010

Chapter 6

Filed under: Uncategorized — Natural Light @ 3:52 am

6 The meaning of composition

COMPOSITION AND THE MULTIMODAL TEXT

In previous chapters we have considered the way images represent the relations between the people, places and things they depict, and the complex set of relations that can exist between images and their viewers. Any given image contains a number of such representational and interactive relations. In figure 6.1, an image from Bergman’s Through a Glass Darkly (1961), we see Karin (Harriet Andersson), who suffers from an incurable mental disease, and her younger brother Minus (Lars Passgard) From the point of view of representation, the shot contains what we have called a ‘non-transactive reaction’ (Karin looks out of the frame, at something the viewer cannot see) and a ‘transactive reaction’ (Minus looks up at his sister). These choices relate to the themes of the dramatic action: Karin has visions, sees things other people cannot see; Minus is caught in the here and now of his problematic relations with the other characters in the film. From the point of view of interactive meaning, the viewer is positioned closer to Karin (‘medium shot’) than to Minus (‘long shot’); and, while Minus is seen from behind, Karin faces the viewer frontally.

Clearly, the viewer is meant to be most centrally involved with Karin, and with her mental turmoil.

These patterns do not exhaust the relations set up by the image. There is a third element: the composition of the whole, the way in which the representational and interactive elements are made to relate to each other, the way they are integrated into a meaningful whole. Minus, for instance, is placed on the left, and Karin on the right. If this were turned around, the representational and interactive meanings would not be affected. Karin’s reaction would still be ‘non-transactive’ and Minus’ reaction ‘transactive’, and Karin would still be in medium shot, Minus still in long shot. But the meaning of the whole would no longer be the same. In other words, the placement of the elements (of the participants and of the syntagms that connect them to each other and to the viewer) endows them with specific information values relative to each other. We will discuss the value of ‘left’ and ‘right’ in the next section.

In addition, Karin is the most salient, the most eye-catching element in the composition, not just because she is placed in the foreground and because she forms the largest, simplest element in the picture, but also because she is in sharper focus and receives the greatest amount of light. Throughout much of the film Karin is dressed in light colours and made to bathe in light, in an almost supernatural fashion, this in contrast to the other characters. For these reasons she is also the most salient element in the shots where one of the other characters, for example her husband, is placed in the foreground. Her white clothes and the light on her pale face draw attention to her, even when she is placed in the background. To generalize, pictorial elements can receive stronger or weaker ’stress’ than other elements in their immediate vicinity, and so become more or less important ‘items of information’ in the whole.

A vertical line formed by the left edge of the door of the shed, and continued by the dividing line between a particularly light and a darker board on the roof of the shed, runs through the middle of the picture, dividing it into two sections, literally and figuratively ‘drawing a line’ between the space of Karin, who can ‘look into the beyond’, and the space of Minus, who cannot. The world of Karin is thus separated from the world of Minus, in this pictorial composition as in the dramatic action of the film as a whole, where Minus’ desire for contact and communion with his sister remains unfulfilled. There is yet another demarcation line in the picture: the horizon, which divides the picture into the zone of ‘heaven’ and the zone of ‘earth’. In his discussion of Titian’s Noli Me Tangere, Arnheim (1982: 112-13) describes how the staff of Christ forms a ‘visual boundary’ between Christ, who is already ‘removed from earthly existence’, and Magdalen, who is not; and how ‘the lower region is separated by the horizon from the upper region of free spirituality, in which the tree and the buildings on the hill reach heavenward’. In figure 6.1, similarly, Karin straddles the two zones, half still of the earth, half already in the realm of ‘free spirituality’, while Minus is ‘held down by the horizon into the region of the earth’. More generally, composition also involves framing (or its absence), through devices which connect or disconnect elements of the composition, so proposing that we see them as joined or as separate in some way, where, without framing, we would see them as continuous and complementary: there would be no visual ‘directive’ of this kind.

Composition, then, relates the representational and interactive meanings of the image to each other through three interrelated systems:

  1. Information value. The placement of elements (participants and syntagms that relate them to each other and to the viewer) endows them with the specific informational values attached to the various ‘zones’ of the image: left and right, top and bottom, centre and margin.
  2. Salience. The elements (participants as well as representational and interactive syntagms) are made to attract the viewer’s attention to different degrees, as realized by such factors as placement in the foreground or background, relative size, contrasts in tonal value (or colour), differences in sharpness, etc.
  3. Framing. The presence or absence of framing devices (realized by elements which create dividing lines, or by actual frame lines) disconnects or connects elements of the image, signifying that they belong or do not belong together in some sense.

These three principles of composition apply not just to single pictures, as in the example we have just discussed; they apply also to composite visuals, visuals which combine text and image and, perhaps, other graphic elements, be it on a page or on a television or computer screen. In the analysis of composite or multimodal texts (and any text whose meanings are realized through more than one semiotic code is multimodal), the question arises whether the products of the various modes should be analysed separately or in an integrated way; whether the meanings of the whole should be treated as the sum of the meanings of the parts, or whether the parts should be looked upon as interacting with and affecting one another. It is the latter path we will pursue in this chapter. In considering, for example, the picture of the train (figure 3.30) we do no seek to see the picture as an ‘illustration’ of the verbal text, thereby treating the verbal text as prior and more important, nor treat visual and verbal text as entirely discrete elements. We seek to be able to look at the whole page as an integrated text. Our insistence on drawing comparisons between language and visual communication stems from this objective. We seek to break down the disciplinary boundaries between the study of language and the study of images, and we seek, as much as possible, to use compatible language, and compatible terminology to speak about both, for in actual communication the two, and indeed many others, come together to form integrated texts.

In our view the integration of different semiotic modes is the work of an overarching code whose rules and meanings provide the modal text with the logic of its integration. There are two such integration codes: the mode of spatial composition, with which we will be concerned in this chapter; and rhythm, the mode of temporal composition. The former operates in texts in which all elements are spatially co-present – for example, paintings, streetscapes, magazine pages. The latter operates in texts which unfold over time – for example, speech, music, dance (see van Leeuwen, 1999) Some types of multimodal text utilize both, for example film and television, although rhythm will usually be the dominant integrative principle in these cases.

It follows that the principles of information value, salience and framing apply, not only to pictures, but also, for example, to layouts. Plate 2, an advertisement for Bushells instant coffee, contains two photographs and a small amount of verbal text. The larger photo is a pictorial representation of the ‘promise’ of the product, and it is placed in the top section. The photo of the product is smaller, and placed below the larger photograph, together with the text. Reversing this would produce an entirely different effect, and probably result in a rather anomalous layout. Just what information values this arrangement accords to the two sections of the page will be discussed below. As far as salience is concerned, we can note that this page is not divided into two equal halves. The top section is the most salient, not only because of its size but also because the salience of the woman, who is positioned on the right and catches most of the golden glow of the light. Thus the advertisement gives greater stress to the promise of the product than to the product itself, or the verbal information. Finally, a sharp line creates a boundary between the photo and the verbal text, dividing the page into two separate sections, two spaces, reserved for two different kinds of meaning – one for the promise of the product, enhanced intimacy between lovers; the other for the product itself. Just as there is a dividing line between heaven and earth in Titian’s Noli Me Tangere, and in the still from Bergman’s Through a Glass Darkly, so there is, in this advertisement too, a dividing line between the world of ‘what might be’, the happiness the product might bring, and the world of ‘what is’, the product itself – and, just as in the two earlier examples, this product, the jar of instant coffee, straddles the two domains of meaning, forming a bridge between them. The home page of Sony’s website (http://www.sony.com) has a similar structure. The top part shows the pleasure derived from using the company’s products, and welcomes the user to the ‘world of Sony’, while the bottom part shows a range of actual products and allows the user to click on the pages from where the products can be ordered.

Early printed pages still treated text as ‘visual material’. Walter Ong (1982: 119ff.) describes how sixteenth-century title pages broke up words without regard for syllable boundaries, and used different typesizes in a way that was not related to the relative importance of words, but served to create pleasing visual patterns. However, the printed page soon developed into the ‘densely printed page’ in which reading is linear and textual integration achieved by linguistic means (conjunctions, cohesive ties, etc.!. In books of this kind it seems that the page has ceased to be a significant textual unit. The page shown in plate 2, on the other hand, is a semiotic unit, structured, not linguistically, but by principles of visual composition. In such a page verbal text becomes just one of the elements integrated by information value, salience and framing, and reading is not necessarily linear, wholly or in part, but may go from centre to margin, or in circular fashion, or vertically, etc. And this is the case, not only in contemporary magazines and websites, but also in many other contexts – for instance, in modern school textbooks, as we will show later (e.g. figure 6.6).

It should be noted, of course, that the layout of the densely printed page is still visual, still carries an overall cultural significance, as an image of progress. The densely written pages of other cultural traditions are laid out differently – as, for example in the Talmud, which has the oldest text, the Mishna, in the centre, the Gemara written around it; and later, medieval commentaries again, around the Gemara, in concentric layers. In such cases,
however, every page is still read the same way. In the case of magazine pages and the pages of modern computer screens, each successive page may have a different reading path.

This development beyond the densely printed page began in the late nineteenth-century mass press, in a context in which the ruling class, itself strongly committed to the densely printed page, attempted to maintain its hegemony by taking control of popular culture, commercializing it, and so turning the media of the people into the media forthe people (see Williams 1977: 295) Their own comparable media – ‘high’ literature and the humanities generally – became even more firmly founded on the single semiotic of writing. Layout was not encouraged here, because it undermined the power of the densely printed page as, literally, the realization of the most literary and literate semiotic mode. The genres of the densely printed page, then, manifest the cultural capital (‘high’ cultural forms) controlled by the intellectual and artistic wing of the middle class, to use Bourdieu’s terms (1986). Yet it is this same social group which has been instrumental in spreading the new visual literacy to those who were not, or not yet, to be initiated into the forms of literacy which constituted its own mark of distinction (the ‘masses’, or children), and to embrace it, for example in ‘high’ culture avant-garde manifestations, as an expression of their oppositional role within the middle class as a whole. As so often in the twentieth century, they turned out, in the end, to have been sawing off the branch on which they were sitting. The distinction between ‘high’ and ‘low’ forms is now everywhere in crisis, and new ways of maintaining cultural hegemony are required, for instance the development of different and differently valued ways of talking about forms which, themselves, are no longer differentiated in the old way (the ‘discourses’ of different ‘audiences’!. But the most highly valued ways of talking (and semiotics is one of them) remain themselves bound to methods that cannot adequately describe the new forms. If we are to understand the way in which vital text-producing institutions like the media, education and children’s literature make sense of the world and participate in the development of new forms of social stratification, a theory of language is no longer sufficient and must be complemented by theories which can make the principles of the new visual literacy explicit, and describe, for instance, the role of layout in the process of social semiosis that takes place on the pages of the texts produced by these institutions – as we will try to do in this chapter.

GIVEN AND NEW: THE INFORMATION VALUE OF LEFT AND RIGHT


Many of the double-page spreads in the Australian women’s magazines we used as one of our data sets when we wrote the first version of this chapter use the layout shown in figure 6.2. Their right pages are dominated by large and salient photographs from which the gaze of one or more women engages the gaze of the viewer (what, in chapter 4, we called ‘demand’ pictures!. These pages show women in specific and sometimes contradictory roles, with which the readers of the magazine are invited to form a positive identification: a mother; a former ’soapie star’ turned housewife and happy in that role; working women capable of coping with ‘tough’, ‘masculine’ jobs. Their left pages contain mostly verbal text, with graphically salient photographs on the right. The spread shown in figure 6.2 has a photograph on the left also, but this photo is smaller and, in contrast to the photo on the right page, it is a ‘fly on the wall’ photograph, which does not acknowledge the presence of the photographer, nor therefore that of the viewer. It is what, in chapter 4, we called an ‘offer’ picture. On such pages there often is a sense of complementarity or continuous movement from left to right, as in figure 6.2, where the photograph on the left is tilted to form a vector that leads the eyes to the photograph on the right, and where the colour gold, with its obvious connections to the theme of the story, is used as another integrating device: it occurs in the photograph as the colour of the helmets and of the liquid being poured, and is used also as the background against which the verbal text is printed.

On such pages the right seems to be the side of the key information, of what the reader must pay particular attention to, of the ‘message’ – whether it is the invitation to identify with a role model highly valued in the culture of the magazine or something else; for example, an instance of what is to be learned in a textbook. It follows that the left is the side of the ‘already given’, something the reader is assumed to know already, as part of the culture, or at least as part of the culture of the magazine. In figure 6.2, gold mining is Given, and the fact that women can engage in it, and that you, the reader, should identify with such ‘tough’ women, is New, the message, the ‘issue’.

Looking at what is placed on the left and what is placed on the right in other kinds of visuals has confirmed this generalization: when pictures or layouts make significant use

GOLD-digging can now be done by women

And just as the image of the two women is the New in figure 6.2, so the word women would be the New, the key point of the message, in the clause above. In other words, there is a close similarity between sequential information structure in language and horizontal structure in visual composition, and this attests to the existence of deeper, more abstract coding orientations which find their expression differently in different semiotic modes. Such coding orientations are culturally specific, certainly where the horizontal dimension is concerned. In cultures which write from right to left, the Given is on the right and the New on the left, as shown in figure 6.3, where the English and the Arabic language versions of Sony’s Middle East website are compared.



So far we have taken a composite text as our example, but the Given-New relation applies also within an image. Figure 6.4 shows a fourteenth-century relief depicting the creation of Eve. God is the Given, agreed origin and departure point of all that exists. ‘Woman’, on the other hand, is New and, in the context of the Genesis story, problematic, the temptress who leads Adam into sin. Michelangelo, on the other hand, in his famous painting The Creation of Adam on the ceiling of the Sistine Chapel, placed God on the of the horizontal axis, positioning some of their elements left, and other, different ones right of the centre (which does not, of course, happen in every composition), the elements placed on the left are presented as Given, the elements placed on the right as New. For something to be Given means that it is presented as something the viewer already knows, as a familiar and agreed-upon point of departure for the message. For something to be New means that it is presented as something which is not yet known, or perhaps not yet agreed upon by the viewer, hence as something to which the viewer must pay special attention. Broadly speaking, the meaning of the New is therefore ‘problematic’, ‘contestable’, ‘the information “at issue” ‘,while the Given is presented as commonsensical, self-evident. This structure is ideological in the sense that it may not correspond to what is the case either for the producer or for the consumer of the image or layout. The important point is that the information is presented as though it had that status or value for the reader, and that readers have to read it within that structure, even if that valuation may then be rejected by a particular reader.

A similar structure exists in spoken English (see Halliday, 1985: 274ff.). As in visual communication, the structure of a ‘tone group’, an intonational phrase, is not a constituent structure, with strong framing between elements, but a gradual, wave-like movement from left to right (or, rather, from ‘before’ to ‘after’, since in language we are dealing with temporally integrated texts), and it is realized by intonation. Intonation creates two peaks of salience within each ‘tone group’ – one at the beginning of the group, and another, the major one (the ‘tonic’, in Halliday’s terminology), as the culmination ofthe New, atthe end. Just as in figure 6.2 we have one peak of salience on the left, in the bold headline and the red bar which separates it from the article itself, and another on the right, in the photo of the two women, so we would have one peak of salience on the syllable gold and another on the syllable wo- of women in: right, in keeping with the new, humanistic spirit of the Renaissance. In this period God suddenly became New, and problematic. Generations of philosophers were to attempt to redefine Him in ways commensurate with the new science, and to try to prove H is existence by the use of logic. In this picture the movement is no longer from God to ‘Man’, but from ‘Man’ to God. ‘Man’ reaches out, aspiring to divine status, and almost achieving it – but not quite.

In magazine layouts such as the one shown in figure 6.2, the space of the Given is filled by verbal text, and the space of the New, or at least a large part of it, by one or more images. But this is not always the case. A double-page advertisement for Mercedes-Benz showed, on the left, a Mercedes photographed objectively (rather than, for example, from the driver’s point of view), and with the well-known Mercedes emblem in the centre of the composition. The right page contained only verbal text, with a headline saying, ‘Mercedes-Benz agrees with its competitors. You should drive their cars before you drive a Mercedes-Benz.’ In other words, the advertisement treated the Mercedes as an alreadyknown, ‘G iven’ symbol of status, and the message that ‘you, too, might own a Mercedes’ as the New. More generally, if the left contains a picture and the right is verbal text, the picture is presented as Given, as a well-established point of departure for the text, and the text contains the New. If the left page has text and the right page a picture, the text contains the Given, and the picture the New. The example points to the social effects and uses of this structure. What is taken for granted by one social group is not taken for granted by another. We might expect to find, therefore, systematic differences in the dispositions of material in layout across different magazines – for instance, according to their readership.

The concepts of Given and New can be applied also to the design of diagrams. In Shannon and Weaver’s (1949) communication model (figure 2.2) it might seem that the horizontal order of the elements is motivated representationally: the process of ’sending information’, for instance, must take place before the information can be received. But the left does not always signify ‘before’, nor does the arrow of time always point to the right. A diagram from a 1990 issue of Time Magazine which we were not allowed to reproduce here showed, on the right, a stick figure whose very large head was a pie chart representing the composition of the workforce in the year 2000 (i.e. ten years into the future at the time of publication!. Another pie chart, on the left, was superimposed on a massive office building and represented the present composition of the work force. An arrow showed that the stick figure was walking towards the door of the massive office building, i.e. that a change in the composition of the work force was gradually coming closer to the present, but it was not moving towards the right, because the current composition of the work force had to be treated as Given and the future additions (more women, minorities and immigrants) as New and problematic. This shows how the Given-New structure can be ideological even in diagrams. If the horizontal order of the communication model were rearranged in a similar way (see figure 6.5), it would no longer depict communication from the point of view of the ’sender’, with the ‘receiver’ as New, and problematic (Will the message ‘hit the target’? Will it have the intended effect?). Instead, the reader would become the origin and departure of the communication process, and the ’sender’ (‘author’) problematic, as has indeed happened, for instance, in literary reception theory.


Given-New structures can also be found in film and television. Media interviews, for example, often place the interviewer on the left of the interviewee (from the viewer’s point of view!. Thus interviewers are presented as people with whose views and assumptions viewers will identify and are already familiar, indeed, as the people who ask questions on behalf of the viewers. The interviewees, on the other hand, present’ New’ information and are situated on the right (see Bell and van Leeuwen, 1994: 160-4!. The relation between Given and New may be emphasized by horizontal camera movements (‘pans’!. In a current affairs item from an ABC 7.30 Report (March 1987), the children from a Muslim Community School were initially shown as ‘ethnic’, ‘different’ from ‘us’, viewers – there was much emphasis on their non-Western dress, and there was Arabic musIc In the background. But it was the point of the programme to establish that they were, despite this, ‘just like ordinary Australian children’, playful, spontaneous, creative, etc. This was realized, among other things, by various horizontal camera movements: a shot which panned from children in non-Western clothes to the teacher, a young woman in a Western dress, tying a bow in the hair of a little girl; a shot which panned along a classroom wall from an Arabic sign to a picture of a clown, etc. In other words, ‘difference’, ethnic prejudice, was treated as Given; the fact that at least these children should be accepted as ‘like us’ was treated as New, and formed the message the programme was trying to get across.

In ongoing texts, each New can, in turn, become Given for the next New. The opening pages of the chapter ‘In search of a straw’ from the Dutch junior high-school geography textbook Werk aan de Wereld (Bois et al., 1986) have, on the far left, a single column of text, occupying about a fifth of the page, which has a ‘landscape’ format, favouring a horizontally oriented layout. The text contains assertions such as ‘Many people in the Third World have nothing’ and questions such as ‘What prospects do all these people have?’ Given is, in the first place, the Third World as a problem. The remainder ofthe left page has a large colour photo of a man sleeping on the street, covered by a blanket (there is no indication where this photo is taken!. This more emotive way of presenting the ‘problem’ therefore functions as New in relation to the text. The right page features a single photograph, showing a large crowd of people searching a rubbish dump, armed with cane bags and baskets. In relation to this photo, the image of the homeless man (an image now also familiar in Europe) becomes Given, while the photo itself, with its (in Northern Europe) less familiar image of shocking mass poverty, is presented as New. Together with the introductory text, the two images constitute the Given of the chapter as a whole. Thus each new item of information, once received, becomes, in turn, Given for the information which follows, as shown in figure 6.6. This pattern ofthe New becoming Given is characteristic of language also, both in speech and in writing.

IDEAL AND REAL: THE INFORMATION VALUE OF TOP AND BOTTOM

Like many other magazine advertisements and marketing oriented websites (see Myers, 1994: 139), the Bushells advertisement (plate 2) and the Sony website are structured along the vertical axis. In such texts the upper section visualizes the ‘promise of the product’, the status of glamour it can bestow on its users, or the sensory fulfilment it can bring. The lower section visualizes the product itself, providing more or less factual information about it, and telling the readers or users where it can be obtained, or how they can request more information about it, or order it. There is usually less connection, less ongoing movement, between the two parts of the composition than in horizontally oriented compositions. Instead, there is a sense of contrast, of opposition between the two. The upper section tends to make some kind of emotive appeal and to show us ‘what might be’; the lower section tends to be more informative and practical, showing us ‘what is’. A sharp dividing line may separate the two, although, at a less conspicuous level, there may also be connective elements. In plate 2 this is created by the way the jar of coffee forms a bridge between the upper and the lower section of the ad, while in the Sony website it is created by the colour scheme which unites the page as a whole: in both the top and the bottom part of the page the dominant colours are shades of beige, with some blue and blue-grey elements added (the jacket of the girl, the pictures in the bottom part of the page) as well as some red elements (e.g. the girl’s lips and the words ‘What’s new’ in the top half, and the headings of the four sections in the bottom half). Overall, however, the opposition between top and bottom is strongly emphasized, with products placed firmly in the realm of the real, as a solid foundation for the edifice of promise, and with the top section as the realm of the consumer’s supposed aspirations and desires.

In other contexts, the opposition between top and bottom takes on somewhat different values. In a fairly conservative but (in the early 1990s) still widely used Dutch geography textbook (Dragt et al., 1986), the upper half of the first page of a chapter on, again, ‘The Third World’, is fully verbal, presenting generalized assertions and definitions such as ‘A large part of the world has a low development’ and ‘These underdeveloped countries we call poor countries or developing countries’. This provides a more neutral and less emotive (but not less ideological) kind of idealization, a representation of the world which is divested of contradictions, exceptions and nuances. The lower half of the page is given over to a map of the world which uses colour-coding to divide the world into regions according to the average income of the inhabitants, thus providing specific and detailed evidence to support the assertions in the top half. Directions for action – for instance, coupons for ordering a product in advertisements, or assignments or questions in textbooks – also tend to be found on the lower half of the page, usually at the bottom right (hence also New!)

The information value of top and bottom, then, can perhaps be summarized along the following lines. If, in a visual composition, some of the constituent elements are placed in the upper part, and other different elements in the lower part of the picture space or the page, then what has been placed on the top is presented as the Ideal, and what has been placed at the bottom is put forward as the Real. For something to be ideal means that it is presented as the idealized or generalized essence of the information, hence also as its, ostensibly, most salient part. The Real is then opposed to this in that it presents more speCific information (e.g. details), more ‘down-to-earth’ information (e.g. photographs as documentary evidence, or maps or charts), or more practical information (e.g. practical consequences, di rections for action).

As is already evident from the examples given so far, the opposition between Ideal and

Real can also structure text-image relations. If the upper part of a page is occupied by the text and the lower part by one or more pictures (or maps or charts or diagrams), the text plays, ideologically, the lead role, and the pictures a subservient role (which, however, is important in its own way, as specification, evidence, practical consequence, and so on!. If the roles are reversed, so that one or more pictures occupy the top section, then the Ideal, the ideologically foregrounded part of the message, is communicated visually, and the text serves to elaborate on it.

As with the Given and New, the Ideal-Real structure can be used in the composition both of single images and of composite texts such as layouts. Figure 6.7, reproduced from one of the Dutch geography textbooks we have discussed (Bois et al., 1986), includes a photo which may have been taken in India – its origin is not mentioned, but to the left of the picture, as its Given, we see a map of India. A young mother, carrying a baby, occupies, by herself, the top section ofthe vertically composed photo, as a ‘Third World’ Madonna with child. The bottom section shows a group of women and children, sitting on the ground, tightly packed together. The young mother looks at this group, a worried expression crossing her face. In this way the picture as a whole expresses a contradiction between the deep-rooted Ideal of motherhood and the Real of overpopulation. Immediately below the photo we find a collage of newspaper headlines (‘India struggles against overpopulation’, ‘U nemployment nightmare in India’) as Real (the newspaper as source of ‘hard facts’, of evidence) with respect to the more symbolic, idealized and emotive representation of the problem in the picture.

Ideal and Real can also playa role in diagrams. It is striking, for instance, that diagrams based on a vertical timeline sometimes idealize the present, sometimes the past. The already-mentioned Dutch geography textbook Werk aan de Wereld (Bois et al., 1986) features a diagram which represents the decrease of living space per head of the population, by means of a vertical arrangement of what look like chessboards of different sizes. On these ‘chessboards’ stand cartoon figures. On top we see a gentleman from 1900, complete with top hat, on a large (‘6285 m2′) ‘chessboard’. At the bottom, on the smallest ‘chessboard’, we see a ‘punk’ character from 1980.. Here, as in many advertisements, the past, the ‘good old days’, is presented as Ideal. The other Dutch geography textbook we mentioned (Dragt et al., 1986) features a ‘geological calendar’ in which the present (‘development of vertebrates’, complete with a small drawing of a naked woman) becomes the Ideal, the culmination of progress and evolution.


Many visuals combine horizontal and vertical structuring. In figure 6.8 (as in figure 6.4) God is Given, and Adam and Eve are New. But their fall from grace has introduced a (New) opposition, between the Ideal of Paradise, of the Garden of Eden, and the Real of death and decay – and the two are visually separated by the river which surrounds the Garden of Eden.

The communication model in figure 6.9 also combines horizontal and vertical structuring, in an intricate piece of visual thinking about the impossibility of knowing reality ‘as it is’, objectively. Given is the ‘event’, as it exists ‘out there’, separate from our perception of it. New, and therefore problematic, is our perception of the event and, at the lower level, the way we communicate our perceptions through language. Ideal is the ‘empirical’, the world ‘as it is’, and our perception of it, unmediated by communication, culture, language (which are positioned in the lower section). Real are our interpretations of these perceptions, as mediated by communication. Clearly, this diagram could have been vertical, or horizontal. But it is not. Communication is positioned below the ‘event’ and its perception. The ‘empirical’ world and ‘pure’ observation are Ideal. But this Ideal is also depicted in isolation from our ’statements’ about it, and our perceptions of these statements. This is what the lower section of the diagram, the Real, tells us. Perception is secondhand, filtered through culture and language, which, as the double-headed arrows indicate, feed back into our perception of nature, and hence into nature itself. The diagram tells us that reality does exist, but that our perception of it can only be ’subjective, selective, variable and unpredictable’ (McQuail and Windahl, 1993: 25!.

Figure 6.10 is another one of our original examples from late 1980.s Australian women’s magazines, but it remains a good example of the combination of horizontal and vertical structuring. Ideal is the moment, one might say, that ‘marriage was made in Heaven’. Modality is ‘distant’, representing the ‘not now’, the ‘out of time’. The bottom section, by contrast, represents the world of ‘is’, ‘now’, ‘in our time’.

Fig 6.8 God Shows Death to Adam and Eve (French, fifteenth-century miniature from ms. of De civitate Del) (from Hughes, 1969)

Given is the royal couple, presented as the quintessential couple, the well-established symbol of family values. New are Sydney’s Gwen and Ray Kinkade, an instance, an example of these values. Hence what is Given is the pre-eminence (historically, socially, semiotically) of the royal couple as the paradigm example of the married couple. What is New is one instance of the paradigm – where many others of a set of acceptable instances would have served equally well. This distinction is perhaps sharpest in the bottom part. The two pictures in the top part become, at the safe distance of forty years, almost identical, an equation between equal terms.

In one, and a very real sense, the place of the New seems merely perfunctory: it is the place of the replication of the paradigm, of the reproduction of the existing classifications of the culture, the place where the underlying values of the culture are reaffirmed. The New instantiates and ‘naturalizes’ these values. But that very fact also makes the position problematic, for it is at the same time the place of the affirmation of what is, the place of the reproduction of social meanings, and the place where the contestation of paradigmatic values can take place, the place therefore of the constant production of social meanings (e.g. of new definitions of ‘women’s work’ in figure 6.2), even when that production seems to be mere reproduction and hence conservative in its effects. Could there, for instance, have been a Vietnamese or Lebanese or Aboriginal couple in this position (in 1987), not to mention a gay couple? This contestation over ‘established’, ‘G iven’ values may happen in one or two ways: a reader who is not Anglo-Australian will either identify with the syntagm of Anglo-Australianness, ‘assimilate’, in other words; or will refuse the syntagm as having no relevance or value to him or her. In the latter case there will be pressure on this place in the syntagm, and this in turn will result in pressure on the paradigm as a whole.

There is another aspect to this: while the syntagm declares itself as unquestionably established, its appearance points at the same time to a problem with the paradigm, to the need precisely for a testing and (re)affirmation of its legitimacy. Read from the right to the left, the syntagm declares that it is the willingness of readers to read it as a relation of identity (within a hyponymic structure) which gives legitimacy to the royal couple. Royalty is the established, the Given. What has to be reaffirmed anew is that subjects are still prepared to enter into this paradigmatic relation. A monarchy trying to establish itself, on the other hand, might need to utilize a structure where the power of the people is represented as Given, and the identity of the monarch is to be established – that is, the royal couple would appear on the right.

Thus this syntagm reveals a number of social facts: what is regarded as established and Given; what the cultural classification system is with respect to a certain feature; and whether the system is progressive or reactionary. It is above all a syntagm which does not permit deviance; or, rather, once an item is in the syntagm, it has to be read as being in the paradigm. Where it does permit deviance is on the part of the reader, who can refuse to be part of the community defined by this paradigm.

In the Western visual semiotic, then, the syntagmatic is the realm of the process of semiosis, and the top-bottom structure the result and record of semiosis, the realm of order, the paradigm, the mimetic representation of culture (H odge and Kress, 1988!. To maintain and unsettle top-bottom structures, one has to work on the left-right structures. That this system goes back a long way in Western art can be seen in genres such as fifteenth-century Flemish diptychs, which, for instance, may have the Virgin and the Child as Given, and a donor or Saint as New, as in the diptych by the master of Bruges in the Courtauld Gallery, and polyptychs from the same period, which may parallel a Real (earthly) and Ideal (heavenly) version of the same theme in the lower and upper part of the panels, as in Bosch’s Last Judgement, where the lower part of one of the left panels shows Adam and Eve being driven from the Garden of Eden and the upper part the expUlsion of the Rebel Angels from Heaven.

As we have said in the Introduction, we are largely concerned with the description of the visual semiotic of Western cultures. Cultures which have long-established reading directions of a different kind (right to left, or top to bottom) are likely to attach different values to these positions, as shown in figure 6.3. In other words, reading directions may be the material instantiations of deeply embedded cultural value systems. Directionality as such, however, is a semiotic resource in all cultures. All cultures work with margin and centre, left and right, top and bottom, even if they do not all accord the same meanings and values to these spatial dimensions. And the way they use them in their signifying systems will have relations of homology with other cultural systems, whether religious, philosophical or practical.

We will end this section with one further example of the uses of Given and New, the way in which Rembrandt used Given and New for the expression of affective aspects of meaning, and this especially in relation to the source and direction of light and the effect produced by that. In many, perhaps the majority, of Rembrandt’s paintings, whether in landscapes such as Landscape with a Stone Bridge or in portraits such as A Young Woman in Bed or Double Portrait of the Mennonite Preacher Cornelius Claesz Anslo and his Wife Aeltje Gerritsdr Schouten (figure 6.11), the light source is outside the left frame of the picture and illuminates mainly the left part, leaving the right of the painting in greater or lesser darkness. Iconographically speaking the metaphoric range of light is wide – light can signify ‘the divine’, ‘illumination’, ‘hope’, etc. In these paintings light, whatever its meaning, is in the area of the Given, the taken for granted, the now/present. ‘Light’ is Given, ‘darkness’ New.

The height of the unseen light source also varies: in Young Woman it is on or just below the centre; in Double Portrait it comes from a position above the halfway mark, perhaps two-thirds of the way up; and in Landscape it comes from somewhere high up, near the top corner and in the near middle distance. That is, light may be in the area of the Real coming from a ‘mundane’ source or it may be ‘divine’. In other paintings the light comes from within the painting; for instance, in The Holy Family on the Flight to Egypt, where it forms the (divine) light ‘in the world’ (there is also a second, faint light coming from outside, in the sky above). In Belshazzar’s Feas( by a most unusual contrast, the light source (the glowing script announcing the doom of the king) is situated in the top-right quadrant – the space of the New and the ‘Ideal’/'divine’. The variation in the source and directionality of light thus has a complex set of meanings. It can contrast the secular/mundane and the divine/ideal; light as Given and taken for granted, and light as New and astonishing: and all these in variable combinations. In Double Portrai( for example, the light comes from outside the depicted world, is situated in the area of the Given (the area where the scriptures are depicted), and comes from just above the midway point between Ideal and Real, so that it could be interpreted as ‘divine’, yet close to the Real. One overwhelming effect is the brightness of the area of the Given, and the total darkness of the area of the New (the future?) to which the two figures have, in any case, turned their backs. Are we entitled to read from this autobiographical, affective meanings – perhaps a deep, pervasive pessimism about both the future, the New, and the present, the Real, which then contrasts with a feeling of security about what was, a faith in a divine light from the past certainty, which entails that we must turn our back on the New, on the future? If so, these affective, personal meanings are surely as significant as social and cultural meanings and, of course, related to them.

THE INFORMATION VALUE OF CENTRE AND MARGIN

Visual composition may also be structured along the dimensions of centre and margin. The most typical manifestations of this can be found in children’s drawings or, for example, in Byzantine art. As Arnheim (1982: 73) notes,

In the Byzantine churches the dominant image of the divine ruler holds the centre of the apse. In portrait paintings, a pope or emperor is often presented in central position. More generally, when the portrait of a man shows him in the middle of a framed area, we see him detached from the vicissitudes of his life’s history, alone with his own being and his own thoughts. A sense of permanence goes with the central position.


Figure 6.12 is an example – a Buddhist painting in which the central figure is surrounded by a circle of subordinates. Arnheim in fact makes the centre the crucial element of his theory of composition, conceiving of the visual objects in a composition as ’so many cosmic bodies attracting and repelling one another in space’ (1982: 207).

In contemporary Western visualization central composition is relatively uncommon, though here too there may be changes in train. Most compositions polarize elements as Given and New and/or Ideal and Real. But when one of us was teaching on a media design course in Singapore, he found that central composition played an important role in the imagination of young Asian designers. Perhaps it is the greater emphasis on hierarchy, harmony and continuity in Confucian thinking that makes centring a fundamental organizational principle in the visual semiotic of their culture. Much of the work produced by these students had strong dominant centres, surrounded or flanked by relatively unpolarized marginal elements.

While many Anglo-Western tabloid newspapers tend to adhere to a basic left-right structure in the layout of their front pages, others place the main stories and photographs in the top section. The front pages of the business sections of the Sydney Morning Herald, however, for a time invariably used central composition, featuring a large photo (or, frequently, drawing) in the centre of the page: for instance, Asian students entering the neo-Gothic Quadrangle of the University of Sydney when the page featured articles on education as a money earner for the country’s economy; a cartoon-like drawing of two men playing Monopoly (based on Van Doesburg’s Card-players – see figure 5.6), when corporate takeovers dominated the news; and so on. Such pictures provided a symbolic kernel for the issues of the day, and a centre for the elements arranged around them – news stories at the top and to the left, as, still, the Ideal and the Given of the newspaper, even if now somewhat marginalized; advertisements as the Real; and a column of expert commentary as New, hence as the element to which readers should pay particular attention.

Figure 6.13 shows a diagram from a tourism studies textbook in which ‘going on holiday’ is the core issue, and in which a range of reasons for going on holiday is arranged around this Centre, without any sense of polarization.

To generalize, then, if a visual composition makes significant use of the Centre, placing one element in the middle and the other elements around it, we will refer to the central element as Centre and to the elements around it as Margins. For something to be presented as Centre means that it is presented as the nucleus of the information to which all the other elements are in some sense subservient. The Margins are these ancillary, dependent elements. In many cases the Margins are identical or at least very similar to each other, so that there is no sense of a division between Given and New and/or Ideal and Real elements among them. In other cases – for instance, the newspaper pages we discussed above – Centre and Margin combine with Given and New and/or Ideal and Real.

Not all Margins are equally marginal. Circular structures can create a gradual and graded distinction between Centre and Margin, as for instance in the communication model by Andersch et al. in figure 6.14, where the process of ’structuring’ is more marginal than the process of ‘evaluating’. In this model, nature (the ‘environment’) is Centre, origin and prime mover of communication. Compared to the dominant position of nature, communication is a marginal phenomenon, just as, in the medieval maps of cities we discussed in chapter 3, the cities themselves were placed in the centre and depicted with topographical accuracy, while the surrounding countryside was represented on a smaller scale, and with less accuracy. Verbal commentaries do not necessarily try to ‘translate’ such meanings. Watson and Hill <1980: 76), for example, say that in this model the ‘message’ is ‘interacting with factors in the environment’. Yet, the model itself represents the relation between communication and the ‘environment’, not as interaction, but as a one-way process, a ‘nontransactive reaction’, according to our terminology in chapter 2 (there is an arrow only from the ‘environment’ to the communicative processes that surround it!. And ‘interacting’ suggests greater equality between the ‘message’ and the ‘factors in the environment’ than does the centred composition of the model.

As we have seen, Given-New and Ideal-Real can combine with Centre and Margin. Dividing visual space according to these dimensions results in the figure of the Cross, a fundamental spatial symbol in Western culture (see figure 6.15). Just how marginal the margins are will depend on the size and, more generally, on the salience of the Centre. But even when the Centre is empty, it continues to exist in absentia, as the invisible (denied) pivot around which everything else turns, the place of the ‘divine ruler’. The relative infrequency of centred compositions in contemporary Western representation perhaps signifies that, in the words of the poet, ‘the centre does not hold’ any longer in many sectors of contemporary society.

One common mode of combining Given and New with Centre and Margin is the triptych.

In many medieval triptychs there is no sense of Given and New. The Centre shows a key religious theme, such as the Crucifixion or the Virgin and Child, and the side panels show Saints or donors, kneeling down in admiration. The composition is symmetrical rather than polarized, although the left was regarded as a slightly less honorific position. In the sixteenth century altarpieces become more narrative, showing, for instance, the birth of Christ or the road to Golgotha in the left panel, the Crucifixion on the centre panel, and the Resurrection on the right panel. This could involve some polarization, albeit subordinated to the temporal order, with the left as the ‘bad side’ (e.g. the transgression of Adam), the right as the ‘good side’ (e.g. the Resurrection) and the middle panel representing Christ’s role as Mediator and Saviour (e.g. the Crucifixion!. Bosch’s Last Judgement (and also his Earthly Delights) inverts this, showing on the left the Garden of Eden and on the right a cataclysmic vision of Hell in which there is no place for the ‘ascent of the blessed’.

The triptychs in modern magazines and newspaper layouts are generally polarized, with a ‘Given’ left, a ‘New’ right, and a centre which bridges the two and acts as ‘Mediator’. In August 2004, the top banner of Nokia’s website showed on the left an image of a fashionable woman and on the right a Nokia imaging phone. The text in the Centre connected the two. In fact it consisted of two alternating texts. First we read ‘Inspiringly. Welcome to London Fashion Week’, then this first text made way for ‘Inspiringly different. The new official imaging phone of London Fashion Week’. The concept of fashion is Given, and the Nokia imaging phone as a fashion accessory New.

Triptychs can also be used to structure diagrams. Iedema et al. n 994: 217) shows how the left column of the organizational chart of a local council lists that council’s ‘Corporate Services’, so making the ‘administrative and financial backbone of the organization’ Given, while the council’s’ Development and Environmental Services’, the department which ‘connects all other departments’, is described in the central column, as the Mediator. In a lecture on social cognition attended by the authors, the lecturer used the blackboard (conventional blackboards also have a triptych structure!) to list, on the left panel, a number of key issues in linguistics (this was Given because most of those present were linguists and students of applied linguistics), on the right panel a number of issues in sociology this was New, as the linguists were meeting to discuss the social relevance of discourse analysis), and on the central ‘panel’ an outline of his own theory of social cognition, which he presented as the necessary link between the two fields, and as an issue that should be the central concern of those present at the meeting.

Vertical triptychs are also common in websites. The triptych from the University of Oxford website (figure 6.16) can be interpreted as a simple Margin-Centre-Margin structure, though there is some polarization in that the top image is a ‘long shot’ and the bottom image a close up. Overall, however, the student is Centre here, while images of history and tradition surround and support her.

The triptych in figure 6.17 comes from a German junior high-school politics textbook (Nitzschke, 1990!. As the Ideal, we see (in colour) immigrants (Auslander, ‘foreigners’) in high-status professions. As the Real, we see ‘foreigners’ in low-status professions. This Real is divided into a Given and a New, with a colour photo as Given and a black-and-white photo as New, as though, in the 1990s, the low status of immigrants should be looked at in a more sober light, and no longer as ‘Given’ as it once was. In the Centre we see, again in black and white, a single immigrant worker cleaning a train. The accompanying text encourages students to explore what would happen if ‘one day all foreign workers had to leave’. What, it asks, would be the consequences for the building industry, the children of the workers, the owners of hostels, the workers themselves, the managers of hospitals and cleaning firms? In other words, this triptych (itself the New on the double page in which it appears) tells us that foreign workers should, perhaps, ideally be able to move into high-status positions, but in reality are needed to do ‘our’ menial jobs. The central image is an attempt to overcome, or at least mitigate, this contradiction. It shows a worker who, like the high-status immigrants in the Ideal, is depicted as an individual, and as involved in ‘clean’ work, but who, also like the workers shown in the Real, has a low-status job – and is shown in the sober, documentary modality of black-and-white realism.

The structure of the triptych, then, can be either a simple and symmetrical MarginCentre-Margin structure or a polarized structure in which the Centre acts as a Mediator between Given and New or between Ideal and Real (see figure 6.18)

In this and the preceding section of this chapter, we have not drawn any parallels with language. Though spoken English has its own Given-New structure, this is not the case with the Ideal-Real and the Centre-Margin structures. This is not to say that the meanings these structures express cannot, in some form, be expressed in language, but rather that they are more readily and frequently expressed visually, and that language, unlike visual communication, has not developed ‘grammatical’ forms to express them. As we have emphasized throughout this book, sometimes language and visual communication express the same kind of semantic relations, albeit in very different ways, but there are also many types of semantic relation which are more often and more easily expressed visually, just as there are others which are more often and more easily expressed linguistically, with epistemological consequences of the kind we discussed in the Introduction and chapter 1.

SALIENCE

The fundamental function of integration codes such as composition is textual. Integration codes serve to produce text, to place the meaningful elements into the whole, and to provide coherence and ordering among them. So far we have discussed how composition determines ‘where things can go’ and how the positioning of the elements in a composition endows these elements with different information values in relation to other elements. But the composition of a picture or a page also involves different degrees of salience to its elements. Regardless of where they are placed, salience can create a hierarchy of importance among the elements, selecting some as more important, more worthy of attention than others. The Given may be more salient than the New, for instance, or the New more salient than the Given, or both may be equally salient. And the same applies to Ideal and Real and to Centre and Margin.

The same phenomenon occurs in temporally integrated texts. Rhythm always involves cycles which consist of an alternation between successive sensations of salience (stressed syllables, accented notes, etc.! and non-salience (unstressed syllables, unaccented notes) and these cycles repeat themselves with the time intervals that are perceived as equal even when, measured objectively, they are not. The perception of salience, in speech as in music, results from a complex interplay between a number of auditory factors: the duration of the strong and weak elements of the cycle (‘Iong’-’short’), the pitch of the strong and the weak elements (‘high’-'Iow’) their loudness (‘Ioud’-’soft’), and in speech also the vowel colour (vowels may be fully pronounced, for instance the first ‘e’ in element, or pronounced as a ’schwa’, like the second ‘e’ in elemen( or the second ‘a’ in alabaster!. Indeed anything that can create an auditory contrast between successive sounds can serve to realize salience. And even when objective clues for salience are absent, the first element of each cycle can be perceived as ’stronger’: perception imposes rhythm, waves of salience and non-salience on sound (and on movement) even when, strictly speaking, there is none.

When composition is the integration mode, salience is judged on the basis of visual clues. The viewers of spatial compositions are intuitively able to judge the ‘weight’ of the various elements of a composition, and the greater the weight of an element, the greater its salience. This salience, again, is not objectively measurable, but results from complex interaction, a complex trading-off relationship between a number of factors: size, sharpness of focus, tonal contrast (areas of high tonal contrast – for instance, borders between black and white – have high salience), colour contrasts (for instance, the contrast between strongly saturated and ’soft’ colours, or the contrast between red and blue), placement in the visual field (elements not only become ‘heavier’ as they are moved towards the top, but also appear ‘heavier’ the further they are moved towards the left, due to an asymmetry in the visual field), perspective (foreground objects are more salient than background objects, and elements that overlap other elements are more salient than the elements they overlap), and also quite specific cultural factors, such as the appearance of a human figure or a potent cultural symbol. And, just as rhythm creates a hierarchy of importance among the elements of temporally integrated texts, so visual weight creates a hierarchy of importance among the elements of spatially integrated texts, causing some to draw more attention to themselves than others.

Being able to judge the visual weight of the elements of a composition is being able to judge how they ‘balance’. The weight they put into the scales derives from one or more of the factors just mentioned. Taken together, the elements create a balancing centre, the point, one might say, from which, if one conceived of the elements as part of a mobile, this mobile would have to be suspended. Regardless of whether this point is in the actual centre of the composition or off-centre, it often becomes the space of the central message, and this attests to the ‘power of the centre’ (Arnheim, 1982) to which we have alluded already, a power which exerts itself even if the Centre is an empty space around which the text is organized – d. Barthes’ remarks about the ‘empty heart of Tokyo’ (1970: 44).

Perspective produces centres of its own, and by doing so contributes to the hierarchization of the elements in compositions. As a result viewers may relate to compositions in two ways: perspectivally, in which case the composition is ostensibly based on the viewer’s perspective/position; or non-perspectivally, in which case the composition is not based on the viewer’s position/perspective. In the former case the viewers, face-to-face with the infinite recess of perspective, become themselves the centre of the composition, thus taking the place o( for example, the deities in Byzantine or Buddhist paintings. In the latter case the representation is coded from an internal point of view, as is borne out by the fact that what is left and what is right is judged from the point of view of the represented participants rather than from the point of view of the viewer. Uspensky (1975: 33-9) has documented this with respect to icon-painting. He cites traditional guides for icon-painters which state, for instance, ‘On the right, or good side, is Mount Sinai, on the left, or bad side, Mount Lebanon’, and then shows how, from the viewer’s point of view, Mount Sinai is on the right and Mount Lebanon on the left. He adds that this is a general feature of pre-Renaissance art, and also of primitive cartographic drawing.

In the theory of art, composition is often talked about in aesthetic and formal terms (‘balance’, ‘harmony’, etc.!. In the practice of newspaper and magazine layout it is more often discussed in pragmatic terms (does it ‘grab the readers’ attention’?!. In our view these two aspects are inextricably intertwined with the semiotic function of composition. As we have seen, in many magazine advertisements (e.g. plate 2) the top section, the ‘promise of the product’, is the most salient element due to its size. This suggests, not just that such advertisements attempt to make readers notice the attractive picture first, so as to ‘hook’ them, but also that Ideal and Real are ranked in importance and opposed to each other in this way. Composition is not just a matter of formal aesthetics and of feeling, or of pulling the readers (although it is that as well); it also marshals meaningful elements into coherent texts and it does this in ways which themselves follow the requirements of mode-specific structures and themselves produce meaning.

Rhythm and balance also form the most bodily aspects of texts, the interface between our physical and semiotic selves. Without rhythm and balance, physical coordination in time and space is impossible. They form an indispensable matrix for the production and reception of messages and are vital in human interaction. Moreover, it is to quite some degree from the sense of rhythm and the sense of compositional balance that our aesthetic pleasure in texts and our affective relations to texts are derived.

FRAMING

The third key element in composition is framing. In temporally integrated texts framing is, again, brought about by rhythm. From time to time the ongoing equal-timed cycles of rhythm are momentarily interrupted by a pause, a rallentando, a change of gait, and these junctures mark off distinct units, disconnect stretches of speech or music or movement from each other to a greater or lesser degree. Where such junctures are absent, the elements are connected in a continuous flow. In spatially integrated compositions it is no different. The elements or groups of elements are either disconnected, marked off from each other, or connected, joined together. And visual framing, too, is a matter of degree: elements of the composition may be strongly or weakly framed.

The stronger the framing of an element, the more it is presented as a separate unit of information. Context then colours in the more precise nature of this ’separation’. The members of a group, for instance, may be shown in a group portrait (as in group photos of school classes or employees of a company) or in a collage of individual photos, marked off by frame lines and/or empty space between them (as with photos of the managers of a company in a company brochure!. The absence of framing stresses group identity, its presence signifies individuality and differentiation. In figure 6.1, framing acquires dramatic significance. The left post of the door and the dividing line between the light and dark boards on the roof create a frame line which, literally and figuratively, separates Minus from his sister, expressing the communicative gap between them. In film and video a similar effect can be created by the choice between showing two or more actors together in one shot, or editing between individual shots of the actors in which each is isolated from the others by frame lines.

The more the elements of the spatial composition are connected, the more they are presented as belonging together, as a single unit of information. In the Nokia triptych referred to above, for instance, there are no frame lines to demarcate the elements of the triptych strongly from each other. There is a sense of continuous flow from left to right. But in figure 6.16 the ‘panels’ of the triptych are separate units – there is a sharp demarcation here between past and present. The same applies to figure 6.17, where the empty space between the top and the central ‘panel’ and the colour contrast (the top panel is in colour, the middle panel in black and white) create a strong division between the Ideal and the reality of immigration. The example also illustrates the many ways in which framing can be achieved – by actual frame lines, by white space between elements, by discontinuities of colour, and so on.

Connectedness, too, can be realized in many ways. It can be emphasized by vectors, by depicted elements (structural elements of buildings, perspectivally drawn roads leading the eye to elements in the background, etc.! or by abstract graphic elements, leading the eye from one element to another, beginning with the most salient element, the element that first draws the viewer’s attention. In figure 6.2 the tilting of the left-hand photo forms a vector leading the eye from left to right, and the repetition of the colour gold in all the elements of the two pages privides a strong sense of unity and cohesion – visual ‘rhymes’ of this kind, repetition of colours and shapes in different elements of the composition, form another key connection device, often used in advertisements to stress the connection between the ‘promise of the product’ and the product itself (ct. also the colour-coordination in the Sony homepage!.

It should finally be noted that, at a deeper level, there is also an element of framing in styles of drawing and painting. In line drawings, for instance, the outlines of objects strictly demarcate them from their environment, whereas in certain styles of painting (e.g. Impressionism) they are set apart from their environment only by subtle transitions of colour.

LINEAR AND NON-LINEAR COMPOSITIONS

In densely printed pages of text, reading is linear and strictly coded. Such texts must be read the way they are designed to be read – from left to right and from top to bottom, line by line. Any other form of reading (skipping, looking at the last page to see how the plot will be resolved or what the conclusion will be) is a form of cheating and produces a slight sense of guilt in the reader. Other kinds of pages (e.g. traditional comic strips) and images (e.g. timeline diagrams) are also designed to be read in this linear way.

The pages we have described in this chapter are read differently – and can be read in more than one way. Their reading path is less strictly coded. Readers of magazines, for instance, may flick though the magazine, stopping every now and again to look at a picture or read a headline, and perhaps later returning to some of the articles which drew their attention, and websites are specifically designed to allow multiple reading paths. Yet in many pages composition does set up particular hierarchies of the movement of the hypothetical reader within and across their different elements. Such reading paths begin with the most salient element, and from there move to the next most salient element, and so on. Their trajectories are not necessarily similar to that of the densely printed page, leftright and top-bottom, but may move in a circle, as in figure 6.2, where the gold being poured is the most salient element, because of its extreme brightness (somewhat reduced in reproduction), the photo of the two gold-diggers the next most salient, the headline the third most salient, and the text the next most salient – but it may also be that the vector formed by the tilting of the left photograph leads the eye back to the larger photo, and so on, in circular fashion. Whether the reader only ‘reads’ the photos and the headline, or also part or all of the verbal text, a complementarity, a to and fro between text and image, is guaranteed. For anyone reader the photograph or the headline may form the starting point of the reading. Our assumption is that the most plausible reading path is the one in which readers begin by glancing at the photos, and then make a new start from left to right, from headline to photo, after which, optionally, they move to the body of the verbal text. Such pages can be ’scanned’ or read, just as pictures can be taken in at a glance or scrutinized for their every detail. We deliberately make a modest claim here and speak of the ‘most plausible’ reading path, for this type of reading path is not strictly coded, not as mandatory, as that of the densely printed page or the conventional comic strip. Different readers may follow different paths. Given that what is made salient is culturally determined, members of different cultural groupings are likely to have different hierarchies of salience, and perhaps texts of this kind are the way they are precisely to allow for the possibility of more than one reading path, and hence for the heterogeneity and diversity of their large readership.

As non-linear texts become more common, even densely printed pages of text begin to be read differently. The scientist, reading a journal of organic chemistry, will glance at the diagrammatic representations of organic compounds before deciding whether or not to read the paper or, when reading that only one rat has been used in the experiment, skip to find out first why this was done (G ledhill, 1994!. Students preparing for their exams will use the index of the textbook to find out and highlight the passages they need, rather than read the textbook from cover to cover. The more a text makes use of subheadings, emphatic devices (italics, bold type, underlining), numbered lines of typical elements or characteristics of some phenomenon, tables, diagrams and so on, the more likely it is to be scanned, skip-read, ‘used’ rather than read: linear reading is gradually losing ground.

We noted that reading paths may be circular, diagonal, spiralling, and so on. As soon as this possibility is opened up, as soon as there is a choice between differently shaped reading paths, these shapes can themselves become sources of meaning. If the reading path is circular, one reads outwards, in concentric circles, from a central message which forms the heart, so to speak, of the cultural universe. If the reading path is linear and horizontal, it constitutes a progression, moving inexorably forwards towards the future (or backwards, towards the ‘origin’ of all things). If it is vertical, a sense of hierarchy is signified, a movement from the general to the specific, from the ‘headline’ to the ‘footnote’. The shape of the reading path itself conveys a significant cultural message.

Sixteenth-century books of emblems explicitly described the meanings of different kinds of reading path. The reading path of figure 6.19, an illustration from a Flemish book of emblems, is a spiral, which was an emblem for the inexorable progress of time. It is also a serpent, so that the reading proceeds from the tail, a low, base element, to the head, a superior element. Alain-Marie Bassy (1975: 303-5) explains the sequence of the emblematically expressed meanings one encounters as one follows the spiral from the centre outwards: the hand (‘work’), the head (‘intelligence’), the tail of the serpent (a ‘base’ element), the hand which holds down and imposes its will on the tail. Joining these meanings together results in the visual proposition also expressed in the title of the picture: ‘Ex literatum studiis inmortalitem acquiri’ (‘Through intellectual endeavours I gained immortality’!. Today, the study of the meaning of new kinds of reading paths has barely begun.

Analysing reading paths with students, we found that some are easy to agree on, others harder, again others impossible. This was not, we think, because of a lack of analytical ability on our part or on the part of our students, but because of the structure of the texts themselves. Texts encode reading paths to different degrees. Some, though no longer densely printed pages, still take the readers by the hand, guiding them firmly through the text. Others (we might call them ’semi-linear’ texts) at best provide readers with a few hints and suggestions, and for the rest leave the readers to their own devices. In again others we can, with the best will in the world, not detect any reading path that is more plausible than any number of others. In figure 6.20, a comic strip from the magazine Cracke~ the headline stands out and this, together with the strong vector formed by the waterslide on the left page, predisposes us to start our reading top left. But where the eye will move from here is difficult to predict. There is neither chronology (despite the resemblance to a flowchart) nor a clear hierarchy of salience.

Increasingly many texts (newspapers, billboards, comic strips, advertisements, websites) are of this kind. They offer the reader a choice of reading path and, even more so than in the case of texts where a plausible reading can be discerned, leave it up to the reader how to traverse the textual space. They are ‘interactive’ – and it is perhaps no accident that they have their clearest antecedent in the genre of the ‘activities’ books which offer children a choice of puzzles, riddles, colouring-in pictures, etc. for a rainy day during the holidays. This is not to say that the order of the elements on such pages is random. The comic strip, for instance, still has its ‘welcome’ sign at top left, and its most gruesome images in the Real, a division between depictions of holiday fun and of sadistic torture that recalls the division between the Garden of Eden and Death in figure 6.8.

Linear texts, then, are like movies, where the viewers have no choice but to see the images in an order that has been decided for them, or like an exhibition in which the paintings are hung in long corridors through which the visitors must move, following signs perhaps, to eventually end up at the exit. In non-linear texts viewers can select their own images and view them in an order of their own choosing. They are like an exhibition in a large room which visitors can traverse in any way they like. But, again, the way these exhibits are arranged will not be random. It will not be random that a particular major sculpture is placed in the centre of the room, or that a particular major painting has been hung on the wall opposite the entrance, to be noticed first by all visitors entering the room.

Linear texts thus impose a syntagmatics on the reader, describe the sequence of, and the connection between, the elements. As a result the meanings of individual elements can be less strictly coded, as for instance in documentary films, where the meaning of the individual shots can be largely determined by the editing, rather than by the intrinsic meanings of the shots. Non-linear texts impose a paradigmatics. They select the elements that can be viewed and present them according to a certain paradigmatic logic – the logic of Centre and Margin or of Given and New, for instance – but leave it to the reader to sequence and connect them. In the design of such texts there will be pressure to put more of the meaning in the individual elements of the composition, to use more highly coded images – symbolic and conceptual images, tightly written, self-contained items of information, stereotyped characters, drawings or highly structured images rather than realistic photographs, and so on. Linear and non-linear texts thus constitute two modes of reading and two regimes of control over meaning, exactly in the same way as we discussed in chapter I, in connection with Baby’s First Book (figure 1.1) and the page from Dick Bruna’s On My Walk (figure 1.2).

A SUMMARY


Figure 6.21 provides a summary of the distinctions we have introduced in this chapter. The double-headed arrows (t) stand for graded contrasts (‘more or less’, rather than ‘either-or’!. The superscript ‘I’ means ‘if’ and the superscript ‘T’ means ‘then’. In other words, ‘if there is no horizontal polarization, then there must be vertical polarization’ the opposite follows from this. In the next section we will discuss a number of examples in greater detaiI.

GIVEN AND NEW IN CHILDREN’S DRAWINGS AND CD-ROMs

In any sequential structure, that which is about to be said or shown is by definition always New, not yet known. By contrast, what has (just) been seen, heard, discovered is, by comparison, now known, Given. In visual media, sequence can of course be represented in a number of dimensions, right to left, bottom to top, in a spiral from outside, etc. (and in medieval painting perspective can indicate sequence, with the foreground as the present and the background as the future!. Such dimensions have been used throughout history, and are still used by different cultures, as primary visual sequencing orientations. The medium of the book, bringing the possibility of turning the page, adds a further means of reprinting sequence visually, the left page/right page structure and the possibility of the two-page structures (right page and following left page!)

Figure 6.22 shows a double page from a book produced by a six-year-old boy while staying in Paris for half a year with his parents. It records events and experiences he was involved in, and sights and objects he encountered during his stay in Paris. Clearly, in this situation everything was New for the child, literally. He was faced with the question of how to represent new information, new ideas, new objects, without the possibility of relating them to already-established, known domains.

The book opens with the name and address of the author, on the first lefthand page.

This is the Given for the book as a whole, an element of security and familiarity in the new environment. On the first righthand page that new environment is represented visually: a picture of the Eiffel Tower. It is only when this page is turned that the picture is named, commuted into language. Once named, the Eiffel Tower becomes Given, and on the adjoining righthand page the child faces the next aspect of his new environment. Thus the book continues: the new picture, too, is only identified on the next lefthand page – the Arc de Triomphe. The child obviously realized that this structure could be misunderstood, and used left-facing arrows to refer the reader to the picture on the previous page. But his impulse was to first visually represent Paris as the New, and then to master it, make it known and Given by means of language, by means of naming it. H is attitude was empirical and he used language as an ‘anchorage’ in his effort to come to terms with his new experiences.

We will end with an example that brings all the elements of this chapter together.

Figure 6.23 shows the first screen of an ‘edutainment’ CD-ROM for children, titled ‘3D Body Adventure’. The top of the screen shows a range of media on a desktop. A slide is projected on a screen. A video monitor shows an animated sequence. Half-hidden behind the monitor, a loudspeaker plays soft music. In other words, the Ideal here is what we might call ‘information media’, media to read, look at and listen to. The Real, on the other hand, presents things the user can do. It offers games to play, media to interactively engage with. ‘Emergency’, for instance, is a game which mixes laser surgery and the shooting gallery the player zaps brain cells in a race against time (‘ Hurry doctor, save the patient’!. And in ‘Body Recall’ body parts must be matched with their names. Thus the composition of the screen uses the vertical dimension to separate information-as-knowledge from information-as-action, or information-as-knowledge from information-as-entertainment.

And, while it continues to put the former, literally and figuratively, on a pedestal, it places real learning squarely in the zone of interactive activities. We might say that ‘entertaining activities’ are here represented as ‘consolidating’ (giving a firm ‘footing’ or ‘grounding’ to) authoritatively presented, ‘high’ knowledge. Reversing the two – putting the games on top and the information media at the bottom – would create a very different meaning, perhaps something like ‘knowledge provides a “foundation” for (“highly” regarded) active experiences’.

The screen also uses the horizontal dimension, and this in two ways. First, the left is the domain of the still image, and the right the domain of the animated ‘3D’ images, of the move from two-dimensional representation to ‘virtual reality’. Second, the left is the domain of what has already been formulated for the users, while the right is the domain of what users can do themselves: they can rotate the skeleton with their mouse so as to view the image from whichever angle they choose, and they can exit the screen at will. Note that the monitor straddles the boundary between Ideal and Real: like interactive games, user-activated 3D viewing has (still) some entertainment value, because of its novelty; but, like information media, it also has instruction value – the animated skeleton can serve as a stand-in for a real or reproduction skeleton and make a good learning aid for students. In other words, as we move from left to right, we move from the traditional 2 D diagram to the new animated 3 D diagram or drawing, and from the traditional ‘passive learner’ to the new ‘interactive’ mode of learning.

Another dimension used here is that of foreground and background. The loudspeaker is placed behind the monitor, which is congruent with the role played by sound and image in this CD-ROM: all information is provided visually, and the soundtrack only offers soft background music.

Most salient on the screen is the monitor image of the moving skeleton, and this for two reasons: it moves, and it displays the greatest tonal contrast. Next most salient are perhaps the names of the games. Although they do not occupy much space, their colours – bright red and yellow – contrast strongly with the cool whites, blues and greys that dominate the rest of the screen. And the images (the doors of the Emergency Ward and the’ Body Recall’ keyboard) are both sharper and more saturated in colour than the rest of the screen. Relative size can also establish salience, and as a result the ’slide’ with the X-Ray picture of the body and the title of the C D- RO M is perhaps the next most salient element. Which leaves the loudspeaker and the ‘exit’ sign.

From the point of view of framing, finally, the most significant ‘disconnection’ is that between the space of the interactive games and the rest of the screen. The games, against a brighter, more garish blue than can be found elsewhere on the screen, insert themselves into the more traditional, naturalistic continuity (and natural palette) of the desktop. They could have been placed on the desktop. But they are not. They are represented as a quite separate, ‘alien’ element, disrupting the natural perspectival homogeneity of the semiotic space. Within the picture of the desktop on the other hand, there is a sense of continuity, both because of the harmony of the muted colours, and because of the way the elements form part of a continuous, homogeneous, non-fragmented space. Thus the traditional media are represented as naturalistic and complementary to each other, but also as radically different from the new ‘interactive’ media.

The example shows that the composition of this screen positions the component modes of the multimodal text in relation to each other, making some playa foreground role, some a background role, presenting some as complementary to each other, others as each other’s opposites, and so on. It visually realizes a discourse of ‘edutainment’, and visually defines its characteristic relations and values, and the part played in it by different semiotic modes.

January 21, 2010

Chapter 5

Filed under: Uncategorized — admin @ 12:01 am

5 Modality:

designing models of reality

MODALITY AND A SOCIAL THEORY OF THE REAL

One of the crucial issues in communication is the question of the reliability of messages. Is what we see or hear true, factual, real, or is it a lie, a fiction, something outside reality? To some extent the form of the message itself suggests an answer. We routinely attach more credibility to some kinds of messages than to others. The credibility of newspapers, for instance, rests on the ‘knowledge’ that photographs do not lie and that ‘reports’ are more reliable than ’stories’, though since we wrote the first edition of this book the rise of Photoshop and ’spin’ have begun to undermine both these types of knowledge.

More generally, and with particular relevance to the visual, we regard our sense of sight as more reliable than our sense of hearing, ‘I saw it with my own eyes’ as more reliable evidence than ‘I heard it with my own ears’.

Unfortunately, we also know that, while the camera may not lie – or not much, at any rate – those who use it and its images can and do. The questions of truth and reality remain insecure, subject to doubt and uncertainty and, even more significantly, to contestation and struggle. Yet, as members of a society, we have to be able to make decisions on the basis of the information we receive, produce and exchange. And in so far as we are prepared to act, we have to trust some of the information we receive, and do so, to quite some extent, on the basis of modality markers in the message itself, on the basis of textual cues for what can be regarded as credible and what should be treated with circumspection. These modality markers have been established by the groups within which we interact as relatively reliable guides to the truth or factuality of messages, and they have developed out of the central values, beliefs and social needs of that group.

In this chapter we will discuss these modality cues. As throughout the book, we take them to be motivated signs – signs which have arisen out of the interest of social groups who interact within the structures of power that define social life, and also interact across the systems produced by various groups within a society. As we have discussed in the Introduction, the relation between the signifiers and signifieds of motivated signs is, in principle, one of transparency. Sign-makers choose what they regard as apt, plausible means for expressing the meanings they wish to express. We are therefore focusing on the range of signs from which such choices can be made – some of them specialized modality markers, others part of a much wider and more general range of means of expressing meanings of truth and falsehood, fact and fiction, certainty and doubt, credibility and unreliability.

A social semiotic theory of truth cannot claim to establish the absolute truth or untruth of representations. It can only show whether a given ‘proposition’ (visual, verbal or otherwise) is represented as true or not. From the point of view of social semiotics, truth is a construct of semiosis, and as such the truth of a particular social group arises from the values and beliefs of that group. As long as the message forms an apt expression of these beliefs, communication proceeds in an unremarkable, ‘felicitous’ fashion. This does mean, however, that our theory of modality has to account for a complex situation: people not only communicate and affirm as true the values and beliefs of their group. They also communicate and accord degrees of truth or untruth to the values and beliefs of other groups.

The term ‘modality’ comes from linguistics and refers to the truth value or credibility of (linguistically realized) statements about the world. The grammar of modality focuses on such modality markers as the auxiliary verbs which accord specific degrees of modality to statements, verbs like may/ will and must (the difference between He may come and He will come) and their related adjectives (e.g. possible/ probable/ certain) and adverbs (see Halliday, 1985: 85-9J. But modality is not only conveyed through these fairly clear-cut linguistic systems (see Kress and Hodge, 1978, 1993: 127). Take this example, from Our Society and Others (Oakley et al., 1985). Clearly, it not only contains statements such as ‘Aboriginal people had no religion’ and ‘the whole land was a cathedral’, but also indications of the truth value of these statements:

Governor Phillip, the settlers, and the convicts could find no churches or cathedrals or works of art like those of Britain. Perhaps this made them think that Aboriginal people had no religion. In fact, the Aborigines had very complicated religious beliefs. These had been passed down from one generation to the next through Dreamtime stories for thousands of years. For the Aborigines the whole land was a cathedral. Their art was joined to their religion. Much of their art had been kept safely for thousands of years. (Oakley etal., 1985: 142)

The statement ‘Aboriginal people had no religion’ is given low modality. The writers distance themselves from it by attributing it to ‘Governor Phillip, the settlers, and the convicts’ by formulating it as a subjective idea (‘this made them think that. . .’) and by using the past tense (after all, what was true in the past need not be true in the present). The writers’ own statements (e.g. ‘the Aboriginals had very complicated religious beliefs’) are not qualified in this way; they are formulated as objective facts (‘In fact, the Aborigines had. . .’) and not attributed (it is curious, however that they are not extended to the present time! J. The statements that embody the ‘beliefs’ of the Aborigines, finally, are given lower modality. They are explicitly attributed to the Aborigines and therefore not subscribed to by the writers, and they are qualified by terms like ’story’, ‘dream’ and ‘belief’ – terms which, in Western culture, signify low modality and are contrasted with high-modality terms such as ‘reality’, ‘fact’ and ‘truth’.

The example shows that modality is ‘interpersonal’ rather than ‘ideational’. It does not express absolute truths or falsehoods; it produces shared truths aligning readers or listeners with some statements and distancing them from others. It serves to create an imaginary ‘we’. It says, as it were, these are the things ‘we’ consider true, and these are the things ‘we’ distance ourselves from, for instance: ‘we’ have no religion, but the Aborigines do, and although this religion is true for ‘them’, it is not true for ‘us’. Nevertheless, as (for ‘us’) art and religion are not ‘joined’, ‘we’ can appreciate Aboriginal religion as ‘art’, as beautiful ’stories’ and ‘dreams’ (art, in Western culture, has lower modality than, for instance, science – hence the greater licence given to artists). We call the ‘we’ the text attempts to produce ‘imaginary’ because many of the children who are made to read the book may in fact ‘have religion’. However we realize that the social groupings discursively instituted in this way may be very real and may have very real effects on children’s lives.

The concept of modality is equally essential in accounts of visual communication. Visuals can represent people, places and things as though they are real, as though they actually exist in this way, or as though they do not – as though they are imaginings, fantasies, caricatures, etc. And, here too, modality judgements are social, dependent on what is considered real (or true, or sacred) in the social group for which the representation is primarily intended. Conside~ for instance, the ’speech circuit’ diagram from de Saussure’s famous Course in General Linguistics Cl974 0916J), shown once more in figure 5.1 (see also figure 2.18).

It depicts two humans, ‘A and ‘B’, and a process, circular and continuous, described as the ‘unlocking of sound-images in the brain’, followed by the transmitting of an impulse corresponding to the image to the organs used in producing sounds’, followed by the ‘travelling of the sound waves from the mouth of A to the ear of B (1974 0916J: 11-12), In another version (figure 5.2), de Saussure schematizes the diagram even further making it look almost like an electrical circuit.

The photograph in figure 5.3 also represents the speech process, or rather part of it, since we see only ‘A’ speaking, and only ‘8′ ‘unlocking sound-images in his brain’. It is a scene from Robert Aldridge’s movie The Big Knife Cl955), starring Rod Steiger and Jack Palance.

The three representations of the speech process differ in a number of ways. First, while the photograph restricts itself to representing what would normally be visible to the naked eye, the diagrams do not: they make visible what is normally invisible (mental processes, ’sound-images in the brain’) and they do show what can normally only be heard (’sound waves’). To do so they take recourse to abstract graphic elements (dotted and continuous lines, arrows) and to language. Second, while the photograph presents us with a moment frozen in time, the diagrams depict a process that takes a certain amount of time to unfold: one utterance of ‘A’ as well as one utterance of ‘B’, at the very least. Third, while the photograph depicts ‘A’ and ‘B’ in great detail, showing strands of hair wrinkles, glimmers of light in Steiger’s dark glasses, the diagrams reduce the two to schematic profiles, or even circles, minimal geometric shapes, abstract elements. And, while the photograph shows depth, modelling caused by the play of light and shade, and a setting, a background, the diagrams omit all of these. They are abstract and schematic where the photograph is concrete and detailed; conventionalized and coded where the photograph presents itself as a naturalistic, unmediated, uncoded representation of reality.

Does this mean that diagrams are less ‘real’ than photographs, and hence lower in modality, and that photography is more true than diagrammatic representation? Not necessarily. To the viewers for whom de Saussure’s diagrams are intended, they may in fact be more real than the photograph, in the sense that they reveal a truth which represents more adequately what the speech process is really like.

Reality is in the eye of the beholder; or rather, what is regarded as real depends on how reality is defined by a particular social group. From the point of view of naturalism reality is defined on the basis of how much correspondence there is between the visual representation of an object and what we normally see of that object with the naked eye (or, in practice, on the capacity of 35mm photography to resolve detail and render tonal or colour differentiation: images, including photographs, can be experienced as ‘hyper-real’, as showing ‘too much detail’, ‘too much depth’, ‘too much colour’ to be true). Scientific realism, on the other hand, defines reality on the basis of what things are like generically or regularly. It regards surface detail and individual difference as ephemeral, and does not stop at what can be observed with the naked eye. It probes beyond the visual appearance of things. In other words, reality may be in the eye of the beholder, but the eye has had a cultural training, and is located in a social setting and a history; for instance, in the community of linguists, or of semioticians in de Saussure’s day, a community which saw reality in that form, in terms of abstractions and deeper regularities. A ‘realism’ is produced by a particular group, as an effect of the complex of practices which define and constitute that group. In that sense, a particular kind of realism is itself a motivated sign, in which the values, beliefs and interests of that group find their expression.

As the examples suggest, definitions of reality are also bound up with technologies of representation and reproduction. The relatively recent change from the dominance of black and white to the dominance of colour in many domains of visual communication shows how quickly these histories can develop, and how closely they are related to techno¬logical change. For us, now, as common sense viewers, everyday members of society at large, the defining technology is perhaps still that of 35mm colour photography, as we suggested above. But the shift to digital photography is already creating a new standard for naturalism, which still aims at ever higher resolution, naturalistic colour rendition, and so on, but has in fact made a decrease in resolution and contrast to become acceptable as the norm in many domains.

Each realism has its naturalism – that is, a realism is a definition of what counts as real – a set of criteria for the real, and it will find its expression in the ‘right’, the best, the (most) ‘natural’ form of representing that kind of reality, be it a photograph, digital or otherwise, or a diagram. This is not to say that all realisms are equal. Although different realisms exist side by side in our society, the dominant standard by which we judge visual realism, and hence visual modality, remains for the moment, naturalism as conventionally understood, ‘photorealism’. In other words, the dominant criterion for what is real and what is not is based on the appearance of things, on how much correspondence there is between what we can ‘normally’ see of an object, in a concrete and specific setting, and what we can see of it in a visual representation – again, at least in theory, for in effect it is based on currently dominant conventions and technologies of visual representation. We judge an image real when, for instance, its colours are approximately as saturated as those in the standard, the most widely used photographic technology. When colour becomes more saturated, we judge it exaggerated, ‘more than real’, excessive. When it is less saturated we judge it ‘less than real’, ‘ethereal’, for instance, or ‘ghostly’. And the same can be said about other aspects of representation, the rendition of detail, the representation of depth, and so on. Pictures which have the perspective, the degree of detail, the kind of colour rendition, etc. of the standard technology of colour photography have the highest modality, and are seen as ‘naturalistic’. As detail, sharpness, colour etc. are reduced or amplified, as the perspective flattens or deepens, so modality decreases.

Like many other advertisements, the advertisement in plate 2 is a composite text. It shows a picture of the product (the jar of instant coffee), with a verbal caption, and a picture which visualizes the pleasure the product will afford. This picture, showing two lovers sharing an intimate moment, uses soft focus and soft colours, tending towards the same golden-brown hue, and so deliberately lowering modality, representing ‘what using the product will be like’ as fantasy or promise, as ‘what might be’, rather than as reality, as ‘what is’. The picture of the product itself, however, is in sharper focus and uses more saturated and differentiated colours: the product is given higher modality, higher reality value, than the promise of bliss attached to it, and the advertisement as a whole therefore accords varying degrees of ‘credibility’ to the different representations it contains, just like the text on ‘Aboriginal religion’. The lower modality of the photo of the romantic scene, however, is not a matter of the scene itself being improbable or fantastic (although that, too, often happens in advertising photographs). Probable as well as improbable events may have high or low modality. Just as one can say There certainly are ghosts (high modality) and I believe ghosts may exist (low modality), so one can also show realistic and not-so¬realistic depictions of ghosts.

What is the difference between these uses of colour? We would put it this way: the more that is taken away, abstracted from the colours of the representation, the more colour is reduced, the lower the modality. There is a continuum which runs from full colour saturation to the absence of colour, black and white, in which only the brightness values of the colours, their ‘darkness’ or ‘lightness’, remains. There is also a continuum which runs from full colour differentiation to a ‘reduced palette’ and eventually monochrome. For example, eighteenth-century landscape painting (e.g. Claude Lorrain) was often restricted to various shades of brown for the foreground and to desaturated, silvery blues for the distance. This is not the only way in which abstraction from ‘naturalistic’ colour is possible. The colour of many objects is not even. Pale skin, for instance, may vary in redness, may have the blue veins showing through, and so on, and such differences may either be rendered or abstracted from. In other words, colour may be idealized to a greater or lesser degree – a scale which runs from naturalistic photography via the choice of different values of a colour for the representation of light and shade, to the flat, unmodulated colour used by children in their drawings, or, for example, in the work of painters such as Matisse. Matisse was not a child, of course, when he produced the paintings we now admire. His unmodulated colours expressed a different view of what counts as real, as do the unmodulated colours in children’s drawings – we will comment on this in more detail later. From the point of view of naturalism, however, modality is decreased in such images. The continuum from modulated to flat colour is at the same time a continuum from high to low modality. And in both cases the rule applies: the greater the abstraction (away from saturation, differentiation and modulation), the lower the modality.

It should be stressed that what we are talking about is not abstraction from what we actually see, from ‘the real world’. The literature of other ages and cultures attests to the fact that people have marvelled at the ‘lifelikeness’ of works which, by our standards, are far from ‘naturalistic’. What we are talking about at this point is abstraction relative to the standards of contemporary naturalistic representation.

MODALITY MARKERS

So far we have discussed the role of colour as a marker of naturalistic modality, in terms of three scales:

  1. Colour saturation, a scale running from full colour saturation to the absence of colour; that is, to black and white.
  2. Colour differentiation, a scale running from a maximally diversified range of colours to monochrome.
  3. Colour modulation, a scale running from fully modulated colour, with, for example, theuse of many different shades of red, to plain, unmodulated colour.

At one end of these scales the particular dimension of colour is maximally reduced. At the other end it is most fully articulated, used to its maximum potential. Each point of the scale has a certain modality value in terms of the naturalistic standard. However, the point of highest modality does not coincide with either extreme of the scale: naturalistic modality increases as articulation increases, but at a certain point it reaches its highest value and thereafter it decreases again. Naturalistic modality scales could therefore be represented as in the following example:

We will now discuss the other key markers of visual modality on which we already touched in our discussion of figure 5.3.

  1. Contextualization, a scale running from the absence of background to the most fully articulated and detailed background.

Within the naturalistic coding orientation, the absence of setting lowers modality. By being ‘decontextualized’, shown in a void, represented participants become generic, a ‘typical example’, rather than particular, and connected with a particular location and a specific moment in time. The scale of ‘contextualization’ runs from ‘full contextualization’, to ‘plain, unmodulated background’. One step away from ‘full contextualization’ we find settings which are out of focus to a greater or lesser degree, or which lose detail through overexposure, resulting in a kind of ethereal brightness, or underexposure, resulting in muddy darkness, or through the loss of visual detail in the depiction. Further decontextualization can be achieved through ellipsis: a few ‘props’ suffice to suggest a setting, or a small, irregularly shaped patch of green under the feet of a figure with a few lines suggesting grass indicates the setting, while the rest of the paper is left blank. Or perhaps the background may merely show an irregular pattern of light and shade, or a field of unmodulated colour, or black, or white. Again, the most fully articulated background does not have the highest naturalistic modality. The limitations imposed by the resolution of standard 35mm photographic emulsions and by the depth of field of standard lenses have accustomed us to images in which the background is less articulated than the foreground. When the background is sharper and more defined than this, a somewhat artificial, ‘more than real’ impression will result – as, for instance in older Hollywood movies shot in a studio with back projection (a close-up of an actor in a car, in front of a rear window behind which we see the receding landscape in sharp focus), or in much Surrealist painting, such as in the work of Salvador Dali.

  1. Representation, a scale running from maximum abstraction to maximum representation of pictorial detail.

An image may show every detail of the represented participants: the individual strands of hair, the pores in the skin, the creases in the clothes, the individual leaves of the tree, and so on, or it may abstract from detail to a greater or lesser degree. Again, there is a point beyond which a further increase of detail becomes ‘hyper-real’ and hence lower in modality from the point of view of ‘photographic’ naturalism. Similarly, in discussing decontextualization, above, we have pointed out that reduced representation of detail may form one of the ways in which the modality of backgrounds, of what is ‘distant’, is lower than the modality of the foreground (there is a parallel here with the lower modality of the past tense in language). In photography it is not only sharpness of focus, but also exposure which can reduce detail. In artwork a variety of techniques could be ranked on a scale from maximum to minimum detail. Texture can become stylized, rendered by lines which trace the folds in the clothes, for example, and these lines may be many and fine, as in detailed engravings, or few and coarse, as in quick and ready styles of drawing. In medical drawings, for example, texture may become entirely conventional: dots to indicate the texture of one layer of skin, short, curved lines to indicate the texture of another. Texture can also be omitted altogether – the participant is then represented merely by the lines that trace its contour. Beyond this, the contour may be simplified to different degrees: a head may become a circle, the eyes two dots, the mouth a short, straight line. Diagrams and geometrical art take abstraction even further and reduce the shape of things to a small vocabulary of abstract forms, as in the paintings of Mondrian, or in figure 5.2, de Saussure’s schematized ’speech circuit’ diagram.

  1. Depth, a scale running from the absence of depth to maximally deep perspective.

By the criteria of standard naturalism, central perspective has highest modality, followed by angular-isometric perspective, followed by frontal-isometric perspective, followed by depth created by overlapping only. Again, perspective can become ‘more than real’, as when strong convergence of vertical lines is shown, or a ‘fish-eye’ perspective is used.

  1. Illumination, a scale running from the fullest representation of the play of light and shade to its absence.

Naturalistic depictions represent participants as they are affected by a particular source of illumination. Less naturalistic images, on the other hand, may abstract from illumination, and show shadows only in so far as they are required to model the volume, especially of round objects. They have ’shading’ rather than shadow. Or they use shading to indicate receding areas and highlights to indicate protruding areas, often in ways which have no explanation in terms of the logic of illumination. This can be done to different degrees: with a fully modulated darkening of the areas of shadow; with just two degrees of brightness, one for the ‘lit’ areas and one for those in shadow; with more or less dense hatching or dotting of the shaded areas; and so on. At the extreme end of the scale,light and shade are ab¬stracted from altogether, and line rather than shading is used to indicate receding contours.

  1. Brightness, a scale running from a maximum number of different degrees of brightness to just two degrees: black and white, or dark grey and lighter grey, or two brightness values of the same colour.

Brightness values can also contrast to a greater or lesser degree: in one picture the difference between the darkest and the lightest area may be very great (deep blacks, bright whites), in another the difference may be minimal, so that a misty, hazy effect is created. The paintings of Rembrandt are interesting from this point of view, in part because his use of illumination and brightness is so often invoked as a paradigm example of naturalism, and in part because of the way his subtle divergences from naturalism often acquire ideational functions, in a broadly allegorical sense.

The ability of photography to render black and white is limited, as is its ability to differentiate brightness values. Again, a contrast range and a range of brightness values which exceeds this ability may be experienced as ‘more than real’ and hence as being of lower modality.

It follows from our discussion that modality is realized by a complex interplay of visual cues. The same image may be ‘abstract’ in terms of one or several markers and ‘naturalistic’ in terms of others. Impressionist paintings, for example, often have a narrow brightness range, and abstract from light and shadow, but they have a highly naturalistic approach to colour. Yet, from this diversity of cues an overall assessment of modality is derived by the viewer.

From all this it might seem that the realization of modality in images is much more complex and finely graded than the realization of modality in language. Yet language, too, allows complex combinations of different modality cues. Take, for instance, the sentence I absolutely don’t think he could possibly have done it. Is this ‘low’, ‘middle’ or ‘high’ modality? How does one ‘compute’ these various modality cues into one ‘degree of credibility’? Frequently there are even contradictions: It is probably definitely true that. . .And in language, too, the value of modality cues depends on context. In academic writing, for example, qualifications such as It may well be the case. . . or It is quite possible that. . .(both low modality, strictly speaking) serve in fact to increase the credibility of the text, as indicators of the care with which the writer’s judgements were made, and hence of the reliability of these judgements.

CODING ORIENTATION

So far we have described the value of modality markers in terms of the naturalistic criteria for ‘what counts as real’. We have hypothesized that the ability of modern colour photography to render detail, brightness, colour, etc. constitutes for our culture today a kind of standard for visual modality. When this standard is exceeded, an image becomes ‘more than real’ – an effect which can be achieved not only in art (and is often the favoured modality in Surrealism), but also by means of the special techniques, materials and equipment of studio photography. A certain standard of photographic naturalism, dependent on the state of photographic technology and on current photographic practices, hence ever evolving, has become the yardstick for what is perceived as ‘real’ in images, even when these images are not photographs. Underpinning this is the belief in the objectivity of photographic vision, a belief in photography as capable of capturing reality as it is, unadulterated by human interpretation. Behind this, in turn, is the primacy which is accorded to visual perception in our culture generally. Seeing has, in our culture, become synonymous with understanding. We ‘look’ at a problem. We ’see’ the point. We adopt a ‘viewpoint’. We ‘focus’ on an issue. We ’see things in perspective’. The world ‘as we see it’ (rather than ‘as we know it’, and certainly not ‘as we hear it’ or ‘as we feel it’) has become the measure for what is ‘real’ and ‘true’.

So visual modality rests on culturally and historically determined standards of what is real and what is not, and not on the objective correspondence of the visual image to a reality defined in some ways independently of it. At the moment holograms are probably still seen by most people as ‘more than real’. In the images we are most used to, the absence of the third dimension, the flatness of the picture, does not function as an indicator of low modality, just as the absence of perspective in cultures whose art does not employ it does not function as an indicator of low modality for members of those cultures.

As we have already discussed in relation to de Saussure’s ’speech circuit’, however, even within our own culture the same standards for what is ‘real’ and what is not do not apply in every context. In technological contexts, a different concept of reality underlies visual modality, a concept we could call ‘Galilean reality’. In the early seventeenth century, Galileo wrote:

I do not find myself absolutely compelled to apprehend [objects] as necessarily accompanied by such conditions as that they must be white or red, bitter or sweet, sonorous or silent, smelling sweetly or disagreeably. . . . I think that these tastes, smells, colours, etc. with regard to the object in which they appear to reside are nothing more than mere names. . . . I do not believe that there exists anything in external bodies for exciting tastes, smells, sounds, etc., except size, shape, quantity and motion.
(quoted in Mumford, 1936: 48)

Here ‘real’ means ‘what can be known by means of the methods of science’; that is, by means of counting, weighing and measuring. By this standard of what is real, a technical line drawing, without colour or texture, without light or shade, and without perspective, can have higher modality than a photograph. Everyday common sense naturalism and realism no longer merge here. The realism (and hence the ‘naturalism’) of scientific-technical images is of a different kind, based, in the end, on the questions ‘Can we use it?’, ‘Can we measure the real dimensions from it?’, ‘Can we find out from it how to set up the experiment?’, and so on. Whatever does not contribute to this purpose merely adds a dimension of ‘illusionism’ to the picture, and dilutes ‘Galilean realism’ with common sense ‘naturalism’. The latter, of course, is sometimes done for the purpose of communicating scientific ideas or technological complexities to a public of non-initiates. In his book Writing Biology (990) Greg Myers compares the reporting of the ’same’ research findings in specialist and popular journals such as Scientific American, and visual representations in the latter tend to be lavish, full-colour and ‘hyper-real’, while in the former sparse line drawings are the only form of visual image. Furthermore, we have to be aware that there are competing theories of reality in today’s science, despite the fact that for many practical purposes Galilean reality remains of overriding importance. Alternative theories might lead to different standards for high and low modality.

In other contexts the ‘hyper-real’ does not have the decreased modality it has in ‘photo¬graphic’ naturalism. Magazine photos of food are one example. A different principle for what counts as real operates here, the converse of Galilean reality: the more a picture can create an illusion of touch and taste and smell, the higher its modality. In such images everything is done to appeal to ’sensory’ qualities: reality here is constituted precisely by those sensations which Galileo branded as illusions: texture, colour, ‘feel’. It is here that the affective values of colours come into their own, for example. The emotive value of colour is sometimes seen as a general characteristic of colour. But in scientific-technological contexts, colour may be conventional (more or less arbitrary ‘colour codes’ to facilitate the reading of complex diagrams), and in naturalism colours are there ‘because they are there in reality’. From the point of view of the ’sensory’ definition of reality, on the other hand, colours are there to be experienced sensually and emotively – it is for this reason that people enjoy the highly saturated and unmodulated colours of, say, Matisse, or that children enjoy the highly saturated and unmodulated colours of their plastic toys. Within naturalism these colours are ‘less than real’, but within a realism that takes subjective emotions and sensations as the criterion for what is real and true, they have the highest modality.

There is, finally, a third area in which the standard of ‘photographic’ naturalism does not apply, the area of ‘abstract realism’ – both in science (e.g. the ’speech circuit’ diagram in figures 5.1 and 5.2) and in abstract art. Higher education in our society is, to quite some extent, an education in detachment, abstraction and decontextualization (and against naturalism), and this results in an attitude which does not equate the appearance of things with reality, but looks for a deeper truth ‘behind appearances’. Just as academically trained persons may accord greater truth to abstract expository writing than to stories about concrete, individual events and people, so they may also place higher value on visual representations which reduce events and people to the ‘typical’, and extract from them the ‘essential qualities’.

While our ideas here are drawn to a large extent from the theoretical work of Jurgen Habermas (especially his Theory of Communicative Action, 1984), and to some extent from that of Bourdieu Cl986), we will use Bernstein’s term ‘coding orientation’ Cl981) for these different reality principles. Coding orientations are sets of abstract principles which inform the way in which texts are coded by specific social groups, or within specific institutional contexts. We distinguish the following:

  1. Technological coding orientations, which have, as their dominant principle, the ‘effectiveness’ of the visual representation as a ‘blueprint’. Whenever colour for example, is useless for the scientific or technological purpose of the image, it has, in this context, low modality.
  2. Sensory coding orientations, which are used in contexts in which the pleasure principle is allowed to be the dominant: certain kinds of art, advertising, fashion, food photography, interior decoration, and so on. Here colour is a source of pleasure and affective meanings, and consequently it conveys high modality: vibrant reds, soothing blues, and so on – a whole psychology of colour has evolved to support this.
  3. Abstract coding orientations, which are used by sociocultural elites – in ‘high’ art, in academic and scientific contexts, and so on. In such contexts modality is higher the more an image reduces the individual to the general, and the concrete to its essential qualities. The ability to produce and/or read texts grounded in this coding orientation
    is a mark of social distinction, of being an ‘educated person’ or a ’serious artist’.
  4. The common sense naturalistic coding orientation, which remains, for the time being, the dominant one in our society. It is the one coding orientation all members of the culture share when they are being addressed as ‘members of our culture’, regardless of how much education or scientific-technological training they have received. Individuals with special education or group allegiance may draw on non-naturalistic coding orientations in certain contexts, but they are likely to revert to the naturalistic coding orientation when they are ‘just being themselves’. They may, for example, use the abstract coding orientation when visiting a gallery, and the naturalistic coding orientation when watching television or reading a magazine. For those without such education, however, abstract and technological images will never have high modality and always remain ‘unreal’. Today, however, naturalism is coming into crisis, as a result of new ways of thinking and new image technologies. In this context the role of some or all of the non-naturalistic coding orientations is likely to become of increasing importance.

The diagram in figure 5.5 shows how the same colour continuum, running from ‘no abstraction’ to ‘full abstraction’ (abstraction always being a matter of degree) can have different modality values, according to the four coding orientations. It is drawn here for colour saturation, but it could also have been drawn for any of the other modality markers we discussed in the previous section.

MODALITY IN MODERN ART

The issue of modality becomes particularly complex in modern art, because it has, to a large extent, been the project of modern art to redefine ‘reality’, and to do so in contra¬distinction to photographic naturalism. In this section we will attempt to discuss a few of the issues, beginning with some Australian examples.

The picture in plate 3 shows William Dobell’s portrait of Joshua Smith, a painting which won the annual Australian Archibald Prize competition for portrait painting in 1943. It was the first modern painting to do so. All previous winners had been conventional ‘academic’ portraitists, staying well within the bounds of naturalistic depiction. After Dobell had been awarded the prize, a number of conservative painters took the trustees of the prize to court for giving it to a painting which, they argued, was not eligible, as it was not a portrait but a caricature. The prosecutor, Garfield Barwick, interrogated Dobell about every detail of the painting, asking him whether he had faithfully represented the ears, the neck, the arms, and so on. The painter in exasperation, answered: ‘Yes, within the limits of art.’ From a naturalistic point of view, the painting does of course have comparatively low modality, both in the direction of ‘less than real’ and in the direction of ‘more than real’. Colour differentiation is greatly reduced, to a palette of orange, yellow and brown. The representation of detail, on the other hand, is amplified, exaggerated, ‘more than real’. From a naturalistic point of view, the prosecutor was right. But he applied a criterion which was no longer valid in the context of modern art, a way of matching modality values to the scales of colour differentiation, represen¬tation and so on which, in the history of modern art, had been successfully contested decades earlier. In modern art, the truth of painting no longer lies in being faithful to appearances, but in being faithful to something else – for example, to some modern abstract truth, in the case of this rather ‘expressionist’ painting, to ‘the spirit of the man’, and ‘the essence of what he looks like’, as Dobell himself formulated it during the trial.

Attempts to alter definitions of reality are always likely to produce scandal, and therefore resisted. Whether in the trials of D.H. Lawrence’s novel or Dobell’s painting, the issues are far larger and more far-reaching than aesthetics or artistic convention. Changes in the definition of reality have profound cultural and social effects, and this helps to explain extreme reactions such as the proscription of ‘entartete Kunst’ or the burning of books.

Despite the trial, future winners of the Archibald Prize would, without exception, be modern artists rather than conventional portrait painters. The picture shown in plate 4, a portrait of the writer Patrick White by Louis Kahan, is one of them. It shows a different modality configuration, a different set of abstractions and amplifications. Unlike Dobell, Kahan does not deviate from the naturalistic representation of detail: if one disregards the strangely unseeing eyes, White’s features are rendered with naturalistic faithfulness. But texture is amplified: the figure of White seems to have been carved out of some porous, chalky rock, and the rendition of its surface is so detailed that one can almost feel its cold, wet touch – a ’sensory’ orientation, but in the direction of displeasure rather than pleasure. Colou~ on the other hand, is greatly reduced: in what is at once a pun on White’s name and a symbolic gesture, White is drained of all colour. Thus Kahan depicts him as cold, hard, almost repulsive to the touch, and the expression of this ‘truth’ has taken precedence over the faithful rendering of outward appearances; indeed, has become possible only by means of these ‘deviations’ from the naturalistic standard. The visual pun reminds us that modality is always related to the values, meanings and beliefs of a particular group, in this case an ‘ordering’ of the figure of Patrick White within the system of Australian high culture, and of Australian society generally.

Our second example pertains to the geometric abstractionism of the 1920s. When European painters, after studying visible reality for centuries, began to conceive of it as made up of abstract, geometrical elements (circles, cones, squares, triangles), reality was redefined as a configuration of basic elements, just as had already happened, for instance, in physics. Within this new definition of reality, painters at first still sought to produce recognizable representations, as shown in figure 5.6. Mondrian, who tried to paint trees in this way, complained that it was difficult to represent trees as arrangements of rectangular shapes. But soon these painters went a step further and abandoned the attempt to reconcile the visible surface appearance of things with their geometric inner structure (see figure 5.n. From here it was only one step to, for instance, Gerrit Rietveld’s Colour Project for the SchrOder Residence (1923-4),

Rietveld’s work (figure 5.8) is no longer a reduced, abstract representation of reality, but a design for a new reality, yet to be constructed. Of course, blueprints and plans had existed alongside visual representation long before the 1920s, but in separate domains. In the twentieth century, however, they became intertwined. Art became intertwined with design, just as science had already become intertwined with technology. The boundaries between representing reality and constructing reality became blurred. And when real things were produced from designs such as these, the processes of abstraction could come full circle and yield ‘naturalistic’ images again (figure 5.9).

In the work of Ryman, a contemporary American artist, abstraction is perhaps taken to its limit. Many of his paintings are, at least at first glance, white surfaces. Everything is reduced, everything abstracted. There is no colour, no line, no background. And, in terms of our earlier chapters, there is neither representation of action or of social constructs, nor yet any indication of textuality, of composition. This truly is the degree zero of representation.

But if reduction and abstraction serve to reveal otherwise hidden, inner truths, the same might be said about Ryman’s paintings. Indeed, like so many other non-naturalistic artists, he sees his work as realistic, and calls his paintings ‘realistic paintings’: they aspire to present the reality and the truth of the process of representation and the process of perception, and thereby perhaps also of the social and cultural world.

There is, however, another much less abstract feature of Ryman’s painting: his concern with texture. H is work shows a constant preoccupation with the materials of representation, and with the materiality of the processes of representation. Some of the paintings leave a patch of the canvas uncovered and only thinly cover the rest. Others display a variety of brushstrokes or, by contrast, completely de-emphasize the way in which the paint is applied, resulting in totally flat, presumably sprayed surfaces. Again others emphasize the frame, or the means by which frames are attached to walls, or the flatness of the painting, by rotating it through ninety degrees, and so foregrounding its two dimensionality.

In other words, there is a strong representational concern in these paintings, but it is a concern with representing the process of representation. Does this suggest low modality, given the enormous distance from everyday naturalism? Or does it suggest the highest modality, in which the negation of representation forms an ultimate truth, or in which the highest modality is accorded to the representation which does not represent but simply is?

As before, our answer is one which refers to the social. Whether a representation is judged credible or not is not necessarily a matter of absolute truth. What one social group considers credible may not be considered credible by another. This is why we see modality as interactive, rather than ideational, as social, rather than as a matter of some independently given value. Modality both realizes and produces social affinity, through aligning the viewer (or reader, or listener) with certain forms of representation, namely those with which the artist (or speaker, or writer) aligns himself or herself, and not with others. Modality realizes what ‘we’ consider true or untrue, real or not real. In this lies some of the power of art. To the extent that people are drawn into this ‘we’, new values, new modes of thinking and perceiving can establish themselves. And when enough people are drawn in, the organs of popularizing culture, such as advertising, will quickly move in to amplify the new forms, and move them into the mainstream of culture.

MODALITY CONFIGURATIONS

The examples in the previous section show that the modality values in art can be complex. A painting can reduce naturalism in the way it treats colour, amplify it in the way it treats texture, and yet represent its subject in a naturalistic way, as in plate 4. It can be abstract in respect of one modality marker, naturalistic in respect of another and sensory in respect of yet another, and this allows a multiplicity of possible modality configurations, and hence a multiplicity of ways in which artists can relate to the reality they are depicting and ‘define’ reality in general. In many other kinds of images, too, ‘modality markers’ do not move en bloc in a particular direction across the scales, say from the abstract to the sensory, but behave in relatively independent ways. Most glossy magazine food photo¬graphs, for instance, are highly sensory in their depiction of the food. The colours are intense. The texture of the food is shown in sharp detail. Lighting enhances the fresh droplets of water on a bunch of grapes, or the viscosity of a sauce, or the glazing of the ham and the cherries in a pie. But the surrounding objects tend to have lower modality. The weave of the tablecloth on which the food is displayed, for instance, may be only just be visible and often the setting is absent altogether, with the food shown against a black background. In other words, such pictures are not only sensory, they are also abstract. The
’sensorily’ depicted food is taken out of its context, idealized and essentialized. And this shows that each of the modality choices in such a modality configuration is expressive of specific meanings, which then come together in the whole.

From our inventory of modality markers we could construct ‘modality prints’ (borrowing the metaphor of ‘voice print’, ‘DNA print’, etc.J to characterize the modality configurations, and show which modality markers are reduced, made ‘less than real’, and which are amplified, made ‘more than real’ – and this either in relation to an anchoring point of common sense high naturalistic modality (as one might do for an audience of ‘lay’ people at an art exhibition, taking the representational function of art as a common sense point of departure) or in relation to an anchoring point situated in some other realism. Figure 5.10 is an attempt to show what we have in mind.

Such modality configurations would describe what, in a specific genre or a specific work, is regarded as real, as adequate to reality. And it would also demonstrate that images are polyphonic, weaving together choices from different signifying systems, different representational modes, into one texture. In this view, a term such as ‘painting’ is an artificial construct which brings together and treats as a homogeneous unit what is in reality a complex configuration of different voices, different representational modes. (In the same way it can be said that ‘grammar’ is an artifice of theory, describing widely different representational modes – phonic substance, intonation, lexis, syntax, etc.) And it is of Course from here that the interesting questions can be asked. Are there, or could there be, social and historical explanations for these modality configurations?

Here is an example, from a science textbook for the upper years of primary school, produced in Australia (figure 5.11), This is a scientific-technical picture for children. As such it forms a compromise between the naturalistic and the technological coding orientation, perhaps because a ‘pure’ technological picture would have been regarded by the writer as beyond the understanding of young children. On the one hand, it is a drawing and not a photograph and it lacks a Setting; on the other hand, it uses perspective (angular-isometric), colour (idealized, fiat colour), and it shows at least something of light and shade (though in a rather simple and, in part, inconsistent way), and of texture (the grain of the wood, the texture of the head of the nail, the creases in the piece of cloth). The producer of this image perhaps operates with the assumption that children are familiar with the naturalistic coding orientation (that is, ‘where they come from’) and have to be inducted into the technological coding orientation (that is, the progression into disciplinary knowledge). The image captures this transitional phase.

Diagrams, maps and charts for lay readers may be ‘naturalized’ in similar ways. Newspaper diagrams and maps, for instance, may be drawn in perspective (see the Gulf War map in figure 4.19). Magazines may add colour and pictorialize pie charts. In com¬pany brochures or annual reports, the bars of bar graphs may become three-dimensional and rise, like featureless skyscrapers, from a clean landscape of undulating hills in strong, fiat colour. This shows that modality is a system of social deixis which ‘addresses’ a particular kind of viewe~ or a particular social/cultural group, and provides through its system of modality markers an image of the cultural, conceptual and cognitive position of the addressee. At the same time it shows the transition across and between such groups, and in doing so demonstrates the social aspect of modality. Most crucially, it shows how modality is motivated, in the close matching of modality and the modal address (location) of specific (and assumed) aspects of the viewer’s subjectivity.

Figure 5.12 shows a drawing by Newton, illustrating the set-up for one of his colour experiments. Figure 5.13 is a modern scientific illustration showing the set-up for an experiment by Stratton which caused him to see himself stretched out in space as indicated in the drawing. To modern eyes, Newton’s drawing has not yet advanced very far in the direction of high technological modality: he uses (inverted) perspective, and shows the Setting. The modern drawing, by contrast, leaves out the setting and simplifies the forms, concentrating on the relation between them, rather than on the representation of the experimenter and the mirrors.

As Halliday has shown (Halliday and Martin, 1993: 54-68), Newton’s writing did not yet have the objective, impersonal stance and the lexical density of modern scientific writing. At the same time he made some decisive moves towards developing the gram¬matical resources that would become characteristic of scientific writing. Clearly the same can be said of his scientific drawings. And that shows that, however great the differences between the verbal and the visual grammar, they derive from similar concerns and orientations.

Powered by WordPress