Old Papers Never Die – They Only Fade Away…… https://www.hci-site-experiments.co.uk/wp-content/themes/engic/images/empty/thumbnail.jpg 150 150 John John https://secure.gravatar.com/avatar/d46e5c6154304cf75a0c2d0059f249ab2204a9e4e458b1e88aaaa9e322763c85?s=96&d=mm&r=g May 20, 2013 September 28, 2016

Old Papers Never Die – They Only Fade Away……

John

May 20, 2013

Interacting with the Computer: a Framework

John Long Comment 1

The title remains an appropriate one. However, given its subsequent references to: ‘domains’; ‘applications’; ‘application domains’; ‘tasks’ etc, it must be assumed that the interaction is: ‘to do something’; ‘to perform tasks’; ‘to achieve aims or goals’; or some such. Further modeling of such domains/applications, beyond that of text processing, would be required for any re-publication of the paper and in the light of advances in computing technology – see earlier. The issue is pervasive – see also Comments 6, 35, 37, 40 and 41.

Comment 2

‘A Framework’ is also considered to be appropriate. Better than ‘a conception’, which promises greater completeness, coherence and fitness-for-purpose (unless, of course, these criteria are explicitly taken on-board). However, the Framework must explicitly declare and own its purpose, as later set out in the paper and referenced in Figure 1. See also Comments 15, 19, 27, and 42.

J. Morton, P. Barnard, N. Hammond* and J.B. Long

M.R.C. Applied Psychology Unit, Cambridge, England *also IBM Scientific Centre, Peterlee, England

Recent technological advances in the development of information processing systems will inevitably lead to a change in the nature of human-computer interaction.

Comment 3

‘Recent technological advances’ in 1979 centred around the personal, as opposed to the main-frame, computer. To-day there are a plethora of advances in computing technology – see Commentary Introduction earlier for a list of examples. A re-publication of the paper would require a major up-date to address these new applications, as well as their associated users. Any up-date would need to include additional models and tools for such address, as well as an assessment of the continued suitability of the models and tools, proposed in the ’79 paper.

Direct interactions with systems will no longer be the sole province of the sophisticated data processing professional or the skilled terminal user. In consequence, assumptions underlying human-system communication will have to be re-evaluated for a broad range of applications and users. The central issue of the present paper concerns the way in which this re-evaluation should occur.

First of all, then, we will present a characterisation of the effective model which the computer industry has of the interactive process.

Comment 4

We contrasted our ’79 models/theories with a single computer industry’s model. To-day, there are many types of HCI model/theory. A recent book on the subject listed 9 types of ‘Modern Theories’ and 6 types of ‘Contemporary Theories’ (Rogers, 2012). The ‘industry model’ has, of course, itself evolved and now takes many forms (Harper et al., 2008). Any re-publication of the ’79 paper would have to specify both with which HCI models/theories it wished to be contrasted and with what current industry models.

The shortcoming of the model is that it fails to take proper account of the nature of the user and as such can not integrate, interpret, anticipate or palliate the kinds of errors which the new user will resent making. For remember that the new user will avoid error by adopting other means of gaining his ends, which can lead either to non-use or to monstrously inefficient use. We will document some user problems in support of this contention and indicate the kinds of alternative models which we are developing in an attempt to meet this need.

The Industry’s Model (IM)

The problem we see with the industry’s model of the human-computer interaction is that it is computer-centric. In some cases, as we shall see, it will have designer-centric aspects as well.

Comment 5

In 1979, all design was carried out by software engineers. Since then, many other professionals have become involved – initially psychologists, then HCI-trained practitioners, graphic designers, ethnomethodologists, technocratic artists etc. However, most design (as opposed to user requirements gathering or evaluation) is still performed by software engineers. Any re-publication of this paper would have to identify the different sorts of design activity, to assess their relative contribution to computer- and designer-centricity respectively and the form of support appropriate to each, which it might offer – see also Comments 14, 15, 21 (iv) and (v), 27 and 41 (iv).

To start off with, consider a system designed to operate in a particular domain of activity.

Comment 6

Any re-published paper would have to develop more further the concept of ‘domain’ (see Comment 1). The development would need to address: 1. The computer’s version of the domain and its display thereof. There is no necessary one-to-one relationship (consider the pilot alarm systems in the domain of air traffic management). Software engineer designers might specify the former and HCI designers the latter; and 2. To what extent the domain is an ‘image of the world and its resources’. See Comments 1, 35, 37, 40 and 41.

In the archetypal I.M. the database is neutralised in much the same kind of way that a statistician will ritually neutralise the data on which he operates, stripping his manipulation of any meaning other than the purely numerical one his equations impose upon the underlying reality. This arises because the only version of the domain which exists at the interface is that one which is expressed in the computer. This version, perhaps created by an expert systems analyst on the best logical grounds and the most efficient, perhaps, for the computations which have to be performed, becomes the one to which the user must conform. This singular and logical version of the domain will, at best, be neutral from the point of view of the user. More often it will be an alien creature, isolating the user and mocking him with its image of the world and its resources to which he must haplessly conform.

Florid language? But listen to the user talking.

Comment 7

The ’79 user data are now quite out of date, both in terms of their content, means of acquisition and associated technology, compared with more recent data. However, current user-experience continues to have much in common with that of the past. Up-dated data are required to confirm this continuity.

“We come into contact with computer people, a great many of whom talk a very alien language, and you have constant difficulty in trying to sort out this kind of mid-Atlantic jargon.”

“We were slung towards what in my opinion is a pretty inadequate manual and told to get on with it”

“We found we were getting messages back through the terminal saying there’s not sufficient space on the machine. Now how in Hell’s name are we supposed to know whether there’s sufficient space on the machine?” .

In addition the industry’s model does not really include the learning process; nor does it always take adequate note of individual’s abilities and experience:

“Documentation alone is not sufficient; there needs to be the personal touch as well . ”

“Social work being much more of an art than a science then we are talking about people who are basically not very numerate beginning to use a machine which seems to be essentially numerate.”

Even if training is included in the final package it is never in the design model. Is there anyone here, who, faced with a design choice asked the questions “Which option will be the easiest to describe to the naive user? Which option will be easiest to understand? Which option will be easiest to learn and remember?”

Comment 8

Naive users, of course, continue to exist to-day. However, there are many more types of users than naive and professional of interest to current HCI researchers. Differences exist between users of associated technologies (robotic versus ambient); from different demographics (old versus young); at different stages of development (nursery versus teenage children); from different cultures (developed versus less developed) etc. These different types of user would need some consideration in any re-publication.

Let us note again the discrepancy between the I.M. view of error and ours . For us errors are an indication of something wrong with the system or an indication of the way in which training should proceed. In the I.M. errors are an integral part of the interaction. For the onlooker the most impressive part of a D.P. interaction is not that it is error free but that the error recovery procedures are so well practised that it is difficult to recognise them for what they are .

Comment 9

As well as this important distinction, concerning errors, they need to be related to ‘domains’, applications’ and ‘effectiveness’ or ‘performance’ and not just user (or indeed computer) behaviour. See Comment 6 earlier and Comments 35, 36, 37 and 38 later.

Errorless performance may not be acceptable (consider air traffic expedition). Errorful behaviour may be acceptable (consider some e-mail errors). A re-published ’79 paper would have to take an analytic/technical(that is Framework grounded) view of error and not just a simple adoption of users’ (lay-language) expression. This problem is ubiquitous in HCI, both past and present.

We would not want it thought that we felt the industry was totally arbitrary . There are a number of natural guiding principles which most designers would adhere to. See also Comment 16.

Comment 10

We contrast here two types of principle, which designers might adhere to: 1. IM principles, as ‘intuitive, non-systematic, not totally arbitrary’; and our proposed principles, as ‘systematic’. In the light of this contrast, we need to set out clearly: 1. What and how are our principles ‘systematic’? and 2. How does this systematicity guarantee/support better design?

Note that in Figure 1 later, there is an ‘output to system designers’. Is this output expressed in (systematic) principles? If not, what would be its form of expression? Any form of expression would raise the same issues raised earlier for ‘sysematic principles’.

We do not anticipate meeting a system in which the command DESTROY has the effect of preserving the information currently displayed while PRESERVE had the effect of erasing the operating system. However , the principles employed are intuitive and non-systematic. Above all they make the error of embodying the belief that just as there can only be one appropriate representation of the domain, so there is only one kind of human mind.

A nice example of a partial use of language constraints is provided by a statistical package called GENSTAT. This package permits users to have permanent userfiles and also temporary storage in a workfile. The set of commands associated with these facilities are :

PUT – copies from core to workfile

GET – copies from workfile to core

FILE – defines a userfile

SAVE – copies from workfile to userfile

FETCH – copies from userfile to workfile

The commands certainly have the merit that they have the expected directionality with respect to the user. However to what extent do, for example, FETCH and GET relate naturally to the functions they have been assigned? No doubt the designers have strong intuitions about these assignments. So do users and they do not concur. We asked 40 people here at the A. P.U. which way round they thought the assignments should go: nineteen of these agreed with the system designers, 21 went the 0ther way . The confidence levels of rationalisations were very convincing on both sides!

The problem then, is not just that systems tend to be designer-centric but that the designers have the wrong model either of the learning process or of the non-D.P. users’ attitude toward error. A part-time user is going to be susceptible to memory failure and, in particular, to interference from outside the computer system. du Boulay and O’ Shea [I] note that naive users can use THEN in the sense of ‘next’ rather then as ‘implies’. This is inconceivable to the IM for THEN is almost certainly a full homonym for most D.P. and the appropriate meaning the appropriate meaning thoroughly context-determined .

Comment 11

The GENSTAT example was so good for our purposes, that it has taken considerable reflection to wonder if there really is a natural language solution, which would avoid memory failure and/or interference. It is certainly not obvious.

The alternative would be to add information to a menu or somesuch (rather like in our example). But this is just the sort of solution IM software engineers might propose. Where would that leave any ‘systematic’ principles’? – see Comment 10 earlier.

An Alternative to the Industry Model

The central assumption for the system of the future will be ‘systems match people’ rather than ‘people match systems’. Not entirely so, as we shall elaborate, for in principle, the capacity and perspectives of the user with respect to a task domain could well change through interaction with a computer system.

Comment 12

In general, the alternative aims to those of the IM promise well. The mismatch, however, seems to be expressed at a more abstract level than that of the ‘task domain’ – the ‘alien creature, isolating the user and mocking him with its image of the world and its resources to which he must haplessly conform’ – see earlier in the paper. Suppose the mismatch is at this specific level, where does this leave, for example, the natural language mismatch? Of course, we could characterise domain-specific mismatches, for example, the contrasting references to ambient environment in air- and sea-traffic management, although for professional, not for naive users. Such mismatches would require a form of domain model absent from the original paper. However, the same issue arises in the domains of letter writing and planning by means of ‘to do’ lists. Either way, the application domain mismatch needs to be addressed, along with that of natural language.

But the capacity to change is more limited than the variety available in the system .

Comment 13

The contrast ‘personal versus mainframe computer’ and the parallel contrast ‘occasional/naive versus professional user’ served us very well in ’79. But the explosion of new computing technology (see Comment 3 earlier) and associated users requires a more refined set of contrasts. There are, of course, still occasional naive users; but these are mainly in the older population and constitute a modest percentage of current users. However, with demographic changes and a longer-living older population, it would not be an uninteresting sub-set of all present users. A re-publication, which wanted to restrict its range in the manner of the ’79 paper, might address ‘older users’ and domestic/personal computing technology. An interesting associated domain might be ‘computer-supported co-operative social and health care’. We could be our own ‘subjects, targets, researchers, and designers’, as per Figure 1 later.

Our task, then, is to characterise the mismatch between man and computer in such a way that permits us to direct the designer’s effort.

Comment 14

Directing the designer’s efforts are strong words and need to be linked to the notion and guarantee of principles – see Comment 10 and Figure 1 ‘output to designers’. Such direction of design needs to be aligned with scientific/applied scientific or engineering aims (see Comments 15 and 18).

In doing this we are developing two kinds of tool, conceptual and empirical. These interrelate within an overall scheme for researching human-computer interaction as shown in Figure 1.

Comment 15

Figure 1 raises many issues:

1. Empirical studies require their own form of conceptualisation, for example: ‘problems’; ‘variables’; ‘tasks’ etc. These concepts would need specification before they could be conceptualised in the form of multiple models and operationalised for system designers.

2. What is the relationship between ‘hypothesis’ and the thories/knowledge of Psychology? Would the latter inform the former? If so, how exactly? This remains an endemic problem for applied science (see Kuhn, 1970).

3. Are ‘models’, as represented here, in some (or any) sense Psychological theories or knowledge? The point needs to be clarified – see also Comment 15 (1) earlier.

4. What might be the ‘output to system designers’ – guidelines; principles; systematic heuristics; hints and tips; novel design solutions; methods; education/training etc? See also Comment 14.

5. How is the ‘output to system designers’ to be validated? There is no arrow back to either ‘models’ or ‘working hypotheses’. At the very least, validation requires: conceptualisation; operationalisation; test; and generalisation. But with respect to what – hypotheses for understanding phenomena or with respect to designing artefacts?

Relating Conceptual and Empirical Tools

Comment 16

The relationship between conceptual and analytic tools and their illustration reads like engineering. In ’79, I thought that we were doing ‘applied science’ (following in the footsteps of Donald Broadbent, the MRC/APU’s director in 1979). The distinction between engineering and applied science needs clarification in any republished version of the original paper.

Interestingly enough, Card, Moran and Newell (1983) claimed to be doing ‘engineering’. Their primary models were the Human Information Processing (HIP) Model and the Goals, Operators, Methods and Strategies (GOMS) Model. There is some interesting overlap with some of our multiple models; but also important differences. One option for a republished paper would be to keep to the ’79 multiple models. An alternative option would to augment the HIP and GOMS with the ’79 multiple models (or vice versa), to offer a (more) complete expression of either approach taken separately.

The conceptual tools involve the development of a set of analytic frameworks appropriate to human computer interaction. The empirical tools involve the development of valid test procedures both for the introduction of new systems and the proving of the analytic tools. The two kinds of tool are viewed as fulfilling functions comparable to the role of analytic and empirical tools in the development of technology. They may be compared with the analytic role of physics, metallurgy and aerodynamics in the development of aircraft on the one hand and the empirical role of a wind tunnel in simulating flight on the other hand.

Empirical Tools

The first class of empirical tool we have employed is the observational field study, with which we aim to identify some of the variables underlying both the occasional user’s perceptions of the problems he encounters in the use of a computer system, and the behaviour of the user at the terminal itself.

Comment 17

Observational field studies have undergone considerable development since ’79. Many have become ethnomethodological studies, to understand the context of use, others have become front-ends to user-centred design methodologies, intended to be conducted in parallel to those of software engineering. Neither sort of development is addressed by our original paper. Both raise numerous issues, including: the mutation of lay-language into technical language; the relationship between user opinions/attitudes and behaviour; the relationship between the simulation of domains of application and experimental studies; the integration of multiple variables into design; etc.

The opinions cited above were obtained in a study of occasional users discussing the introduction and use of a system in a local government centre [2]. The discussions were collected using a technique which is particularly free from observer influence [3 ].

In a second field study we obtained performance protocols by monitoring users while they solved a predefined set of problems using a data base manipulation language [4 ]. We recorded both terminal performance and a running commentary which we asked the user to make, and wedded these to the state of the machine to give a total picture of the interaction. The protocols have proved to be a rich source of classes of user problem from which hypotheses concerning the causes of particular types of mismatch can be generated.

Comment 18

HCI has never given the concept of ‘classes of user problem’ the attention that it deserves. Clearly, HCI has a need for generality (see Comment 10, concerning (systematic) principles with their implications of generalisation). Of course, generalising over user problems is critical; but so more comprehensively is generalising over ‘design problems’. The latter might express the ineffectiveness of users interacting with computers to perform tasks (or somesuch). The original paper does not really say much about generalisation – its conceptualisation; operationalisation; test; and – taken together – validation. Any republication would have to rise to this challenge.

Comment 19

The concept of ’cause’ here is redolent of science, for example, as in Psychology. See also Comment 18, as concerns phenomena and Comment 15 for a contrast with engineering. Science and engineering are very different disciplines. Any re-publication would have to address this difference and to locate the multiple models and their application with respect to it.

There is thus a close interplay between these field studies, the generation of working hypotheses and the development of the conceptual frameworks. We give some extracts from this study in a later section.

Comment 20

This claim would hold for both a scientific (or applied scientific) and an engineering endeavour. See also Comments 15 and 18 earlier. However, both would be required to align themselves with Figures 1 and 2 of the original paper.

A third type of empirical tool is used to test specific predictions of the working hypothesis.

Comment 21

The testing of predictions (which in conjunction with the explanation of phenomena, together constituting understanding) suggests the notion of science (see Comments 18 and 19), which can be contrasted with the prescription of design solutions (which in conjunction with the diagnosis of design problems, together constituting design of artefacts), as engineering (see Comment 15). The difference concerning the purpose of multiple models needs clarification.

The tool is a multi-level interactive system which enables the experimenter to simulate a variety of user interfaces, and is capable of modeling and testing a wide range of variables [5]. It is based on a code-breaking task in which users perform a variety of string-manipulation and editing functions on coded messages.

It allows the systematic evaluation of notational, semantic and syntactic variables. Among the results to be extensively reported elsewhere is that if there is a common argument in a set of commands, each of which takes two arguments, then the common argument must come first for greatest ease of use. Consistency of argument order is not enough: when the common argument consistently comes second no advantage is obtained relative to inconsistent ordering of arguments [6].

Comment 22

The 2-argument example is persuasive on the face of it; but is it a ‘principle’ (see Comment 10) and might it appear in the ‘output to designers’ (Figure 1 and Comment 15(4))? If so, how is its domain independence established? This point raises again the issue of generalisation – see also Comment 17.

Conceptual Tools

Since we conceive the problem as a cognitive one, the tools are from the cognitive sciences.

Comment 23

The claim is in no way controversial. However, it raises the question of whether the interplay between these cognitive tools and the working hypotheses (see Figure 1) also contribute to Cognitive Science (that is, Psychology)? See also Comment 15(3). Such a contribution would be in addition to the ‘output to designers’ of Figure 1.

Also we define the problem as one with those users who would be considered intellectually and motivationally qualified by any normal standards. Thus we do not admit as a potential solution that of finding “better” personnel, or simply paying them more, even if such a solution were practicable.

Comment 24

If ‘design problem’ replaced ‘user problem’ (see also Comment 18), then better personnel and/or better pay might indeed contribute to the design (solution) of the design problem. The two types of problem, that is, design problem and user problem need to be related and grounded in the Framework. The latter, for example, might be conceptualised as a sub-set of the former. Eitherway, additional conceptualisation of the Framework is required. See also Comment 18.

The cognitive incompatibility we describe is qualitative not quantitative and the mismatch we are looking for is one between the user’s concept of the system structure and the real structure: between the way the data base is organised in the machine and the way it is organised in the head of the user: the way in which system details are usually encountered by the user and his preferred mode of learning.

The interaction of human and computer in a problem-solving environment is a complex matter and we cannot find sufficient theory in the psychological literature to support our intuitive needs. He have found it necessary to produce our own theories, drawing mainly on the spirit rather than the substance of established work.

Comment 25

It sounds like our ‘own’ theories are indeed psychological theories (or would be if constructed). See also Comments 21 and 23.

Further than this, it is apparent that the problem is too complex for us to be able to use a single theoretical representation.

Comment 26

Decomposition (as in multiple models) is a well-tried and trusted solution to complexity. However, re-integration will be necessary at some stage and for some purpose. Understanding (Psychology) and design of artefacts (HCI) would be two such (different) purposes. They need to be distinguished. See also Comment 15(5).

The model should not only be appropriate for design, it should also give a means of characterising errors – so as to understand their origins and enable corrective measures to be taken.

Comment 27

What characterises a ‘model appropriate for design’? (see also Comment 15(4) and(5)). Design would have to be conceptualised for this purpose. Features might be derived from field studies of designer practice (see Figure 1); but a conceptualisation would not be ‘given’; but would have to be constructed (in the manner of the models). This construction would be a non-trivial undertaking. But how else could models be assured to be fit-for-(design)purpose? See also Comment 14).

Take the following protocol.

The user is asked to find the average age of entries in the block called PEOPLE.

“I’ll have a go and see what happens” types: *T <-AVG(AGE,PEOPlE)

machine response: AGE – UNSET BLOCK

“Yes, wrong, we have an unset block. So it’s reading AGE as a block, so if we try AGE and PEOPLE the other way round maybe that’ll work.”

This is very easy to diagnose and correct. The natural language way of talking about the target of the operation is mapped straight into the argument order. The cure would be to reverse the argument order for the function AVG to make it compatible.

Comment 28

Natural language here is used both to diagnose ‘user problems’ and to propose solutions to those problems. Natural language, however, does not appear in the paper as a model, as such. Its extensive nature in psychology/linguistics would prohibit such inclusion. Further, there are many theories of natural language and no agreement as to their state of validation (or rejection). However, the model appears as a block in the BIM (see Figure 2). The model/representation, of course, might be intuitive, in the form and practice of lay-language, which we all possess. However, such intuitions would also be available to software engineers and would not distinguish systematic from non-systematic principles ( see Comment 10). The issue would need to be addressed in any re-publication of the ’79 paper.

The next protocol is more obscure. The task is the same as in the preceding one.

“We can ask it (the computer) to bring to the terminal the average value of this attribute.”

types: *T -AVG( AGE)

machine response: AVG(AGE) – ILLEGAL NAME

“Ar.d it’s still illegal. .. ( … ) I’ve got to specify the block as well as the attribute name.”

Well of course you have to specify the block. How else is the machine going to know what you’re talking about? A very natural I.M. response. How can we be responsible for feeble memories like this.

However, a more careful diagnosis reveals that the block PEOPLE is the topic of the ‘conversation’ in any case.

Comment 29

Is ‘topic of conversation’, as used here an intuition, derived from lay-language or a sub-set of some natural language theory, derived form Psychology/Linguistics? This is a good example of the issue raised by Comment 28. The same question could be asked of the use of ‘natural language conventions’, which follows next.

The block has just been used and the natural language conventions are quite clear on the point.

We have similar evidence for the importance of human-machine discourse structures from the experiment using the code-breaking task described above. Command strings seem to be more ‘cognitively compatible’ when the subject of discourse (the common argument) is placed before the variable argument. This is perhaps analogous to the predisposition in sentence expression for stating information which is known or assumed before information which is new [7]. We are currently investigating this influence of natural language on command string compatibility in more detail.

Comment 30

These natural language interpretations and the associated argumentation remain both attractive and plausible. However, command languages in general (with the exception of programmers) have fallen out of favour. Given the concept of the domain of application/tasks and the requirements of the Goal Structure Model, some addition to the natural language model would likely be required for any re-publication of the ’79 paper. Some relevance-related, plan-based speech act theory might commend itself in this case.

The Block Interaction Model

Comment 31

The BIM remains a very interesting and challenging model and was (and remains) ahead of its time. For example, the very inclusion of the concept of domain (as a hospital; jobs in an employment agency etc); but, in addition, the associated representations of the user, the computer and the workbase. Thirty-four years later, HCI researchers are still ‘trying to pick the bits/blocks out of that’ in complex domains such as air traffic and emergency services management. Further development of the BIM in the form of more completely modeled exemplars would be required by any republished paper.

Systematic evidence from empirical studies, together with experience of our own, has led us to develop a conceptual analysis of the information in the head of the user (see figure 2). Our aim with one form of analysis is to identify as many separable kinds of knowledge as possible and chart their actual or potential interactions with one another. Our convention here is to use a block diagram with arrows indicating potential forms of interference. This diagram enables us to classify and thus group examples of interference so that they could be counteracted in a coordinated fashion rather than piecemeal. It also enables us to establish a framework within which to recognise the origin of problems which we haven’t seen before. Figure 2 is a simplified form of this model. The blocks with double boundaries, connected by double lines, indicate the blocks of information used by the ideal user. The other lines indicate prime classes of interference. The terminology we have used is fairly straightforward: Domain – the range of the specific application of a system. This could be a hospital, a city’s buildings, a set of knowledge such as jobs in ~n employment agency. Objects – the elements in the particular data base. They could be a relational table, patients’ records. I Representation of domain I Representa ti on of work-base version of domain domain Representation of problem Operations – the computer routines which manipulates the objects. Labels – the letter sequences which activate operators which, together with arguments and syntax, constitute the commands. Work base – in general, people using computer systems for problem solving have had experience of working in a non-computerised work environment either preceding the computerisation or at least in parallel with the computer system. The representation of this experience we call the work-base version. There will be overlap between this and the users representation of the computer’s version of the domain; but there will be differences as well, and these differences we would count as potential sources of interference. There may be differences in ·the underlying structure of the data in the two cases, for example, and will certainly be differences in the objects used. Thus a user found to be indulging in complex checking procedures after using the command FILE turned out to be perplexed that the material filed was still present on the screen. With pieces of paper, things which are filed actually go there rather than being copied. Here are some examples of interference from one of our empirical studies [4]:

Interference on the syntax from other languages. Subject inserts necessary blanks to keep the strings a fixed length.

“Now that’s Matthewson, that’s 4,7, 10 letters, so I want 4 blanks”

types: A+<:S:NAME = ‘MATTHEWSON ‘:>PEOPLE

Generalised interference

“Having learned how reasonably well to manipulate one system, I was presented with a totally different thing which takes months to learn again.”

Interference of other machine characteristics on machine view

“I’m thinking that the bottom line is the line I’m actually going to input. So I couldn’t understand why it wasn’t lit up at the bottom there, because when you’re doing it on (another system) it’s always the bottom line.”

Comment 32

These examples do not do justice to the BIM – see Comment 31. More complete and complex illustrations are required.

The B.I.M. can be used in two ways. We have illustrated its utility in pinpointing the kinds of interference which can occur from inappropriate kinds of information. We could look at the interactions in just the opposite way and seek ways of maximising the benefits of overlap. This is, of course, the essence of ‘cognitive compatibility’ which we have already mentioned. Trivially, the closer the computer version of the domain maps onto the user’s own version of the domain the better. What is less obviou~ is that any deviations should be systematic where possible.

Comment 33

In complex domains (see Comment 31), the user’s own model is almost always implicit. Modeling that representation is itself non-trivial. A re-published paper would have to make at least a good stab at it.

In the same way, it is pointless to design half the commands so that they are compatible with the natural language equivalents and use this as a training point if the other half, for no clear reason, deviate from the principle. If there are deviations then they should form a natural sub-class or the compatibility of the other commands will be wasted.

Information Structures

In the block interaction model we leave the blocks ill-defined as far as their content is concerned. Note that we have used individual examples for user protocols as well as general principles in justifying and expanding upon the distinctions we find necessary. What we fail to do in the B. I .M. is to characterise the sum of knowledge which an individual user carries around with him or brings to bear upon the interaction. We have a clear idea of cognitive compatibility at the level of an individual. If this idea is to pay then these structures must be more detailed.

There is no single way of talking about information structures. At one extreme there is the picture of the user’s knowledge as it apparently reveals itself in the interaction; the view, as it were, that the terminal has of its interlocutor. From this point of view the motivation for any key press is irrelevant. This is clearly a gross oversimplification.

The next stage can be achieved by means of a protocol. In it we would wish to separate out those actions which spring from the users concept of the machine and those actions which were a result of him being forced to do something to keep the interaction going. This we call ‘heuristic behaviour’. This can take the form of guessing that the piece of information which is missing will be consistent with some other system or machine. “If in doubt, assume that it is Fortran” would be a good example of this. The user can also attempt to generalise from aspects of the current system he knows about. One example from our study was where the machine apparently failed to provide what the user expected. In fact it had but the information was not what he had expected. The system was ready for another command but the user thought it was in some kind of a pending state, waiting with the information he wanted. In certain other stages – in particular where a command has produced a result which fills up the screen – he had to press the ENTER key – in this case to clear the screen. The user then over-generalised from this to the new situation and pressed the ENTER key again, remarking

“Try pressing ENTER again and see what happens.”

We would not want to count the user’s behaviour in this sequence as representing his knowledge of the system – either correct knowledge or incorrect knowledge. He had to do something and couldn’t think of anything else. When the heuristic behaviour is eliminated we are left with a set of information relevant to the interaction. With respect to the full, ideal set of such information, this will be deficient with respect to the points, at which the user had to trust to heuristic behaviour.

Comment 34

The concept of ‘heuristic behaviour’ has never received the attention that it deserves in HCI research, although it must be recognised that much user interactive behaviour is of this kind. The proliferation of new interactive technologies (see Comment 3) is likely to increase this type of behviour by users attempting to generalise across technologies. A re-published paper would have better to relate the dimension of heuristic to that of correctness both with respect to user knowledge and user behaviour.

Note that it will also contain incorrect information as well as correct information; all of it would be categorised by the user as what he knew, if not all with complete confidence, certainly with more confidence than his heuristic behaviour. The thing which is missing from B.I.M. and I.S. is any notion of the dynamics of the interaction. We find we need three additional notations at the moment to do this. One of these describes the planning activity of the user, one charts the changes in state of user and machine and one looks at the general cognitive processes which are mobilised.

Comment 35

The list of models required, in addition to the B.I.M. and the I.S. is comprehensive – planning, user-machine state changes, and cognitive processes. However, it might be argued that yet another model is required – one which maps the changes of the domain as a function of the user-computer interactive behaviours. The domain can be modeled as object-attribute-state (or value) changes, resulting from user-computer behaviours, supported respectively by user-computer structures. Such models currently exist and could be exploited by any re-published paper.

Goal Structure Model

The user does some preparatory work before he presses a key. He must formulate some kind of plan, however rudimentary. This plan can be represented, at least partially, as a hierarchical organisation. At the top might be goals such as “Solve problem p” and at the bottom “Get the computer to display Table T”. The Goal Structure model will show the relationships among the goals.

Comment 36

The G.S.M. is a requirement for designing human-computer interactions. However, it needs to be related in turn to the domain model (see Comments 31, 32 and 33). In the example, the document in the G.S.M. is transformed by the interactive user-computer behaviours from ‘unedited’ to ‘edited’. Any hierarchy in the G.S.M. must take account of any other type of hierarchy, for example, ‘natural’, represented in the domain model (see also Comment 35). The whole issue of so-called situated plans a la Suchman would have to be addressed and seriously re-assessed (see also Comment 37).

This can be compared with the way of structuring the task imposed by the computer. For example, a user’s concept of editing might lead to the goal structure:

Comment 37

HCI research has never recovered from loosing the baby with the bath-water, following Suchman’s proposals concerning so-called ‘situated actions’. Using the G.S.M, a republished paper could bring some much needed order to the concepts of planning. Even the simple examples provided here make clear that such ordering is possible.

Two problems would arise here. Firstly the new file has to be opened at an ‘unnatural’ place. Secondly the acceptance of the edited text changes from being a part of the editing process to being a part of the filing process.

The goal structure model, then, gives us a way of describing such structural aspects of the user’s performance and the machines requirements. Note that such goals might be created in advance or at the time a node is evaluated. Thus the relationship of the GSM to real time is not simple.

The technique for determining the goal structure may be as simple as asking the user “What are you trying to do right now and why?” This,may be sufficient to reveal procedures which are inappropriate for the program being used.

Comment 38

Complex domain models, for example, of air traffic management and control would require more sophisticated elicitation procedures than simple user questioning. User knowledge, supporting highly skilled and complex tasks is notoriously difficult to pin down, given its implicit nature. So-called ‘domain experts’ would be a possible substitute; but that approach raises problems of its own (for example, when experts disagree). A re-published paper would at least have to recognise this problem.

State Transition Model

In the course of an interaction with a system a number of changes take place in the state of the machine. At the same time the user’s perception of the machine state is changing. It will happen that the user misjudges the effect of one command and thereafter’ enters others which from an outside point of view seem almost random. Our point is, as before, that the interaction can only be understood from the point of view of the user.

Comment 39

The S.T.M. needs in turn to be related to the domain model (See Comments 31 and 35). These required linkings raise the whole issue of multiple-model re-integration (see also Comment 26).

This brings us to the third of the dynamic aspects of the interaction: the progress of the user as he learns about the system.

Comment 40

As with the case of ‘heuristic behaviour’, HCI research has never treated seriously enough the issue of ‘user learning’. Most experiments record only initial engagement with an application or at least limited exposure. Observational studies sometimes do better. We are right to claim that users learn (and attempt to generalise). Designers, of course, are doing the same thing, which results in (at least) two moving targets. Given our emphasis on ‘cognitive mismatch’ and the associated concept of ‘interference’, we need to be able to address the issue of user learning in a convincing manner, at least for the purposes in hand.

Let us explore some ways of representing such changes. Take first of all the state of the computer. This change is a result of user actions and can thus be represented as a sequence of Machine States (M.S.) consequent on user action.

If the interaction is error free, changes in the representations would follow changes in the machine states in a homologous manner. Errors will occur if the actual machine state does not match its representation.

Comment 41

At some stage and for some purpose, the S.T.S surely needs to be related to the G.S.M. (and or the domain model). Such a relationship would raise a number of issues, for example, ‘errors’ (see Comment 9) and the need to integrate multiple-models (see also Comments 26 and 39).

We will now look at errors made by a user of an interactive data enquiry system. We will see errors which reveal both the inadequate knowledge of the particular machine state or inadequate knowledge of the actions governing transitions between states. The relevant components of the machine are the information on the terminal display and the state of a flag shown at the bottom right hand corner of the display which ‘informs the user of some aspects of the machine state (ENTER … or OUTPUT … ). In addition there is a prompt, “?”, which indicates that the keyboard is free to be used, there is a key labelled ENTER. In the particular example the user wishes to list the blocks of data he has in his workspace. The required sequence of machine states and actions is:

The machine echoes the command and waits with OUTPUT flag showing.

User: “Nothing happening. We’ve got an OUTPUT there in the corner I don’ t know what that means.

The user had no knowledge of MS2: we can hypothesise his representation of the transition to be:

This is the result of an overgeneralisation. Commands are obeyed immediately if the result is short, unless the result is block data of any size. The point of this is that the data may otherwise wipe everything from the screen. With block data the controlling program has no lookahead to check the size and must itself simply demand the block, putting itself in the hands of some other controlling program. We see here then a case where the user needs to have some fairly detailed and otherwise irrelevant information about the workings of the system in order to make sense of (as opposed to learn by rote) a particular restriction.

The user was told how to proceed, types ENTER, and the list of blocks is displayed together with the next prompt. However, further difficulties arise because the list of blocks includes only one name and the user was expecting a longer listing. Consequently he misconstrues the state of the machine. (continuing from previous example)

User types ENTER

Machine replies with block list and prompt.

Flag set to ENTER …

“Ah, good, so we must have got it right then.

A question mark: (the prompt). It doesn’t give me a listing. Try pressing ENTER again and see what happens.”

User types ENTER

“No? Ah, I see. Is that one absolute block, is that the only blocks there are in the workspace?”

This interaction indicates that the user has derived a general rule for the interaction:

“If in doubt press ENTER”

After this the user realises that there was only one name in the list. Unfortunately his second press of the ENTER key has put the machine into Edit mode and the user thinks he is in command mode. As would be expected the results are strange.

At this stage we can show the machine state transitions and the user’s representation together in a single diagram, figure 3.

This might not be elegant but it captures a lot of features of the interaction which might otherwise be missed.

Comment 42

The S.T.M. includes ‘machine states’ and the user’s representation thereof. Differences between the two are likely to identify both errors and cognitive mismatches. However, the consequences – effective or ineffective interactions and domain transformations – are not represented; but need to be related to the G.S.M. ( and the domain model). This raises, yet again, the issue of the relations between multiple-models required in the design process (see Figure 1 and Comments 26 and 39).

The final model we use calls upon models currently available in cognitive psychology which deal with the dynamics of word recognition and production, language analysis and information storage and retrieval. The use of this model is too complex for us to attempt a summary here.

Comment 43

Address of the I.P.M. is noticeable only by its intended absence. This may have been an appropriate move at the time. However, any re-published paper would have to take the matter further. In so doing, at least the following issues would need to be addressed:

1. The selection of appropriate Psychology/Language I.P.M.s, of which there are very many, all in different states of development and validation (or rejection). (Note Card et al’s synthesis and simplification of such a model in the form of the HIP – see Comment 15).

2. The relation of the I.P.M. to all other models (see Comments 26, 35, 41 and 42).

3. The need to tailor any I.P.M. to the particular domain of concern to any application, for example, air traffic management (see Comments 6 and 39).

4. The level of description of the I.P.M. See also 1. above.

5. The use of any I.P.M. by designers (see Figure 1).

6. The ‘guarantee’ that Psychology brings to such models in the case of their use in design and the nature of its validation.

Conclusion

We have stressed the shortcomings of what we have called the Industrial Model and have indicated that the new user will deviate considerably from this model. In its place we have suggested an alternative approach involving both empirical evaluations of system use and the systematic development of conceptual analyses appropriate to the domain of person-system interaction. There are, of course, aspects of the I.M. which we have no reason to disagree with, for example, the idea that the computer can beneficially transform the users view of the problems with which he is occupied. However, we would appreciate it if someone would take the trouble to support this point with clear documentation. So far as we can see it is simply asserted.

Comment 44

In the 34 years, following publication of our original paper, numerous industry practitioners, trained in HCI models and methods, would claim to have produced ‘clear documentation’, showing that the ‘computer can beneficially transform the user’s view of the problems with which he is occupied’. This raises the whole (and vexed) question of how HCI has moved on since 1979, both in terms of the number and effectiveness of trained/educated HCI practitioners. HCI community progress, clear to everyone, needs to be contrasted with HCI discipline progress, unclear to some.

Finally we would like to stress that nothing we have said is meant to be a solution – other than the methods. We do not take sides for example, on the debate as to whether or not interactions should be in natural language – for we think the question itself is a gross oversimplification. What we do know is that natural language interferes with the interaction and that we need to understand the nature of this interference and to discover principled ways of avoiding it.

Comment 45

Natural language understanding and interference smacks of science. Principled ways of avoiding interference smacks of engineering. What is the relationship between the two? What is the rationale for the relationship? What is the added-value to design (see also Comment 15).

And what we know above all is that the new user is most emphatically not made in the image of the designer.

Comment 46

The original paper essentially conceptualises and illustrates the need for the proposed ‘Framework for HCI’. That was evil, sufficient unto the day thereof. However, what it lacks thirty-four years later is any exemplars, for example, following Kuhn’s requirement for knowledge development and validation. The exemplars would be needed for any re-publication of the paper and would require the complete, coherant and fit-for-purpose – operationalisation, test and generalisation of the Framework, as set out in Figure 1. A busy time for someone……

References

[1 ] du Boulay, B. and O’Shea, T. Seeing the works: a strategy of teaching interactive programming. Paper presented at Workshop on ‘Computing Skills and Adaptive Systems’, Liverpool, March 1978.

[2] Hammond, N.V., Long, J.B. and Clark, l.A. Introducing the interactive computer at work: the users’ views. Paper presented at Workshop on ‘Computing Skills and Adaptive Systems’, Liverpool, March 1978.

[3] Wilson. T. Choosing social factors which should determine telecommunications hardware design and implementation. Paper presented at Eighth International Symposium on Human Factors in Telecommunications, Cambridge, September 1977.

[4] Documenting Human-computer Mismatch with the occasional interactive user. APU/IBM project report no. 3, MRC Applied Psychology Unit. Cambridge, September 1978.

[5] Hammond, N.V. and Barnard, P.J. An interactive test vehicle for the investigation of man-computer interaction. Paper presented at BPS Mathematical and Statistical Section Meeting on ‘Laboratory Work Achievable only by Using a Computer’, London, September 1978.

[6] An interactive test vehicle for the study of man-computer interaction. APU/IBM project report no. 1,MRC Applied Psychology Unit, Cambridge, September 1978.

[7] Halliday, M.A.K. Notes on transitivity and theme in English. Part 1. Journal of Linguistics, 1967, 3, 199-244.

FIGURE 3: STATE TRANSITION EXAMPLE

A Preliminary Model of the Planning and Control of the Combined Response to Disaster https://www.hci-site-experiments.co.uk/wp-content/themes/engic/images/empty/thumbnail.jpg 150 150 John John https://secure.gravatar.com/avatar/d46e5c6154304cf75a0c2d0059f249ab2204a9e4e458b1e88aaaa9e322763c85?s=96&d=mm&r=g November 28, 2013 October 27, 2017

A Preliminary Model of the Planning and Control of the Combined Response to Disaster

John

November 28, 2013

Becky Hill & John Long

Ergonomics & HCI Unit, University College London,

26 Bedford Way, London WC1H 0AP, England.

Tel: +44 (0)171 387 7050.

Fax: +44 (0)171 580 1100.

E-mail: b.hill@ucl.ac.uk.

ABSTRACT

The emergency management combined response system is a planning and control system, set up to manage disasters. This paper presents a preliminary model of the planning and control of the combined response system and its domain. The model is based on a framework for modelling planning and control for multiple task work and on data from a training scenario. The application of the framework has extended its scope to the new domain of emergency management combined response. In addition, the preliminary model is used to identify tasks and behaviours of the combined response system which highlight issues of co-ordination.

Keywords

Planning and control, multiple tasks, emergency management

INTRODUCTION

This paper presents an informal analytic assessment of a model of the emergency management combined response (EMCR) worksystem with respect to the data from a training scenario. The model was derived by applying a framework for modelling planning and control in multiple task work (PCMT) to the data from an emergency management training scenario. This framework has been applied to develop models of three office administration planning and control domains – Medical Reception (Hill, Smith, Long and Whitefield 1995) -Secretarial Office Administration (Smith, Hill, Long and Whitefield 1992) and Solicitors Legal Work. The intention here was to extend the scope of the PCMT framework by using it to model a domain other than office administration. The first section of the paper characterises the domain to be modelled, identifying issues with the current design of the EMCR worksystem. The following section outlines the PCMT framework. Next, the data collection, training scenario, and example data are described. The model of PCMT- EMCR is then presented. Using the model to identify overlap between worksystem behaviours is exemplified. Last, issues related to the use of this framework for modelling the domain of EMCR are identified, along with plans for future work.

EMCR WORKSYSTEM

The planning and control system, which is set up for emergency response to disaster, is that of the ‘combined’ response (EMCR). The EMCR worksystem has a command and control organisation with a three tier structure. Different agencies use this structure to organise their own planning behaviours, so that they co-ordinate effectively with one another. The three levels of command and control are operational, tactical and strategic. At each level the agencies have their own commander for co-ordinating the response. At the strategic level, these commanders make up a ‘senior co-ordinating group’. The operational response is carried out by each agency, concentrating on its specific roles (tasks) within its areas of responsibility. For example, the Fire Service fight fires. The tactical response determines the priority in allocating resources, for example additional fire engines. It also plans and co-ordinates the overall response, obtaining other resources as required. The strategic co-ordinating group formulates the overall policy within which the response to a major incident is made. This EMCR worksystem has been specified to ensure better co-ordination of the different agencies involved in the response. Co-ordination in this context is defined as the ‘harmonious integration of the expertise of all the agencies involved with the object of effectively and efficiently bringing the incident to a successful conclusion.'(Emergency planning college document 1995). The principal agencies involved are the Police, Fire Service and Ambulance Service. The primary objectives for all the emergency services are: ‘to save life, prevent escalation of the disaster, to relieve suffering, to facilitate investigation of the incident, safeguard the environment, protect property and restore normality’ (Home Office Publication 1994).

However, a succession of public enquiry documents analysing various disasters have identified co-ordination problems within EMCR. The Home Office Emergency Planning Research Group has also identified problems relating to co-ordination within EMCR (see acknowledgements).

Each emergency service has its own major incident plan. Each plan specifies a set of roles/tasks that need to be supported, when responding to a disaster, for example, evacuation, setting up a casualty clearing station etc. Some of the tasks, at the most generic level of description, are the same for the three emergency services, for example the Police and the Fire Service both support evacuation tasks. Most of the behaviours which support these tasks for the two services are different, but some are the same. Sometimes the behaviours overlap between the different services, when they are supporting different tasks. The Fire Service may be attempting to set up an inner cordon to contain a hazardous scene; the Ambulance Service may be

attempting to access the same scene to locate casualties. It is proposed here, that when behaviours overlap, co-ordination between the services may become a problem. (Behaviours do not necessarily have to be the same to overlap.)

In developing the present preliminary model of EMCR with respect to a training scenario, it is intended to identify such overlapping behaviours to support better the identification of the co-ordination problems raised by the system.

FRAMEWORK

A framework was developed for modelling the planning and control of multiple task work (PCMT) in office administration. The PCMT framework makes a fundamental distinction between an interactive worksystem, comprising one or more users and computers/devices, and its domain of application, constituting the work carried out by the worksystem. This distinction allows for an expression of the effectiveness (performance) with which the work is carried out, conceptualised as a function of two factors: the quality of the task (i.e. whether the desired goals have been achieved), and resource costs (i.e. the resources required by the worksystem to accomplish the work) (Dowell and Long 1989).

In the framework, a domain of application (or work domain e.g. the disaster) is described in terms of objects, which may be abstract or physical. Objects are defined by their attributes, which have values. The attribute values of an object may be related to the attribute values of one or more other objects. An object at any time is determined by the values of its attributes. The worksystem performs work by changing the value of domain objects (i.e. by transforming their attribute values) to their desired values, as specified by the work goal.

The framework defines a number of worksystem structures for the planning and control of multiple task work. These structures are expressed at both abstract and physical levels of description. First, the framework describes the worksystem’s abstract cognitive structures. These structures comprise four processes (planning, controlling, perceiving and executing), and two representational structures (plans and knowledge-of-tasks). The four processes support the behaviours of planning, control, perception and execution respectively. A complete description for this set of structures, and the associated rationale, is to be found elsewhere (Smith, Hill, Long and Whitefield, 1992).

At the first (abstract) level of description, Plans are specifications of required transformations of domain objects and/or of required behaviours (to achieve goals). They may be partial (in the sense that they may specify only some of the behaviours or transformations). They may also be general (in the sense that some behaviours or transformations may be specified only generally, and not at a level which can be directly executed). Planning behaviours thus specify the required transformations and/or behaviours to support domain object transformations.

Perception and execution behaviours are, respectively, those whereby the worksystem acquires information about the domain objects and those whereby it carries out work which changes the value of the attributes for those objects as desired. Information about domain objects from perception behaviours is expressed in the knowledge-of-tasks representation. Control behaviours entail deciding which behaviour to carry out next, both within and between tasks.

The second level of description of planning and control structures is physical, wherein the framework describes the distribution of the abstract cognitive structures across the physically separate user and devices of particular worksystems.

This framework was chosen to model the combined response worksystem because:

(i) the EMCR worksystem is a planning and control system;

(ii) the work involves multiple tasks; and

(iii) there is a need to identify the tasks and behaviours of the worksystem in relationship to the work, to better identify co-ordination problems within the EMCR worksystem.

DATA COLLECTION

Data were collected by means of a training scenario which took place at the Home Office Emergency Planning Training College. The trainees were members of the emergency services and local authority emergency planning officers. The exercise required the trainees to describe their response to the disaster scenario from initial response to restoration of normality. Data were collected by transcribing the descriptions given by the trainees, and through presentations given by the trainees about their response.

There are various phases in response to the disaster scenario, each of which can be clearly defined, and which involve different worksystem configurations and different worksystem behaviours. In this paper, only one worksystem configuration and its behaviours is modelled – that of the initial response phase to the scenario.

Scenario

‘At 9.30am on a weekday during school term time, a tanker train en-route from a refinery to an airport fuel depot is derailed whilst passing over a railway bridge. The railway bridge carries the railway over the main access road to a town from east to west. The main access road bisects the town in a north/ south direction. There are market stalls set up on the roadway on either side of the bridge. At the time of the accident, a tourist’ bus carrying 45 foreign tourists is passing beneath the railway bridge. One of the tank cars is ruptured during the derailment and aviation fuel flows down the sides of the embankment onto the roadway. Flammable vapours from the fuel have been ignited by an open gas burner from a catering caravan. The explosion has created

severe structural damage in a 30 metre radius and moderate structural damage in 100 metre radius. At least 50 people have been killed including some of the foreign tourists, many people have received burns, and many people are trapped. A number have also been contaminated with aviation fuel. The leaking fuel has run down the road and is entering the canal and watercourses at the bottom of the incline. There is evidence to believe that vandalism may be responsible for the derailment.’

DESCRIPTION OF THE INITIAL COMBINED RESPONSE DATA

A major incident was declared within 15 minutes of assessing the situation. Thus, all the emergency services initiated their major incident plans. Each service has a set of defined roles or tasks specified in their individual plans. For this scenario, examples of initial Police roles/tasks were:

setting up and manning an outer cordon around the site (a larger area than the scene).

preserving and managing the scene

logging all personnel entering the outer cordon

establishing routes and rendezvous points and marshalling areas for all the emergency services

Examples of the initial Fire Service roles/tasks were:

set up an inner cordon

control the fire

carry out immediate rescue

Stem the flow of the hazardous substance

Monitor the health and safety of all inside the inner cordon

Examples of the initial Ambulance Service roles/tasks were:

set up a casualty clearing point

set up a triage point

get medics to the scene

help with evacuation of bedridden people

For each of the tasks the personnel required were identified by, and are shown in, the model.

PCMT-EMCR MODEL

The model describes the EMCR worksystem and its domain for the initial response scenario data. The initial response in this scenario had no strategic level of command. Thus, although the strategic level physical worksystem users and devices are represented in the model, they are included for completeness. No further description is offered. Also, the operational physical users and devices change rapidly over time. The model represents only those entities present for initial response. The local authority response has not been modelled.

EMCR Domain

Based on the PCMT framework, the EMCR domain is expressed as those objects, whose transformation constitutes the work of the EMCR worksystem.

In the model, the domain is conceptualised as having a single disaster object comprising other abstract objects, such as lives, property, environment etc. The disaster object and its component sub-objects are abstract. The sub-objects of the domain are based on the identified primary objectives of EMCR (see earlier). The domain in addition contains physical objects.

The work carried out by the EMCR worksystem is thus the transformation of a ‘disaster object’s’ attribute values. Each task carried out by the EMCR worksystem transforms the attribute values of the disaster object. For example, one attribute is stability, which has values along a continuum (i.e. very unstable, slightly unstable, unstable, nearly stable, moderately stable, stable). The work of the EMCR worksystem is to transform this attribute to a desired level of stability. In order to transform this attribute, the attributes of the component sub-objects must be transformed. This attribute value changes by manipulating the values of the attributes of the other objects of the domain, for example, transforming the ‘lives object’ attribute survivor from not rescued to rescued. The transformation of the abstract object attributes result from the manipulation of the physical domain objects of the worksystem. Attributes may be affordant or dispositional. Affordant attributes are transformed by the worksystem; their transformation constitutes the work performed. Dispositional attributes are relevant to the work (they need to be perceived by the worksystem, but are not changed by the worksystem ). For example for the ‘lives object’ the attribute survivor has a value of mobility injured or mobility uninjured. This value needs to be perceived by the EMCR worksystem, so that the Ambulance Service knows there is a casualty to be transported. However, the worksystem does not change the injured state of the survivor. In Figure 1, all the starred attributes are dispositional.

Figure 1 shows the physical domain objects (but not their attributes) in the model. Included are all the objects identified from the scenario data. The objects include those associated with the example which follows.

EMCR Worksystem

The model of the EMCR worksystem (see Figure 1) shows the cognitive structures of the PCMT framework. The cognitive structures in Figure 1 embody the planning and control behaviours described earlier. The framework comprises four process structures: planning; controlling; perceiving; and executing and two representation structures: plans and knowledge of tasks. For EMCR, these cognitive structures were identified as follows:

A perceiving process which acquires information about property, lives and other domain objects, such as their risk status. This information appears in the knowledge-of-tasks representation. A task in the EMCR will be different for the different agencies. A task for the Police could be setting up a casualty clearance point. A task for the Fire Service could be making sure all personnel are safe within the inner cordon.

An executing process which transforms domain object attribute values. For example, moving injured survivors

away from the scene transforms the ‘lives object’ attribute survivor from not rescued to rescued.

A controlling process which decides which of the other processes should be carried out next based on the plans and knowledge-of-tasks. For example the controlling process might direct the executing process to perform next the execution behaviours of evacuating people (based on the perceiving process that identified where people were, and the knowledge-of-tasks information that people in this situation are at risk), rather than directing the planning process to specify how to preserve the scene. (Thus, there is prioritisation of the tasks carried out by EMCR.)

A planning process, which constructs plans based on knowledge-of-tasks. For example using information from knowledge-of-tasks that the scene needs to be preserved for investigation, the planning process constructs a plan of the sequence of behaviours to carry out this task.

The Plans representation structure embodies the different types of plan used in the combined response.

A knowledge-of-tasks structure, which represents knowledge of relevant aspects of the work domain. For example, the type of hazardous substance.

The physical worksystem comprises the Police and their devices, the Fire Service and their devices, the Ambulance Service and their devices (plus other agencies and their devices which may be involved). The framework allows the construction of alternative models of the distribution of cognitive structures across the user and devices. Thus, it supports reasoning about allocation of function. In the case of the EMCR worksystem, planning and control are distributed across the different levels of the worksystem hierarchy. In the model, the distribution of the structures across the physical worksystem is not shown. However, these physical worksystem structures support its behaviours. In the example which follows, the worksystem behaviours are identified for a selection of data from the training scenario to illustrate this distribution across the worksystem.

RELATIONSHIP BETWEEN THE DOMAIN AND THE WORKSYSTEM

This section describes an example of two tasks being carried out by the combined response system, in terms of the planning, control, perception and execution behaviours and the transformations these behaviours perform in the domain. This description helps to identify potential conflicts within the combined response system behaviours which may cause co-ordination problems. It also highlights some issues related to using the PCMT framework for modelling EMCR, which will be discussed in the final section.

The Fire Service operational commander carries out perception behaviours that update his knowledge-of-tasks with the information that there are structurally damaged buildings, fires and leaking hazardous fuels. He then carries out control behaviours that direct him to consult the major incident plan. According to the plan, the Fire Service is responsible for setting up an inner safety cordon when there are hazards and dangers at the scene. It also maintains the safety of the emergency service personnel within this cordon. Based on this plan, the operational commander then carries out control behaviours that direct him to consult the operational plan for setting up of a cordon. The operational plan gives guidance for cordon set up and regulations.

The operational commander then carries out control behaviours that direct him to carry out planning, based on the operational plan and the knowledge-of-tasks. The planning specifies how the inner cordon should be set up and what the regulations are for entering it. The operational commander then carries out control behaviours that direct him to carry out an execution behaviour of setting up the cordon. This execution behaviour is carried out by the operational personnel setting up the cordon and maintaining specified safety regulations. It is the manipulation of the physical objects that transform the abstract disaster scene objects attribute scene from uncontained to contained.

The operational commander then carries out control behaviours that direct him to inform the tactical incident officer of the inner cordon set up. The tactical incident officer thus carries out perception behaviours which update his knowledge-of-tasks about the inner cordon set up. The tactical incident officer then carries out control behaviours that direct him to consult his plan to assess the resources required for the set up. He then carries out planning behaviours to specify the resources required for this task.

At the same time, the operational Ambulance senior officer is carrying out perception behaviours that update his knowledge-of-tasks with the information that there are a number of casualties at the scene. He then carries out control behaviours that direct him to consult his major incident plan. According to the plan, casualties must be located and then either treated at the scene and/or transported to hospital. He then carries out control behaviours that direct him to carry out planning behaviours to specify in the operational plan what personnel are required and how to access the casualties. This plan then directs him to carry out control behaviours to direct the execution behaviours of accessing, treating and transporting casualties. These execution behaviours are carried out by the Ambulance operational personnel. It is the manipulation of the physical objects which transform the abstract lives object attribute survivor from not transported to transported.

However, the Ambulance operational senior officer has not carried out perception behaviours that update his knowledge of tasks that the scene is now contained. Therefore, when the Ambulance personnel attempt to carry out their execution behaviours, they do not fulfill the proper safety requirements which would allow them to enter the inner cordon. Therefore, the execution behaviours of transporting casualties cannot be carried out. The primary objective of EMCR is to save life. Here, there is an overlap of behaviours which is

hindering this objective. Also, if the Ambulance Service is not allowed to carry out its execution behaviour, in trying to increase the saving of life, the fire service must carry out rescue execution behaviours to move the casualties to the edge of the inner cordon. These rescue execution behaviours will decrease the resources available for carrying out the execution behaviours of controlling the hazard, thus decreasing the effectiveness of the response to the secondary objective of preventing escalation of disaster

To attempt to rectify this situation, the Ambulance senior officer informs the Ambulance incident officer of the inner cordon regulations. Thus, the Ambulance incident officer carries out perception behaviours that update his knowledge-of-tasks about the inner cordon regulations. He then carries out control behaviours that direct him to carry out planning behaviours to specify in the plan the required equipment for all resources directed to the scene. Until these regularised resources arrive, the Ambulance incident officer needs to communicate with the Fire Service incident officer to try and negotiate access for the operational personnel. The Fire Service incident officer should have informed the Ambulance incident officer of the regulations, to obviate the problem of co-ordination.

CONCLUSIONS

This paper has described the application of the PCMT framework to modelling the domain of Emergency management combined response (EMCR). A preliminary model has been developed based on data from a training scenario. This final section will discuss issues related to using the PCMT framework for modelling this domain, and the use of the model for identifying overlapping behaviours that highlight co-ordination issues within EMCR.

The PCMT framework is for use in planning and control for multiple task domains. EMCR is such a domain, but there are differences between this domain and the other domains already modelled. First, EMCR has a changing worksystem. The PCMT representation does not currently represent a changing worksystem. Thus, the preliminary model only represents one phase of response. Second, EMCR is multiple task, but unlike the previous domains studied, there are not multiple objects at the highest level i.e. multiple disaster objects, rather there are multiple sub-objects, such as multiple lives objects. Third, EMCR is made up of multiple agents within a complex three tier command structure. The PCMT framework has so far only modelled domains with a single level of operation. Thus, interactions between the different horizontal layers and different vertical layers of the system are difficult to describe in terms of the present framework. The difficulty can be appreciated in the example, where the different commanders liase, at the same horizontal level, and information flows between vertical levels.

However, the model provides a description of the system and its domain that can be used to identify overlapping behaviours, albeit for a simple example and in a limited context. The allocation of the abstract structures across the physical worksystem can be inferred from the description of the physical behaviours, which can then be used to identify possible reconfigurations of the worksystem to support more effective co-ordination.

Future work will develop more complete models of EMCR in response to a disaster scenario and in so doing extend the PCMT framework to accommodate the issues described earlier.

ACKNOWLEDGEMENTS

This work is part funded by the Home Office Emergency Planning Research group under the EPSRC CASE scheme.

Thanks are due to the Home Office Emergency Planning college for their infinite co-operation with the data collection

REFERENCES

Dowell, J. and Long, J. (1989) Towards a conception for an engineering discipline of human factors. Ergonomics, 32, 1513-1536.

Hill, B., Long, J.B., Smith, W. and Whitefield, A.D. (1995) A model of medical reception – the planning and control of multiple task work, Applied Cognitive Psychology, 9, 81-114.

Smith, M.W., Hill, B., Long, J.B. and Whitefield, A.D. (1992) Modelling the Relationship Between Planning, Control, Perception and Execution Behaviours in Interactive Worksystems. In D.Diaper, M.Harrison and A.Monk (Eds) People and Computers VII; Proceedings of HCI ’92. Cambridge University Press.

Dealing with Disaster 2nd Edition (1994). Home Office Publication HMSO London.

Emergency planning college document (1995) Management levels in response to a major incident – The Principles of Command and Control

Figure 1 The PCMT-EMCRS Model

Engineering Works: What is (and is not) “Engineering” for Interactive Computer Systems – Ann Blandford (2013) https://www.hci-site-experiments.co.uk/wp-content/themes/engic/images/empty/thumbnail.jpg 150 150 John John https://secure.gravatar.com/avatar/d46e5c6154304cf75a0c2d0059f249ab2204a9e4e458b1e88aaaa9e322763c85?s=96&d=mm&r=g June 1, 2014 April 10, 2016

Engineering Works: What is (and is not) “Engineering” for Interactive Computer Systems – Ann Blandford (2013)

John

June 1, 2014

Introduction

I have known Ann for many years. Our paths have crossed and even at times overlapped. For example, I left the MRC Applied Psychology Unit, Cambridge, in 1979 to direct the Ergonomics Unit at UCL. Ann joined the APU in 1991 to work on the Amodeus Project with Phil Barnard, Michael Harrison and others, with whom I continued to keep in contact. Along with Steve Cummaford, Ann and I presented a joint tutorial entitled: ‘Introduction to HCI Structured Engineering Design Methods’ at HCI ’02, which signaled a common interest in HCI Engineering.

I retired in 2001, to be replaced by Harold Thimbleby, who wisely brought Ann with him from Middlesex University. Ann, in turn, replaced Harold and directed UCLIC from 2004 to 2011. She and Alistair Sutcliffe endeared themselves to me for ever by editing the John Long Festschrift, even allowing me to contribute, which is unusual in festschrift circles. Indeed, it was the Festschrift, which gave me the idea for setting up <hciengineering.net>.

To complete the circle, I invited Ann to contribute to the Papers Section here and she kindly agreed. Even better, her paper continues the engineering theme of HCI, making an important contribution to the website. Indeed, the issues raised are so relevant, that, with Ann’s agreement, I have added a commentary of my own, in an attempt to bring the issues together and to take them forward by means of a listing of Requirements for engineering interactive computer systems. Thank you Ann – much appreciated.

Engineering works: what is (and is not) “engineering” for interactive computer systems?

Ann Blandford University College LondonDept. of Computer Science, Gower Street, London WC1E 6BTA.Blandford@ucl.ac.uk

ABSTRACT

What does it mean to “engineer” an interactive computer system? Is it about the team doing the work (that they are engineers), about the process being followed, about the application domain, or what? Is engineering about managing complexity, safety or reliability? For physical artifacts, it may be possible to achieve consensus on how well engineered a product is, but this is more difficult for digital artifacts. In this talk, I will offer some perspectives, both positive and negative, on the nature of engineering for interactive computer systems and, at least implicitly, the nature and future of the EICS conference series.

Comment 1

Blandford’s paper is entirely concerned with Engineering. It is an important paper, as it raises critical issues, concerning ‘the nature of engineering for interactive computer systems’. The aim of this commentary is to suggest how these issues might be carried forward both generally and particularly in the form of Requirements for Engineering Interactive Computer Systems. The needs for such Requirements are identified in the comments and then listed separately.

Author Keywords

Engineering; HCI; safety; reliability; professionalism.

ACM Classification Keywords

H.5.2 [Information Interfaces and Presentation]: User Interfaces – Interaction styles.

General Terms

Human Factors; Design; Reliability; Verification.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee.

INTRODUCTION

The aim of this short paper is to facilitate discussion on the role and value of engineering in relation to interactive computer systems.

Comment 2

Engineering, here, is presumably in contrast to the arts and to the sciences, which have their own different roles and values, if and when, relevant to interactive computer systems. The contrast is instructive for distinguishing engineering approaches from those of other disciplines or types of knowledge more generally.

It arises, in part, from an activity at the most recent IFIP WG2.7 / 13.4 (User Interface Engineering) working meeting: to develop a short video to communicate the value of engineering for user interfaces. It also arises from discussions I have had with various people on the nature and scope of the EICS conference. Both activities have generated more heat than light. It is, intentionally, not a well engineered argument for a particular position, but a series of vignettes putting forward different cases, for and against particular views of engineering in relation to interactive computer systems (ICS).

Comment 3

Blandford, indeed, achieves the aim of ‘putting forward different cases for and against particular views of engineering in relation to interactive computer systems’. The question now arises as to how to bring these views together and how to take them forward. The listing of Requirements for engineering interactive computer systems, as proposed here, is one such way.

My intention, which may or may not be realized, is that the community should establish a better shared understanding of the nature, value and role of engineering in the ICS context.

If it’s done by engineers then it is engineering

I am a Chartered Engineer. I started my career as a Graduate Trainee Engineer, at about the time the Finniston report [7] was published. That report emphasized the importance of engineering to the future of the economy, and also argued strongly for the status of professional engineers. Within the UK, that is a battle which has now been lost: “anyone in the UK may describe themselves as an engineer. Seeking to regulate or legislate on the use of a now common term is recognized by the Engineering Council as totally impractical.” [6] So, at least in the UK, anyone can call themselves an engineer, and – by extension – claim that what they are doing is engineering.

Comment 4

The reverse is presumably also the case, that anyone, who claims to be doing engineering, at the same time claims to be an engineer. The latter case would seem to be more suited to the current engineering claims with respect to interactive computer systems.

A subset can make a stronger claim: that we are accredited as professional engineers. But is what any of us do “engineering”? Let us consider definitions of engineering.

Engineering: definitions

A dictionary definition of engineering is: “The application of scientific and mathematical principles (…. to practical ends”.)

Comment 5

No-one would question engineering’s application of scientific and mathematical principles, where the latter already exist and are appropriate. However, in their absence, or if they are not relevant, ‘practical knowledge’ can also be claimed to support engineering practices (Wikipedia, 2015), as in ‘craft engineering’. The latter can also be claimed to be ‘rigorous’, in that one craft engineer, or one type of craft engineering, might be said to be more rigorous than another.

Requirement 1

Any approach to engineering interactive computer systems is required to specify the relationship between the explicit and the implicit knowledge used to support its practices.

(…. A dictionary definition of engineering is: “The application of scientific and mathematical principles)……. to practical ends” [14].

Comment 6

Most definitions of engineering, following Blandford, suppose it to be used for ‘practical ends’ and to solve practical problems. Confirmation is also to be found elsewhere: ‘to find suitable solutions to problems’; ‘to make improvements to the status quo’ etc. (Wikipedia, 2015). Solving practical problems , then, is central to engineering, including diagnosis of the problem in the first place. In the absence of the diagnosis of a problem, it is unclear what can be meant by a solution (see also Comment 5 and Requirement 1).

Requirement 2

Any approach to engineering interactive computer systems is required to specify the implicit and explicit engineering knowledge and practices used in the diagnosis of (engineering) problems and the prescription of (engineering) solutions.

The emphasis in the Engineering HCI Community of ACM is on “the application of scientific knowledge and rigorous design methodology…..

Comment 7

Blandford is, of course, quite right to identify the importance of design for engineering interactive computer systems. However, other definitions suggest the need to include implementation/build/use/research/improve/ maintain or some such (Wikipedia, 2015). Here, engineering is considered to include at least both design and implementation – the latter term being adopted as already in general use, as concerns the development of interactive computer systems.

Requirement 3

Any approach to engineering interactive computer systems is required to decompose engineering into its component practices, at least to a level commensurate with that of ‘design’.

….. to reliably predict and, thus, help improve the consistency, usability, economy and safety of solutions to practical problems” [1].

Comment 8

The engineering criteria, cited by Blandford for the successful address of practical problems – that is, ‘consistency, usability, economy, and safety’ are uncontroversial. Additional criteria to be found in the literature are no more controversial: intended function; improvement of structures; economics of operation; performance as expected; efficiency etc. (Wikipedia, 2015). No limit appears to be placed on the criteria, although they are related, directly or indirectly, to the knowledge and practices of engineering. The criteria can be both absolute (units of interactive work performed within some time limit) or relative (more units of work performed than at present).

Requirement 4

Any approach to engineering interactive computer systems is required to specify the performance criteria, by which the developed system is to be judged (and validated – see also Comment 17 and Requirement 9). Engineering for performance is central to any approach and implicates both design and implementation (see also Requirement 7).

Both of these definitions focus on principles and rigor for addressing practical problems. Clearly, these principles should apply in the design of complex, safety-critical systems [10]. It is much less clear what it means to engineer the user experience, in terms of fun or affect (a theme in this year’s call for papers).

Comment 9

There are many obvious differences , as pointed out by Blandford, between safety-critical systems and fun-, affect-, experience-systems. Both concern the issues of principles (or more generally knowledge and practices) and rigour (or more generally formality). Both these issues deserve more detailed development by the different approaches to engineering interactive computer systems.

Requirement 5

Any approach to engineering interactive computer systems is required to specify the formal and the empirical nature of its knowledge and practices (and indeed, the type of problem they are able to diagnose and the type of solution they are able to prescribe) – see also Requirement 4.

The science of fun is poorly developed, the mathematics of fun even more so. User experience is not so well understood that it can be reliably predicted or delivered consistently without extensive iterative testing, which is standard ICS development practice, and not particular to an engineering approach.

Comment 10

Blandford correctly asserts that iterative testing is not particular to an engineering approach to developing interactive computer systems. However, iterative testing currently makes an important contribution to the development of such systems. Iterative testing may be an empirical means of making good some of the current deficit in formal engineering knowledge and practices. Empirical (and so rigour) might be usefully contrasted with formality. See also Comment 9.

Requirement 6

Any approach to engineering interactive computer systems is required to specify the relationship between the formal and the empirical nature of its knowledge and practices (and, indeed, the type of problem they are able to diagnose and the type of solution they are able to prescribe). See also Comment 10.

The importance of iteration in iCS

Most HCI text books assert that iteration is essential in the design of ICS.

Comment 11

Engineering involves not only design components of the development of interactive computer systems; but also implementation ones ( see also Comment 7 and Requirement 3).

Rogers et al [12] focus on four main phases of developing interactive systems: establish requirements; design alternatives; prototype; and evaluate. Iteratively.

Comment 12

Iteration, as identified by Blandford, is indeed currently essential to the development of interactive computer systems (although not necessarily as a matter of principle). Empirical iterative knowledge and methods may be making good some of the deficit of formal (analytic) knowledge and methods. See also Comment 10.

Best practice in the development of ICS includes requirements gathering and user testing, neither of which is particularly amenable to the application of scientific or mathematical principles, although both can be done rigorously and are essential to the delivery of systems that are safe and usable.

Comment 13

The four main phases of interactive computer systems development, identified by Blandford and generally agreed within HCI are: requirements; design alternatives; prototype; and evaluation. Blandford considers that neither requirements gathering nor user testing are ‘particularly amenable to the application of scientific and mathematical principles, although both can be performed rigorously’. This is certainly the case at present and may indeed be the case in the longer term and even ultimately; but until the notion of principles is more clearly specified and developed, including its formality, it may be better to leave the matter open at this time. Perhaps that is the thrust of Blandford’s ‘ not particularly amenable’.

Requirement 7

Any approach to engineering interactive systems is required to specify the different types of knowledge (for example: principles (both substantive and methodological)); guidelines; models etc), which support different sorts of design and implementation practices (for example: trial and error; specify then implement; prototype, test and iterate etc).

Tools such as CogTool [13] bring an engineering rigor and prediction to important aspects of user interaction with interactive devices, based on task performance.

Comment 14

See Comment 8.

Similarly, model-based approaches to system development [9] support the task-based development of ICS. However, none of these “engineering” approaches take account of the softer, but equally important, aspects of the use of ICS, including the full user experience, how the ICS fits within its broader context of use, and how people conceptualise the activity the ICS is designed to support [2].

Comment 15

Blandford’s notion that certain aspects of interactive computers systems may be ‘softer’ than others (such as full user experience; context of use; and user conceptualisation of computer functionality) is an interesting one. The idea might be carried forward by relating it to engineering knowledge and practices and their problems and solutions. For example, ‘hard’ problems, which can be completely specified (for the purposes in hand), can be prescribed a solution by formal or empirical knowledge and practices. However, ‘soft’ problems, which cannot be completely specified (for the purposes in hand), can only be prescribed by empirical knowledge and practices. The hardness or softness of a problem and solution can only be determined empirically at this time (and maybe sometime into the future too).

Requirement 8

Any approach to engineering interactive computer systems is required to specify the types of problem and solution, which it addresses, as well as their relationship with formal and empirical knowledge and practices. This requirement applies to both design and implementation.

Without taking such aspects of use into account, the engineering of ICS runs the risk of delivering solutions to the wrong practical problems.

Comment 16

It is worth noting, that following lay language usage, a problem can be said to exist, in the absence of a solution; but a solution can only be said to exist with respect to an actual or a possible problem. In addition, a problem can have many solutions and a solution can resolve many problems. Ignoring ‘softer’ interactive computer system aspects might indeed produce a ‘a solution to the wrong practical problem’, as suggested by Blandford; but also might produce ‘a wrong solution to the right practical problem’. Neither is the right solution to the right problem, which is the aim of engineering.

A case study: CHI+MED

The CHI+MED programme [5] provides an interesting object of study in terms of engineering ICS. CHI+MED is studying the design and use of interactive medical devices: safety-critical devices such as infusion pumps that are themselves moderately complex, and are used in highly complex settings. There are many aspects of these devices that can be subjected to an engineering approach, including modeling their safety properties [4] and formal verification [8].

Comment 17

Blandford is right to identify the importance of formal verification (although other types of verification may be possible). However, a pre-requisite to verification is derivation (or some such). In order to verify two representations, or whatever, both must be available and in a form, which can indeed be verified. This requirement holds for all products and representations involved in the engineering design and implementation processes. For example, for design (following Blandford): establishing requirements; prototypes; and conducting evaluations.

Requirement 9

Any approach to engineering interactive computer systems is required to specify how its products and representations are both derived and verified and how the latter relate to the validation of both design and implementation.

Such approaches are necessary, but not sufficient. There are many aspects of the use of such devices in practice [11] that need to be understood and designed for. Without systematic study of the use of devices in context, and rigorous description of the “problem”, which defines requirements for the next generation of systems, and without careful testing of device prototypes, it is easy to deliver solutions that are verified by not validated [3].

Comment 18

Blandford is quite right to identify the importance of the validation, as well as the verification, of interactive computer systems. Validation here cannot be less than evaluating to what extent, the interactive computer system , as designed and implemented, meets (or fails to meet) the user requirements, especially those of performance. Such an evaluation would involve the conceptualisation and operationalisation of the user requirements and their test against the interactive computer system, intended (that is, claimed) to meet them.

Requirement 10

Any approach to engineering interactive computer systems is required to specify the processes by which the system is validated against its user requirements, especially those of performance, and their relationship with verification of both design and implementation.

There is a risk that by separating off “engineering” approaches to ICS, the engineering becomes distanced from the practical problems that it is intended to address.

Conclusion

There is an argument, based on the above, that engineering is the servant of design – that the user needs are identified outside the engineering process, that the engineer’s job is just to make the design as conceived by others work (ensuring that the system performs as intended – traditionally referred to as ‘verification’).

Comment 19

Blandford draws an interesting distinction between a ‘narrower’ and a ‘broader’ view of software development. According to the narrower view, the engineering process makes the design (as conceived by others) ‘work’ and is the object of verification. According to the broader view, software development is iterative and includes usability, utility and user experience and is the object of validation. The question arises, as to the relationship between these two views and how that relationship is to be embodied in an engineering approach to interactive computer system development.

It is tempting to assign the responsibilities, associated with the narrower view, to software engineering, as implementation and the responsibilities, associated with the broader view to HCI, as design. However, the temptation is to be rejected. First, many interfaces, at this time, are designed (as well as implemented) by software engineers. Some of these engineers have had some HCI training; but others have not. Second, the recent development of interface tools for design include de facto components of implementation. However, how and to whom narrow and broad responsibilities of design and implementation are to be assigned to both practitioners and researchers are critical for both professional and academic education/training matters.

Requirement 11

Any approach to engineering interactive computer systems is required to specify the assignment of professional and academic responsibilities for verification (for example, property modelling; verification etc) and validation (for example, user requirements; context of use; user evaluation etc).

This seems at odds with the broader view of software development lifecycles that development is iterative [3], and is concerned with considerations of usability, utility and experience (all of which are arguably elements of ‘validation’), which should also be concerns for engineering.

The title of this paper, “Engineering works”, is a play on words. One reading is a claim: that engineering makes things better; that it provides assurance that the proposed solution to a problem (an ICS) is well engineered: that it will not crash or permit the system to get into unsafe states, and will manage complexity well. The second reading is as a noun, “works”, qualified with an adjective, “engineering”; engineering work is needed when things have gone wrong, or need maintenance. In the context of EICS, I suggest that both meanings pertain: that engineering can make complex systems work well, but that the engineering approach needs active maintenance to remain relevant to other aspects of ICS design and to avoid becoming narrow and irrelevant.

ACKNOWLEDGMENTS

This paper has benefitted from discussions with many people, but the viewpoints put forward are my own. CHI+MED is funded by EPSRC EP/059063/01.

REFERENCES

ACM (n.d.) chi2013.acm.org/communities/engineering/ accessed 17/4/13.
Blandford, A., Green, T. R., Furniss, D., & Makri, S. (2008). Evaluating system utility and conceptual fit using CASSM. Int. J. Human-Computer Studies, 66(6), 393-409.
Boehm, B. W. (1988). A spiral model of software development and enhancement. Computer, 21(5), 61-72.
Bowen, J. & Reeves, S. (in press) Modelling Safety Properties of Interactive Medical Systems. Proc. EICS 2013. To appear.
CHI+MED (n.d.). www.chi-med.ac.uk.
Engineering Council (n.d.). Status of Engineers. www.engc.org.uk/statusofengineers.aspx ac. 17/4/13.
Finniston, H. M. (1980). Engineering our future: report. HM Stationery Off.
Masci, P., Ayoub, A., Curzon, P., Harrison, M., Lee, I., Sokolsky, O. & Thimbleby, H. (in press) Verification of Interactive Software for Medical Devices: PCA Infusion Pumps and FDA Regulation as an Example. Proc. EICS 2013. To appear.
Meixner, G., Paternò, F., & Vanderdonckt, J. (2011). Past, Present, and Future of Model-Based User Interface Development. i-com, 10(3), 2-11.

10. Navarre, D., Palanque, P., Ladry, J. F., & Barboni, E. (2009). ICOs: A model-based user interface description technique dedicated to interactive systems addressing usability, reliability and scalability. ACM Trans. CHI, 16(4), 18.

11. Rajkomar, A., & Blandford, A. (2012). Understanding infusion administration in the ICU through Distributed Cognition. J. biomedical informatics, 45(3), 580-590.

12. Rogers, Y., Sharp, H., & Preece, J. (2011). Interaction design: beyond human-computer interaction. Wiley.

13. Teo, L. H., John, B., & Blackmon, M. (2012). CogTool-Explorer: a model of goal-directed user exploration that considers information layout. Proc CHI. 2479-2488.

14. The Free Dictionary (n.d.) www.thefreedictionary.com/engineering accessed 17/4/13.

Grand Challenge in HCI: the Quest for Theory-led design – Alistair Sutcliffe (2005) https://www.hci-site-experiments.co.uk/wp-content/themes/engic/images/empty/thumbnail.jpg 150 150 John John https://secure.gravatar.com/avatar/d46e5c6154304cf75a0c2d0059f249ab2204a9e4e458b1e88aaaa9e322763c85?s=96&d=mm&r=g July 6, 2014 July 6, 2014

Grand Challenge in HCI: the Quest for Theory-led design – Alistair Sutcliffe (2005)

John

July 6, 2014

Grand Challenges in HCI: the Quest for

Theory-led Design

Alistair Sutcliffe

Introduction

I have known Alistair (on and off) for about 25 years. We first met, I think, at HCI ’89 in Nottingham. We last met at his UCLIC seminar in February, 2014. I was delighted to see him still at it, so to speak. During those 25 years, we have each, in our own way, promoted an engineering approach to HCI.

It has been part of my good fortune to have been ‘promoted’, along with Alistair’s support for HCI. Alistair, as co-organiser with Linda Macaulay, invited me to present the keynote paper on the conference theme of ‘The Theory and Practice of HCI’. Fortuitously at this time, John Dowell and I were working on a Conception for HCI, which included both its Discipline and its Design Problem. Since the former included both HCI Knowledge and HCI Practice, I was able to interpret the conference theme in terms of the Discipline Conception. The result was: ‘Conceptions of the Discipline of HCI: Craft, Applied Science and Engineering (Long and Dowell, 1989). I remain grateful to Alistair for having given me this opportunity.

In 2010, I was again ‘promoted’ by Alistair, when he and Ann Blandford edited the John Long Festschrift (www.hci-site-experiments.co.uk Section 2) as a special issue of the journal Interacting with Computers (Vol. 22, Issue 1, 2010). Indeed, it was the festschrift, which gave me the idea for the present website, as the front-end to the Ergonomics Unit legacy archive, which I am in the process of setting up in the UCL Discovery repository.

To complete the circle, I invited Alistair to contribute to the Papers Section of the website and he kindly agreed. Even better, his paper continues both the engineering theme of HCI (as well as the HCI ’89 conference theme of ‘ the Theory and Practice of HCI’). The paper, thus, makes an important contribution to this website. Thank you Alistair – much appreciated.

Centre for HCI Design, School of Informatics, University of

Manchester, PC Box 88, Manchester, UK

Tel: +44 161200 3315

Email: ags@manchester.ac.uk

A grand challenge of theory-led design is proposed for HCI. The history

and state of the art in HCI theory and knowledge is reviewed, expanding

on Long & Dowell’s conception of HCI. A new approach to bridging from

theory to design practice is proposed that uses generic task models as a

means of locating theory-based design advice. Theory-based knowledge

is also transferred to design as bridging models and as critical cognitive

aspects which are applied to task models. Design of specific applications

uses theory-based knowledge via mappings to generic task models and by

the application of bridging models. The approach is illustrated by a case

study investigation of the design issues in notifier systems and explained

with a design scenario of hospital patient monitoring.

Keywords: theory, cognitive models, claims, bridging representations.

1 Introduction

Grand challenges have been much in vogue in the computer science community

[UKCRC 2005]; however, the computational emphasis in the grand challenges

competition has sidelined HCI to a lip-service role in Memories for Life challenge.

Fred Brooks [2003], however, was more enlightened and cites design of effective

user interfaces as one of three major challenges for the 21st century. The HCI

community should not be depressed by this outcome; instead, I argue it should

propose more ambitious and wider ranging grand challenges of its own, and in doing

so demonstrate that it has a broader ambition than computer science. In this paper I

will propose and expound the challenge of Theory-led Design.

The development and application of theory is a sign of intellectual respect in

most scientific or engineering disciplines, and HCI should be no exception. Long &

Dowell’s conception of HCI as craft, science and engineering urged systematic

development of principles, laws and guidelines derived from theory [Long & Dowell

1989; Dowell & Long 1998]. Barnard and colleagues in the AMODEUS project

represent the most concerted effort to translate theory into design practice over

several years. The cognitive theory of Interacting Cognitive Sub-Systems [Teasdale

& Barnard 1993], in combination with more formal system modelling was proposed

as a toolbox for designers [Barnard et al. 2000]. In restricted case studies, these

models could be demonstrated to ‘cohabit’, i.e. applications of each model analysed

different views on the same problem [Harrison & Barnard 1993]. However, a

key problem has been how to translate knowledge in psychological theory into a

form that is usable by designers. The task artefact theory has been proposed as

a means of developing HCI knowledge [Carroll & Rosson 1992], using claims to

express fragments for theory-based knowledge with contextual references to design

problems, artefacts and scenarios [Sutcliffe 2000; Sutcliffe & Carroll 1999]. While

claims continue to make modest progress [Carroll 2000; Sutcliffe et al. 2003], they

have not as yet become widespread as a bridging representation between theory

and design practice. The state of the art for HCI advice still remains as principles

[Sutcliffe 2003; Dix et al. 2003], or golden rules [Shneiderman & Plaisant 2004]

and the legion of, one suspects, under-used ISO guidelines [ISO 1997, 1998, 2000].

More recently HCI patterns have proliferated [Borchers 2001; Van Welie 2005],

often without any validation. While patterns can express knowledge in a more

contextualized manner, unless they are composed into a pattern language [Alexander

et al. 1977], they can present incompatible, inconsistent and fragmentary pieces of

HCI knowledge. The standardization process of developing an HCI pattern language

has been slow to emerge and one can question whether it would be necessary

since patterns duplicate much knowledge already present in better regulated sets of

guidelines.

Some may consider development of exemplar artefacts as a more productive

way to advance HCI. However, I believe that without theory, HCI will be doomed to

the invention of stimulating yet ephemeral artefacts [e.g. Ishii et al. 2001] without

generating generalized long-term knowledge. The thesis in this paper is to urge

the HCI community to adopt the Science of Design [NSF 2004] and thereby become

more than a sub-discipline of computer science. Instead, HCI can become the Science

of User-Centred Technological Design. In the remainder of this paper, I will review

the role of theory and HCI knowledge, and then propose an innovative approach

based on a new role for familiar task models. This will be followed by application of

the approach to investigate a particular area of attentive user interfaces, with a brief

discussion to conclude the paper.

2 Theories and HCI Knowledge

HCI is not short of cognitive theories to apply to design; see for example ACT-R,

[Anderson & Lebiere 1998], EPIC [Kieras & Meyer 1997] and LICAI [Kitajima

& Poison 1997]. However, theories of cognition have proved difficult to apply in

practice. They can be used to motivate experiments and demonstrate interesting and

useful design phenomena, for example Homoff’s work on optimizing menu layout

Figure 1: Long & Dowell’s conception of HCI, revisited.

for efficient search, and eyetracking studies that demonstrate that banner adverts are

not attended to or remembered but do impose subliminal cognitive cost [Burke et al.

2004]. This work uses EPIC as its theoretical lens. GOMS is the prime exemplar

of applicable theory [John & Kieras 1995], and continues to be the focus of many

studies although evidence of its effective use in industry is hard to find. A cutdown

version of LICAI has been successfully developed in combination with an

information searching algorithm (latent semantic indexing) in the CoLiDes systems

that can predict design flaws in website navigation and link cues given a model of

the user’s search goals [Blackmon et al. 2003].

The quest for bridging models from theory to design was pursued with some

vigour, first via design rationale, providing the glue to synthesize contributions from

different modelling approaches [Bellotti et al. 1995], and more convincingly by

developing Cognitive Task Models to analyse task sequences and diagnose potential

design problems indicated by reasoning with the ICS cognitive architecture [Barnard

et al. 2000]. Barnard, in his 1998 plenary address to this conference, proposed

* syndetic theory’ and a blending of several theoretical influences that could be

applied to design problems at different levels of granularity — micro and macro

theory. Unfortunately, application of this approach to scaled-up industrial design

problems has never been demonstrated.

Interest in theoretical approaches has continued in cut-down approaches such

as the Resource Theory that proposes an event modelling view incorporating

cognitive issues such as working memory and selective attention [Wright et al.

2000]. Interactors [Dix & Mancini 1998] provided an interesting approach for

encapsulating HCI knowledge in what are essentially ‘abstract interaction types’

(cf. ADTs) as specifications or code for generic interactive functions, e.g. multilevel

undo. More specialized approaches, such as Information Foraging Theory,

continue to motivate research in information retrieval [Pirolli & Card 1999], while

Activity Theory [Nardi 1996] has been much in vogue as an analytic framework

for ethnographic-style studies of CSCW and CMC systems. Although theory is

being used, its prime role seems to be an indirect influence on design via analytic

frameworks or general conceptual influences. For instance, it is difficult to extract

prescriptive design advice from the concepts of conflict, activity level, artefacts and

zone of proximal development in Activity Theory even though some have attempted

to derive heuristics from such opaque theory [Bertelsen & B0dker 2003]. The range

of applicable theory has been demonstrated by interest in theories of emotion [e.g.

Ortony et al. 1988], as an inspiration to frame design questions [Norman 2004], and

more directly as an embedded cognitive model to drive the emotional response that

embodies conversational agents [Cassell et al. 1999]. So theory is far from dead,

but the pathway for theory application in design research is fragmented, and more

critically to design practice, is unclear. The status quo is summarized in Figure 1.

The influence of theory-based research on design is tending to be recruited via the

scientific route in Long & Dowell’s framework, i.e. theory motivates experiments or

case studies, the results of which provide design guidelines.

So is there a consensus of what has been achieved so far? In the majority of

HCI text books, for example [Dix et al. 2003; Preece et al. 2002; Shneiderman &

Plaisant 2004], theory is present as the Model Human Processor [Card et al. 1983],

and as a bridging model in Norman’s [1986] action cycle. What is missing is the

meta-level guidance on how to exploit and develop theory in the design process: the

grand challenge of theory-led design.

We need to return to the debate on bridging from theory to design. Some

promising directions have emerged in cognitive dimensions [Green & Petre 1996]

which propose principles for the design and evaluation initially of notations but more

recently of interactive systems. Object system models [Connell et al. 2003] attempt

to apply cognitive dimensions within design contexts for particular problems, while

critical parameters focus attention on key cognitive requirements for interactive

systems [Newman et al. 2000]. The important direction, I believe, is to focus on

typical classes of problems which are common in HCI, understand the cognitive

and social implications of those problem classes and then apply the appropriate

theories to solving them. HCI needs to investigate tasks more deeply in order

to understand the psychological implications of certain classes of problem. Take

monitoring tasks as a specific example. Monitoring tasks are known to be error

prone and people find it difficult to concentrate over long periods of time. The types

of errors people make and their problem-solving strategies can be predicted using

several theories [Norman 1988; Rasmussen 1986; Hollnagel 1998]. Human error

theory predicts that interruptions and time pressures produce frequency and recency

gambling pathologies resulting in sub-optimal problem-solving strategies [Reason

1990].

Monitoring tasks involve a perceptual process of recognizing significant events,

cognitive processes of interpreting events, and domain knowledge to evaluate the

significance of events in context. Norman’s [1986] cycle of action can be reversed

to set these issues in an interaction context; i.e. once a significant event has

been recognized, interpreted, and evaluated one has to plan, specify and execute

the appropriate response. The critical cognitive aspects of monitoring tasks are

Figure 2: Process map of theory-led design.

selective attention and recognition of events, reasoning with domain knowledge to

interpret the events and then planning appropriate responses. To develop effective

support systems the designer has to understand the problem, isolate the critical

cognitive aspects, then use the appropriate theories or experimental evidence to find

appropriate solutions. This approach is summarized in Figure 2.

By linking theory to generic task models and bridging representations —

be they cognitive dimensions, claims or critical parameters — a library of HCI

knowledge can be built which enables effective reuse. Generic task models [Sutcliffe

2002] can play a key role by providing an access point from task analysis to reusable

knowledge. If specific tasks could be mapped to generalized tasks, and if the latter

are associated with theoretical, sound design advice, then designers will have advice

placed in the context of their current problem. They still have to specialize the advice

and reason about trade-offs but this is an advance on ploughing through libraries of

design guidelines or patterns. However, generic task models need to be associated

with ‘critical aspects’ to place theory in its application context. I have used the aspect

oriented programming analogy since it illustrates the problem. Theory provides

knowledge that is applicable in many parts of a design, i.e. cross-cutting concerns.

The problem is to define the theory treatment for the aspect and when to apply it at

‘point cuts’ in the design. Linking critical aspects to generic task models indicates

the point cuts, but defining the treatment is not so simple. This requires reasoning

about trade-offs using bridging models. There is no silver bullet for producing high

quality designs; however, theory-led design can make the bullet easier to aim. In the

following section I will put this approach to the test with an exploration of design for

notifier user interfaces, which support monitoring tasks.

3 Application of Theory-led Design

To illustrate how we should apply theory in HCI, I will investigate a current research

area as a ‘gedanken experiment’. Notifiers, attentional user interfaces or dual task

displays have attracted considerable attention over recent years [Horvitz et al. 2003;

McCrickard et al. 2003]. This research is motivated by a general user need, i.e. to

enhance support for multitasking and to monitor information such as stock markets

or the weather, while performing another primary task. The research problem is best

encapsulated by a scenario. Marilyn, a senior staff nurse, is in charge of a chronic

care ward and has to keep an eye on her patients during the night shift. The patients

are automatically given medication via drips for management of chronic and postoperative

conditions, e.g. pain relievers, anti-coagulants, etc. The patients’ heart and

respiratory rates and blood pressure are automatically monitored and displayed on

a screen in Marilyn’s office. She has to monitor patients’ condition and adjust the

rate of medication if the measured parameters depart from the expected treatment

plan. More urgent alerts must be triggered if the patients experience distress which

could be picked up by heart ECG sensors or an emergency call button which can be

activated by patients. If all is quiet Marilyn is expected to catch up on the day’s paper

work to complete duty rosters, treatment records, etc.

Several approaches have been taken to this problem and many systems have

been implemented which provide background monitoring displays as information

tickers [Maglio & Campbell 2000], ambient wallpaper [Stasko et al. 2004] and

radar metaphors to communicate the status of monitored information to the user

[McCrickard & Chewar 2003]. However, these designs are based on intuition and

give no guarantee of their effectiveness. Horvitz et al. [2003] have taken a more

principled approach by proposing a cost of interruption measure expressed as the cost

of diverting from the primary task compared with the benefit from the information

gained from the monitoring tasks. This enables reasoning about when to interrupt

using Bayesian networks with user defined valuation of the task activity, cost of

interruption and environmental measures. Chewar et al. [2004] have elaborated

this idea with a conceptual framework composed of three dimensions: Interruption,

Reaction and Comprehension, that draws attention to the cognitive parameters of the

problem, namely, when and how to alert the user to draw their attention away from

the primary task, how to communicate information relevant to the secondary task

and how to promote comprehension of such information. While these approaches

are better than craft-level design, they do not constitute a theory-based approach.

To examine the problem in a more principled manner we have to understand

the problem from a psychological viewpoint, and this is the first impact of theory:

the ability to conceptualize interaction problems. The first part of the problem is

alerting the users to relevant information for the secondary task. This poses the

system initiative question. Should the monitored information be accessible to the

user all the time, and hence possibly distracting from the primary task, or should it

only be made available when the system decides it is critical for the user’s interests?

The second issue is how to ensure the user has seen or heard the critical information

and responded to it. The third part of the problem is to support the user’s transition

from the primary task to the secondary task.

When to alert requires a model of the user’s task, the user and the context. This

is part of a user modelling grand challenge since acquiring a model of the user and

their context requires solving the difficult problem of inferring complex states from

Table 1: Cognitive task audit: Monitor and Evaluate tasks.

low-level data streams. I shall finesse this challenge and return to my main theme.

Task analysis is the conventional HCI approach to understanding the user’s domain

[Diaper 2002; Johnson 1992]. While a goal model and procedural sequences can

give us breakpoints where intervention should be more appropriate it doesn’t go far

enough. The cognitive demands of the primary task will influence when the user

interface should signal an alert. The role of theory is to provide a cognitive audit so

we can understand the critical aspects of a particular task and plan appropriate action.

An example of the cognitive task audit is illustrated in Table 1. In the hospital patient

monitoring scenario, working memory has to cope with keeping track of several

patients while selective attention will be distributed between the several patient

monitoring devices and the foreground tasks of updating drug registers and patient

records. Reasoning will be rule-based using the nurse’s training, but knowledge

intensive judgement [Rasmussen 1986] will have to be applied when the unexpected

happens.

A cognitive task audit requires application of general psychological knowledge

to the task in question. While this does utilize theoretical knowledge, it does so

only second hand via education. The designer has to know the theory in order to

apply it in a cognitive task analysis. However, if a set of generic task models could

be developed and annotated with cognitive aspects, then theoretical knowledge could

be reused by non-HCI expert designers. A taxonomy of generic tasks exists [Sutcliffe

2002]; however, the process of mapping from specific task to these generic models,

annotating them with cognitive properties and then applying such knowledge are

open research questions.

Three tasks involved in the notifier problem are illustrated in Figure 3, showing

the critical cognitive aspects. For recognition the salience and modality of the

stimulus is important. This is set against environmental obstacles such as complexity

of the environment and competing, similar stimuli. Precise design advice depends on

knowledge of perceptual psychology [e.g. Ware 2000] which can not be encapsulated

Figure 3: Monitor, Interpret and Evaluate generic tasks annotated with critical aspects and environment

Figure 4: Components in the Embedded Interaction Model for system initiative control.

in simple models, although design principles for choice of media and modalities

provide some guidance [Sutcliffe 2003]. These need to be combined knowledge

critical aspects such as selective attention to understand possible problems arising

from competing stimuli on the same modality, and the effect of distractions on the

monitoring task. In the Interpret and Evaluate tasks the critical aspects point to the

need to reduce working memory burden by supplying appropriate information for

understanding the event within a display appropriate to the user’s task and context.

Grand Challenges in HCl: the Quest for Theory-led Design 499

Attention Resource Theory [Wickens 2002] can provide an appropriate

framework for analysing the selective attention and when to intervene problem.

This provides the basis for specifying an embedded cognitive model which predicts

when to intervene, given a task model and monitored inputs of the user’s activity.

This embedded interaction model (see Figure 4) could be configured to other

notifier systems, thereby contributing another conduit by which theory can influence

design via reusable computational models that embed theory for solving interaction

problems.

Knowledge of the salience of perceptual stimuli and how the stimulus properties

affect attentional executive processing are necessary to design the alerting stimuli.

Such knowledge is only partially formulated even in cognitive theory such as EPIC

[Kieras & Meyer 1997] and experimental derived knowledge [Ware 2000], so the

designer will have to use several sources to understand how the salience of stimuli

interacts with users’ knowledge and motivation in perceptual mechanism. This

points to a need for new theoretical development. However, sufficient knowledge

already exists to suggest a design solution for the appropriate alerting stimulus and

modality. In our scenario, the alert will depend on the criticality of the information.

If the monitored parameters stray out of a normal range but into a non-critical zone

then a low-level stimulus will be used (e.g. colour change in the information display);

whereas if parameters stray into a critical zone then more salient alerts, such as voice

or sound coupled with visual highlighting, or animated displays, are required. Note

that this design adaptation will also be driven by the embedded interaction model.

Applying theory also helps understanding how to manage the transition from the

primary to a secondary task, although once again the task has a critical intermediary

role. Marilyn has to monitor several stimuli, interpret the measurements and then

decide on appropriate adjusting action. However, when an emergency occurs she

has to rapidly interpret the event, diagnose the problem and take remedial action.

In both cases her training will prepare her for the correct response. However,

the patient-monitoring task is close to a skill and requires little reasoning; in

contrast, responding to an emergency requires more conscious rule- and knowledgebased

reasoning (Rasmussen, 1986). Diagnosis places different cognitive demands

and hence requires information displays localizing the problem, while suggesting

possible causes and remedial actions

The storyboard design that might be prompted by the application of theory-led

design is illustrated in Figure 5.

The user’s foreground task occupies most of the screen, as windows with Word

and Excel that Marilyn uses for updating her records, writing reports and planning

rosters. The monitor is placed on continuous display on the side of the screen, since

change in the background monitored system is gradual so Marilyn can sample a

patient’s state on demand. A current and recent past display enables her to glance

at the display and see if the state of any patient has changed in the recent time

period. The display maps to rooms occupied by individual patients on the ward,

with colour coding used for change in parameters: from green within normal range

to red signalling a dangerous digression. Dangerous changes trigger an audio alarm

and highlight the patient’s location. The audio alarm uses canned speech messages

Figure 5: Storyboard design for the patient monitoring system.

which can be tailored to reinforce the message. Below the ‘room map’ display a time

line gives the sequence of visits to check on condition, changes to medication, and

reminders when periodic checks are due.

The above design decisions were motivated by several critical aspects. Use of a

continuous display monitor was motivated by the periodic nature of the monitoring

task and the cognitive aspect on reducing working memory burden. When to use

an explicit alert used the embedded model to calculate the trade-off between the

criticality of the event and the cost of interrupting the foreground task. Third was

how to alert the user to change; a variety of strategies were indicated linking the

strength of the stimulus to the criticality of the event. This used media design

principles as well as knowledge of language theory and selective attention. The final

design issue is how to make the transition between the foreground and background

tasks. System actions are tuned to the criticality of the monitoring task. For lowlevel

alerts the foreground task is automatically saved but no further action is taken

apart from displaying more information on the critical patients. For critical alarms

the work is saved, the Word and Excel windows are closed, and patient information

is displayed with appropriate treatment suggestions for the individual concerned.

In this case the system has made the transfer from supporting documentation to a

diagnosis task.

4 Conclusions

This exercise in theory-led design proposed a solution via three complementary

strands for applying theory to design: internalized knowledge of simplified

psychological theory, reuse of knowledge located via mapping generic tasks

Grand Challenges in HCl: the Quest for Theory-led Design 501

applicable to the problem, and design of reusable embedded interaction models. It

will always be difficult to separate the relative contribution to a design solution from

internalized and tacitly applied theory, reusable knowledge, external guidance and

advice. To an extent I argue for both as a necessary prerequisite for good design.

My quest is to provide different pathways by which knowledge from theory can

be applied to the design process. This is where I believe generic tasks and critical

aspects can play their role, first as a retrieval mechanism and secondly as a method

of structuring HCI to teach it in the first place.

It is germane to reflect whether this approach is really new. For example,

application of Ecological Interface Design [Vicente 1999] could have led to the

same metaphor domain representation of the patient monitor interface, although this

method would not have provided advice on cognitive aspects of media selection.

Claims and the task artefact cycle provide a similar access path to theory-based

knowledge, which I have proposed as a precursor to this approach [Sutcliffe 2000].

In an extended schema [Sutcliffe & Carroll 1999] claims have been linked to

generic task models and organized in taxonomies. Claims could therefore provide

design advice for specific critical aspects. The critical aspects are not dissimilar

to cognitive dimensions, and provide summaries of cognitive knowledge linked to

a particular design concern [Green & Petre 1996]. Generic task models, embedded

cognitive models and critical aspects supplement those views, but bring theory-based

knowledge closer to the designer by providing a more direct mapping to tasks and

design problems.

The example used in this paper has inevitably biased the focus towards cognitive

theory and single user systems. The impact of social theory and critical aspects for

collaborative systems has hardly been researched. For example are there critical

aspects of shared identity, mutual awareness, collective motivation, the glue of social

relationship as factors that influence the success of groups? So far we have but a

few heuristics to use, although a more well formed theory, such as Small Groups as

Complex Adaptive Systems [Arrow et al. 2000; Sutcliffe 2005] could provide more

insight into theory-led design for collaborative systems.

In conclusion, theory will continue to play a vital role in design, not only in

HCI but in the wider context of technology design, as computers become part of

everyday products. The quest to translate theory into usable knowledge for design

is still underway, and in this paper I have argued for a multi-path route via reuse

of knowledge in the head and theory embedded in computation models, as well as

localizing knowledge in a task context. This grand challenge has a wide-ranging

scope as a general theoretical approach to design not only of computing related and

computer embedded artefacts but to all designs that involve human interaction or

use. HCI could transform itself into the Science of Design by rising to this challenge.

References

Alexander, C, Ishikawa, S., Silverstein, M., Jacobson, M., Fiksdahl-King, I. & Angel, S.

[1977], A Pattern Language: Towns, Buildings, Construction^ Oxford University Press.

Anderson, J. R. & Lebiere, C. [1998], Representing Cognitive Activity in Complex Tasks,

Lawrence Erlbaum Associates.

502 Alistair Sutcliffe

Arrow, H., McGrath, J. E. & Berdahl, J. L. [2000], Small Groups as Complex Systems:

Formation, Coordination, Development and Adaptation, Sage Publications.

Barnard, P., May, J., D, D. & Duce, D. [2000], Systems, Interactions and Macrotheory, ACM

Transactions on Computer-Human Interaction 7(2), 222-62.

Bellotti, v., Buckingham Shum, S., MacLean, A. & Hanmiond, N. [1995], Multidisciplinary

Modelling in HCI Design: In Theory and in Practice, in I. Katz, R. Mack, L. Marks, M. B.

Rosson & J. Nielsen (eds.). Proceedings of the SIGCHI Conference on Human Factors in

Computing Systems (CHr95), ACM Press, pp. 146-53.

Bertelsen, O. W. & B0dker, S. [2003], Activity Theory, in J. M. Carroll (ed.), HCI Models,

Theories and Frameworks: Toward a Multidisciplinary Science, Morgan-Kaufmann,

pp.291-324.

Blackmon, M. H., Kitajima, M. & Poison, P. G. [2003], Repairing Usability Problems

Identified by the Cognitive Walkthrough for the Web, in V. Bellotti, T Erickson, G. Cockton

& P. Korhonen (eds.), Proceedings of SIGCHI Conference on Human Factors in Computing

Systems (CHI’OS), CHI Utters 5(1), ACM Press, pp.497-504.

Borchers, J. [2001], A Pattern Approach to Interaction Design, John Wiley & Sons.

Brooks, F. P [2003], Three Grand Challenges for Half Century Old Computer Science,

Journal of the ACM 50(10), 25-6.

Burke, M., Gorman, N., Nilsen, E. & Homof, A. [2004], Banner Ads Hinder Visual Search

and Are Forgotten, in E. Dykstra-Erickson & M. Tscheligi (eds.), CHI’04 Extended Abstracts

of the Conference on Human Factors in Computing Systems, ACM Press, pp.1139-42.

Card, S. K., Moran, T. P & Newell, A. [1983], The Psychology of Human-Computer

Interaction, Lawrence Erlbaum Associates.

Carroll, J. M. [2000], Making Use: Scenario-based Design of Human-Computer

Interactions, MIT Press.

Carroll, J. M. & Rosson, M. B. [1992], Getting Around the Task-Artefact Framework: How

to Make Claims and Design by Scenario, ACM Transactions on Office Information Systems

10(2), 181-212.

Cassell, J., Bickmore, J., Billinghurst, M., Campbell, L., Chang, K., Vilhjalmsson, H. & Yan,

H. [1999], Embodiment in Conversational Interfaces: REA, in M. G. Williams & M. W.

Altom (eds.). Proceedings of the SIGCHI Conference on Human Factors in Computing

Systems: The CHI is the Limit (CHr99), ACM Press, pp.520-7.

Chewar, C. M., McCrickard, D. S. & Sutcliffe, A. G. [2004], Unpacking Critical Parameters

for Interface Design: Evaluating Notification Systems with the IRC, in D. Gruen & I. McAra-

McWilliams (eds.). Proceedings of the Symposium on Designing Interactive Systems:

Processes, Practices, Methods and Techniques (DIS’04), ACM Press, pp.279-88.

Connell, I., Green, T. R. G. & Blandford, A. [2003], Ontological Sketch Models:

Highlighting User-system Misfits, in E. O’Neill, P. Palanque & P. Johnson (eds.). People and

Computers XVII: Designing for Society (Proceedings ofHCVOS), Springer-Verlag, pp. 163-

78.

Grand Challenges in HCI: the Quest for Theory-led Design 503

Diaper, D. [2002], Tasks, Scenarios and Thought, Interacting with Computers 15(4), 629-38.

Dix, A., Finlay, J., Abowd, G. D. & Beale, R. [2003], Human-Computer Interaction, third

edition, Prentice-Hall.

Dix, A. & Mancini, R. [1998], Specifying History and Back Tracking Mechanisms, in

Formal Methods in Human-Computer Interaction, Formal Approaches to Computing and

Information Technology, Springer-Verlag, Chapter 1, pp. 1-23.

Dowell, J. & Long, J. B. [1998], Conception of the Cognitive Engineering Design Problem,

Ergonomics 41(2), 126-39.

Green, T. R. G. & Petre, M. [1996], Usability Analysis of Visual Progranmiing

Environments: A ‘Cognitive Dimensions’ Framework, Journal of Visual Languages and

Computing 7(2), 131-74.

Harrison, M. D. & Barnard, P. J. [1993], On Defining the Requirements for Interaction,

in S. Fickas & A. Finkelstein (eds.). Proceedings of the IEEE International Symposium on

Requirements Engineering (RE’93), IEEE Computer Society Press, pp.50-5.

Hollnagel, E. [1998], Cognitive Reliability and Error Analysis Method (CREAM), Elsevier.

Horvitz, E., Kadie, C. M., Paek, T. & Hovel, D. [2003], Models of Attention in Computing

and Conmiunications: From Principles to Application, Communications of the ACM

46(3). 52-9.

Ishii, H., Mazalek, A. & Lee, J. [2001], Bottles as a Minimal Interface to Access Digital

Information, in J. A. Jacko & A. Sears (eds.), CHI’Ol Extended Abstracts of the Conference

on Human Factors in Computing Systems, ACM Press, pp. 187-8.

ISO [1997], ISO 9241 International Standard. Ergonomic Requirements for Office Systems

with Visual Display Terminals (VDTs). International Organization for Standardization,

Geneve, Switzerland.

ISO [1998], ISO 14915 International Standard. Multimedia User Interface Design Software

Ergonomic Requirements — Part 1: Introduction and Framework. International Organization

for Standardization, Geneve, Switzerland.

ISO [2000], ISO 14915 International Standard. Multimedia User Interface Design Software

Ergonomic Requirements — Part 3: Media Combination and Selection. International

Organization for Standardization, Geneve, Switzerland.

John, B. E. & Kieras, R. E. [1995], The GOMS Family of User Interface Analysis

Techniques: Comparison and Contrast, ACM Transactions on Computer-Human Interaction

3(4), 320-51.

Johnson, P. [1992], Human-Computer Interaction: Psychology, Task Analysis and Software

Engineering, McGraw-Hill.

Kieras, D. E. & Meyer, D. E. [1997], An Overview of the EPIC Architecture for Cognition

and Performance with Application to Human-Computer Interaction, Human-Computer

Interaction 12(4), 391-438.

504 Alistair Sutcliffe

Kitajima, M. & Poison, P. G. [1997], A Comprehension-based Model of Exploration,

Human-Computer Interaction 12(4), 345-89.

Long, J. & Dowell, J. [1989], Conceptions of the Discipline of HCI: Craft, Applied

Science and Engineering, in A. Sutcliffe & L. Macaulay (eds.), People and Computers V

(Proceedings ofHCP89), Cambridge University Press, pp.9-34.

Maglio, P P & Campbell, C. S. [2000], Trade-offs in Displaying Peripheral Information, in

T. Turner & G. Szwillus (eds.), Proceedings of the SlGCHl Conference on Human Factors

in Computing Systems (CHI’OO), CHI Utters 2(1), ACM Press, pp.241-8.

McCrickard, D. S., Catrambone, R., Chewar, C. M. & Stasko, J. T. [2003], Establishing

Tradeoffs that Leverage Attention for Utility: Empirically Evaluating Information Display

in Notification Systems, International Journal of Human-Computer Studies 58(5), 547-82.

McCrickard, D. S. & Chewar, C. M. [2003], Attuning Notification Design to User Goals and

Attention Costs, Communications of the ACM 46(3), 67-72.

Nardi, B. A. (ed.) [1996], Context and Consciousness: Activity Theory and Human-

Computer Interaction^ MIT Press.

Newman, W. M., Taylor, A. S., Dance, C. R. & Taylor, S. A. [2000], Performance

Targets, Model and Innovation in Design of Interactive Systems, in D. Boyarski & W. A.

Kellogg (eds.). Proceedings of the Symposium on Designing Interactive Systems: Processes,

Practices, Methods and Techniques (DIS2000), ACM Press, pp.381-7.

Norman, D. A. [1986], Cognitive Engineering, in D. A. Norman & S. W. Draper (eds.). User

Centered System Design: New Perspectives on Human-Computer Interaction, Lawrence

Erlbaum Associates, pp.31-62.

Norman, D. A. [1988], The Psychology of Everyday Things, Basic Books.

Norman, D. A. [2004], Emotional Design: Why We Love (or Hate) Everyday Things, Basic

Books.

NSF [2004], Science of Design, http://www.nsf.gov/funding/pgm_summ.jsp?pims_id= 12766

(last accessed 2005-04-26). National Science Foundation: Directorate for Computer &

Information Science & Engineering.

Ortony, A., Clore, G. L. & Collins, A. [1988], The Cognitive Structure of Emotions,

Cambridge University Press.

Pirolli, P & Card, S. [1999], Information Foraging, Psychological Review 106(4), 643-75.

Preece, J., Rogers, Y. & Sharp, H. (eds.) [2002], Interaction Design: Beyond Human-

Computer Interaction, John Wiley & Sons.

Rasmussen, J. [1986], Information Processing and Human-Machine Interaction: An

Approach to Cognitive Engineering, North-Holland.

Reason, J. [1990], Human Error, Cambridge University Press.

Shneiderman, B. & Plaisant, C. [2004], Designing the User Interface: Strategies for Effective

Interaction, fourth edition, Addison-Wesley.

Grand Challenges in HCI: the Quest for Theory-led Design 505

Stasko, J., Miller, T, Pousman, Z., Plaue, C. & Ullah, O. [2004], Personalized Peripheral

Information Awareness through Information Art, in N. Davies, E. Mynatt & I. Siio (eds.),

UbiComp 2004: Ubiquitous Computing (Proceedings of the Sixth International Conference

on Ubiquitous Computing), Vol. 3205 of Lecture Notes in Computer Science, Springer,

pp. 18-35.

Sutcliffe, A. G. [2000], On the Effective Use and Reuse of HCI Knowledge, ACM

Transactions on Computer-Human Interaction 7(2), 197-221.

Sutcliffe, A. G. [2002], The Domain Theory: Patterns for Knowledge and Software Reuse,

Lawrence Erlbaum Associates.

Sutcliffe, A. G. [2003], Multimedia and Virtual Reality: Designing Multisensory User

Interfaces, Lawrence Erlbaum Associates.

Sutcliffe, A. G. [2005], Extending Small Group Theory for Analysing Complex Systems, in

C. W. Johnson (ed.). Proceedings of the Workshop on Complexity in Design and Engineering,

Department of Computer Science, University of Glasgow, pp. 139-48. GIST Report 2005-1.

Sutcliffe, A. G.. Fickas, S., Sohlberg, M. M. & Ehlhardt, L. A. [2003], Investigating the

Usability of Assistive User Interfaces, Interacting with Computers 15(4), 577-601.

Sutcliffe, A. G. & Carroll, J. M. [1999], Designing Claims for Reuse in Interactive Systems

Design, International Journal of Human-Computer Interaction 50(3), 213-42.

Teasdale, J. D. & Barnard, P J. [1993], Affect, Cognition and Change: Re-modelling

Depressive Thought, Lawrence Erlbaum Associates.

UKCRC [2005], Grand Challenges for Computing Research,

http://www.ukcrc.org.uk/grand_challenges/index.cfm (last accessed 2005-04-26). UK

Computing Research Conmiittee.

Van Welie, M. [2005], Patterns in Interaction Design: Web Design Patterns,

http://www.welie.com/pattems (last accessed 2005-04-26).

Vicente, K. J. [1999], Cognitive Work Analysis: Towards Safe, Productive and Healthy

Computer-based Work, Lawrence Erlbaum Associates.

Ware, C. [2000], Information Visualization: Perception for Design, Morgan-Kaufmann.

Wickens, C. [2002], Multiple Resources and Performance Prediction, Theoretical Issues in

Ergonomics Science 3(2), 150-77.

Wright, P C, Fields, R. E. & Harrison, M. D. [2000], Analysing Human-Computer

Interaction as Distributed Cognition: The Resources Model, Human-Computer Interaction

15(1), 1-41.

Viewpoints – Sabetzadeh, Finkelstein and Goedicke (2009) https://www.hci-site-experiments.co.uk/wp-content/themes/engic/images/empty/thumbnail.jpg 150 150 John John https://secure.gravatar.com/avatar/d46e5c6154304cf75a0c2d0059f249ab2204a9e4e458b1e88aaaa9e322763c85?s=96&d=mm&r=g October 7, 2014 October 7, 2014

Viewpoints – Sabetzadeh, Finkelstein and Goedicke (2009)

John

October 7, 2014

Introduction

I have known Anthony Finkelstein on-and-off, since his City days. Mostly, off it must be said, although, as I remember (I think), we were both involved some time ago now in a Joint Council’s research initiative – or some such. These sorts of initiatives always involve much politicking and associated immodesty; but I always considered , that Anthony spoke with much good sense and direction. He is one of those people, whom you can hear think, when they are talking (unlike the many people, who talk without thinking). I followed his rapid progress on all fronts, during his time at City and now at UCL. He has not peaked by a long chalk. I last bumped into him at a UCLIC prize giving.

I am keen to include on the website software engineering research, which has some overlap and /or implications for HCI. We have not heard the last about the relations between SE and HCI – indeed far from it. The symmetry between Cognitive Engineering and Software Engineering is a very enticing notion; but it has yet to be delivered in a combined, never mind about an integrated, form. Anthony’s Viewpoints paper allows us at least to reflect on what such possibilities might be and I am delighted to be able to include it here. Food for thought indeed.

Viewpoints

Mehrdad Sabetzadeh

Simula Research Laboratory

Oslo, Norway

mehrdad@simula.no

Anthony Finkelstein

University College London

London, UK

a.finkelstein@cs.ucl.ac.uk

Michael Goedicke

University of Duisburg-Essen, Campus Essen

Essen, Germany

michael.goedicke@s3.uni-due.de

November 8, 2009

Abstract

The construction of any sizable software system involves many

agents, each with their own perspective of the system being built.

Viewpoints provide a framework for guiding and managing development

in a multiple-perspective setting, where a variety of agents with

different areas of concern collaborate towards building a system. In

this article, we explain the main concepts and techniques underlying

viewpoint-based development and illustrate them using a number of

examples.

Keywords. Perspectives, Separation of Concerns, Integration, Inconsistency

Management, Merging, Parallel Composition, Weaving.

1 Introduction

Large-scale software development is necessarily a collaborative effort, involving

multiple agents (also called, participants or actors) with different

perspectives of the system they are trying to develop. Individual perspectives

are partial descriptions of the overall system, arising in response to the

specific responsibilities or roles assigned to the agents. These responsibilities

or roles may be organizationally-defined, be governed by the physical

distribution of agents across development sites, follow some pre-specified

structuring of the system at hand, or reflect the agents’ knowledge, experience,

and descriptive capabilities.

Inevitably, the perspectives of those involved in the development process

overlap, interact or compete with one another, giving rise to a need for

coordination. The relationships between the perspectives, however, may be

far from obvious because the agents may use different vocabularies and representation

notations to express their concerns. Further, since development

may be done concurrently, different perspectives may be at different stages

of elaboration and hence subject to different development strategies.

The problem of how to guide and manage development in this setting –

multiple agents, diverse knowledge and expertise, heterogeneous representation

notations, differing development strategies – is known as the “multiple

perspectives problem”. In this article, we introduce viewpoints – a conceptual

framework for addressing the multiple perspectives problem.

Intuitively, a viewpoint is an independent and locally-managed object

encapsulating a particular perspective. The main feature of viewpoint-based

development is its recognition of perspectives and the relationships between

them as first-class artifacts, so that perspectives can be directly defined by

users, and the relationships between them can be specified, modified, and

analyzed explicitly.

Viewpoints have often been studied as part of Requirements Engineering.

Although the multiple-perspective problem is particularly acute during

the requirements gathering stage where a diverse group of stakeholers with

different goals and needs are involved, the problem is not restricted to it. In

fact, viewpoints arise with increasing frequency in software design and implementation

as well, due to such factors as evolution, functional variability,

globalization, etc.

It is not our intention to exhaustively examine all ramifications of viewpoints

in software engineering. Our main goal is instead to describe the

key ideas that motivate the use of viewpoints and to outline the integration

activities that go hand in hand with them. To achieve our goal, we

are going to draw on a number of examples throughout the article. While

we have tried to ensure reasonable breadth in our choice of examples, we

acknowledge that these examples are not reflective of the full range of the

applications of viewpoints reported in the literature. General references for

further details about viewpoints and their broader set of applications are

provided later in the article (see Section 7).

We use software models as the primary context for our examples. This

enables us to build on the growing familiarity of the general software engineering

community with modelling languages such as the UML (Unified

Modeling Language, 2003), hence alleviating the need to provide extensive

background on the notations used.

The remainder of this article is structured as follows:

We begin in Section 2 with an example of a multiple-perspective setting.

In Section 3, we introduce viewpoints as a vehicle for capturing perspectives.

In Section 4, we discuss how different viewpoints can be related using explicit

relationships. We continue in Section 5 with reviewing the core design

principles underpinning viewpoints. In Section 6, we concentrate on the

most essential activity in viewpoint-based development, called integration,

and highlight the different facets of the problem. We conclude the article in

Section 7 with a summary and references for further reading.

2 An Example of Multiple Perspectives

To illustrate multiple perspectives, we use a simple example capturing some

basic aspects of a Hospital Information System (HIS). A HIS is a a specialized

kind of information system designed to manage the clinical and

administrative functions of a hospital.

An HIS has to meet the needs of a variety of stakeholders. This includes

people at the management levels of the hospital, medical staff, technicians,

administrators, and so on. These people clearly have different areas

of concern, and hence different perspectives. The perspectives are usually

expressed in different ways, and capturing them often necessitates the use

of different representation notations, chosen to be particularly appropriate

to the descriptions provided by each stakeholder.

Figure 1 shows a few representative perspectives in an HIS. The formal

specification of perspectives as structured models is seldom done directly by

the stakeholders, but rather by the requirements and design analysts. For

simplicity, we ignore this technicality and directly associate the models to

the stakeholders.

The perspectives in Figure 1 describe an HIS along different dimensions

and at different levels of abstraction. The hospital head focuses on the highlevel

objectives of the system and expresses her view as a goal model in

KAOS (van Lamsweerde, 2009) notation. In her model (P1 in the figure),

achieving patient satisfaction is declared as a top goal, and a decomposition

of this goal into more concrete and lower-level goals is provided.

The medical director uses a UML activity diagram (P2) to capture the

normal workflow for patient care at the hospital. The process begins with

diagnosis, after which a patient is classified as either an inpatient or an

outpatient. The patient then gets the proper care based on this classification

and is subsequently discharged.

The doctor and the nurse each contribute two perspectives, a UML class

diagram and a UML sequence diagram. The class diagrams (P3 and P5)

capture the concepts and associations relevant to each stakeholder, and the

sequence diagrams (P4 and P6) describe usage scenarios. The usage scenario

for the doctor (P4) concerns the routine visiting of a patient and updating

her medical record. The usage scenario given by the nurse (P6) captures

the daily check up conducted in a hospital ward, where a nurse evaluates

each patient individually, and notes her observations in the patient’s medical

chart.

Finally, the computer systems administrator, who is in charge of ensuring

the integrity and performance of the HIS, expresses a requirement that any

update made to the persistent data of the system should be logged in a file,

so that problems can be tracked in case of a system failure. This requirement

is modelled as a rewrite rule for sequence diagrams (P7). If the left hand side

of the rule (denote L) matches a fragment of a sequence diagram, then that

fragment is rewritten with the right hand side of the rule (denoted R). The

left hand side would be deemed a match if update*() (i.e. a method whose

name begins with the “update” prefix) is called on a persistent data object.

If a match is found, the sequence diagram in question will be modified as

prescribed by R (the right hand side). That is, SystemLog.append() is called

after the update to append an entry to the system log.

The models in our example relate to one another in many ways. Some

of these relationships are readily identifiable. For example, the objects in

P4 are instances of the classes declared in P3, and the messages in P4

are occurrences of the class methods in P3. The relationship between P5

and P6 is similar. Another obvious pair of relationships are P7,P4 and

P7,P6. The match between the left hand side of P7 and P4 is {Object1 !

:Doctor, Object2 ! :MedicalRecord, update*() ! update()}. The P7,P6 relationship

is defined similarly.

There are also relationships whose existence is almost certain, but whose

details cannot be established with full certainty because of terminological

and structural differences between the perspectives. For example, an anal-

ysis of P3 and P5 would hint at some possible overlaps between the two

perspectives: the Patient class in P3 is likely to be the same as that in P5;

MedicalRecord in P3 is likely to correspond to Chart in P5; and the aggregation

between Patient and MedicalRecord in P3 may be an alternative representation

of the assigned to link between Patient and Chart in P5. Unless these correspondences

are explicitly validated with the stakeholders, one can never be

entirely sure about their correctness. For example, it may turn out that

the doctor’s notion of Patient encompasses both inpatients and outpatients,

while the nurse may mean only inpatients when she refers to Patient in her

models. Similarly, MedicalRecord and Chart may be distinct entities.

There are several other relationships between the perspectives that are

less obvious:

• Since P3 and P5 most likely have overlaps, P4 and P6 will be related

as well, in the sense that they can access and modify shared objects.

For example, it is possible for a doctor and a nurse to both want to

update an individual patient’s chart simultaneously. Hence, P4 and

P6 may need to synchronize on accesses to shared objects.

• P2 implicitly characterizes the different states that a patient goes

through at the hospital. Hence P2 would be related to the design

of the Patient class in P3 and P5.

• Finally, at a higher level of abstraction, all P2–P7 contribute to the

satisfaction of the system’s main objectives, and are, directly or indirectly,

traceable to the goals expressed P1.

In Figure 2, we have shown a bird’s eye view of the perspectives in our

example along with the relationships that capture the known or hypothesized

interlocking constraints on the structure and behaviour of the perspectives.

The use of perspectives made it a lot easier to ensure that all the stakeholders

are properly represented, and much less likely to miss information that is

critical to the success of the system. At the same time, the flexibility to

have perspectives results in several inter-related models that need to be

maintained independently and may be inconsistent with one another.

Effective use of perspectives in system development requires a systematic

framework for management and integration of perspectives and resolving

their inconsistencies. Viewpoints, as we describe in the subsequent sections,

provide the foundation for such a framework.

3 Viewpoints

The basic building block for organizing and supporting multiple perspectives

is a viewpoint. A viewpoint can be thought of as a combination of the idea

of an agent, role, or knowledge source and the idea of a perspective. In

software terms, a viewpoint is a loosely coupled, locally managed object

which encapsulates a perspective about the system and domain, specified

in a particular representation notation. Each viewpoint is composed of the

following components, called slots:

• a domain delineating the part of the “world” the viewpoint is concerned

with;

• a representation style defining the notation used by the specification

slot (below);

• a specification expressing the perspective of interest, represented in

the viewpoint’s style

Additionally, a viewpoint may contain knowledge of the process of its

design. This knowledge is captured using the following slots:

• a work plan describing the process by which the specification can be

built;

• a work record giving an account of the history and current state of

development

As an example, let us consider the specification provided by the medical

director (P2) in the motivating example of the previous Section. In Figure 3,

we have shown the complete viewpoint for this specification. The representation

style of the viewpoint is UML activity diagrams, and the domain of

concern is patient care. The specification is the medical director’s current

knowledge about the procedure for processing patients at the hospital, expressed

as a UML activity diagram. The work plan explains how to build an

activity diagram and how, and in what circumstances, to check consistency

with the other viewpoints. The work record gives the current state of the

specification and an account of its development in terms of the work plan.

Medical Director

Admission

Diagnosis

Treatment

Ambulatory

Care

Discharge

[inpatient] [outpatient]

Patient Care

UML Activity Diagrams

how to build UML activity diagrams,

and how to check consistency with

other viewpoints …

4 Relationships Between Viewpoints

The key feature that distinguishes viewpoint-based development from conventional

development methods is that viewpoints are built and maintained

independently of one another. This means that viewpoints are not just

projections of a common underlying specification, but rather distinct and

separately-evolving artifacts with potentially different vocabularies and frames

of reference. As a result, one cannot merely rely on conventions (e.g., name

or identifier equivalence between elements of different viewpoints) to give

the desired relationships between the viewpoints. Instead, one needs to explicitly

specify these relationships. Doing so is a critical prerequisite for any

meaningful integration of the viewpoints, and for ensuring that the impact

of changes made to the viewpoints can be properly analyzed, scoped, and

propagated.

A relationship typically refers to an (explicit) mapping between the contents

or interfaces of the specification slots in different viewpoints. In a

broader sense, a relationship may also encompass the “organizational structure”

of the development team (i.e., the owners of the viewpoints). In this

article, we shall deal only with the former type of relationships, i.e., those

between specifications.

There is no general rule as to what constitutes an inter-viewpoint relationship.

This depends on a variety of factors, most importantly the semantics

of the viewpoints involved, the stage of development the viewpoints are

in, and the degree of detail one wishes to capture in a relationship.

In the remainder of this section, we illustrate some common relationships

between specifications expressed as models. There is no restriction on the

number of specifications an individual relationship can connect, but usually,

relationships are defined pairwise, i.e., they are between two specifications.

Our example relationships, shown in Figure 4, are all pairwise.

In Figure 4(a), we depict the relationship (denoted R) between the perspectives

contributed by the doctor (previously shown in Figure 1). The relationship

specifies how the objects in the sequence diagram (V2) correspond

to the classes in the class diagram (V1) through “instance of” mappings, and

how the interaction messages in V2 correspond to the methods in V1 through

“occurrence of” mappings. If the two diagrams are assumed to be views on

a monolithic model of the system, this relationship is derivable from the

underlying model and can hence be left implicit. But as we stated earlier,

such an assumption is typically not made in viewpoint-based approaches,

and as a result, the relationships are always specified explicitly.

In Figure 4(b), the specifications involved are in different abstraction

layers. V1 is the extends–implements fragment of the meta-model for Java class

diagrams, and V2 – an instance class diagram conforming to the meta-model.

The relationship between these specifications is a type function mapping

each element of V2 to an element of V1. The relationship respects the structure

of the specifications, in the sense that if it maps an edge e of V2 to

an edge u of V1, it will also map the endpoints of e to those of u. Such

preservation of structure ensures that the edges in the V2 are well-typed.

Our next example, in Figure 4(c), shows the relationship between two

perspectives on the data schema of the hospital’s payroll subsystem. Both

perspectives are described as entity-relationship diagrams. The vocabularies

used by the perspectives differ, but there are conceptual overlaps between

the perspectives. These overlaps are specified by a mapping that equates

the corresponding elements. The relationship in Figure 4(c) states that the

entity referred to as StaffMember in V1 is the same as Employee in V2. Further

the relationship declares the name attribute of these two entities (as well as

the edges linking name to the respective entities) as being equivalent. For

a problem of this size, the relationship may be created by hand, but larger

problems require automation, usually achieved through heuristic techniques

for finding concept matches. These techniques are almost always inexact,

i.e., they may yield matches that are incorrect, or they may miss correct

matches. Despite this, the techniques can be of significant assistance to a

human expert for exploring the relationships between different stakeholder

vocabularies.

The relationships we talked about so far were expressed as collections of

tuples of the form (e1, e2) where e1 and e2 are elements of different specifications.

While tuple-based techniques are very common and widely used,

they are not the only possible way for specifying inter-viewpoint relation-

ships. For example, a relationship may be expressed declaratively using

logical formulas. An example is given below:

In Figure 4(d), V1 and V2 are two alternative ways of expressing the

concept of a person along with their gender. V1 models gender as an attribute

of the Person class, whereas V2 models gender through sub-classing.

This structural discrepancy between the viewpoints is more conveniently

expressed using first-order logic formulas than tuples of mapped elements.

The formulas shown in the figure specify how Female and Male in V1 relate

to Person in V2. To avoid ambiguity, predicate names in the formulas

are subscripted with a number denoting the model of origin. For example,

Person1 refers to Person in V1.

As a final illustration, we consider, in Figure 4(e), the relationship between

viewpoints capturing different components of a system. In contrast

to all our previous examples, the relationship in Figure 4(e) is not between

the contents of the viewpoints involved, but rather between their interfaces

(viewpoint contents are not shown). Here, V1 denotes a user process that

accesses shared resources during its execution, and V2 is one such resource.

The relationship between V1 and V2 describes how the two components are

supposed to bind together, hence establishing a communication channel between

them. More specifically, the relationship requires that the acquire (resp.

release) action from V1 should synchronize with lock (resp. unlock) from V2.

5 Principles Underlying Viewpoints

The use of viewpoints in software engineering is motivated primarily by

the separation of concerns principle – the observation that, for a large and

complex system, it is more effective to build several partial specifications

that focus on individual concerns, rather than to attempt to construct a

single monolithic specification for the whole system.

Separation of concerns may be carried out along different dimensions.

For example, in the motivating problem described in Section 2, the areas of

concern were defined along the organizational roles of the involved participants

(hospital head, doctors, nurses, etc.). A finer-grained separation could

result in developing independent viewpoints for every relevant participant,

e.g., individual doctors and nurses.

The factors influencing the choice of dimensions along which viewpoints

are built are many. They include, among others, the nature of the problem

at hand, organizational considerations, the degree of involvement of the

stakeholders, component architecture of the system, the development teams’

dynamics, and policies for future maintenance of the system.

Separation of concerns naturally leads to the related principle of het-

erogeneity of representation. Viewpoints reinforce this principle by allowing

participants to choose the notation in which they want to describe their perspectives.

The choice of notation for each viewpoint is recorded explicitly

in the representation style slot of that viewpoint.

The next major principle underlying viewpoints is decentralization – the

idea that the knowledge within each viewpoint is maintained and manipulated

locally. Decentralization moves away from any notion of a monolithic

system specification that can be managed globally. Such flexibility becomes

particularly crucial for development teams that are spread across multiple

geographical sites. Decentralization makes it possible for the members of

these teams to function independently without having to constantly coordinate

their work with one another.

A consequence of having multiple viewpoints in a project is the inevitable

inconsistencies that arise between them. The approach taken by viewpoints

is that of living with inconsistency. In other words, there is no requirement

that a set of viewpoints should always be consistent. This stance towards

inconsistencies draws on two important observations: Firstly, immediate

resolution of inconsistencies can be highly intrusive, particularly in early

stages of development, e.g., the requirements elicitation phase, where ambiguities

and conflicts tend to occur too frequently. Having to address every

inconsistency as soon as it arises can adversely affect productivity.

Secondly, maintaining consistency at all times can lead to premature

commitment to decisions that have not yet been sufficiently analyzed. For

example, resolving an inconsistency identified at the requirements stage may

require information from the detailed design stage which might not have yet

even started. Hence, any attempt to fix the inconsistency early on may fail

to amend the problem, and can in fact even worsen it.

Obviously, living with inconsistency should not be viewed as suggesting

living with defects in the final system. What this principle does promote is

the flexibility to deal with each inconsistency at the right time.

6 Viewpoint Integration

Viewpoint integration is primarily concerned with ensuring that a set of

viewpoints fit together in a consistent way while retaining their original

structure. In this sense, viewpoint integration is often synonymous with inconsistency

management. Despite the general desire to maintain viewpoints

as independent objects, there are situations where a collection of viewpoints

need to be combined into a single larger specification, e.g., to gain a unified

perspective, to build a complete blueprint for the system, or to better explore

the interactions among viewpoints. To this end, viewpoint integration

may further encompass three other activities, called merging, parallel composition

and weaving. We briefly explain and exemplify each of these four

facets of integration.

6.1 Inconsistency Management

Consistency is usually defined using inter-viewpoint rules expressing the

desired constraints that must hold between viewpoints. As an example,

a consistency rule that one might want to express over the viewpoints in

Figure 4(a) is that each message in the sequence diagram has a corresponding

method in the class diagram. This is captured by the following rule:

! = ” msg, obj , cls · Target(msg, obj ) # R(obj , cls) =$

% mtd · MethodOf(mtd, cls) # R(msg, mtd).

In the above formula, R(x, y) holds if x and y are related by an interviewpoint

relationship (see the relationship R in Figure 4(a)); Target(x, y)

holds if message x is invoked on object y in the sequence diagram; and

MethodOf(x, y) holds if x is a method of class y in the class diagram. In this

example, the viewpoints satisfy !; but if we, say, remove the getRecord()

method from the Patient class in V1, the rule would no longer hold.

Inconsistency management is the process of handling the potential plethora

of consistency rules, an example of which was shown above, and dealing with

the viewpoints when the rules fail to hold. Inconsistency management involves

several steps:

• Consistency checking, focusing on monitoring, detecting, and reporting

of consistency rule violations.

• Inconsistency classification, focusing on identifying the kinds of inconsistencies

detected. Inconsistencies may be classified according to their

causes, or according to pre-defined kinds prescribed by developers.

• Inconsistency handling, focusing on acting in the presence of inconsistencies.

For example, when an inconsistency is detected, the appropriate

response may be to ignore, tolerate, resolve, or circumvent

it.

• Inconsistency measurement, focusing on calculating the impact of inconsistencies

on different aspects of development, and prioritizing the

inconsistencies according to the severity of their impact. The actions

taken to handle an inconsistency typically depend on the results of

inconsistency measurement.

6.2 Merging of Viewpoints

The main goal of merging is unification of overlaps between viewpoints.

More precisely, given a set of viewpoints, merge combines the viewpoints

together in a way that only one copy of the overlapping parts is included in

the result. This property is known as non-redundancy.

To illustrate merge, consider the two viewpoints in Figure 4(c) and the

relationship that specifies their overlaps. Figure 5 shows the merge of these

viewpoints (with respect to the relationship between them). This merge is

redundancy-free because it has only one copy of the common parts. This

ensures that database instances created over the merged schema will not

have data redundancies.

Non-redundancy is only the most basic requirement for merge. Viewpoint

merging is often subject to additional correctness criteria. The most

notable of these criteria are:

• Completeness: Merge should not lose information, i.e., it should represent

all the source viewpoints completely.

• Minimality: Merge should not introduce information that is not present

in or implied by the source viewpoints.

• Semantic Preservation: Merge should support the expression and preservation

of semantic properties. For example, if viewpoints are expressed

as state machines, one may want to preserve their temporal behaviours

to ensure that the merge properly captures the intended meaning of

the source viewpoints.

• Totality: Merge should be well-defined for any set of viewpoints,

whether or not they are consistent. This property is of particular

importance if one wants to tolerate inconsistency between the source

viewpoints.

These additional criteria are not universal to all model merging problems.

For example, completeness and minimality, in the strong sense described

above, may be undesirable if viewpoint merging involves conflict resolution

as well, in which case the final merge can potentially add or delete information.

Semantic preservation may be undesirable when one wants to induce a

design drift or perform an abstraction during merge. Such manipulations are

usually not semantics-preserving. And, totality may be undesirable when

the source viewpoints are expected to seamlessly fit together. In such a case,

the source viewpoints should be made consistent before they are merged.

6.3 Parallel Composition of Behavioural Viewpoints

Parallel composition refers to the process of assembling a set of behavioural

viewpoints that capture different components of a system, and verifying that

the result fulfills the properties of interest.

In contrast to merge, the inter-viewpoint relationships built for parallel

composition are between the interfaces of the viewpoints, and not between

their internal contents. As such, the line of sight of the relationships into

the viewpoints is limited to the interfaces that the viewpoints expose to the

outside world. A second difference between parallel composition and merge

is that, in parallel composition, multiple copies of the same viewpoint may

be participating to denote the fact that the system has multiple components

of the same type.

Figure 6 shows a simple component diagram, expressed in the visual

modelling notation of the Darwin architectural language (Magee et al.,

1995). The diagram describes the interconnections between three concur-

rent processes in the hospital system: C1, the process taken by a doctor to

add a prescription to a patient’s chart; C2, the process taken by a nurse to

update a patient’s chart after a checkup; and C3, the process representing a

patient’s chart as a resource. The semantics of the relationships between the

processes in Figure 6 is the same as that of the relationship R in Figure 4(e),

discussed earlier.

The behaviours of C1–C3 are shown in Figures 7(a)–(c), respectively.

The notation used to describe these behaviours is Labelled Transition Systems

(LTSs) (Milner, 1989). To make sure that patients’ charts are protected

against concurrent changes, accesses to C3 by C1 and C2 need to be mutually

exclusive.

The parallel composition of C1–C3 is shown in Figure 7(d). We have

used the LTS Analyzer (LTSA) tool (Magee & Kramer, 2006) for expressing

and computing the parallel composition. In Figure 7(e), we show the speci-

fication of the processes and their parallel composition in the input language

of the LTSA tool.

As seen from Figure 7(d), an update made by the doctor is preceded

by doctor.chart.lock (i.e., doctor’s request to acquire access to the chart), and

followed by doctor.chart.unlock (i.e., doctor’s request to release the chart). The

same is true for the nurse: her update is preceded by nurse.chart.lock and

followed by nurse.chart.unlock. By computing a parallel composition like this,

one can verify that the updates are mutually exclusive.

6.4 Weaving of Aspectual Viewpoints

Weaving is the predominant notion of integration in Aspect-Oriented Software

Development (AOSD) (Filman et al., 2004). AOSD is an attempt to

provide better functional decomposition in complex systems. This can in

part be achieved by means of encapsulating different concerns into distinct

system components. However, some concerns defy encapsulation as they

cut across many parts of the system. A classic example of a cross-cutting

concern is logging which affects all logged activities in a system. The perspective

of the Computer Systems Administrator (P7) in Figure 1 illustrated

this concern for sequence diagrams.

Given a set of cross-cutting concerns and a base system, weaving is

the process of incorporating the concerns into the base system. If no base

system is provided, weaving would refer to the process of incorporating the

given concerns into one another. Weaving operators may be implemented in

various ways depending on the nature of the base system and the concerns

involved, and whether weaving is performed statically (at compile time) or

dynamically (at runtime).

Aspect-oriented languages usually come equipped with built-in constructs

for defining the weaving operator. For example, aspect-oriented programming

languages provide pointcut constructs by which programmers specify

where and when additional code (i.e., an aspect) should be executed in place

of or in addition to an already-defined behaviour (i.e., the base program).

In aspect-oriented modelling, weaving is usually defined by patterns, to be

chosen either manually or automatically using pattern matching techniques.

Similar to other integration activities, weaving may result in undesirable

side effects. Thus, automated analysis techniques may be required to ensure

that the result of weaving satisfies the desired correctness properties.

As an example, Figure 8 shows the result of weaving P7 in Figure 1

into P4 and P6 in the same figure. For this problem, weaving is performed

statically, and the weaving operator can be most conveniently implemented

using graph rewriting.

7 Conclusion and Further Reading

In this article, we gave a brief introduction to viewpoints as a vehicle for

addressing the multiple perspectives problem. We discussed the basic con-

cepts involved in building software systems as a collection of inter-related

viewpoints, and explained the core integration activities associated with this

kind of development.

The principles upon which viewpoints are founded continue to be valuable

and relevant in many areas of software engineering today. In particular,

several major challenges faced in the areas of global software engineering

(Herbsleb, 2007), and model-based development (France & Rumpe, 2007)

are intimately related to the problems that have been studied under the

umbrella of viewpoints for the past two decades. The thread that ties these

areas to viewpoints is the pervasive need to manipulate systems made up

of many tangled artifacts, with potential inconsistencies in their design and

usage. This is indeed what viewpoints research has been aimed at all along.

In the light of the arguments presented in this article, viewpoint concepts

seem to be naturally applicable for expressing many properties and

structures in software development processes. Thus, the question arises why

no production-quality tools have been developed to date, incorporating all

the core viewpoint principles. We can see a number of tools with support

for multiple development views, e.g. UML-based tools; however, these tools

do not implement viewpoints in the sense we described here.

There are inherent properties to viewpoints that complicate their implementation.

In particular, implementing the various operators for matching

partial structures and creating and maintaining connections between

viewpoints is a challenging task. This is mainly due to the complexity of

the graph algorithms involved in the operators, and also to the potential

existence of structural and terminological mismatches between viewpoints,

which makes establishing precise inter-viewpoint relationships difficult.

In addition, it appears that the decentralized nature of viewpoints is

not fully compatible with the existing project management practices. Par-

ticularly, for project management purposes, one would often like to define

milestones and demand progress reports from the analysts and developers.

Distributing the work across multiple viewpoints means that a distributed

consensus has to be reached if something like a milestone is required, but arriving

at a consensus in a distributed setting is not always straight-forward.

Another property of distributed systems applies here as well: the difficulty

to build a global state. In the case of viewpoints, the notion of

an overarching document containing all specification information in a linear

form is lacking. Of course, the local view expressed by a viewpoint

and the connections to other viewpoints is a perfect basis for a hypertext

document, but this does not automatically ensure that the viewpoints fit together

properly and that they are consistent with one another. Developing

a more comprehensive and usable suite of tools based on viewpoints requires

first addressing the challenges discussed above. These challenges map out a

vision and an agenda for future research on viewpoints.

Numerous papers are available on viewpoints for further reading. Providing

links to all these papers is not possible here due to space limitations.

Below, we outline the expository and technical references used as the basis

for this article. These references together provide an extensive body of

bibliographic information on viewpoints.

Specification of viewpoints and their relationships. Several frameworks

for specifying and inter-relating viewpoints in requirements engineering

are surveyed in (Darke & Shanks, 1996). A more up to date bibliography

of viewpoints in requirements engineering is maintained by J. Leite at

http://www.requirementsviewpoints.blogspot.com/.

To read more about the applications of viewpoints in the broader context

of software engineering, one can refer to (Spanoudakis et al., 1996) where a

diverse agenda on viewpoints is provided.

The detailed anatomy of viewpoints and inter-viewpoint relationships,

as described in this article, is based on work on the ViewPoints framework

(Finkelstein et al., 1992; Nuseibeh et al., 1994; Nuseibeh et al., 2003). This

framework is largely attributed as having set forth the principal ideas of

viewpoint-based development as they exist today.

Inconsistency management. Several inconsistency management approaches

have been proposed, in general based on the success of the ViewPoints framework.

The main questions in this work centre on appropriate notations for

expressing consistency rules, and automated support for monitoring, diagnosis,

and resolution of inconsistencies. A direct successor to the ViewPoints

framework is xlinkit – a lightweight application service that provides rulebased

consistency checking and diagnostics generation for distributed web

content. This tool has been described in (Nentwich et al., 2003).

An overview of inconsistency management activities for viewpoints is

given in (Nuseibeh et al., 2000). Our discussion in Section 6.1 was based

on this reference. A detailed survey of existing inconsistency management

techniques is available in (Spanoudakis & Zisman, 2001).

Viewpoint merging. Several papers study merging of partial viewpoints

in specific domains including database schemata, requirements models, state

machines, and web ontologies. A survey of recent approaches to viewpoint

merging is available in (Sabetzadeh, 2008). The description in Section 6.2

was based on this reference.

Parallel composition. Parallel composition is a well-studied notion for

combining the behaviours of distributed system components, and has been

formally characterized for a variety of behavioural formalisms. Further detail

on analysis tools for concurrent distributed systems is available in (Magee

& Kramer, 2006).

Aspect weaving. Aspect-oriented concepts were originally coined for programming

languages, e.g. see AspectJ (Kiczales et al., 2001). Recently,

these concepts have been adapted to software models and applied to software

requirements and design viewpoints. The treatment of cross-cutting

viewpoints in Section 6.4 is similar to the approach of (Whittle et al., 2007).

For further references on aspect-oriented modelling and aspect weaving,

see (France et al., 2007; Morin et al., 2009).

Acknowledgments. The description of viewpoints in this article is based

on the ideas developed in the ViewPoints framework (Finkelstein et al.,

1992), to which many colleagues have contributed over the years. In particular,

we wish to acknowledge Bashar Nuseibeh and Jeff Kramer for their

significant contributions. We thank Steve Easterbrook, Shiva Nejati, and

Marsha Chechik for providing valuable comments on earlier versions of this

work. The first author acknowledges financial support from the Natural

Sciences and Engineering Research Council of Canada (NSERC), and is

grateful to University College London for hosting a visit during which this

article was shaped.

References

Darke, P., & Shanks, G. 1996. Stakeholder viewpoints in requirements

definition: A framework for understanding viewpoint development approaches.

Requirements Engineering, 1(2), 88–105.

Filman, R., Elrad, T., Clarke, S., & Aksit, M. 2004. Aspect-Oriented Soft-

ware Development. Reading, USA: Addison-Wesley.

Finkelstein, A., Kramer, J., Nuseibeh, B., Finkelstein, L., & Goedicke, M.

1992. Viewpoints: A Framework for Integrating Multiple Perspectives

in System Development. International Journal of Software Engineering

and Knowledge Engineering, 2(1), 31–58.

France, R., & Rumpe, B. 2007. Model-Driven Development of Complex

Software: A Research Roadmap. Pages 37–55 of: Future of Software

Engineering Track of the 29th International Conference on Software

Engineering.

France, R., Fleurey, F., Reddy, R., Baudry, B., & Ghosh, S. 2007. Providing

Support for Model Composition in Metamodels. Pages 253–266

of: Proceedings of the 11th IEEE International Enterprise Distributed

Object Computing Conference.

Herbsleb, J. 2007. Global Software Engineering: The Future of Socio-

Technical Coordination. Pages 188–198 of: Future of Software Engineering

Track of the 29th International Conference on Software Engineering.

Kiczales, G., Hilsdale, E., Hugunin, J., Kersten, M., Palm, J., & Griswold,

W. 2001. An Overview of AspectJ. Pages 327–353 of: Proceedings of

the 15th European Conference on Object-Oriented Programming.

Magee, J., & Kramer, J. 2006. Concurrency: State models and Java Programming:

2nd Edition. Wiley.

Magee, J., Dulay, N., Eisenbach, S., & Kramer, J. 1995. Specifying Distributed

Software Architectures. Pages 137–153 of: Proceedings of the

5th European Software Engineering Conference.

Milner, R. 1989. Communication and Concurrency. New York, USA:

Prentice-Hall.

Morin, B., Barais, O., Nain, G., & J´ez´equel, J. 2009. Taming Dynamically

Adaptive Systems using Models and Aspects. Pages 122–132 of: Proceedings

of the 31st International Conference on Software Engineering.

Nentwich, C., Emmerich, W., Finkelstein, A., & Ellmer, E. 2003. Flexible

Consistency Checking. ACM Transactions on Software Engineering and

Methodology, 12(1), 28–63.

Nuseibeh, B., Kramer, J., & Finkelstein, A. 1994. A Framework for Expressing

the Relationships Between Multiple Views in Requirements Specification.

IEEE Transactions on Software Engineering, 20(10), 760–773.

Nuseibeh, B., Easterbrook, S., & Russo, A. 2000. Leveraging Inconsistency

in Software Development. IEEE Computer, 33(4), 24–29.

Nuseibeh, B., Kramer, J., & Finkelstein, A. 2003. ViewPoints: Meaningful

Relationships Are Difficult! Pages 676–683 of: Proceedings of the 25th

International Conference on Software Engineering.

Sabetzadeh, M. 2008. Merging and Consistency Checking of Distributed

Models. Ph.D. thesis, University of Toronto.

Spanoudakis, G., & Zisman, A. 2001. Handbook of Software Engineering and

Knowledge Engineering. World scientific. Chap. Inconsistency management

in software engineering: Survey and open research issues, pages

329–380.

Spanoudakis, G., Finkelstein, A., & Emmerich, W. (eds). 1996. Viewpoints

96: International Workshop on Multiple Perspectives in Software Development.

Unified Modeling Language. 2003. The Unified Modeling Language.

http://www.uml.org/.

van Lamsweerde, A. 2009. Requirements Engineering: From System Goals

to UML Models to Software Specifications. Wiley.

Whittle, J., Moreira, A., Ara´ujo, J., Jayaraman, P., Elkhodary, A., & Rabbi,

R. 2007. An Expressive Aspect Composition Language for UML State

Diagrams. Pages 514–528 of: Proceedings of the 10th International

Conference on Model Driven Engineering Languages and Systems.

Bridging between Basic Theories and the Artifacts of Human-Computer Interaction – Phil Barnard (1991) https://www.hci-site-experiments.co.uk/wp-content/themes/engic/images/empty/thumbnail.jpg 150 150 John John https://secure.gravatar.com/avatar/d46e5c6154304cf75a0c2d0059f249ab2204a9e4e458b1e88aaaa9e322763c85?s=96&d=mm&r=g October 26, 2014 October 26, 2014

Bridging between Basic Theories and the Artifacts of Human-Computer Interaction – Phil Barnard (1991)

John

October 26, 2014

Introduction to Phil Barnard

Phil Barnard, of course, needs no introduction to the HCI community. His place in (the Valhalla of) HCI is assured (ahead of time) by his work on Interactive Cognitive Sub-systems (ICS), slayer of the Single Channel Model (SCM). More personally, however, I first met Phil at the Medical Research Centre’s, Applied Psychology Unit at Cambridge in the early seventies (OK about 40 or so years ago). We were both studying for a PhD – Phil under Phil Johnson-Laird and myself under Donald Broadbent (see Long, 2010 – Some Celebratory Remarks… and here – Section3, EU Student Reflections 1979/80). No short straws there, at least as far as concerns the supervisors. John Morton, Phil and I (in that order) started the MRC, APU HCI Research Group and along with others proposed and pursued an applied psychology approach to HCI (see Interacting with the Computer: a Framework).

I have known Phil, both as a colleague and as a friend, since that time. Phil claimed recently (yes, Phil, John and I still meet for lunch), that I have the lowest threshold for interrupting people of anyone he knows and indeed has ever known. My counterclaim is that, once ‘launched’, he has the highest threshold for being interrupted. Obviously, we were made for each other (although this was never made clear to Doris and Geraldine) and has led, over the years, to (sometimes interrupted) lively and productive exchanges. A proper introduction to Phil as a colleague and as a friend would require a book and this is neither the time nor the place.

Let’s give Phil the last word. Here is what he wrote in Interfaces (2010) on my retirement from UCL:

‘In summing up and passing judgement on John’s career in HCI, I could easily generate a list of several hundred positive memories, comments and analyses.

But I am simply not going to do that.

He would, of course, question the memories, deconstruct the comments, dispute the analyses and appeal any overall judgement.

That is precisely why it has been so cool to know him as a colleague, to count on him as a friend and to have had so much fun with him both at work and outside of it over the last 37 years…..’

Although describing me, it is indeed Phil’s last word, because in describing me he implies himself ( and if I have to admit it, on balance, he is right, as concerns me).

(On the other hand, ‘several hundred’ (?); ‘question/deconstruct/dispute/appeal’; judgement; cool; fun – moi (?)………).

Who wears the wig in our CHI team?

Introduction to the paper

I studied Psychology at Hull University (1967-70), learned Applied Psychology at the MRC, APU, Cambridge (1970-79) and was appointed Director of the Ergonomics Unit/UCL (1979-01). What to make of it all, I asked myself?

The best sense that I could make of it at the time was:

1. To distinguish Cognitive from Physical Ergonomics;

2. To assimilate Applied Psychology into Cognitive Ergonomics;

3. To equate Cognitive Ergonomics with Human-Computer Interaction.

I incorporated the above into a framework, which was published in a paper, entitled ‘Cognitive Ergonomics and Human-Computer Interaction (1987) and developed further in a book of the same name (1989).

I am delighted then to post Phil’s paper (1991) on the website. First, because it is an excellent paper in its own right. Further, it is the most complete and coherent articulation of Applied Psychology, that it has been my pleasure to read. Second, however, it is based on and develops further the frameworks, cited earlier.

It is an irony, of course, of course, that about the same time as Phil’s paper appeared, I was forsaking an applied psychology approach to HCI for an engineering one (1989), although the former was always retained as a possible alternative option.

Bridging between Basic Theories and the

Artifacts of Human-Computer Interaction

Phil Barnard

In: Carroll, J.M. (Ed.). Designing Interaction: psychology at the human-computer interface.

New York: Cambridge University Press, Chapter 7, 103-127. This is not an exact copy of

paper as it appeared but a DTP lookalike with very slight differences in pagination.

Psychological ideas on a particular set of topics go through something very much

like a product life cycle. An idea or vision is initiated, developed, and

communicated. It may then be exploited, to a greater or lesser extent, within the

research community. During the process of exploitation, the ideas are likely to

be the subject of critical evaluation, modification, or extension. With

developments in basic psychology, the success or penetration of the scientific

product can be evaluated academically by the twin criteria of citation counts and

endurance. As the process of exploitation matures, the idea or vision stimulates

little new research either because its resources are effectively exhausted or

because other ideas or visions that incorporate little from earlier conceptual

frameworks have taken over. At the end of their life cycle, most ideas are

destined to become fossilized under the pressure of successive layers of journals

opened only out of the behavioral equivalent of paleontological interest.

In applied domains, research ideas are initiated, developed, communicated,

and exploited in a similar manner within the research community. Yet, by the

very nature of the enterprise, citation counts and endurance are of largely

academic interest unless ideas or knowledge can effectively be transferred from

research to development communities and then have a very real practical impact

on the final attributes of a successful product.

If we take the past 20-odd years as representing the first life cycle of research

in human-computer interaction, the field started out with few empirical facts and

virtually no applicable theory. During this period a substantial body of work

was motivated by the vision of an applied science based upon firm theoretical

foundations. As the area was developed, there can be little doubt, on the twin

academic criteria of endurance and citation, that some theoretical concepts have

been successfully exploited within the research community. GOMS, of course,

is the most notable example (Card, Moran, & Newell, 1983; Olson & Olson,

1990; Polson, 1987). Yet, as Carroll (e.g., l989a,b) and others have pointed

out, there are very few examples where substantive theory per se has had a major

and direct impact on design. On this last practical criterion, cognitive science can

more readily provide examples of impact through the application of empirical

methodologies and the data they provide and through the direct application of

psychological reasoning in the invention and demonstration of design concepts

(e.g., see Anderson & Skwarecki, 1986; Card & Henderson, 1987; Carroll,

1989a,b; Hammond & Allinson, 1988; Landauer, 1987).

As this research life cycle in HCI matures, fundamental questions are being

asked about whether or not simple deductions based on theory have any value at

all in design (e.g. Carroll, this volume), or whether behavior in human-computer

interactions is simply too complex for basic theory to have anything other than a

minor practical impact (e.g., see Landauer, this volume). As the next cycle of

research develops, the vision of a strong theoretical input to design runs the risk

of becoming increasingly marginalized or of becoming another fossilized

laboratory curiosity. Making use of a framework for understanding different

research paradigms in HCI, this chapter will discuss how theory-based research

might usefully evolve to enhance its prospects for both adequacy and impact.

Bridging Representations

In its full multidisciplinary context, work on HCI is not a unitary enterprise.

Rather, it consists of many different sorts of design, development, and research

activities. Long (1989) provides an analytic structure through which we can

characterize these activities in terms of the nature of their underlying concepts

and how different types of concept are manipulated and interrelated. Such a

framework is potentially valuable because it facilitates specification of,

comparison between, and evaluation of the many different paradigms and

practices operating within the broader field of HCI.

With respect to the relationship between basic science and its application,

Long makes three points that are fundamental to the arguments to be pursued in

this and subsequent sections. First, he emphasizes that the kind of

understanding embodied in our science base is a representation of the way in

which the real world behaves. Second, any representation in the science base

can only be mapped to and from the real world by what he called “intermediary”

representations. Third, the representations and mappings needed to realize this

kind of two-way conceptual traffic are dependent upon the nature of the activities

they are required to support. So the representations called upon for the purposes

of software engineering will differ from the representations called upon for the

purposes of developing an applicable cognitive theory.

Long’s framework is itself a developing one (1987, 1989; Long & Dowell,

1989). Here, there is no need to pursue the details; it is sufficient to emphasize

that the full characterization of paradigms operating directly with artifact design

differs from those characterizing types of engineering support research, which,

in turn, differ from more basic research paradigms. This chapter will primarily

be concerned with what might need to be done to facilitate the applicability and

impact of basic cognitive theory. In doing so it will be argued that a key role

needs to be played by explicit “bridging” representations. This term will be used

to avoid any possible conflict with the precise properties of Long’s particular

conceptualization.

Following Long (1989), Figure 7.1 shows a simplified characterization of an

applied science paradigm for bridging from the real world of behavior to the

science base and from these representations back to the real world. The blocks

are intended to characterize different sorts of representation and the arrows stand

for mappings between them (Long’s terminology is not always used here). The

real world of the use of interactive software is characterized by organisational,

Basic Theories and the Artifacts of HCI

group, and physical settings; by artifacts such as computers, software, and

manuals; by the real tasks of work; by characteristics of the user population; and

so on. In both applied and basic research, we construct our science not from the

real world itself but via a bridging representation whose purpose is to support

and elaborate the process of scientific discovery.

Obviously, the different disciplines that contribute to HCI each have their

own forms of discovery representation that reflect their paradigmatic

perspectives, the existing contents of their science base, and the target form of

their theory. In all cases the discovery representation incorporates a whole range

of explicit, and more frequently implicit, assumptions about the real world and

methodologies that might best support the mechanics of scientific abstraction. In

the case of standard paradigms of basic psychology, the initial process of

analysis leading to the formation of a discovery representation may be a simple

observation of behavior on some task. For example, it may be noted that

ordinary people have difficulty with particular forms of syllogistic reasoning. In

more applied research, the initial process of analysis may involve much more

elaborate taxonomization of tasks (e.g., Brooks, this volume) or of errors

observed in the actual use of interactive software (e.g., Hammond, Long, Clark,

Barnard, & Morton, 1980).

Conventionally, a discovery representation drastically simplifies the real

world. For the purposes of gathering data about the potential phenomena, a

limited number of contrastive concepts may need to be defined, appropriate

materials generated, tasks selected, observational or experimental designs

determined, populations and metrics selected, and so on. The real world of

preparing a range of memos, letters, and reports for colleagues to consider

before a meeting may thus be represented for the purposes of initial discovery by

an observational paradigm with a small population of novices carrying out a

limited range of tasks with a particular word processor (e.g., Mack, Lewis, &

Carroll, 1983). In an experimental paradigm, it might be represented

noninteractively by a paired associate learning task in which the mappings

between names and operations need to be learned to some criterion and

subsequently recalled (e.g., Scapin, 1981). Alternatively, it might be

represented by a simple proverb-editing task carried out on two alternative

versions of a cut-down interactive text editor with ten commands. After some

form of instructional familiarization appropriate to a population of computernaive

members of a Cambridge volunteer subject panel, these commands may be

used an equal number of times with performance assessed by time on task,

errors, and help usage (e.g., Barnard, Hammond, MacLean, & Morton, 1982).

Each of the decisions made contributes to the operational discovery

representation.

The resulting characterizations of empirical phenomena are potential

regularities of behavior that become, through a process of assimilation,

incorporated into the science base where they can be operated on, or argued

about, in terms of the more abstract, interpretive constructs. The discovery

representations constrain the scope of what is assimilated to the science base and

all subsequent mappings from it.

The conventional view of applied science also implies an inverse process

involving some form of application bridge whose function is to support the

transfer of knowledge in the science base into some domain of application.

Classic ergonomics-human factors relied on the handbook of guidelines. The

relevant processes involve contextualizing phenomena and scientific principles

for some applications domain – such as computer interfaces, telecommunications

apparatus, military hardware, and so on. Once explicitly formulated, say in

terms of design principles, examples and pointers to relevant data, it is left up to

the developers to operate on the representation to synthesize that information

with any other considerations they may have in the course of taking design

decisions. The dominant vision of the first life cycle of HCI research was that

this bridging could effectively be achieved in a harder form through engineering

approximations derived from theory (Card et al., 1983). This vision essentially

conforms to the full structure of Figure 7.1

The Chasm to Be Bridged

The difficulties of generating a science base for HCI that will support effective

bridging to artifact design are undeniably real. Many of the strategic problems

theoretical approaches must overcome have now been thoroughly aired. The life

cycle of theoretical enquiry and synthesis typically postdates the life cycle of

products with which it seeks to deal; the theories are too low level; they are of

restricted scope; as abstractions from behavior they fail to deal with the real

context of work and they fail to accommodate fine details of implementations and

interactions that may crucially influence the use of a system (see, e.g.,

discussions by Carroll & Campbell, 1986; Newell & Card, 1985; Whiteside &

Basic Theories and the Artifacts of HCI 107

Wixon, 1987). Similarly, although theory may predict significant effects and

receive empirical support, those effects may be of marginal practical consequence

in the context of a broader interaction or less important than effects not

specifically addressed (e.g., Landauer, 1987).

Our current ability to construct effective bridges across the chasm that

separates our scientific understanding and the real world of user behavior and

artifact design clearly falls well short of requirements. In its relatively short

history, the scope of HCI research on interfaces has been extended from early

concerns with the usability of hardware, through cognitive consequences of

software interfaces, to encompass organizational issues (e.g., Grudin, 1990).

Against this background, what is required is something that might carry a

volume of traffic equivalent to an eight-lane cognitive highway. What is on offer

is more akin to a unidirectional walkway constructed from a few strands of rope

and some planks.

In Taking artifacts seriously Carroll (1989a) and Carroll, Kellogg, and

Rosson in this volume, mount an impressive case against the conventional view

of the deductive application of science in the invention, design, and development

of practical artifacts. They point both to the inadequacies of current informationprocessing

psychology, to the absence of real historical justification for

deductive bridging in artifact development, and to the paradigm of craft skill in

which knowledge and understanding are directly embodied in artifacts.

Likewise, Landauer (this volume) foresees an equally dismal future for theorybased

design.

Whereas Landauer stresses the potential advances that may be achieved

through empirical modeling and formative evaluation. Carroll and his colleagues

have sought a more substantial adjustment to conventional scientific strategy

(Carroll, 1989a,b, 1990; Carroll & Campbell, 1989; Carroll & Kellogg, 1989;

Carroll et al., this volume). On the one hand they argue that true “deductive”

bridging from theory to application is not only rare, but when it does occur, it

tends to be underdetermined, dubious, and vague. On the other hand they argue

that the form of hermaneutics offered as an alternative by, for example,

Whiteside and Wixon (1987) cannot be systematized for lasting value. From

Carroll’s viewpoint, HCI is best seen as a design science in which theory and

artifact are in some sense merged. By embodying a set of interrelated

psychological claims concerning a product like HyperCard or the Training

Wheels interface (e.g., see Carroll & Kellogg, 1989), the artifacts themselves

take on a theorylike role in which successive cycles of task analysis,

interpretation, and artifact development enable design-oriented assumptions

about usability to be tested and extended.

This viewpoint has a number of inviting features. It offers the potential of

directly addressing the problem of complexity and integration because it is

intended to enable multiple theoretical claims to be dealt with as a system

bounded by the full artifact. Within the cycle of task analysis and artifact

development, the analyses, interpretations, and theoretical claims are intimately

bound to design problems and to the world of “real” behavior. In this context,

knowledge from HCI research no longer needs to be transferred from research

into design in quite the same sense as before and the life cycle of theories should

also be synchronized with the products they need to impact. Within this

framework, the operational discovery representation is effectively the rationale

governing the design of an artifact, whereas the application representation is a

series of user-interaction scenarios (Carroll, 1990).

The kind of information flow around the task – artifact cycle nevertheless

leaves somewhat unclear the precise relationships that might hold between the

explicit theories of the science base and the kind of implicit theories embodied in

artifacts. Early on in the development of these ideas, Carroll (1989a) points out

that such implicit theories may be a provisional medium for HCI, to be put aside

when explicit theory catches up. In a stronger version of the analysis, artifacts

are in principle irreducible to a standard scientific medium such as explicit

theories. Later it is noted that “it may be simplistic to imagine deductive relations

between science and design, but it would be bizarre if there were no relation at

all” (Carroll & Kellogg, 1989). Most recently, Carroll (1990) explicitly

identifies the psychology of tasks as the relevant science base for the form of

analysis that occurs within the task-artifact cycle (e.g. see Greif, this volume;

Norman this volume). The task-artifact cycle is presumed not only to draw upon

and contextualize knowledge in that science base, but also to provide new

knowledge to assimilate to it. In this latter respect, the current view of the task

artifact cycle appears broadly to conform with Figure 7.1. In doing so it makes

use of task-oriented theoretical apparatus rather than standard cognitive theory

and novel bridging representations for the purposes of understanding extant

interfaces (design rationale) and for the purposes of engineering new ones

(interaction scenarios).

In actual practice, whether the pertinent theory and methodology is grounded

in tasks, human information-processing psychology or artificial intelligence,

those disciplines that make up the relevant science bases for HCI are all

underdeveloped. Many of the basic theoretical claims are really provisional

claims; they may retain a verbal character (to be put aside when a more explicit

theory arrives), and even if fully explicit, the claims rarely generalize far beyond

the specific empirical settings that gave rise to them. In this respect, the wider

problem of how we go about bridging to and from a relevant science base

remains a long-term issue that is hard to leave unaddressed. Equally, any

research viewpoint that seeks to maintain a productive role for the science base in

artifact design needs to be accompanied by a serious reexamination of the

bridging representations used in theory development and in their application.

Science and design are very different activities. Given Figure 7.1., theorybased

design can never be direct; the full bridge must involve a transformation of

information in the science base to yield an applications representation, and

information in this structure must be synthesized into the design problem. In

much the same way that the application representation is constructed to support

design, our science base, and any mappings from it, could be better constructed

to support the development of effective application bridging. The model for

relating science to design is indirect, involving theoretical support for

Basic Theories and the Artifacts of HCI 109

engineering representations (both discovery and applications) rather than one

involving direct theoretical support in design.

The Science Base and Its Application

In spite of the difficulties, the fundamental case for the application of cognitive

theory to the design of technology remains very much what it was 20 years ago,

and indeed what it was 30 years ago (e.g., Broadbent, 1958). Knowledge

assimilated to the science base and synthesized into models or theories shoudl

reduce our reliance on purely empirical evaluations. It offers the prospect of

supporting a deeper understanding of design issues and how to resolve them.

Indeed, Carroll and Kellogg’s (1989) theory nexus has developed out of a

cognitive paradigm rather than a behaviorist one. Although theory development

lags behind the design of artifacts, it may well be that the science base has more

to gain than the artifacts. The interaction of sicnece and design nevertheless

should be a two-way process of added value.

Much basic theoretical work involves the application of only partially explicit

and incomplete apparatus to specific laboratory tasks. It is not unreasonable to

argue that our basic cognitive theory tends only to be successful for modeling a

particular application. That application is itself behavior in laboratory tasks. The

scope of the application is delimited by the empirical paradigms and the artifacts

it requires – more often than not these days, computers and software for

presentation of information and response capture. Indeed, Carroll’s task-artifact

and interpretation cycles could very well be used to provide a neat description of

the research activities involved in the iterative design and development of basic

theory. The trouble is that the paradigms of basic psychological research, and

the bridging representations used to develop and validate theory, typically

involve unusually simple and often highly repetitive behavioral requirements

atypical of those faced outside the laboratory.

Although it is clear that there are many cases of invention and craft where the

kinds of scientific understanding established in the laboratory play little or no

role in artifact development (Carroll, 1989b), this is only one side of the story.

The other side is that we should only expect to find effective bridging when what

is in the science base is an adequate representation of some aspect of the real

world that is relevant to the specific artifact under development. In this context it

is worth considering a couple of examples not usually called into play in the HCI

domain.

Psychoacoustic models of human hearing are well developed. Auditory

warning systems on older generations of aircraft are notoriously loud and

unreliable. Pilots don’t believe them and turn them off. Using standard

techniques, it is possible to measure the noise characteristics of the environment

on the flight deck of a particular aircraft and to design a candidate set of warnings

based on a model of the characteristics of human hearing. This determines

whether or not pilots can be expected to “hear” and identify those warnings over

the pattern of background noise without being positively deafened and distracted

(e.g., Patterson, 1983). Of course, the attention-getting and discriminative

110 Barnard

properties of members of the full set of warnings still have to be crafted. Once

established, the extension of the basic techniques to warning systems in hospital

intensive-care units (Patterson, Edworthy, Shailer, Lower, & Wheeler, 1986)

and trains (Patterson, Cosgrove, Milroy, & Lower, 1989) is a relatively routine

matter.

Developed further and automated, the same kind of psychoacoustic model

can play a direct role in invention. As the front end to a connectionist speech

recognizer, it offers the prospect of a theoretically motivated coding structure that

could well prove to outperform existing technologies (e.g., see ACTS, 1989).

As used in invention, what is being embodied in the recognition artifact is an

integrated theory about the human auditory system rather than a simple heuristic

combination of current signal-processing technologies.

Another case arises out of short-term memory research. Happily, this one

does not concern limited capacity! When the research technology for short-term

memory studies evolved into a computerized form, it was observed that word

lists presented at objectively regular time intervals (onset to onset times for the

sound envelopes) actually sounded irregular. In order to be perceived as regular

the onset to onset times need to be adjusted so that the “perceptual centers” of the

words occur at equal intervals (Morton, Marcus, & Frankish, 1976). This

science base representation, and algorithms derived from it, can find direct use in

telecommunications technology or speech interfaces where there is a requirement

for the automatic generation of natural sounding number or option sequences.

Of course, both of these examples are admittedly relatively “low level.” For

many higher level aspects of cognition, what is in the science base are

representations of laboratory phenomena of restricted scope and accounts of

them. What would be needed in the science base to provide conditions for

bridging are representations of phenomena much closer to those that occur in the

real world. So, for example, the theoretical representations should be topicalized

on phenomena that really matter in applied contexts (Landauer, 1987). They

should be theoretical representations dealing with extended sequences of

cognitive behavior rather than discrete acts. They should be representations of

information-rich environments rather than information-impoverished ones. They

should relate to circumstances where cognition is not a pattern of short repeating

(experimental) cycles but where any cycles that might exist have meaning in

relation to broader task goals and so on.

It is not hard to pursue points about what the science base might incorporate

in a more ideal world. Nevertheless, it does contain a good deal of useful

knowledge (cf. Norman, 1986), and indeed the first life cycle of HCI research

has contributed to it. Many of the major problems with the appropriateness,

scope, integration, and applicability of its content have been identified. Because

major theoretical prestroika will not be achieved overnight, the more productive

questions concern the limitations on the bridging representations of that first

cycle of research and how discovery representations and applications

representations might be more effectively developed in subsequent cycles.

An Analogy with Interface Design Practice

Basic Theories and the Artifacts of HCI

Not surprisingly, those involved in the first life cycle of HCI research relied very

heavily in the formation of their discovery representations on the methodologies

of the parent discipline. Likewise, in bridging from theory to application, those

involved relied heavily on the standard historical products used in the verification

of basic theory, that is, prediction of patterns of time and/or errors. There are

relatively few examples where other attributes of behavior are modeled, such as

choice among action sequences (but see Young & MacLean, 1988). A simple

bridge, predictive of times of errors, provides information about the user of an

interactive system. The user of that information is the designer, or more usually

the design team. Frameworks are generally presented for how that information

might be used to support design choice either directly (e.g., Card et al., 1983) or

through trade-off analyses (e.g., Norman, 1983). However, these forms of

application bridge are underdeveloped to meet the real needs of designers.

Given the general dictum of human factors research, “Know the user”

(Hanson, 1971), it is remarkable how few explicitly empirical studies of design

decision making are reported in the literature. In many respects, it would not be

entirely unfair to argue that bridging representations between theory and design

have remained problematic for the same kinds of reasons that early interactive

interfaces were problematic. Like glass teletypes, basic psychological

technologies were underdeveloped and, like the early design of command

languages, the interfaces (application representations) were heuristically

constructed by applied theorists around what they could provide rather than by

analysis of requirements or extensive study of their target users or the actual

context of design (see also Bannon & BØdker, this volume; Henderson, this

volume).

Equally, in addressing questions associated with the relationship between

theory and design, the analogy can be pursued one stage further by arguing for

the iterative design of more effective bridging structures. Within the first life

cycle of HCI research a goodly number of lessons have been learned that could

be used to advantage in a second life cycle. So, to take a very simple example,

certain forms of modeling assume that users naturally choose the fastest method

for achieving their goal. However there is now some evidence that this is not

always the case (e.g., MacLean, Barnard, & Wilson, 1985). Any role for the

knowledge and theory embodied in the science base must accommodate, and

adapt to, those lessons. For many of the reasons that Carroll and others have

elaborated, simple deductive bridging is problematic. To achieve impact,

behavioral engineering research must itself directly support the design,

development, and invention of artifacts. On any reasonable time scale there is a

need for discovery and application representations that cannot be fully justified

through science-base principles or data. Nonetheless, such a requirement simply

restates the case for some form of cognitive engineering paradigm. It does not in

and of itself undermine the case for the longer-term development of applicable

theory.

Just as impact on design has most readily been achieved through the

application of psychological reasoning in the invention and demonstration of

artifacts, so a meaningful impact of theory might best be achieved through the

invention and demonstration of novel forms of applications representations. The

development of representations to bridge from theory to application cannot be

taken in isolation. It needs to be considered in conjunction with the contents of

the science base itself and the appropriateness of the discovery representations

that give rise to them.

Without attempting to be exhaustive, the remainder of this chapter will

exemplify how discovery representations might be modified in the second life

cycle of HCI research; and illustrate how theory might drive, and itself benefit

from, the invention and demonstration of novel forms of applications bridging.

Enhancing Discovery Representations

Although disciplines like psychology have a formidable array of methodological

techniques, those techniques are primarily oriented toward hypothesis testing.

Here, greatest effort is expended in using factorial experimental designs to

confirm or disconfirm a specific theoretical claim. Often wider characteristics of

phenomena are only charted as and when properties become a target of specific

theoretical interest. Early psycholinguistic research did not start off by studying

what might be the most important factors in the process of understanding and

using textual information. It arose out of a concern with transformational

grammars (Chomsky, 1957). In spite of much relevant research in earlier

paradigms (e.g., Bartlett, 1932), psycholinguistics itself only arrived at this

consideration after progressing through the syntax, semantics, and pragmatics of

single-sentence comprehension.

As Landauer (1987) has noted, basic psychology has not been particularly

productive at evolving exploratory research paradigms. One of the major

contributions of the first life cycle of HCI research has undoubtedly been a

greater emphasis on demonstrating how such empirical paradigms can provide

information to support design (again, see Landauer, 1987). Techniques for

analyzing complex tasks, in terms of both action decomposition and knowledge

requirements, have also progressed substantially over the past 20 years (e.g.,

Wilson, Barnard, Green, & MacLean, 1988).

A significant number of these developments are being directly assimilated

into application representations for supporting artifact development. Some can

also be assimilated into the science base, such as Lewis’s (1988) work on

abduction. Here observational evidence in the domain of HCI (Mack et al.,

1983) leads directly to theoretical abstractions concerning the nature of human

reasoning. Similarly, Carroll (1985) has used evidence from observational and

experimental studies in HCI to extend the relevant science base on naming and

reference. However, not a lot has changed concerning the way in which

discovery representations are used for the purposes of assimilating knowledge to

the science base and developing theory.

In their own assessment of progress during the first life cycle of HCI

research, Newell and Card (1985) advocate continued reliance on the hardening

of HCI as a science. This implicitly reinforces classic forms of discovery

representations based upon the tools and techniques of parent disciplines. Heavy

Basic Theories and the Artifacts of HCI 113

reliance on the time-honored methods of experimental hypothesis testing in

experimental paradigms does not appear to offer a ready solution to the two

problems dealing with theoretical scope and the speed of theoretical advance.

Likewise, given that these parent disciplines are relatively weak on exploratory

paradigms, such an approach does not appear to offer a ready solution to the

other problems of enhancing the science base for appropriate content or for

directing its efforts toward the theoretical capture of effects that really matter in

applied contexts.

The second life cycle of research in HCI might profit substantially by

spawning more effective discovery representations, not only for assimilation to

applications representations for cognitive engineering, but also to support

assimilation of knowledge to the science base and the development of theory.

Two examples will be reviewed here. The first concerns the use of evidence

embodied in HCI scenarios (Young & Barnard, 1987, Young, Barnard, Simon,

& Whittington, 1989). The second concerns the use of protocol techniques to

systematically sample what users know and to establish relationships between

verbalizable knowledge and actual interactive performance.

Test-driving Theories

Young and Barnard (1987) have proposed that more rapid theoretical advance

might be facilitated by “test driving” theories in the context of a systematically

sampled set of behavioral scenarios. The research literature frequently makes

reference to instances of problematic or otherwise interesting user-system

exchanges. Scenario material derived from that literature is selected to represent

some potentially robust phenomenon of the type that might well be pursued in

more extensive experimental research. Individual scenarios should be regarded

as representative of the kinds of things that really matter in applied settings. So

for example, one scenario deals with a phenomenon often associated with

unselected windows. In a multiwindowing environment a persistent error,

frequently committed even by experienced users, is to attempt some action in

inactive window. The action might be an attempt at a menu selection. However,

pointing and clicking over a menu item does not cause the intended result; it

simply leads to the window being activated. Very much like linguistic test

sentences, these behavioral scenarios are essentially idealized descriptions of

such instances of human-computer interactions.

If we are to develop cognitive theories of significant scope they must in

principle be able to cope with a wide range of such scenarios. Accordingly, a

manageable set of scenario material can be generated that taps behaviors that

encompass different facets of cognition. So, a set of scenarios might include

instances dealing with locating information in a directory entry, selecting

alternative methods for achieving a goal, lexical errors in command entry, the

unselected windows phenomenon, and so on (see Young, Barnard, Simon, &

Whittington, 1989). A set of contrasting theoretical approaches can likewise be

selected and the theories and scenarios organized into a matrix. The activity

involves taking each theoretical approach and attempting to formulate an account

114 Barnard

of each behavioral scenario. The accuracy of the account is not at stake. Rather,

the purpose of the exercise is to see whether a particular piece of theoretical

apparatus is even capable of giving rise to a plausible account. The scenario

material is effectively being used as a set of sufficiency filters and it is possible to

weed out theories of overly narrow scope. If an approach is capable of

formulating a passable account, interest focuses on the properties of the account

offered. In this way, it is also possible to evaluate and capitalize on the

properties of theoretical apparatus and do provide appropriate sorts of analytic

leverage over the range of scenarios examined.

Traditionally, theory development places primary emphasis on predictive

accuracy and only secondary emphasis on scope. This particular form of

discovery representation goes some way toward redressing that balance. It

offers the prospect of getting appropriate and relevant theoretical apparatus in

place on a relatively short time cycle. As an exploratory methodology, it at least

addresses some of the more profound difficulties of interrelating theory and

application. The scenario material makes use of known instances of humancomputer

interaction. Because these scenarios are by definition instances of

interactions, any theoretical accounts built around them must of necessity be

appropriate to the domain. Because scenarios are intended to capture significant

aspects of user behavior, such as persistent errors, they are oriented toward what

matters in the applied context. As a quick and dirty methodology, it can make

effective use of the accumulated knowledge acquired in the first life cycle of HCI

research, while avoiding some of the worst “tar pits” (Norman, 1983) of

traditional experimental methods.

As a form of discovery bridge between application and theory, the real world

is represented, for some purpose, not by a local observation or example, but by a

sampled set of material. If the purpose is to develop a form of cognitive

architecture , then it may be most productive to select a set of scenarios that

encompass different components of the cognitive system (perception, memory,

decision making, control of action). Once an applications representation has

been formed, its properties might be further explored and tested by analyzing

scenario material sampled over a range of different tasks, or applications

domains (see Young & Barnard, 1987). At the point where an applications

representation is developed, the support it offers may also be explored by

systematically sampling a range of design scenarios and examining what

information can be offered concerning alternative interface options (AMODEUS,

1989). By contrast with more usual discovery representations, the scenario

methodology is not primarily directed at classic forms of hypothesis testing and

validation. Rather, its purpose is to support the generation of more readily

applicable theoretical ideas.

Verbal Protocols and Performanc

One of the most productive exploratory methodologies utilized in HCI research

has involved monitoring user action while collecting concurrent verbal protocols

to help understand what is actually going on. Taken together these have often

Basic Theories and the Artifacts of HCI 115

given rise to the best kinds of problem-defining evidence, including the kind of

scenario material already outlined. Many of the problems with this form of

evidence are well known. Concurrent verbalization may distort performance and

significant changes in performance may not necessarily be accompanied by

changes in articulatable knowledge. Because it is labor intensive, the

observations are often confined to a very small number of subjects and tasks. In

consequence, the representatives of isolated observations is hard to assess.

Furthermore, getting real scientific value from protocol analysis is crucially

dependent on the insights and craft skill of the researcher concerned (Barnard,

Wilson, & MacLean, 1986; Ericsson & Simon, 1980).

Techniques of verbal protocol analysis can nevertheless be modified and

utilized as a part of a more elaborate discovery representation to explore and

establish systematic relationships between articulatable knowledge and

performance. The basic assumption underlying much theory is that a

characterization of the ideal knowledge a user should possess to successfully

perform a task can be used to derive predictions about performance. However,

protocol studies clearly suggest that users really get into difficulty when they

have erroneous or otherwise nonideal knowledge. In terms of the precise

relationships they have with performance, ideal and nonideal knowledge are

seldom considered together.

In an early attempt to establish systematic and potentially generalizable

relationships between the contents of verbal protocols and interactive

performance, Barnard et al., (1986) employed a sample of picture probes to elicit

users’ knowledge of tasks, states, and procedures for a particular office product

at two stages of learning. The protocols were codified, quantified, and

compared. In the verbal protocols, the number of true claims about the system

increased with system experience, but surprisingly, the number of false claims

remained stable. Individual users who articulated a lot of correct claims

generally performed well, but the amount of inaccurate knowledge did not appear

related to their overall level of performance. There was, however, some

indication that the amount of inaccurate knowledge expressed in the protocols

was related to the frequency of errors made in particular system contexts.

A subsequent study (Barnard, Ellis, & MacLean, 1989) used a variant of the

technique to examine knowledge of two different interfaces to the same

application functionality. High levels of inaccurate knowledge expressed in the

protocols were directly associated with the dialogue components on which

problematic performance was observed. As with the earlier study, the amount of

accurate knowledge expressed in any given verbal protocol was associated with

good performance, whereas the amount of inaccurate knowledge expressed bore

little relationship to an individual’s overall level of performance. Both studies

reinforced the speculation that is is specific interface characteristics that give rise

to the development of inaccurate or incomplete knowledge from which false

inferences and poor performance may follow.

Just as the systematic sampling and use of behavioral scenarios may facilitate

the development of theories of broader scope, so discovery representations

designed to systematically sample the actual knowledge possessed by users

116 Barnard

should facilitate the incorporation into the science base of behavioral regularities

and theoretical claims that are more likely to reflect the actual basis of user

performance rather than a simple idealization of it.

Enhancing Application Representations

The application representations of the first life cycle of HCI research relied very

much on the standard theoretical products of their parent disciplines.

Grammatical techniques originating in linguistics were utilized to characterize the

complexity of interactive dialogues; artificial intelligence (A1)-oriented models

were used to represent and simulate the knowledge requirements of learning;

and, of course, derivatives of human information-processing models were used

to calculate how long it would take users to do things. Although these

approaches all relied upon some form of task analysis, their apparatus was

directed toward some specific function. They were all of limited scope and made

numerous trade-offs between what was modeled and the form of prediction made

(Simon, 1988).

Some of the models were primarily directed at capturing knowledge

requirements for dialogues for the purposes of representing complexity, such as

BNF grammars (Reisner, 1982) and Task Action Grammars (Payne & Green,

1986). Others focused on interrelationships between task specifications and

knowledge requirements, such as GOMS analyses and cognitive-complexity

theory (Card et al. 1983; Kieras & Polson, 1985). Yet other apparatus, such as

the model human information processor and the keystroke level model of Card et

al. (1983) were primarily aimed at time prediction for the execution of error-free

routine cognitive skill. Most of these modeling efforts idealized either the

knowledge that users needed to possess or their actual behavior. Few models

incorporated apparatus for integrating over the requirements of knowledge

acquisition or use and human information-processing constraints (e.g., see

Barnard, 1987). As application representations, the models of the first life cycle

had little to say about errors or the actual dynamics of user-system interaction as

influenced by task constraints and information or knowledge about the domain of

application itself.

Two modeling approaches will be used to illustrate how applications

representations might usefully be enhanced. They are programmable user

models (Young, Green, & Simon, 1989) and modeling based on Interacting

Cognitive Subsystems (Barnard, 1985). Although these approaches have

different origins, both share a number of characteristics. They are both aimed at

modeling more qualitative aspects of cognition in user-system interaction; both

are aimed at understanding how task, knowledge, and processing constraint

intersect to determine performance; both are aimed at exploring novel means of

incorporating explicit theoretical claims into application representations; and both

require the implementation of interactive systems for supporting decision making

in a design context. Although they do so in different ways, both approaches

attempt to preserve a coherent role for explicit cognitive theory. Cognitive theory

is embodied, not in the artifacts that emerge from the development process, but

Basic Theories and the Artifacts of HCI 117

in demonstrator artifacts that might emerge from the development process, but in

demonstrator artifacts that might support design. This is almost directly

analogous to achieving an impact in the marketplace through the application of

psychological reasoning in the invention of artifacts. Except in this case, the

target user populations for the envisaged artifacts are those involved in the design

and development of products.

Programmable User Models (PUMs)

The core ideas underlying the notion of a programmable user model have their

origins in the concepts and techniques of AI. Within AI, cognitive architectures

are essentially sets of constraints on the representation and processing of

knowledge. In order to achieve a working simulation, knowledge appropriate to

the domain and task must be represented within those constraints. In the normal

simulation methodology, the complete system is provided with some data and,

depending on its adequacy, it behaves with more or less humanlike properties.

Using a simulation methodology to provide the designer with an artificial

user would be one conceivable tactic. Extending the forms of prediction offered

by such simulations (cf. cognitive complexity theory; Polson, 1987) to

encompass qualitative aspects of cognition is more problematic. Simply

simulating behavior is of relatively little value. Given the requirements of

knowledge-based programming, it could, in many circumstances, be much more

straightforward to provide a proper sample of real users. There needs to be

some mechanism whereby the properties of the simulation provide information

of value in design. Programmable user models provide a novel perspective on

this latter problem. The idea is that the designer is provided with two things, an

“empty” cognitive architecture and an instruction language for providing with all

the knowledge it needs to carry out some task. By programming it, the designer

has to get the architecture to perform that task under conditions that match those

of the interactive system design (i.e., a device model). So, for example, given a

particular dialog design, the designer might have to program the architecture to

select an object displayed in a particular way on a VDU and drag it across that

display to a target position.

The key, of course, is that the constraints that make up the architecture being

programmed are humanlike. Thus, if the designer finds it hard to get the

architecture to perform the task, then the implication is that a human user would

also find the task hard to accomplish. To concretize this, the designer may find

that the easiest form of knowledge-based program tends to select and drag the

wrong object under particular conditions. Furthermore, it takes a lot of thought

and effort to figure out how to get round this problem within the specific

architectural constraints of the model. Now suppose the designer were to adjust

the envisaged user-system dialog in the device model and then found that

reprogramming the architecture to carry out the same task under these new

conditions was straightforward and the problem of selecting the wrong object no

longer arose. Young and his colleagues would then argue that this constitutes

118 Barnard

direct evidence that the second version of the dialog design tried by the designer

is likely to prove more usable than the first.

The actual project to realize a working PUM remains at an early stage of

development. The cognitive architecture being used is SOAR (Laird, Newell, &

Rosenbloom, 1987). There are many detailed issues to be addressed concerning

the design of an appropriate instruction language. Likewise, real issues are

raised about how a model that has its roots in architectures for problem solving

(Newell & Simon, 1972) deals with the more peripheral aspects of human

information processing, such as sensation, perception, and motor control.

Nevertheless as an architecture, it has scope in the sense that a broad range of

tasks and applications can be modeled within it. Indeed, part of the motivation

of SOAR is to provide a unified general theory of cognition (Newell, 1989).

In spite of its immaturity, additional properties of the PUM concept as an

application bridging structure are relatively clear (see Young et al., 1989). First,

programmable user models embody explicit cognitive theory in the form of the

to-be-programmed architecture. Second, there is an interesting allocation of

function between the model and the designer. Although the modeling process

requires extensive operationalization of knowledge in symbolic form, the PUM

provides only the constraints and the instruction language, whereas the designer

provides the knowledge of the application and its associated tasks. Third,

knowledge in the science base is transmitted implicitly into the design domain via

an inherently exploratory activity. Designers are not told about the underlying

cognitive science; they are supposed to discover it. By doing what they know

how to do well – that is, programming – the relevant aspects of cognitive

constraints and their interactions with the application should emerge directly in

the design context.

Fourth, programmable user models support a form of qualitative predictive

evaluation that can be carried out relatively early in the design cycle. What that

evaluation provides is not a classic predictive product of laboratory theory, rather

it should be an understanding of why it is better to have the artifact constructed

one way rather than another. Finally, although the technique capitalizes on the

designer’s programming skills, it clearly requires a high degree of commitment

and expense. The instruction language has to be learned and doing the

programming would require the development team to devote considerable

resources to this form of predictive evaluation.

Approximate Models of Cognitive Activity

Interacting Cognitive Subsystems (Barnard, 1985) also specifies a form of

cognitive architecture. Rather than being an AI constraint-based architecture,

ICS has its roots in classic human information-processing theory. It specifies

the processing and memory resources underlying cognition, the organization of

these resources, and principles governing their operation. Structurally, the

complete human information-processing system is viewed as a distributed

architecture with functionally distinct subsystems each specializing in, and

supporting, different types of sensory, representational, and effector processing

Basic Theories and the Artifacts of HCI 119

activity. Unlike many earlier generations of human information-processing

models, there are no general purpose resources such as a central executive or

limited capacity working memory. Rather the model attempts to define and

characterize processes in terms of the mental representations they take as input

and the representations they output. By focusing on the mappings between

different mental representations, this model seeks to integrate a characterization

of knowledge-based processing activity with classic structural constraints on the

flow of information within the wider cognitive system.

A graphic representation of this architecture is shown in the right-hand panel

of Figure 7.2, which instantiates Figure 7.1 for the use of the ICS framework in

an HCI context. The architecture itself is part of the science base. Its initial

development was supported by using empirical evidence from laboratory studies

of short-term memory phenomena (Barnard, 1985). However, by concentrating

on the different types of mental representation and process that transform them,

rather than task and paradigm specific concepts, the model can be applied across

a broad range of settings (e.g., see Barnard & Teasdale, 1991). Furthermore,

for the purposes of constructing a representation to bridge between theory and

application it is possible to develop explicit, yet approximate, characterizations of

cognitive activity.

In broad terms, the way in which the overall architecture will behave is

dependent upon four classes of factor. First, for any given task it will depend on

the precise configuration of cognitive activity. Different subsets of processes

and memory records will be required by different tasks. Second, behavior will

be constrained by the specific procedural knowledge embodied in each mental

process that actually transforms one type of mental representation to another.

Third, behavior will be constrained by the form, content, and accessibility of any

memory records that are need in that phase of activity. Fourth, it will depend on

the overall way in which the complete configuration is coordinated and

controlled.

Because the resources are relatively well defined and constrained in terms of

their attributes and properties, interdependencies between them can be motivated

on the basis of known patterns of experimental evidence and rendered explicit.

So, for example, a complexity attribute of the coordination and control of

cognitive activity can be directly related to the number of incompletely

proceduralized processes within a specified configuration. Likewise, a strategic

attribute of the coordination and control of cognitive activity may be dependent

upon the overall amount of order uncertainty associated with the mental

representation of a task stored in a memory record. For present purposes the

precise details of these interdependencies do not matter, nor does the particularly

opaque terminology shown in the rightmost panel of Figure 7.2 (for more

details, see Barnard, 1987). The important point is that theoretical claims can be

specified within this framework at a high level of abstraction and that these

abstractions belong in the science base.

Although these theoretical abstractions could easily have come from classic

studies of human memory and performance, there were in fact motivated by

experimental studies of command naming in text editing (Grudin & Barnard,

120 Barnard

1984) and performance on an electronic mailing task (Barnard, MacLean, &

Hammond, 1984). The full theoretical analyses are described in Barnard (1987)

and extended in Barnard, Grudin, and MacLean (1989). In both cases the tasks

were interactive, involved extended sequences of cognitive behavior, involved

information-rich environments, and the repeating patterns of data collection were

meaningful in relation to broader task goals not atypical of interactive tasks in the

real world. In relation to the arguments presented earlier in this chapter, the

information being assimilated to the science base should be more appropriate and

relevant to HCI than that derived from more abstract laboratory paradigms. It

will nonetheless be subject to interpretive restrictions inherent in the particular

form of discovery representation utilized in the design of these particular

experiments.

Armed with such theoretical abstractions, and accepting their potential

limitations, it is possible to generate a theoretically motivated bridge to

application. The idea is to build approximate models that describe the nature of

cognitive activity underlying the performance of complex tasks. The process is

actually carried out by an expert system that embodies the theoretical knowledge

required to build such models. The system “knows” what kinds of

configurations are associated with particular phases of cognitive activity; it

“knows” something about the conditions under which knowledge becomes

proceduralized, and the properties of memory records that might support recall

and inference in complex task environments. It also “knows” something about

the theoretical interdependencies between these factors in determining the overall

patterning, complexity, and qualities of the coordination and dynamic control of

cognitive activity. Abstract descriptions of cognitive activity are constructed in

terms of a four-component model specifying attributes of configurations,

procedural knowledge, record contents, and dynamic control. Finally, in order

to produce an output, the system “knows” something about the relationships

between these abstract models of cognitive activity and the attributes of user

behavior.

Basic Theories and the Artifacts of HCI

Figure 7.2. The applied science paradigm instantiated for the use of interacting cognitive subsystems as a theoretical basis for the development

Obviously, no single model of this type can capture everything that goes on

in a complex task sequence. Nor can a single model capture different stages of

user development or other individual differences within the user population. It is

therefore necessary to build a set of interrelated models representing different

phases of cognitive activity, different levels and forms of user expertise, and so

on. The basic modeling unit uses the four-component description to characterize

cognitive activity for a particular phase, such as establishing a goal, determining

the action sequence, and executing it. Each of these models approximates over

the very short-term dynamics of cognition. Transitions between phases

approximate over the short-term dynamics of tasks, whereas transitions between

levels of expertise approximate over different stages of learning. In Figure 7.2,

the envisaged application representation thus consists of a family of interrelated

models depicted graphically as a stack of cards.

Like the concept of programmable user models, the concept of approximate

descriptive modeling is in the course of development. A running demonstrator

system exists that effectively replicates the reasoning underlying the explanation

of a limited range of empirical phenomena in HCI research (see Barnard,

Wilson, & MacLean, 1987, 1988). What actually happens is that the expert

system elicits, in a context-sensitive manner, descriptions of the envisaged

interface, its users, and the tasks that interface is intended to support. It then

effectively “reasons about” cognitive activity, its properties, and attributes in that

applications setting for one or more phases of activity and one or more stages of

learning. Once the models have stabilized, it then outputs a characterization of

the probable properties of user behavior. In order to achieve this, the expert

system has to have three classes of rules: those that map from descriptions of

tasks, users, and systems to entities and properties in the model representation;

rules that operate on those properties; and rules that map from the model

representation to characterizations of behavior. Even in its somewhat primitive

current state, the demonstrator system has interesting generalizing properties.

For example, theoretical principles derived from research on rather antiquated

command languages support limited generalization to direct manipulation and

iconic interfaces.

As an applications representation, the expert system concept is very different

from programmable user models. Like PUMs, the actual tool embodies explicit

theory drawn from the science base. Likewise, the underlying architectural

concept enables a relatively broad range of issues to be addressed. Unlike

PUMs, it more directly addresses a fuller range of resources across perceptual,

cognitive, and effector concerns. It also applies a different trade-off in when and

by whom the modeling knowledge is specified. At the point of creation, the

expert system must contain a complete set of rules for mapping between the

world and the model. In this respect, the means of accomplishing and

expressing the characterizations of cognition and behavior must be fully and

comprehensively encoded. This does not mean that the expert system must

necessarily “know” each and every detail. Rather, within some defined scope,

the complete chain of assumptions from artifact to theory and from theory to

behavior must be made explicit at an appropriate level of approximation.

Basic Theories and the Artifacts of HCI 123

Equally, the input and output rules must obviously be grounded in the language

of interface description and user-system interaction. Although some of the

assumptions may be heuristic, and many of them may need crafting, both

theoretical and craft components are there. The how-to-do-it modeling

knowledge is laid out for inspection.

However, at the point of use, the expert system requires considerably less

precision than PUMs in the specification and operationalization of the knowledge

required to use the application being considered. The expert system can build a

family of models very quickly and without its user necessarily acquiring any

great level of expertise in the underlying cognitive theory. In this way, it is

possible for that user to explore models for alternative system designs over the

course of something like one afternoon. Because the system is modular, and the

models are specified in abstract terms, it is possible in principle to tailor the

systems input and output rules without modifying the core theoretical reasoning.

The development of the tool could then respond to requirements that might

emerge from empirical studies of the real needs of design teams or of particular

application domains.

In a more fully developed form, it might be possible to address the issue of

which type of tool might prove more effective in what types of applications

context. However, strictly speaking, they are not direct competitors, they are

alternative types of application representation that make different forms of tradeoff

about the characteristics of the complete chain of bridging from theory to

application. By contrast with the kinds of theory-based techniques relied on in

the first life cycle of HCI research, both PUMs and the expert-system concept

represent more elaborate bridging structures. Although underdeveloped, both

approaches are intended ultimately to deliver richer and more integrated

information about properties of human cognition into the design environment in

forms in which it can be digested and used. Both PUMs and the expert system

represent ways in which theoretical support might be usefully embodied in future

generations of tools for supporting design. In both cases the aim is to deliver

within the lifetime of the next cycle of research a qualitative understanding of

what might be going on in a user’s head rather than a purely quantitative estimate

of how long the average head is going to be busy (see also Lewis, this volume).

Summary

The general theme that has been pursued in this chapter is that the relationship

between the real world and theoretical representations of it is always mediated by

bridging representations that subserve specific purposes. In the first life cycle of

research on HCI, the bridging representations were not only simple, they were

only a single step away from those used in the parent disciplines for the

development of basic theory and its validation. If cognitive theory is to find any

kind of coherent and effective role in forthcoming life cycles of HCI research, it

must seriously reexamine the nature and function of these bridging

representations as well as the content of the science base itself.

124 Barnard

This chapter has considered bridging between specifically cognitive theory

and behavior in human-computer interaction. This form of bridging is but one

among many that need to be pursued. For example, there is a need to develop

bridging representations that will enable us to interrelate models of user cognition

with the formed models being developed to support design by software

engineers (e.g., Dix, Harrison, Runciman, & Thimbleby, 1987; Harrison,

Roast, & Wright, 1989; Thimbleby, 1985). Similarly there is a need to bridge

between cognitive models and aspects of the application and the situation of use

(e.g., Suchman, 1987). Truly interdisciplinary research formed a large part of

the promise, but little of the reality of early HCI research. Like the issue of

tackling nonideal user behavior, interdisciplinary bridging is now very much on

the agenda for the next phase of research (e.g., see Barnard & Harrison, 1989).

The ultimate impact of basic theory on design can only be indirect – through

an explicit application representation. Alternative forms of such representation

that go well beyond what has been achieved to date have to be invented,

developed, and evaluated. The views of Carroll and his colleagues form one

concrete proposal for enhancing our application representations. The design

rationale concept being developed by MacLean, Young, and Moran (1989)

constitutes another potential vehicle for expressing application representations.

Yet other proposals seek to capture qualitative aspects of human cognition while

retaining a strong theoretical character (Barnard et al., 1987; 1988; Young,

Green, & Simon, 1989).

On the view advocated here, the direct theory-based product of an applied

science paradigm operating in HCI is not an interface design. It is an application

representation capable of providing principled support for reasoning about

designs. There may indeed be very few examples of theoretically inspired

software products in the current commercial marketplace. However, the first life

cycle of HCI research has produced a far more mature view of what is entailed in

the development of bridging representations that might effectively support design

reasoning. In subsequent cycles, we may well be able to look forward to a

significant shift in the balance of added value within the interaction between

applied science and design. Although future progress will in all probability

remain less than rapid, theoretically grounded concepts may yet deliver rather

more in the way of principled support for design than has been achieved to date.

Acknowledgments

The participants ant the Kittle Inn workshop contributed greatly to my

understanding of the issues raised here. I am particularly indebted to Jack

Carroll, Wendy Kellogg, and John Long, who commented extensively on an

earlier draft. Much of the thinking also benefited substantially from my

involvement with the multidisciplinary AMODEUS project, ESPRIT Basic

Research Action 3066.

References

ACTS (1989). Connectionist Techniques for Speech (Esprit Basic Research

ACtion 3207), Technical Annex. Brussels: CEC.

Basic Theories and the Artifacts of HCI 125

AMODEUS (1989). Assimilating models of designers uses and systems (ESprit

Basic Research Action 3066), Technical Aneex. Brussels; CEC.

Anderson, J. R., & Skwarecki, E. 1986. The automated tutoring of

introductory computer programming. Communications of the ACM, 29,

842-849.

Barnard, P. J. (1985). Interacting cognitive subsystems: A psycholinguistic

approach to short term memory. In A. Ellis, (Ed.), Progress in the

psychology of language (Vol. 2, chapter 6, pp. 197-258. London:

Lawrenece Erlbaum Associates.

Barnard, P. J. (1987). Cognitive resources and the learning of human-computer

dialogs. In J.M. Carroll (Ed.), Interfacing thought: Cognitive aspects of

human-computer interaction (pp. 112-158). Cambridge MA: MIT Press.

Barnard, P. J., & Harrison, M. D. (1989). Integrating cognitive and system

models in human-computer interaction. In A. Sutcliffe & L. Macaulay,

(Ed.), People and computers V (pp. 87-103). Cambridge: Cambridge

University Press.

Barnard, P. J., Ellis, J., & MacLean, A. (1989). Relating ideal and non-ideal

verbalised knowledge to performance. In A. Sutcliffe & L. Macaulay

(Eds.), People and computers V (pp. 461-473). Cambridge: Cambridge

University Press.

Barnard, P. J., Grudin, J., & MacLean, A. (1989). Developing a science base

for the naming of computer commands. In J. B. Long & A. Whitefield

(Eds.), Cognitive ergonomics and human-computer interaction (pp. 95-

133). Cambridge: Cambridge University Press.

Barnard, P. J., Hammond, N., MacLean, A., & Morton, J. (1982). Learning

and remembering interactive commands in a text-editing task. Behaviour

and Information Technology, 1, 347-358.

Barnard, P. J., MacLean, A., & Hammond, N. V. (1984). User representations

of ordered sequences of command operations. In B. Shackel (Ed.),

Proceedings of Interact ’84: First IFIP Conference on Human-Computer

Interaction, (Vol. 1, pp. 434-438). London: IEE.

Barnard, P. J., & Teasdale, J. (1991). Interacting cognitive subsystems: A

systematic approach to cognitive-affective interaction and change.

Cognition and Emotion, 5, 1-39.

Barnard, P. J., Wilson, M., & MacLean, A. (1986). The elicitation of system

knowledge by picture probles. In M. Mantei & P. Orbeton (Eds.),

Proceedings of CHI ’86: Human Factors in Computing Systems (pp.

235-240). New York: ACM.

Barnard, P. J., Wilson, M., & MacLean, A. (1987). Approximate modelling of

cognitive activity: Towards an expert system design aid. In J. M. Carroll

& P. P. Tanner (Eds.), Proceedings of CHI + GI ’87: Human Factors in

Computing Systems and Graphics Interface (pp. 21-26). New York:

ACM.

Barnard, P. J., Wilson, M., & MacLean, A. (1988). Approximate modelling of

cognitive activity with an Expert system: A theory based strategy for

developing an interactive design tool. The Computer Journal, 31, 445-

456.

Bartlett, F. C. (1932). Remembering: A study in experimental and social

psychology. Cambridge: Cambridge University Press.

Broadbent, D. E. (1958). Perception and communication. London: Pergamon

Press.

Card, S. K., & Henderson, D. A. (1987). A multiple virtual-workspace

interface to support user task-switching. In J. M. Carroll & P. P. Tanner

(Eds.), Proceedings of CHI + GI ’87: Human Factors in Computing

Systems and Graphics Interface (pp. 53-59). New York: ACM.

Card, S. K., Moran, T. P., & Newell, A. (1983). The psychology of humancomputer

interaction. Hillsdale, NJ: Lawrence Erlbaum Associates.

Carroll, J. M. (1985). What’s in a name? New York: Freeman.

Carroll, J. M. (1989a). Taking artifacts seriously. In S. Maas & H. Oberquelle

(Eds.), Software-Ergonomie ’89 (pp. 36-50). Stuttgart: Teubner.

Carroll, J. M. (1989b). Evaluation, description and invention: Paradigms for

human-computer interaction. In M. C. Yovits (Ed.), Advances in

computers (Vol. 29, pp. 44-77). London: Academic Press.

Carroll, J. M. (1990). Infinite detail and emulation in an ontologically

minimized HCI. In J. Chew & J. Whiteside (Eds.), Proceedings of CHI

’90: Human Factors in Computing Systems (pp. 321-327). New York:

ACM.

Carroll, J. M., & Campbell, R. L. (1986). Softening up hard science: Reply to

Newell and Card. Human-Computer Interaction, 2, 227-249.

Carroll, J. M., & Campbell, R. L. (1989). Artifacts as psychological theories:

The case of human-computer interaction. Behaviour and Information

Technology, 8, 247-256.

Carroll, J. M., & Kellogg, W. A. (1989). Artifact as theory-nexus:

Hermaneutics meets theory-based design. In K. Bice & C. H. Lewis

(Eds.), Proceedings of CHI ’89: Human Factors in Computing Systems

(pp. 7-14). New York: ACM.

Chomsky, N. (1957). Syntactic structures. The Hague: Mouton.

Dix, A. J., Harrison, M. D., Runciman, C., & Thimbleby, H. W. (1987).

Interaction models and the principled design or interactive systems. In

H. Nicholls & D. S. Simpson (Eds.), European software engineering

conference, (pp. 127-135). Berlin: Springer Lecture Notes.

Ericsson, K. A., & Simon, H. A. (1980). Verbal reports as data.

Psychological Review, 87, 215-251.

Grudin, J. T. (1990). The computer reaches out: The historical continuity of

interface design. In J. Chew & J. Whiteside (Eds.), Proceedings of CHI

’90: Human Factors in Computing Systems (pp. 261-268). New York:

ACM.

Grudin, J. T., & Barnard, P. J. (1984). The cognitive demands of learning

command names for text editing. Human Factors, 26, 407-422.

Hammond, N., & Allinson, L. 1988. Travels around a learning support

environment: rambling, orienteering or touring? In E. Soloway, D.

Basic Theories and the Artifacts of HCI 127

Frye, & S. B. Sheppard (Eds.), Proceedings of CHI ’88: Human

Factors in Computing Systems (pp. 269-273). New York: ACM.

Hammond, N. V., Long, J., Clark, I. A., Barnard, P. J., & Morton, J. (1980).

Documenting human-computer mismatch in interactive systems. In

Proceedings of the Ninth International Symposium on Human Factors in

Telecommunications (pp. 17-24). Red Bank, NJ.

Hanson, W. (1971). User engineering principles for interactive systems.

AFIPS Conference Proceedings , 39, 523-532.

Harrison, M. D., Roast, C. R., & Wright, P. C. (1989). Complementary

methods for the iterative design of interactive systems. In G. Salvendy

& M. J. Smith (Eds.), Proceedings of HCI International ’89 (pp. 651-

658). Boston: Elsevier Scientific.

Kieras, D. E., & Polson, P. G. (1985). An approach to formal analysis of user

complexity. International Journal of Man- Machine Studies, 22, 365-

394.

Laird, J.E., Newell, A., & Rosenbloom, P. S. (1987). SOAR: An architecture

for general intelligence. Artificial Intelligence, 33, 1-64.

Landauer, T. K. (1987). Relations between cognitive psychology and computer

systems design. In J. M. Carroll (Ed.), Interfacing thought: Cognitive

aspects of human-computer interaction (pp. 1-25). Cambridge, MA:

MIT Press.

Lewis, C. H. (1988). Why and how to learn why: Analysis-based

generalization of procedures. Cognitive Science, 12, 211-256.

Long, J. B. (1987). Cognitive ergonomics and human-computer interaction. In

P. Warr (Ed.), Psychology at Work (erd ed.). Harmondsworth,

Middlesex: Penguin.

Long, J. B. (1989). Cognitive ergonomics and human-computer interaction: An

introduction. In J. B. Long & A. Whitefield (Eds.), Cognitive

ergonomics and human-computer interaction (pp. 4-34). Cambridge:

Cambridge University Press.

Long, J. B., & Dowell, J. (1989). Conceptions of the discipline of HCI: Craft,

applied science and engineering. In A. Sutcliffe & L. Macaulay (Eds.),

People and computers V (pp. 9-32). Cambridge: Cambridge University

Press.

MacLean, A., Barnard, P., & Wilson, M. (1985). Evaluating the human

interface of a data entry system: User choice and performance measures

yield different trade-off functions. In P. Johnson & S. Cook (Eds.),

People and computers: Designing the interface (pp. 172-185).

Cambridge: Cambridge University Press.

MacLean, A., Young, R. M., & Moran, T. P. (1989). Design rationale: The

argument behind the artefact. In K. Bice & C.H. Lewis (Eds.),

Proceedings of CHI ’89: Human Factors in Computing Systems (pp.

247-252). New York: ACM.

Mack, R., Lewis, C., & Carroll, J.M. (1983). Learning to use word

processors: Problems and prospects. ACM Transactions on Office

information Systems, 1, 254-271.

Morton, J., Marcus, S., & Frankish, C. (1976). Perceptual centres: P-centres.

Psychological Review, 83, 405-408.

Newell, A. (1989). Unified Theories of Cognition: The 1987 William James

Lectures . Cambridge, MA: Harvard University Press.

Newell, A., & Card, S. K. (1985). The prospects for psychological science in

human computer interaction. Human-Comuter Interaction, 1, 209.242.

Newell, A., & Simon, H. A. (1972). Human Problem Solving. Englewood

Cliffs, NJ: Prentice-Hall.

Norman, D. A. (1983). Design principles for human-computer interaction. In

Proceedings of CHI ’83: Human Factors in Computing Systems (pp. 1-

10). New York: ACM.

Norman, D. A. (1986). Cognitive engineering. In D. A. Norman & S. W.

Draper (Eds.), User centered system design: New perspectives on

human-computer interaction (pp. 31-61). Hillsdale, NJ: Lawrence

Erlbaum Associates.

Olson, J. R., & Olson, G. M. (1990). The growth of cognitive modelling since

GOMS. Human Computer Interaction 5, 221-265.

Patterson, R. D. (1983). Guidelines for auditory warnings on civil aircraft: A

summary and prototype. In G. Rossi (Ed.), Noise as a Public Health

Problem (Vol. 2, pp. 1125-1133). Milan: Centro Richerche e Studi

Amplifon.

Patterson, R. D., Cosgrove, P., Milroy, R., & Lower, M.C. (1989). Auditory

warnings for the British Rail inductive loop warning system. In

Proceedings of the Institute of Acoustics, Spring Conference (Vol. 11,

pp. 5-51-58). Edinburgh: Institute of Acoustics.

Patterson, R. D., Edworthy, J., Shailer, M.J., Lower, M.C., & Wheeler, P. D.

(1986). Alarm sounds for medical equipment in intensive care areas and

operting theatres. Institute of Sound and Vibration (Research Report AC

598).

Payne, S., & Green, T. (1986). Task action grammars: A model of the mental

representation of task languages. HumanComputer Interaction, 2, 93-

133.

Polson, P. (1987). A quantitative theory of human-computer interaction. In J .

M. Carroll (Ed.), Interfacing thought: Cognitive aspects of humancomputer

interaction (pp. 184-235). Cambridge, MA: MIT Press.

Reisner, P. (1982). Further developments towards using formal grammar as a

design tool. In Proceedings of Human Factors in Computer Systems

Gaithersburg (pp. 304-308). New York: ACM.

Scapin, D. L. (1981). Computer commands in restricted natural language: Some

aspects of memory and experience. Human Factors, 23, 365-375.

Simon, T. (1988). Analysing the scope of cognitive models in human-computer

interaction. In D. M. Jones & R. Winder (Eds.), People and computers

IV (pp. 79-93). Cambridge: Cambridge University Press.

Suchman, L. (1987). Plans and situated actions: The problem of humanmachine

communication. Cambridge: Cambridge University Press.

Basic Theories and the Artifacts of HCI 129

Thimbleby, H. W. (1985). Generative user-engineering principles for user

interface design. In B. Shackel (Ed.), Human computer interaction:

Interact ’84 (pp. 661-665). Amsterdam: North-Holland.

Whiteside, J., & Wixon, D. (1987). Improving human-computer interaction: A

quest for cognitive science. In J. M. Carroll (Ed.), Interfacing thought:

Cognitive aspects of human-computer interaction (pp. 353-365).

Cambridge, MA: MIT Press.

Wilson, M., Barnard, P. J., Green, T. R. G., & MacLean, A. (1988).

Knowedge-based task analysis for human-computer systems. In G. Van

der Veer, J-M Hoc, T. R. G. Green, & D. Murray (Eds.), Working with

computers (pp. 47-87). London: Academic Press.

Young, R. M., & Barnard, P. J. (1987). The use of scenarios in humancomputer

interaction research: Turbocharging the tortoise of cumulative

science. In J. M. Carroll & P. P. Tanner (Eds.), Proceedings of CHI +

GI ’87: Human Factors in Computing Systems and Graphics Interface

(Toronto, April 5-9) (pp. 291-296). New York: ACM.

Young, R. M., Barnard, P.J., Simon, A., & Whittington, J. (1989). How

would your favourite user model cope with these scenarios? SIGCHI

Bulletin, 20( 4), 51-55.

Young, R. M., Green, T. R. G., & Simon, T. (1989). Programmable user

models for predictive ev aluation of interface designs. In K. Bice and C.

H. Lewis (Eds.), Proceedings of CHI ’89: Human Factors in Computing

Systems (pp. 15-19). New York: ACM.

Young, R.M., & MacLean, A. (1988). Choosing between methods: Analysing

the user’s decision space in terms of schemas and linear models. In E.

Soloway, D. Frye, & S. B. Sheppard (Eds.), Proceedings of CHI ’88:

Human Factors in Computing Systems (pp. 139-143). New York:

ACM.

The Hexagonal-Spindle Model for Ergonomics https://www.hci-site-experiments.co.uk/wp-content/themes/engic/images/empty/thumbnail.jpg 150 150 John John https://secure.gravatar.com/avatar/d46e5c6154304cf75a0c2d0059f249ab2204a9e4e458b1e88aaaa9e322763c85?s=96&d=mm&r=g February 22, 2015 February 24, 2015

The Hexagonal-Spindle Model for Ergonomics

John

February 22, 2015

Introduction

I took over the Directorship of the Ergonomics Unit in 1979. By then Rachel had already completed the MSc in 1975 (for her MSc Reflections – see Present and Future, Section 9) and had been appointed as lecturer by my predecessor Harry Maule. Rachel is still there to-day. That says much for her loyalty, tenacity and sheer survivability (Harry; John; Harold; Ann…..). Rachel and I were (and still are) a ‘rainbow’ of contrasts – Rachel and myself respectively – physiologist and psychologist by background; teacher and researcher by academic leaning; physical and cognitive ergonomists by preference; intellectual and pastoral carers of students by nature and so on….. For that reason, I think that we made a good team. We covered the waterfront, so to speak, and supported the Unit and the MSc through difficult and sometimes stormy times. (For a personal view on the origins and survival of the Ergonomics Unit – see Rachel’s interview Present and Future, Section 4.) We also had our moments, of course, given the contrasts outlined earlier and the going was not always smooth; but our unquestionable loyalty to the Unit and commitment to (Cognitive) Ergonomics more generally ensured success. I am very pleased to include Rachel’s (and Andre’s (another ex-MSc student)) paper on the website.

PS Thirty-six years on, and Rachel and I are still working together, if only to sort out Unit documents for archiving….. Time passes quickly, when you are enjoying yourself…..

The Hexagon-Spindle Model for Ergonomics

Rachel Benedyk¹ and Andrée Woodcock²

¹ University College London Interaction Centre, UK

r.benedyk@ucl.ac.uk

² Design and Ergonomics Applied Research Group, Coventry University, UK

adx974@coventry.ac.uk

The concept of ‘work’ in ergonomics may now be applied to the satisfactory completion of any task. Using a systems view of the process, the task consists of a number of transformative interactions, performed simultaneously or successively, to a given level of effectiveness. Such interactions take place at the interface, between the user (or operator) and key components of the system; these encompass not only traditional hardware, software and workstations, but social, managerial, infrastructure and peripheral factors. This paper proposes a new model for Ergonomics, known as the Hexagon – Spindle Model, which represents the multiple factors influencing the effectiveness of interactions at the interface. The model is holistic, multi-dimensional, task-related and transferable across a range of settings. It extends to characterise a time base for serial and simultaneous tasks and highlights areas where operator/system conflicts may arise. The paper illustrates analysis tools for the application of the model in evaluation and design.

Introduction

Traditional work systems approaches in ergonomics (as described originally by Singleton, 1967 and 1974, and more recently by Dowell and Long, 1998) may be applied to the satisfactory completion of any task, not just those conducted as part of ‘formal work’. Using a systems view of the process, the task consists of a number of transformative interactions, performed simultaneously or successively, to a given level of effectiveness. The tasks are accomplished in a ‘workplace’ – the immediate environment in which the user and the work system interact. Each task is defined as one or more processes undertaken by a user or operator while in a dynamic exchange of information (an ‘interaction’) with another or with the key design features of the interface. Here the wider definition of an interface is used, referring to the ‘point of communication between two or more processes, persons, or other (physical) entities’ (Santoro, 2001). This would include concrete or virtual objects such as tools, equipment, networks, furniture, and formal and informal organisational structures.

Task interactions occur in a particular setting, whether in formal spaces such as offices or factories, or informal ones such as sports venues, coffee shops, cyber cafes, or the home. They are task dependent, user specific, and subject to influence from each component of the system; not only from traditional hardware, software and workstations, but also social, managerial, infrastructure and peripheral factors. The overall accomplishment of the task is achieved by the successful and effective completion of task interactions, each of which may be influenced by wider factors beyond the immediate control of the user. The challenge for ergonomists is to find the optimal conditions of all these factors to maximise this effectiveness.

Previous ergonomic models

Traditional models of ergonomics have either represented the interactive nature of the interface or the multiple sources of influential factors. The earliest model, commonly known as Concentric Rings (Shackel, 1969; Galer, 1987), represented an operator-centred focus, and the influence on the operator of factors at levels moving out from there; however, there is no further elaboration in the model of these factors in the system. Girling and Birnbaum (1988) took this further with the Elaborated Concentric Rings model, in which the many influential factors in the outer rings are seen to originate from management, situational or personal sources. These divisions facilitate the diagnosis of multiple sources of design problems and the directions needed for rectification. The enhanced model has since been applied in design problems, ensuring all relevant contributing factors are included (eg Dubois and Benedyk, 1991) and in evaluation problems, providing a holistic framework for scoping the evaluation (eg Naki and Benedyk, 2002).

In parallel, the development of the systems approach to ergonomics has emphasised the purpose (ie the task) and the functions to support that purpose; thus Singleton’s work (1967, 1974) began the ‘closed loop’ system representation of the task interaction at the interface. This exchange of data or information between operator and interface holds true whether operator/machine or practitioner/patient or user/tool. Later authors have developed detailed structures and behaviours to further understand the interface and the nature of this interaction (eg Dowell and Long, 1998).

Combining the two approaches to form a holistic representation would seem to have some advantage: as Wilson (2005, page 10) emphasises: “Human-machine interaction does not take place in a vacuum;…ergonomic techniques are needed to predict, investigate or develop each of the possible interactions, between person-task, person-process, person-environment, person-job, person-person, person-organisation and person-outside world.” In this regard, Singleton (1967) makes passing reference to groups of external factors that could ‘interrupt the closed loop’, and Grey (1987) expands on sources within the rings, placing the task as the innermost ring. Leamon’s (1980) model comes closest to integration, in that it places the Closed Loop inside the Concentric Rings, and attempts to indicate criteria for system effectiveness; however, it elaborates no further on the rings themselves.

In addition to the need for a holistic representation, there is a pressing need for an ergonomic model that can move beyond a systematic construct, to act as a useful tool for practitioners, able to demonstrate the utility of the ergonomic approach. Moreover, no models as yet embrace the complex ergonomic challenges arising from multiple users, from simultaneous or successive tasks, from peripatetic users or from changes in user characteristics over time. The Hexagon-Spindle model presented begins to address all these.

The Hexagon Model

The Hexagon Model has elaborated the Enhanced Concentric Rings model by firstly providing a clearer division of the sectors and the levels; secondly by making explicit the six directions for task interactions and the factors of influence; and thirdly, by differentiating between the constant characteristics of the user (common to any task), which are in the central hexagon of the model, and the individual variables that arise only for that task, which feature in the personal sector. The model was developed in order to elucidate the factors which effect task interactions in learning environments. Using this domain provided a series of specific tasks, interactions and multi-level factors which could be used to validate the model. These developments have been reported by Benedyk, Woodcock and Harder (2006, 2009), and Woodcock, Woolner and Benedyk (2009). In this paper, we wish to test the viability of the generic model by returning it to more traditional domains, using the example of a re-design of a cockpit simulator instructor workstation.

The generic form of the Hexagon Model is shown in Figure 1. This shows that for a specific task, the user performs transformative interactions (in line with his/her requirements) with the interface at the level of the workstation. The workstation consists of all the immediate and relevant tools, materials,

Legend for Figure 1

Shaded areas represent levels

Dotted lines represent porous boundaries

Hashed area represents task interactions at the interface between

user and the workstation level

Sectors provide groupings for sources of factors that potentially

influence the task interactions

Figure 1 A Generic Hexagon Model

technology, furniture, co-workers and other people in the vicinity, facilities and resources. This wider definition of the interface is compound and complex; it is not just limited to the human-machine interface, and its interactions are subject to influence (both positive and negative) from multiple factors at the workplace, work setting and external levels. The subdivisions to the outer rings intend to ensure a sharp focus of analysis, greater rigour and consistency across applications. The model further encourages breadth of consideration, so that all possible influences are addressed. Contextual factors need to be set each time the model is applied, specific to the application under consideration. For a more detailed description of the parts of the model, the reader is referred to Benedyk et al (2009).

To perform a Hexagon Analysis, which is the first stage of the tool for practice, the model is transformed into a matrix (Figure 2) and the key factors of potential ergonomic influence noted for each cell ie, any factor likely to impact on the effectiveness of the task interaction. Every cell in the Analysis Table must be given consideration; this is one of the benefits of the technique, as it encourages this holistic approach. Note that there is no attempt at quantification or comparison here; the inclusion of a factor simply gives it potential significance. If the analysis has an evaluation purpose, the factors noted may be rooted in actual observations of the task in context. However, more often the factors must be extracted from a focused creative discussion between ergonomist and user or designer.

As each analysis is task dependent, the choice and relative weight of factors in the different cells will vary. The acceptable level of effectiveness also needs to be set in advance, using relevant ergonomic criteria and priorities appropriate to that work system. The ergonomic practitioner’s judgment is needed here.

From the range of influencing factors identified, those most critical to task effectiveness are extracted, and an Application Format table (Figure 3) constructed, in which the potential ergonomic problems due to each factor are identified, and solutions suggested. The emphasis of the analysis is on a holistic approach to design and evaluation which recognizes where there are mismatches between ergonomic requirements for effective interaction, and features of the actual system. These issues can then be given professional consideration, with the application of ergonomic techniques as necessary, and recommendations for improvement put forward.

Figure 2 presents an exemplar of the Hexagon Analysis Table; the data here were used for the ergonomic evaluation and re-design of the physical ergonomic aspects of an instructor workstation in a cockpit motion simulator. Figure 3 presents an Application Format table, with an illustrative analysis for the same example.

The Spindle Model

In the second stage of the analysis, multiple Hexagons are combined and compared, using a Spindle Model (Figure 4). The spindle allows the hexagon plates to be stacked or spaced, and the influencing factors on them compared, in order to integrate user requirements over different tasks over time.

Tasks are rarely undertaken singly. The user might move from task to task consecutively, or perform more than one task concurrently. These tasks have the same user in common, and often the same external environment, perhaps even the same organisational management, but they may require different equipment, different settings and entail different user requirements, at different times. Because they are not entirely independent of each other, and relate along a time dimension, it can be useful to see their hexagons arranged as ‘plates’ on a ‘time spindle’ as in Figure 4. The commonality of the user links them at the centre.

For serial tasks, the hexagons will be arranged spaced out along the time dimension as in Figure 4, and for concurrent tasks they will be ‘stacked’ at a single time point. Analysis involves a comparison of the key priority factors (the X’s in Figure 4) on each of the hexagons on the spindle, in order to identify conflict or congruence. The spindle thus can hold user task-independent needs constant while varying user requirements by task-dependent capabilities.

For time t₁	Organisational Sector	Contextual Sector	Personal Sector
For time t₁	Management Factors	Infrastructure Factors	Task Factors	Tool factors	Individual Factors	Social Factors
External Level	Businessfunding for training; aviation training policy and standards	Developments in integration of instructor workstations in simulators	Best practice in simulator training facilities and in instructor facilities	Trends and standards in use of integrated cockpit instructor workstations	Instructor recruitment and training in relation to use of simulator workstations	Cultural (national and business) expectations of simulator facilities
TaskSetting Level (Pilot training centre)	Allocation of budgetfor simulator facility; costs and benefits ofupgrading	Provision of facilities for simulators; technical support	Syllabus and content of instruction sessions, in relation to use of equipment and interaction of instructor with pilot	Age and antiquity of simulator facilities; flexibility of use of space	Attitudes of instructors to use of different types of simulator workstations	Attitudes and preferences of pilots interacting with instructors in simulators
Task Workplace Level (Cockpit simulator)	Procurement and planning choices for design and facilities in training simulators	Overall space, accessibility,temperature, acoustics, lighting, vibration	Style of delivery of instruction for these tasks; equipment and sightline requirements for these tasks	Suitability of workplace configuration to accommodate instructor workstation; effect of environmental factors on workstation usability	Crowding factor; relation of instructor workstation to pilot workstation	Potential interference between instructor’s workstation and fidelity of cockpit simulation
TaskWorkstation Level (Instructor’s workstation)	Procurement choice of individual workstation eg adjustability, orientation and size – suitability for this cockpit	Installation, support and power supplies affecting individual workstation; task lighting	Instructor workstation and interface design functionality in relation to required task performance; frequency and extent of use	Instructor workstation and interface design features and their usability; fit to this cockpit; sightlines between instructor and pilot’ instruments	Posture, flexibility of posture, and static loading factors imposed by current configuration; effect of vibration on balance, vision and well-being during these tasks	Sightlines and engagement (both required and unwanted) between instructor and pilot
Interaction Level (Task interface)
Interface User Level (Instructor)	Stable personal characteristics, eg anthropometry, skills, abilities, preferences, expectations and knowledge of particular instructor, in relation to these tasks in this cockpit simulator; individual susceptibility to effects of vibration

Figure 2 The Hexagon Analysis Table: an analysis table for a cockpit flight instructor’s tasks – this figure shows a partial analysis, giving examples filled into the table for physical ergonomics factors relevant to the ergonomic design of a flight instructor workstation

Note the hashed boundaries indicating porosity of influence.

Note also the stable user characteristics (those that are not task dependent) underpinning all sector columns.

Note the interaction level occurs between the user and the workstation, in a direct sense, but is under influence from factors at any level and from any sector.

Levels:	External	Work-Setting	Workplace	Workstation	Operator
*Examples of influencing factors at time t₁*	Clients from different nations hire this training facility and require different configurations of cockpit to suit.	Motion Simulator instruction must replicate Static Simulator Instruction	Simulator must be as similar to cockpit as possible, so space for instructor is very limitedMovement of flight causes vibration	Instructor must monitor both instructor displays and pilot displaysInstruction takes place in darkness and during flight motion	Instructor workstation is used by many different instructors on different occasions
*Examples of ergonomics issues at time t₁*	Different configurations impose variations in requirements which if not met could impact optimal use of the facility by some users	Do the ergonomic requirements for the instructor in the different facilities cause a conflict?	Are postures too constrained? Could emergency egress be compromised?Can the operator still make fine eye and finger movements without error?	Can the instructor keep sightlines and focus on both displays despite obstructions and distance?Do the darkness, flight motion and isolation of this workplace cause ill effects for these operators?	Given the constrained space in the work area, can the full range of sizes of people who may be instructors be accommodated?
*Examples of ergonomics approaches to solutions*	Ensure use of instructor workstation is compatible with all possible client-preferred cockpit configurations without loss of task quality	Ensure comparison of Motion to Static simulator is not compromised by ergonomic interventions	Assess potential risk of postures and problems with egress, and reduce risk by operator adjustment of layout.Design displays and controls that do not require fine movements. Provide limb supports.Ensure rest breaks are taken outside the work environment.	Carry out user trials to test that proposed design and layout meet visual task requirements, and build in adjustability where possible.Ensure potential users test their tolerance to the environmental conditions. Ensure operators are adapted to reduce motion sickness.	Ensure anthropometry applied to this design is valid for this user population, and build in adjustability where possible.
Figure 3 – The Application Format for the Hexagon-Spindle model: an illustrative example, using the exemplar from Figure 2

Legend for Figure 4

t= point in time

x= critical factors on each hexagon

hashed area = common task interactions

Figure 4 A Spindle Model

Provision of generic facilities can sometimes compromise the specific facilities the user needs to perform a variety of tasks. Thus, moving from one task to another or attempting consecutive tasks in the same workstation may entail different user requirements – with the result that the workstation or the materials may be ergonomically optimal for this user for one task but not for others. Similarly, moving a peripatetic workstation (such as a laptop) from one setting or context to another can reveal conflicting requirements for optimal use, even for the same user.

The same analysis technique, stacking the hexagons, can compare different users doing the same task in the same workstation, and accommodate individual differences; highlighting the need for adjustability. For example, in the analysis in Figure 3, if there were two Cockpit Instructors using the same workstation at different times, their individual needs would both have to be accommodated by the workstation design. This perspective can also allow us to return to the wider focus at the workplace level. A single workplace supports not only the tasks of the system user, but also the tasks of the supervisor, the cleaning and maintenance tasks of the janitor, etc; one environment, several users. They have contrasting tasks, and thus demand contrasting user requirements from that environment. The Hexagon-Spindle model would allow us to identify points of conflict that would lead to ergonomic concern (such as increased risk) in such shared environments.

Further, the model can examine the same user over time by comparing factors in the central user hexagon, and point to the changes in the interface necessary to accommodate time-dependant factors such as fatigue, aging, or pregnancy.

Any of the factors in the model can be held constant, and any others varied, and the effect on the task interaction quality or the costs to the user can be examined.

Conclusion

The Hexagon-Spindle model provides a generic framework that enables a practitioner to take a multi-factorial, holistic approach to the design and evaluation of different forms of task material, aids, devices and environments whilst taking into account the requirements of different types of users and tasks. Its approach will lead to a more integrated understanding of each scenario, and an increased confidence in the likely effectiveness of ergonomic interventions and design changes. This is particularly critical in complex settings with multiple tasks, wide ranges of users and budget-limited facilities, which can lack a holistic ergonomic focus.

The flexibility of the spindle provides an opportunity to think outside the rigid box that can sometimes limit other models. A working day for many people now comprises sets of activities, which may or may not occur in the same place, with the same or different colleagues or using the same or different equipment. To create a truly effective working environment requires not just one, but a series of effective environments to be constructed.

The Hexagon-Spindle model explicitly focuses on the effectiveness of the interactions for a single task, thus clarifying conflicts between different task needs. It evaluates the whole complex, multifactorial space as defined by the extended concentric rings and the influence of all factors, whether supportive or limiting, on the task interactions. The Spindle allows for the inclusion of an extra dimension in which to explore and resolve conflicts and priorities over time, over space, or between tasks or users. Not only may the same environment have to cater for the needs of varying users, but also for the other work undertaken in the same environment by other stakeholders. Most importantly, the model extends to provide a usable ergonomics analysis and demonstration tool.

References

Benedyk, R., Woodcock, A. and Harder, A., 2006, Towards a new model of Educational Ergonomics. In: Pro. 2^ndConf on Access and Integration in Schools, (Coventry University, UK)

Benedyk, R., Woodcock, A. and Harder, A., 2009, The Hexagon-Spindle Model for Educational Ergonomics, accepted for publication in Work Journal

Dowell, J and Long, J., 1998, Conception of the cognitive engineering design problem, Ergonomics, 41.2, 126-139.

Dubois, S. and Benedyk, R., 1991, An ergonomic review of the usability of the postal delivery bag. In: Designing for Everyone, I. Queinnec and F. Daniellou, Eds., (Taylor and Francis, London)

Galer, I., Ed., 1987, Applied Ergonomics Handbook, (Butterworths Scientific, Guildford)

Girling, B., and Birnbaum, R., 1988, An ergonomic approach to training for prevention of musculoskeletal stress at work, Physiotherapy 74.9

Grey, S., Norris, B. and Wilson, J., 1987, Ergonomics in the electronic retail environment (ICL UK Ltd, Slough, UK)

Leamon, T., 1980, The organisation of industrial ergonomics – a human machine model, Applied Ergonomics, 11, 223-226

Naki, M., and Benedyk, R., 2002, Integrating new technology into nursing workstations: can ergonomics reduce risks? In: Contemporary Ergonomics 2002, P. McCabe, Ed., (Taylor and Francis, London), 57-61.

Santoro, D., Snyder, R., Snyder E., 2001, FDA-Speak: A Glossary and Agency Guide, Informa Health Care, ISBN 1574911295, 9781574911299

Singleton, W., 1967, The systems prototype and his design problems, Ergonomics 10.2, 120-124

Singleton, W., 1974, Man-Machine Systems, Penguin.

Shackel, B., 1969, Workstation analysis – turning cartons by hand, Applied Ergonomics 1.1, 45-51

Woodcock, A., Woolner, A. and Benedyk, R., 2009, Applying the Hexagon-Spindle Model for Educational Ergonomics to the design of school environments for children with autistic spectrum disorders, accepted for publication in Work Journal

Wilson, J., 2005, Methods in the understanding of human factors, Chapter 1 in Wilson, J. and Corlett, N., (eds) Evaluation of Human Work, 3^rd Edition Taylor and Francis

Paper – Obrist et al. Opportunities for Odor: Experience with Smell and Implications for Technology https://www.hci-site-experiments.co.uk/wp-content/themes/engic/images/empty/thumbnail.jpg 150 150 John John https://secure.gravatar.com/avatar/d46e5c6154304cf75a0c2d0059f249ab2204a9e4e458b1e88aaaa9e322763c85?s=96&d=mm&r=g April 12, 2016 April 12, 2016

Paper – Obrist et al. Opportunities for Odor: Experience with Smell and Implications for Technology

John

April 12, 2016

Opportunities for Odor:

Experiences with Smell and Implications for Technology

Marianna Obrist1,2, Alexandre N. Tuch3,4, Kasper Hornbæk4

m.obrist@sussex.ac.uk | a.tuch@unibas.ch | kash@diku.dk

1Culture Lab, School of Computing Science

Newcastle University, UK

2School of Engineering and Informatics

University of Sussex, UK

3Department of Psychology,

University of Basel, CH

4Department of Computer Science,

University of Copenhagen, DK

Permission to make digital or hard copies of all or part of this work for personal or

classroom use is granted without fee provided that copies are not made or distributed

for profit or commercial advantage and that copies bear this notice and the full

citation on the first page. Copyrights for components of this work owned by others

than ACM must be honored. Abstracting with credit is permitted. To copy otherwise,

or republish, to post on servers or to redistribute to lists, requires prior specific

permission and/or a fee. Request permissions from Permissions@acm.org.

CHI 2014, April 26 – May 01 2014, Toronto, ON, Canada

http://dx.doi.org/10.1145/2556288.2557008
ABSTRACT

Technologies for capturing and generating smell are

emerging, and our ability to engineer such technologies and

use them in HCI is rapidly developing. Our understanding

of how these technologies match the experiences with smell

that people have or want to have is surprisingly limited. We

therefore investigated the experience of smell and the

emotions that accompany it. We collected stories from 439

participants who described personally memorable smell

experiences in an online questionnaire. Based on the stories

we developed 10 categories of smell experience. We

explored the implications of the categories for smellenhanced

technology design by (a) probing participants to

envision technologies that match their smell story and (b)

having HCI researchers brainstorm technologies using the

categories as design stimuli. We discuss how our findings

can benefit research on personal memories, momentary and

first time experiences, and wellbeing.

Author Keywords

Smell; smell experiences; odor; olfaction; user experience;

smell-enhanced technology; narratives; smell stories;

crowdsourcing; design brainstorming; designing for smell.

ACM Classification Keywords

H.5.2 Information interfaces and presentation (e.g., HCI):

Miscellaneous.

General Terms

Experimentation, Human Factors, Design.

INTRODUCTION

Smell plays an important role for memories and emotions.

Compared to other modalities, memories evoked by smell

give stronger feelings of being brought back in time, are

more emotionally loaded, are experienced more vividly,

feel more pleasant, and are autobiographically older

(ranging back to childhood) [15,33]. Smell is incredibly

powerful in connecting humans to past events and

experiences.

Matsukura et al. [22] recently proposed the Smelling

Screen, an olfactory display system that can distribute

smells. Earlier work in HCI has proposed other systems that

capture and generate smells. For example, Brewster et al.

[5] developed a smell-based photo-tagging tool, and Bodnar

et al. [4] showed smell to be a less disruptive notification

mechanism than visual and auditory modalities.

Thus, smell technologies are already emerging.

Our understanding of how these technologies match the

experiences with smell that people have or want to have is

surprisingly limited.

First, while technologies such as those

mentioned above are often evaluated, the results mainly

concern the perception of smell. The evaluations say little

about the general potential of smell technologies for

humans or their ability to generate particular experiences.

Second, whereas earlier work states that the subjective

experience of smell stimulation is crucial for the success of

a system (e.g., [5]), we are unaware of work in HCI that

studies the subjective experience of smell (though see [17]).

Third, several hundred receptors exist for smell and we

cannot rely on any primary smells to stimulate a particular

experience, as might be imagined for other human senses

[2].

Taken together, these points suggest that we can only

link smell tenuously to particular experiences or emotions.

This limits our ability to design for a spectrum of

experiences.

The present paper focuses instead on experiences and

emotions related to smell and links them to potential

technologies. Inspired by work on user experience [14,34],

we concentrate on personal memorable smell experiences

and their links to emotion. From the focus on experience we

developed design guidance for smell-enhanced

technologies. The goal is to contribute knowledge on

subjective smell experiences and their potential for design.

We collected 439 smell stories, that is, descriptions of

personal memorable experiences involving smell.We

distributed a questionnaire through crowdsourcing, ensuring

a large-scale coverage and variety of smell stories. We

analyzed the stories and identified 10 main categories and

36 sub-categories. Each category was described with

respect to its experiential and emotional characteristics and

specific smell qualities. Besides smell stories associated

with the past (e.g., memory of loved people, places, life

events) we identify stories where smell played an important

role in stimulating action, creating expectations, and

supporting change (e.g., of behavior, attitude, mood). Smell

can sometimes also be invasive and overwhelming, and can

affect people’s interaction and communication. Within the

categories, we identify common smell qualities and

emotions, which support the exploration of opportunities

for design. In particular, we discuss the implications for

technology based on feedback from participants and on a

brainstorming session with HCI researchers working on

smell technologies.

The main contributions of this paper are

(1) an experiencefocused understanding of smell experiences grounded in a

large sample of personal smell stories, which allowed us

(2) to establish a systematic categorization and description scheme for smell experiences, leading to

(3) the identification of technology implications by participants,and

(4) the exploration of design potentialities by HCI

researchers.

THE HUMAN SENSE OF SMELL

The sense of smell is the most complex and challenging

human sense.

Hundreds of receptors for smell exist and the

mixing of the sensations, in particular with our sense of

taste, is immense [2]. The sense of smell is further

influenced by other senses such as vision, hearing, and

touch; plays a significant role for memory and emotion; and

shows strong subjective preferences. Willander and Larsson

[33] showed that autobiographical memories triggered by

smell were older (mostly from the first decade of life) than

memories associated with verbal and visual cues (mostly

from early adulthood). Moreover, smell-evoked memories

are associated with stronger feelings of being brought back

in time, are more emotionally loaded, and are experienced

more vividly than memories elicited through other

modalities [15,33]. No other sensory system makes the

direct and intense contact with the neural substrates of

emotion and memory, which may explain why smellevoked

memories are usually emotionally potent [15].

The emotion-eliciting effect of smell is not restricted to the

context of autobiographical memories. Smell is particularly

useful in inducing mood changes because they are almost

always experienced clearly as either pleasant or unpleasant

[8]. For instance, Alaoui-Ismaïli et al. [1] used ‘vanilla’ and

‘menthol’ smells to trigger positive emotions in their

participants (mainly happiness and surprise) and ‘methyl

methacrylate’ and ‘propionic acid’ to trigger negative

emotions (mainly disgust and anger). Interestingly, Herz

and Engen [15] pointed out that almost all responses to

smell are based on associative learning principles. They

argued that only smells learned to be positive or negative

can elicit the corresponding hedonic response and that

people, therefore, should not have any hedonic preference

for novel smells. The only exceptions are smells of

irritating quality that strongly stimulate intranasal

trigeminal structures. Such smells often indicate toxicity.

While neuroscientists and psychologists have established a

detailed understanding of the human sense of smell, insight

into the subjective characteristics of smell and related

experiences is lacking.

The exploration of this subjective layer of smell is often understood as going beyond the

interest of these disciplines, but is highly relevant for HCI

and user experience research.

SMELL IN HUMAN-COMPUTER INTERACTION

Ten years ago, Kaye [17] encouraged the HCI community

to think about particular topics that need to be studied and

understood about smell. While some attempts have been

made to explore smell during recent years, the potential of

smell in HCI remains underexplored.

Most work on smell in HCI focuses on developing and

evaluating smell-enhanced technologies.

Brewster et al. [5]

used smell to elicit memories, and developed a smell-based

photo-tagging tool (Olfoto). Bodnar et al. [4] showed smell

to be less disruptive as a notification mechanism than visual

and auditory modalities. Emsenhuber et al. [9] discussed

scent marketing, highlighting the technological challenges

for HCI and pervasive technologies. Ranasinghe et al. [24]

further investigated the use of smell for digital

communication, enabling the sharing of smell over the

Internet. More examples of smell-enhanced technologies

can be found in multimedia applications [13], games [16],

online search interfaces [19], health and wellbeing tools

(e.g., http://www.myode.org/), and ambient displays [22].

The exploration of smell-enhanced technologies is mostly

limited to development efforts and the evaluation of users’

smell perception of single smell stimuli. The smells used

are often arbitrary and not related to experiences. This is

because of the lack of knowledge pertaining to the

description and classification of smells required for HCI

[17]. Kaye points out that “There are specific ones

[classification and description schemes] for the perfume,

wine and beer industries, for example, but these do not

apply to the wide range of smells that we might want to use

in a user interface” (p. 653). Thus, previous work has a

general and quite simple usage of smell.

THE POTENTIAL OF STUDYING SMELL EXPERIENCES

In contrast to the work reported above, the present paper

focuses instead on experiences with smell and links them to

potential technologies. We do so through stories of

experiences with smell. Stories are increasingly used within

user experience research to explore personal memories of

past experiences, but also to facilitate communication in a

design process [3,34]. Stories are concrete accounts of

particular people and events, in specific situations [10],

and

are more likely to stimulate empathy and inspire design thinking than, for example, scenarios.

STUDY METHOD

We asked a large sample of participants to report smell

experiences that were personal and meaningful.

We refer to the description of these experiences as smell stories. These

stories were captured through a questionnaire described

below, which included inspirational examples of smellenhanced

technologies at its end. Based on the examples we

asked participants to reflect on their experience and future

technologies. The rationale of this approach was to begin

from smell experiences that matter to participants, instead

of starting from an application or a particular technology.

Questionnaire

We created a web-based questionnaire consisting of six

parts. We started with an open question to stimulate the

report of a personal memorable smell experience. This was

followed by closed questions aiming to elucidate the

relevant emotional and experiential characteristics, as well

as the smell qualities. Participants could freely choose the

story to report. The questionnaire was administered through

a crowdsourcing platform to obtain a large sample of smell

stories. Crowdsourcing provides valid and reliable data [20]

and has been used for capturing user experiences [31].

Part 1: Smell Story

The smell stories were elicited through an initial exercise,

where participants were asked to think about situations and

experiences where smell played an important role. The aim

was to get participants into the right frame of mind and

sensitize them to smell. Next, participants were asked to

describe one memorable smell experience in as much detail

as possible, inspired by the questioning approach used in

explicitation interviews [23]. This questioning technique is

used to reconstruct a particular moment and aims to place a

person back in a situation to relive and recount it. Part 1 of

the survey was introduced as follows: Bring to your mind

one particular memorable moment of a personal smell

experience. The experience can be negative or positive.

Please try to describe this particular smell experience in as

much detail as possible. You can use as many sentences as

you like, so we can easily understand why this moment is a

memorable experience involving smell for you.

Participants were asked to give a title to their story

(reflecting its meaning) and indicate if the experience was

positive, negative, or ambivalent (i.e., equally positive and

negative). They were also asked to indicate how personally

relevant the experience was (from ‘not personally relevant

at all’ to ‘very personally relevant’).

Part 2: Context

Part 2 asked participants to give further details of their

reported experience via open and closed follow-up

questions. There were four questions on the context of the

described experience, including the social context (who else

was present), the place (based on the categories used by

[26]), the location (as an open field), and the time when the

reported experience took place (days, weeks, months, or

years ago).

Part 3: The smell

Specific questions on the characteristics and qualities of the

smell were asked in Part 3. Participants characterized the

smell itself using a list of 72 adjectives (i.e., affective and

qualitative terms) derived from the ‘Geneva Emotion and

Odor Scale’ (GEOS) [7]. Participants could also add

descriptions to characterize the smell in an open feedback

box. In addition, they rated the smell with respect to its

perceived pleasantness, intensity, and familiarity.

Part 4: Experienced emotions

In Part 4 participants had to describe how they felt about

the experience as a whole, using a list of affective terms

(101 in total). They could go through the list and tick the

words that best described their emotions during the

experience. The words were derived from Scherer [27].

Participants could also add their own words in a free-text

field.

Part 5: Smell technologies

After the participants had selected, thought about, and

described a particular smell episode, Part 5 linked their

personal experience to technology. The participants were

engaged in a envisioning exercise inspired by work on

mental time travel [30]. They were shown six inspirational

examples of smell technologies, namely: Olfoto: searching

and tagging pictures (CHI, [5]); Smelling screen: ambient

displays (IEEE, [22]); Digital smell: Sharing smell over the

Web (ICST, [24]); Scent dress: interactive fabric with smell

stimulation (http://www.smartsecondskin.com/); Mobile

smell App: iPhone To Detect Bad Breath and Other Smells

(BusinessInsider 01/2013), and Smell-enhanced cinema:

Iron Man 3 Smell-Enhanced Screening (Wired 04/2013).

These six technologies cover areas of relevance for HCI

(mobile, ambient, wearable, personal, and entertainment

computing), give realistic examples of smell technologies

from research, and include recent, commercial examples.

We asked the participants to imagine any desirable change

that future smell technology might make (or not) with

respect to their personal smell experience.

We asked them the following questions: (1) How could your experience be

enhanced? (2) What technology are you thinking about? (3)

Why would such a combination of your experience and the

technology be desirable, or why would it not?

Finally, the participants could express any other ideas for smell

technology in a free-text field.

Part 6: Personal background

At the end of the questionnaire, participants answered

questions on their socio-demographic and cultural

background. The goal was to try to identify any

geographical and cultural influences on smell attitudes (as

found by Seo et al. [29]). The participants were also asked

to assess their own smell sensitivity.

All the questions, except for those on demographics, were

mandatory. On average, the survey took 16 minutes to

complete (SD = 7.57 minutes). Participants received US$

1.50 for completing the questionnaire, corresponding to an

hourly salary of 5.63 dollars.

Collected data and participants

A total of 554 participants began the questionnaire. Of

these, 480 completed the questionnaire and answered three

verification questions at its end. These questions required

participants to describe the purpose of the study without

being able to go back and look at the earlier questions or

guidelines. After data cleaning, 41 stories were excluded.

Fake entries (n = 11) were identified immediately, while

repeated entries (n = 10), incomplete stories (unfinished

sentences; n = 6), and incomprehensible stories (which did

not make sense on their own; n = 14), were excluded

iteratively throughout the coding process. This left us with

439 smell stories.

At the time of the study, all 439 participants (52.8% female)

lived in the US; most had grown up in the US (95%). The

participants’ age ranged from 18 to 67 years (M = 31.5, SD

= 10.0). A majority of participants (84%) indicated being

sensitive to smell (rating 4 or higher on a scale from 1 to 5).

Data analysis

The analysis process followed an open and exploratory

coding approach [25]. Two researchers conducted the

qualitative coding process. After coding an initial 25% of

the stories, two more coding rounds (to reach 33% and then

50% of the data), led to the establishment of an agreed

coding scheme. The coding scheme contained 10 main

categories and 36 sub-categories, and a category entitled

‘not meaningful’ for cases where smell did not seem to

have any relevance in the described experience. Based on

this coding scheme, one researcher coded the remaining

50% of the data, and the second researcher coded a subsample

of 25% of that data, resulting in a good inter-coder

agreement (Cohen’s kappa of κ = .68) [12].

Follow up design brainstorming

In addition to the feedback from our participants,

we also explored the design value of the smell stories with experts

in the field. We organized a two-hour design brainstorming

session with three HCI researchers, two working on smell

technologies and one working on advanced interface and

hardware design. None of them were from the same

organization as the authors and none were familiar with the

details of the study before the session.

The brainstorming session aimed to share and interpret the

smell stories and followed four stages [11]: (1) prompting,

(2) sharing, (3) selecting, and (4) committing. We selected

36 stories (one representative story for each sub category)

as brainstorming prompts. All 36 stories were printed on A6

sheets (including the story title, the smell story, context

information, and personal background). Each researcher

was asked to read through the stories individually before

discussing them together. They were asked the same

questions as our participants (e.g., how they might imagine

a connection between the experience and technology). Each

researcher chose the most interesting/inspiring stories to

share with the group, then they generated ideas as a group,

and selected three to four ideas to be developed in more

detail. The outcome of the brainstorming session is

presented in the implication section, after the description of

the findings from the smell stories.

FINDINGS ON EXPERIENCES WITH SMELL

In the following sections we present our findings according

to the 10 identified categories. The 439 smell stories were

organized via their primary category, as agreed by the

coders. This categorization does not define a strict line

between the categories, as they are not wholly independent,

but it does enable us to organize the material and generate a

useful dataset for design.

Below we provide for each category a rich description of the particularities of the

stories, excerpts from example stories, and their associated

smell qualities and emotions. Each category also contains

information about the participants’ own rating of the stories

as positive, negative, or ambivalent.

Category 1: Associating the past with a smell

This category is the largest and contains 157 stories. In

these stories, the participants described a past experience in

which a smell was encountered during a special event in life

(e.g., ‘Wedding Day’, ‘New House’), at a special location

(e.g., ‘The Smells of Paris’, ‘Grandma’s House’), or as part

of a tradition (e.g., ‘The Smell of Thanksgiving’ or

‘Christmas Eve’). In these stories the smell was described

as having a strong association to those particular moments

in the past, with no actual smell stimulus in the present. A

particularity of this category is the distinction between

stories describing personal memorable events versus

personal life events (e.g., ‘Disneyland’ versus ‘When my

mother died’). Smells were also associated with personal

achievement/success (e.g., ‘Scent of Published Book’,

‘New Car Smell’) and other important episodes of change,

such as “‘Fresh Start’: I was taking a job in a new city. …. I

took a plane trip across the country and the moment I took

a step off the plane and took a deep breath will always stick

with me. It felt so clean and the air actually smelled fresh

and new” [#488]. Within this story, the qualities of the

smell were for instance described as fresh, energetic, and

invigorating. Some of the emotions experienced at this

moment were courageousness and excitement. Although

this category is dominated by positive experiences (n =

127), negative experiences were also reported (n = 27),

such as ‘Car Crash’.

Category 2: Remembering through a smell

The 40 stories in this category described a recent

experience of a smell, which reminded participants about

past events, people, locations, or specific times in their life.

In contrast to the previous category (where stories describe

a direct link from the recollected past smell to the present;

e.g., the smell of ‘Grandma’s House’), this category

contains stories that describe an indirect link from the

present experienced smell stimulus

to the past event, person or place (e.g., the smell of chocolate cookies as sudden

reminder about grandma). Most stories in this category

contain reminders of childhood described as ‘sweet’,

‘reassuring’ and ‘nostalgic’ with respect to the qualities of

the smell. A sub-set of stories in this category (n = 10) also

highlight the particular power of smells to take a person

back in time. The description of such a flashback caused by

a sudden smell stimulus was described as: “‘My first love’:

It was the next day, when I was walking through the local

Macy’s that I smelled something that threw me back into

that situation, I could feel and see everything that had

happened the day before when I smelled a perfume in the

store” [#630]. Some of the qualities used to describe the

smell were attractive, erotic, and fresh. The experienced

emotions were described as amorous, aroused, excited,

hopeful, and interested. The stories in this category were

mainly positive (n = 37), except for three.

Category 3: Smell perceived as stimulating

The 62 stories in this category described experiences with a

unique, mostly unknown smell (all stories, except one, were

positive). The smells arose from different sources, such as

perfume, food, and nature. A particularity of this category is

the quality of ‘first time’ encounters with a smell across all

origins. One participant described the first time he was at a

beach: “The smell was very different from anything I had

ever experienced before. At first I was kind of grossed out

by the smell, but I grew to love it” [#921]. Another

participant described the smell of a tornado experienced for

the first time: “It was similar to the smell before rain but

had a certain sharpness to it, as if to warn of the incoming

danger. I felt like I knew this smell but at the same time, it

felt foreign to me. It wasn’t a bad smell, it was just slightly

unfamiliar” [#713]. The smell qualities and experienced

emotions were often described with mixed attributes (e.g.,

heavy, imitating, and stimulating; attentive, serious, and

calm), but still rated as positive experiences by participants.

Most of the other stories in this category reported on the

first experiences with food (e.g., ‘Slice of Heaven’) and

nature (e.g., ‘Grass’), and were described as desirable,

fresh, or pure, and provoked feelings of happiness at the

moment they occurred. Although specific memories were

established, including unique new associations (e.g.,

‘Tornado smell’), the stories in this category did not evoke

the kind of strong connections to the past as described in

Category 1 and 2.

Category 4: Smell creating desire for more

This category contains 48 stories (45 positive). Key to these

stories is that the smell grabbed the persons’ attention

unexpectedly. The smell was either associated with food

(triggering appetite), nature (triggering curiosity), or the

scent of other people (triggering attractiveness), which

motivated one to do or get something. In some stories smell

was described in relation to the sensation of newness (e.g.,

“‘The sweet smell of CPU’: …There was the smell of the

cardboard boxes it all arrived in, the smell of new metal–

perhaps it was a combination of these and other things, but

when the building was complete there was just a singular

smell that was unique to a new computer built by my own

hands” [#685]). The qualities of the smell in this story

included beneficial, heavy, sophisticated, energetic, and

pleasantly surprising. The experienced excitement was

expressed through words such as confident, delighted,

enthusiastic, impressed, or triumphant. This category also

contained one story where the smell at a funeral stimulated

reflection in the moment (e.g., ‘The scent of moving on’).

The story was rated as a positive experience and at the same

time the smell was described as clean, penetrating, and

persistent, and the participant indicated that she was afraid,

anxious, discontented, sad, tired, and uncomfortable.

Despite the negative situation described in this story, the

smell gave hope and a desire to live and move on, looking

into the future in contrast to the stories in Category 1 and 2.

Category 5: Smell allowing identification and detection

This category captures the enabling role of smell in certain

situations, such as allowing one to identify or detect a smell

(e.g., “‘Gas leak’: I was cooking something on a gas stove

and went out for a few minutes. When I came back, the fire

was extinguished but the gas was still on. My roommate

was sat at the table doing schoolwork, completely oblivious

to the poisonous gas that was filling the room. I told him to

get the hell up and open the windows and doors” [#951]).

The qualities used to describe the smell were distinguished,

penetrating, dirty, and light. The emotions related to this

situation were described as anxious, conscientious,

confident, and serious. Although the category is rather

small (n = 11), the lesson to be learnt from the shared

stories was the immediacy of the smell, allowing the

participant to act or prevent something.

Category 6: Overwhelming power of smell

This category includes 37 stories where the smell

overwhelmed the person in a positive way (n = 5; e.g., ‘The

Chocolate Factory’) but predominately in a negative way (n

= 30; e.g., ‘The Smell of Death’). In the latter case, people

described the smell as something disturbing, as something

that hit them suddenly on their way or during an activity. A

subset of the stories was recounted as traumatizing, so that

the person could still vividly remember the particular

moment in the past although years have passed and no

recent similar smell stimulation had occurred unlike in

Category 2 (e.g., “‘Visit to a local county jail’: My guide

warned me ahead of the time that it was going to be a little

foul in there, but nothing could have prepared me for the

obscenely acrid stench of hundreds of men crammed into

every available space of the jail, right down to windowless

storage rooms converted into more cells. … For days

afterwards, I couldn’t shake the smell…. There weren’t

enough showers to take it away. It’s been several years

since then, and my memory of that smell is just a strong as

ever” [#604]). In this category, the qualities of the smell

were described as heavy, penetrating, dirty, or sickening.

Amongst others, the experienced emotions were described

as alarmed, anxious, distressed, frustrated, or

uncomfortable. In contrast to Category 1 and 2 (where the

smell was associated with an event from the past or

triggered a specific memory), Category 6 is about the smell

as such during the experience and not about the memory

associated with this smell. As opposed to the first two

categories, in most stories forgetting – not remembering –

the smell was the key element.

Category 7: Smell invading private and public spaces

All the stories in this category (n = 32) described an

experience where one could not get rid of the smell. The

smell invaded private and public spaces. In contrast to the

previous category, the smell entered the person’s personal

space (the person did not enter the space where the smell

already existed) and took over the space. The loss of control

over the smell was linked to the lingering quality of the

smell (e.g., “‘Don’t want to smell that twice!’: I woke up

one morning suddenly confused and was hit with an odor so

horrible I couldn’t figure out what it was. … It was not like

the smell you get a whiff of when a skunk stinks up the

outdoors” [#530]). In the story the power of the smell,

causing them to leave the house for several hours, was

described with qualities such as foul, nauseous, penetrating,

and persistent. One of the experienced emotions was

surprisingly ‘amused’, however it was overruled by other

emotions including annoyed, anxious, disgusted, taken

aback, and uncomfortable. Despite the glimpse of humor in

some stories, this category mainly contains negative

experiences and underlines the power of the smell with its

sudden and lingering qualities.

Category 8: Social interaction is affected by the smell

Within this category, smell was related to a person’s own

smell or to the smell of others. Smell negatively affected

the interaction among people and their togetherness (e.g.,

“‘Dragon breath teacher’: Once a teacher yelled at me

during class. She got so close up into my face that I could

smell her bad breath. This made the experience much worse

because I wanted to get up and walk away but she was

grabbing me to keep me focused on her while she was

talking” [#744]). The smell qualities were described as

nauseous, penetrating, and sickening, and caused negative

emotions experienced as bitter, distressed, or insulted.

Despite frequent interactions among people, this category

only contains 11 stories. This set of stories (overall negative

experiences, apart from two) contains interesting elements

with respect to a person’s own awareness of body smell and

the overbearing effect of other peoples’ smell on one’s

comfort.

Category 9: Smell changes mood, attitude and behavior

This category contains 23 stories, which underlined the

power of smells to change a person’s mood, attitude, or

behavior. Stories reported the active regulating effect of

smells with respect to mood, but mostly (n = 14) the change

of behavior due to smells (e.g., ‘Accidental vegetarian’ or

‘Saved by the Smell!’). One story showed the active usage

of smells to change one’s mood. A participant had recently

been divorced and reported on the day her husband had

moved out: “‘White Lilac Sheets’: “I made the bed with my

lilac sheets and the atmosphere changed. I still remember

that scent and how I felt on that day. I was going to be

okay. The room didn’t look or feel or smell lonely anymore.

It looked and smelled fresh and clean and lovely and a bit

romantic and it was mine” [#526]. The qualities of the

smell were described as fresh, reassuring, and spring-like,

while the experienced emotions were determined, hopeful,

longing, tense, but also worried. Overall, the stories in this

category were reported as mainly positive (n = 12)

experiences, but also as negative (n = 7) and four stories

were rated ambivalent, neither positive nor negative.

Category 10: Smell builds up and changes expectations

This category shows the potential of smell to build up

expectations and to surprise. In the former case (11 stories)

the smell was building up expectations until the actual

contact with the trigger, such as food or a perfume (e.g.,

“‘The Smell of Hungry Anticipation’: “I was trying a new

soup for the first time. When it was brought to the table, the

soft smell of rosemary immediately hit my nostrils. …It

complimented the taste of the soup and built anticipation”

[#585]). The smell was described as mouthwatering,

healthy, and pleasantly surprising, and was further related

to emotions such as conscientious, expectant, and relaxed.

In other stories (n = 7), expectations were exceeded to the

extent that they surprised and diverted anticipations (e.g.,

‘PomVinegar Surprise’: “I could smell the pomegranate

and vinegar from about 10 steps away, and it was a very

pungent (thought not unpleasant) odor. I almost felt my

nose becoming runny and took out a tissue. When I tasted

the dish, however, the taste wasn’t nearly as sour as I

expected it to be from the smell” [#542]). The smell in this

story was described as distinguished and penetrating, and

was associated with emotions such as attentive and excited.

Key quantitative facts behind the smell stories

While the above-described categories can be used as an

inspiration and as a starting point for exploring design

opportunities for smell in HCI,

our quantitative data provides additional background information. Below, the

key quantitative information across all the collected smell

stories is summarized. The majority of the 439 collected

stories were positively valenced (n = 296), 112 were

negative, and 31 were ambivalent. On average, negative

stories tended to be slightly longer (M = 90 words) than

positive stories (M = 79 words), but the difference is

statistically not significant (U = 14600, p = .063, r = .09).

Contextual information

Most stories occurred in a context where one or more familiar persons were present (64.2%)

or where participants were alone (21.6%). The presence of

one or more strangers was reported less frequently (8.7%).

With regard to location, most of the experiences happened

at the participant’s or a friend’s home (38.1%) or in a public

building (20.7%). Quite a few participants reported that

their experience took place in the streets or another public

space (14.4%), in a natural setting (7.3%), or at work

(6.4%). The remaining participants (13.2%) indicated other

places (e.g., stranger’s home). On average the reported

experiences occurred 8.7 years ago (SD = 10.3), ranging

from 1 day to 58 years ago.

The qualities of smell

The most frequent smell qualities

reported in positive stories were pleasant (60%), fresh

(42%), sweet (38%), clean (31%), and mouthwatering

(30%). Smells in negative stories were described as

unpleasant (62%), penetrating (55%), heavy (54%), foul

(53%), and nauseous (51%). In ambivalent stories the smell

was perceived as fresh (39%), pleasant (32%), mouthwatering

(32%), attractive (26%), and penetrating (23%).

Experienced emotions

When asked to describe how they felt during their experience, participants’ used the affective

terms happy (63%), pleased (53%), joyous (42%), delighted

(41%), and excited (39%) in positive stories and

uncomfortable (55%), disgusted (51%), distressed (43%),

miserable (42%), and taken aback (29%) in negative stories.

Ambivalent experiences were most frequently described as

happy (42%), excited (39%), enthusiastic (35%), joyous

(32%), and serene (29%).

An overview of all 10 categories and 36 sub-categories

including qualitative and quantitative information

(including a full example for each sub-category, used in the

design brainstorming session) is provided as supplementary

material. All smell stories and related qualities of smell,

experienced emotions, and context information, are also

available at www.multisensory.info for further exploration.

IMPLICATIONS FOR TECHNOLOGY

This study focused first on experiences and second on the

implications for technology. This section turns to

technology. Below we summarize the feedback from the

participants on how technology would fit with their

experience, and describe input from a brainstorming session

with HCI researchers working on smell technologies and

advanced interface and hardware design, based on a sub-set

of the smell stories (one from each sub-category).

Ideas for technology from participants

Below we summarize the six areas and ideas for desirable

future smell technologies mentioned by participants in Part

5 of the questionnaire:

(1) To share smells with family/friends: allow one to

participate in a family event through remote smelling; share

smells of special moments such as the smell of a newborn

baby with distant relatives; share smells with people who

they know would appreciate it (such as through social

media); allow capturing and sharing of smells to create a

common understanding where you can’t explain it.

Participants also desired to be able to design and share new

smells from a personal database and create a personal smell

box/bottle.

(2) To support decision making: use smells for a quick

judgment in online shopping (like/dislike can be determined

easily); create smell profiles about holiday places and travel

destinations; smell match maker in dating websites for

allowing a better decision making about going on a date or

not with a person (smell enhanced profiles).

(3) To regulate mood actively/passively: smell to relive

good experiences whenever you want to get in a better

mood; to calm yourself down in stressful moments such as

in traffic jam or at work; as a reminder of past memories

you would have forgotten otherwise but that can cheer you

up when you feel depressed and life seems too difficult.

(4) To combine with other technologies and activities:

integrate smell into radio; combine smell with music such

as with ‘soundhound’ or ‘shazam’ apps; smell-enhanced

advertisement on TV (for food channels); enhance visits of

concerts, theater and performances with smell; allow underwater

smelling when diving.

(5) To combine with everyday objects: enhance wristband

with smell for keeping a preferred perfume lingering; have

special glasses to see and smell the beach; smell-enhanced

jewelry and clothes. One participant imagined her wedding

ring enhanced with the smell of that day.

(6) To make oneself and others aware about body smell: to

avoid embarrassing moments; provide invisible cues to a

person about her/his smell level; quick smell check after

sporting activities.

The first idea matches the experiences in Categories 1 and

2, where particular events/moments in life are associated

with a smell. The desire for capturing and sharing these

experiences enhanced with smell becomes prevalent and

suggests design implications for real-time smell-enhanced

technologies (e.g., mobile phone, photo or video camera).

The second idea can be linked to Category 5, allowing

people to identify and detect a smell. Moreover, smells are

seen as very powerful for supporting quick decision-making

(e.g., smell-enhanced website navigation and searching).

The third idea shows a direct link to Category 9 and the

potential of smell to change mood. Interestingly participants

whose story was in Category 1 or 2 were wishing for the

possibility to capture pleasant smells, for instance from

their childhood, and released to them in the present. This

desire for smell-enhanced technologies or products is also

apparent in the fourth and fifth ideas, where technology,

objects, or even activities can be enhanced through smells

from the past, or actual smells sourced through nature (e.g.,

diving in the ocean). Finally, the sixth idea is linked to

Category 8, aiming to avoid embarrassing moments in

social interactions.

Participants also expressed concerns about future smell

technologies. They were concerned about the possible

misuse of smell when sharing it through the Internet or

mobile phones (e.g., teasing people with smells, how to

trust a smell message), and about the potential manipulation

through smell (e.g., TV ads, online shopping). Some

participants were also afraid to get sick, catch an allergy, or

become addicted if they are exposed to chemical

stimulations from technology. Finally, one participant

raised the question of copyright and ownership of smells

(e.g., ‘can I share others’ smells?’).

Ideas for technology from HCI researchers

Below we outline the ideas that emerged from the two-hour

brainstorming session prompted by 36 smell stories (one

from each sub-category). Four groups of design ideas

emerged from this session and are described below:

(1) Smell-enhanced performance regulator: a technology

stimulating smell in order to structure the day, taking

activities and moods into account, and combining different

smells to avoid habituation (training and evaluation phase

needed). Smell as a reminder to take a break or as

motivation to keep going a bit longer to meet the deadline

[inspired by #526 ‘White Lilac Sheets’, Category 9].

(2) Autonomous smell agent: a technology spreading

ambient cues (e.g., a robot) to guide someone to a certain

place, to build up expectations, and motivate action. Smell

trails in the environment can also make hidden information

accessible, for instance, before entering a room (e.g., smell

warning: tense working atmosphere) [inspired by #801

‘Don’t forget to check your gas stove before you leave the

house’, Category 5].

(3) Reminder alert with smell: a technology to remind us

about important events, birthdays, and appointments.

Although we have reminders on mobile phones and

computers, they are often ignored, snoozed or in the worst

case forgotten about. A smell can provide a pleasant

reminder to say ‘it is time to call your mom’ by presenting

the smell of your favorite dish your mom makes for you.

On the other hand, if more critical, bad smells can be very

powerful as a reminder and are not easily ignored [inspired

by #530 ‘Don’t want to smell that twice’, Category 7].

(4) Smell-enhanced storytelling: a technology that

stimulates storytelling around a digitized version of an

incense stick. A stick was imagined with different layers,

representing smells related to a loved person who passed

away. When friends or family members come together, for

instance at an anniversary year, they can add new smells to

be shared in the group and thus trigger new stories about

the dead person. It is as if looking through a photo album,

telling the stories from the past, and using the smells as

anchor points for keeping the memory alive [inspired by

#672 ‘The Scent of Moving On’, Category 4].

We saw that the smell stories, even if they only provided

limited information (story, story title, context, gender, and

age), triggered vivid discussions, created empathy, and

stimulated the sharing of personal smell experiences. The

four ideas described above provide a starting point for

exploring smell in HCI. The categorization along with

additional background information on smell qualities and

experienced emotions (see supplementary material) can

inspire further explorations of smell technologies.

DISCUSSION

Our findings about experiences with smell in combination

with the ideas for technologies just presented show several

design opportunities for smell. Below we do not provide

solutions for smell-enhanced technology designs, rather we

illustrate where our findings might be relevant to stimulate

novel designs in existing areas of interest within HCI. We

see three anchor points for smell-enhanced technology.

First, the smell stories in Categories 1 and 2 suggest design

opportunities for remembering and recalling the past. Our

findings might enrich ongoing research on the design for

personal memories. Apart from enhancing research

supporting the capturing and sharing of personal

experiences (e.g., in family relationships [18]) through

smell, our findings support research to support people who

are living with memory loss (e.g., patients with dementia

[32]), where smell can play an important role in

remembering the past. An increasing body of research also

explores the potential of digital technologies to support our

memory in everyday tasks (e.g., reminder systems), to

recall past events and experiences (e.g., life-logging tools),

to design end-of-life technologies allowing reminiscence of

passed away people [21], and to record and reproduce

smells [35]. All this research shows the potential of smell to

enrich experiences, for instance by enhancing personal

memories such as photos or videos with smell. Based on

their study of a smell-based photo-tagging system, Brewster

et al. [5] stated that participants asked for personal smells to

be added. The information on how to classify such smells

was still missing; the present analysis allows us to relate

smell qualities to particular types of experiences.

When designing with smell, as for any memory-based

technology, access to such memories has to be considered

carefully to preserve their uniqueness. One participant

wrote: “I could see it being desirable in that it would allow

me to experience the scent whenever I want, but it’s kind of

a two edged sword in that experiencing that scent time and

time again will make it common place” [#513]. The power

of smell might not persist if always available, thus

participants suggested to either restrict the access and

retrieval of smells to special times (e.g., at ‘grandmas

birthday’) or to link them to a certain social setting (e.g.,

smelling only in company with ‘your sister’). This way the

uniqueness of the smell can be preserved.

Second, the stories in Categories 3 to 8 as well as 10 draw

the attention away from past memories and suggest design

opportunities for the present moment. Designing for in-situ

stimulation, the ability to capture and share smells in the

moment, and the capability to mask and neutralize bad

smells creates a vast space for smell interaction design. One

suggestion made by participants was the combination of

smell and social media, such as “An app that would allow

me to store smells, send smells, or attach smells to a picture

that I could post on social media or Instagram or

something”. This supports existing research on the delivery

of smells through the Internet [24]. We draw attention to

three additional design directions concerned with (1) first

time experiences with smell, (2) the power of smell to build

up expectations, and (3) the potential of designing for bad

smells. User experience designers put a lot of effort into

designing ‘out-of-the-box’ and first time experiences to

create positive experiences [13]. Our categorization not

only provides designers with rich descriptions of such first

time experiences, but also describes the related qualities of

smells in combination with descriptors of the experienced

emotions. This can be used to stimulate positive smellenhanced

experiences with technology, build up

expectations, and create anticipation as studied within

experience research [33]. Typically this anticipation stage is

influenced by a variety of aspects (e.g., advertisements,

product descriptions, accounts from existing customers).

Smell stories in Category 10 provide evidence for the

power of smell to build up expectations, create surprise by

exceeding anticipated experiences, and enhance momentary

experiences through capturing and sharing pleasant smells.

Categories 6 to 8 contain stories about bad smells, which

are wished to be neutralized or masked to change the

experience from something negative to something positive.

While the idea of outbalancing smells seems to be

desirable, the design brainstorming session stimulated a

discussion on the usage of bad smell in design, particularly

as part of the design idea (3) Reminder alert with smell.

Designing for bad smells might not seem appropriate at

first, but through intensity manipulation it can open up an

interesting space for design. Similar to a snooze function,

which slowly increases volume, smell stimulations could be

added to certain events (e.g., reminder for mother’s

birthday). Starting with a pleasant smell, it could turn

slowly into something unpleasant if you did not act.

Category 8 also contained stories recounting social

experiences with smell, where the smell of a person or of

other people caused embarrassment or discomfort. Despite

the importance and frequency of social contact in everyday

life, few such stories were shared. They might not seem

meaningful enough to be memorable or to be shared. Yet,

this set of stories holds potential for personal and social

smell-enhanced awareness systems, as well as for wearable

technologies, and smart fabrics. Technology could, as stated

by a participant, “…make the people in those settings feel

more comfortable if I interact with them… My holding my

nose could be insulting and impede communication.”

Third, the smell stories in Category 9 suggest design

opportunities reaching out to the future through positive

stimulation, with potential relevance for wellbeing and

behavior change research in HCI. The stories shared in this

category were about the power of smell to regulate mood,

change attitudes, and behavior. Designing for smell could

be combined with behavior change research in HCI (e.g.,

tools to support healthy nutrition and diet), and thought of

in relation to positive psychology and research on

wellbeing. Seligman and Csikszentmihalyi [28] suggested

that happiness can be learned and cultivated and that

positive psychology can help change how a person feels.

They point to the power of positive emotions for our health,

happiness and wholeness. We would suggest that our

findings add an understanding of the positive emotional

impact of smells that might be a valuable research strategy

in wellbeing research (e.g., for regulating mood).

Smell can have a regulating impact on a person’s mood and can, as in

one case explicitly reported, be used to regulate mood

(‘White Lilac Sheets’). The participant wrote, “I guess the

experience could have been enhanced by some kind of

mood moderator. Something that would have sensed my

sadness and filled the room or house with comforting

scents” [#526]. The participant pointed out that technology

would not change the situation to something more positive,

as it just was not a happy time at all, but that it could

support the sad moments in this transitional period of life.

Limitations

We would like to acknowledge three limitations of this

work. First, by using Amazon Mechanical Turk for

recruiting and asking participants to describe personal

relevant experiences, we were limited to the US and do not

know to what extent the smell stories are representative of

more general experiences with smell. We are aware about

cultural and geographical differences (as described by Seo

et al. [29]), which require further studies with a more

diverse group of participants. Second, collecting narratives

by means of an online questionnaire has an influence on

how people narrate their experience and deprives us of the

advantages of an interview situation where we can engage

in a dialogue with the participant to explore the meaning

behind the shared experience in more depth as described by

Bruner [6]. We tried to collect information beyond the

initial trigger of the shared smell stories in order to allow

the establishment of meaningful categorizations and the

creation of a basic understanding of experiences with smell

in HCI.

Third, our approach provides an overview on the

emerging field of smell-enhanced technologies. Future

studies will, we hope, lead to in-depth research into

experiences with smell inspired by our identified categories.

CONCLUSIONS

Despite interactive technologies increasingly disappearing

into our environment (in ubiquitous and mobile computing)

and becoming essential in everyday life, the senses used to

interact with technology are still limited.

We have discussed the opportunities for smell in HCI based on an

analysis of 439 smell stories. We identified 10 primary

categories for stories about experiences with smell, which

help discuss the potential implications for technology.

Implications were drawn from feedback from our

participants envisaging desired connections between their

own personal experience and future smell technology. The

implications for designing for smell were further enriched

through ideas from an initial brainstorming session with

HCI researchers. Our findings provide guidance for smell

enhanced technology design, not only giving a

categorization of the role of smell in personal experiences,

but also extracting the qualities of smell across the smell

stories and the experienced emotions. We argue that this

research enriches existing technology driven research on

smell in HCI and provides a fruitful starting point when

designing for experiences with smell.

ACKNOWLEDGMENTS

This work is supported by the Marie Curie IEF Action of

the European Union (FP7-PEOPLE-2010-IEF) and the

Swiss National Science Foundation (PBBSP1 144196). We

thank our participants and especially Annika Haas for her

valuable support in designing the supplementary material.

REFERENCES

1. Alaoui-Ismaïli, O., Robin, O., Rada, H., Dittmar, A.,

Vernet-Maury, E. Basic Emotions Evoked by Odorants:

Comparison Between Autonomic Responses and Self-

Evaluation. Physiology & Beh. 62, 4 (1997), 713–720.

2. Bakalar, N. Sensory science: Partners in flavour. Nature

486, 7403 (2012), S4–S5.

3. Baumeister, R.F., Newman, L.S. How Stories Make

Sense of Personal Experiences: Motives that Shape

Autobiographical Narratives. Personality and Social

Psychology Bulletin 20, 6 (1994), 676–690.

4. Bodnar, A., Corbett, R., Nekrasovski, D. AROMA:

ambient awareness through olfaction in a messaging

application. Proc. ICMI, (2004), 183–190.

5. Brewster, S., McGookin, D., Miller, C. Olfoto:

designing a smell-based interaction. Proc. CHI (2006),

653–662.

6. Bruner, J. Life as narrative. Social Research: An

International Quarterly 71, 3 (2004), 691–710.

7. Chrea, C., Grandjean, D., Delplanque, S., Cayeux, I., Le

Calvé, B., Aymard, L., Velazco, MI., Sander, D.,

Scherer, KR. Mapping the Semantic Space for the

Subjective Experience of Emotional Responses to

Odors. Chem. Senses 34, 1 (2009), 49–62.

8. Ehrlichman, H., Bastone, L. The use of odour in the

study of emotion. In S. Van and G.H. Dodd, (Eds.),

Fragrance: The psychology and biology of perfume.

Elsevier, (1992), 143–159.

9. Emsenhuber, B. Scent Marketing: Making Olfactory

Advertising Pervasive. In J. Müller, F. Alt and D.

Michelis (Eds.), Pervasive Advertising. Springer,

(2011), 343–360.

10. Erickson, T. Design as storytelling. Interactions 3, 4

(1996), 30–35.

11. Faste, H., Rachmel, N., Essary, R., Sheehan, E.

Brainstorm, Chainstorm, Cheatstorm, Tweetstorm: new

ideation strategies for distributed HCI design. Proc. CHI

(2013), 1343–1352.

12. Fleiss, J.L., Levin, B., Paik, M.C. Statistical Methods

for Rates and Proportions. John Wiley & Sons, (2013).

13. Ghinea, G., Ademoye, O. The sweet smell of success:

Enhancing multimedia applications with olfaction.

TOMCCAP 8, 1 (2012), 2.

14. Hassenzahl, M. Experience Design: Technology for All

the Right Reasons. Syn. Lect. on HCI 3, 1 (2010), 1–95.

15. Herz, R.S., Engen, T. Odor memory: review and

analysis. Psy. Bulletin & Review. 3, 3 (1996), 300–313.

16. Jonsson, F., Verhagen, H. Senses working overtime: on

sensuous experiences and public computer game play.

Proc. ACE (2011), 56:1–56:8.

17. Kaye, J. “Jofish.” Making Scents: aromatic output for

HCI. Interactions 11, 1 (2004), 48–61.

18. Kazakos, K., Howard, S., Vetere, F. Revisiting the

relationship between reunion and technology-mediated

separation in periodically transitioning families. Proc.

CSCW (2013), 1157–1168.

19. Loumakis, F., Stumpf, S., Grayson, D. This image

smells good: effects of image information scent in

search engine results pages. CIKM (2011), 475–484.

20. Mason, W., Suri, S. Conducting behavioral research on

Amazon’s Mechanical Turk. Behavior Research

Methods 44, 1 (2012), 1–23.

21. Massimi, M., Moncur, W., Odom, W., Banks, R., Kirk,

D. Memento mori: technology design for the end of life.

Proc. CHI EA (2012), 2759–2762.

22. Matsukura, H., Yoneda, T., Ishida, H. Smelling Screen:

Development and Evaluation of an Olfactory Display

System for Presenting a Virtual Odor Source. IEEE

TVCG 19, 4 (2013), 606–615.

23. Petitmengin, C. Describing one’s subjective experience

in the second person: An interview method for the

science of consciousness. Phenomenology and the

Cognitive Sciences 5, 3-4 (2006), 229–269.

24. Ranasinghe, N., Karunanayaka, K., Cheok, A.D.,

Fernando, O.N.N., Nii, H., Gopalakrishnakone, P.

Digital taste and smell communication. Proc. BodyNets,

(2011), 78–84.

25. Saldana, J. The Coding Manual for Qualitative

Researchers. SAGE (2012).

26. Scherer, K.R. Appraisal considered as a process of

multilevel sequential checking. Appraisal processes in

emotion: Theory, methods, research 92, (2001), 120.

27. Scherer, K.R. What are emotions? And how can they be

measured? Social Science Inf. 44, 4 (2005), 695–729.

28. Seligman, M.E.P., Csikszentmihalyi, M. Positive

psychology: An intro. Am. Psy. 55, 1(2000), 5–14.

29. Seo, H.-S., Guarneros, M., Hudson, R., et al. Attitudes

toward Olfaction: A Cross-regional Study. Chem.

Senses 36, 2 (2011), 177–187.

30. Suddendorf, T., Corballis, M.C. The evolution of

foresight: What is mental time travel, and is it unique to

humans? Beh. & Brain Sciences 30, 3 (2007), 299–313.

31. Tuch, A.N., Trusell, R., Hornbæk, K. Analyzing users’

narratives to understand experience with interactive

products. Proc. CHI (2013), 2079–2088.

32. Wallace, J., Wright, P.C., McCarthy, J., Green, D.P.,

Thomas, J., Olivier, P. A design-led inquiry into

personhood in dementia. Proc. CHI (2013), 2617–2626.

33. Willander, J., Larsson, M. Smell your way back to

childhood: Autobiographical odor memory. Psy.

Bulletin & Review 13, 2 (2006), 240–244.

34. Wright, P., McCarthy, J. Experience-Centered Design:

Designers, Users, and Communities in Dialogue. Syn.

Lect. on HCI 3, 1 (2010), 1–123.

35. Wyszynski, B., Yamanaka, T., Nakamoto, T. Recording

and reproducing citrus flavors using odor recorder.

Sensors&Actuators B: Chemical 106, 1(2005), 388–393.

Dowell and Long (1989) https://www.hci-site-experiments.co.uk/wp-content/themes/engic/images/empty/thumbnail.jpg 150 150 John John https://secure.gravatar.com/avatar/d46e5c6154304cf75a0c2d0059f249ab2204a9e4e458b1e88aaaa9e322763c85?s=96&d=mm&r=g November 13, 2017 November 13, 2017

Dowell and Long (1989)

John

November 13, 2017

Towards a Conception for an Engineering Discipline of Human Factors

John Dowell and John Long

Ergonomics Unit, University College London,

26, Bedford Way, London. WC1H 0AP.

abstract

This paper concerns one possible response of Human Factors to the need for better user-interactions of computer-based systems. The paper is in two parts. Part I examines the potential for Human Factors to formulate engineering principles. A basic pre-requisite for realising that potential is a conception of the general design problem addressed by Human Factors. The problem is expressed informally as: ‘to design human interactions with computers for effective working’. A conception would provide the set of related concepts which both expressed the general design problem more formally, and which might be embodied in engineering principles. Part II of the paper proposes such a conception and illustrates its concepts. It is offered as an initial and speculative step towards a conception for an engineering discipline of Human Factors.

In P. Barber and J. Laws (ed.s) Special Issue on Cognitive Ergonomics, Ergonomics, 1989, vol. 32, no. 11, pp. 1613-1536. Dowell and Long 2

Part I. Requirement for Human Factors as an Engineering Discipline of Human-Computer Interaction

1.1 Introduction…………………………………………………………………………………………….2

1.2. Characterisation of the Human Factors Discipline…………………………………..3

1.3. State of the Human Factors Art………………………………………………………………..5

1.4. Human Factors Engineering Principles……………………………………………………7

1.5. The Requirement for an Engineering Conception for Human Factors…………………………………………………………………………………..11

1.1. Introduction

Advances in computer technology continue to raise expectations for the effectiveness of its applications. No longer is it sufficient for computer-based systems simply ‘to work’, but rather, their contribution to the success of the organisations utilising them is now under scrutiny (Didner, 1988). Consequently, views of organisational effectiveness must be extended to take account of the (often unacceptable) demands made on people interacting with computers to perform work, and the needs of those people. Any technical support for such views must be similarly extended (Cooley, 1980).

With recognition of the importance of ‘human-computer interactions’ as a determinant of effectiveness (Long, Hammond, Barnard, and Morton, 1983), Cognitive Ergonomics is emerging as a new and specialist activity of Ergonomics or Human Factors (HF). Throughout this paper, HF is to be understood as a discipline which includes Cognitive Ergonomics, but only as it addresses human-computer interactions. This usage is contrasted with HF as a discipline which more generally addresses human-machine interactions.

HF seeks to support the development of more effective computer-based systems. However, it has yet to prove itself in this respect, and moreover, the adequacy of the HF response to the need for better human-computer interactions is of concern. For it continues to be the case that interactions result from relatively ad hoc design activities to which may be attributed, at least in part, the frequent ineffectiveness of systems (Thimbleby, 1984).

This paper is concerned to develop one possible response of HF to the need for better human-computer interactions. It is in two parts. Part I examines the potential for HF to formulate HF engineering principles for supporting its better response. Pre-requisite to the realisation of that potential, it concludes, is a conception of the general design problem it addresses. Part II of the paper is a proposal for such a conception.

The structure of the paper is as follows. Part I first presents a characterisation of HF (Section 1.2) with regard to: the general design problem it addresses; its practices providing solutions to that problem; and its knowledge supporting those practices. The characterisation identifies the relations of HF with Software Engineering (SE) and with the super-ordinate discipline of Human-Computer Interaction (HCI). The characterisation supports both the assessment of contemporary HF and the arguments for the requirement of an engineering HF discipline.

Assessment of contemporary HF (Section 1.3.) concludes that its practices are predominantly those of a craft. Shortcomings of those practices are exposed which indict the absence of support from appropriate formal discipline knowledge. This absence prompts the question as to what might be the Dowell and Long 3

formal knowledge which HF could develop, and what might be the process of its formulation. By comparing the HF general design problem with other, better understood, general design problems, and by identifying the formal knowledge possessed by the corresponding disciplines, the potential for HF engineering principles is suggested (Section 1.4.).

However, a pre-requisite for the formulation of any engineering principle is a conception. A conception is a unitary (and consensus) view of a general design problem; its power lies in the coherence and completeness of its definition of the concepts which can express that problem. Engineering principles are articulated in terms of those concepts. Hence, the requirement for a conception for the HF discipline is concluded (Section 1.5.).

If HF is to be a discipline of the superordinate discipline of HCI, then the origin of a ‘conception for HF’ needs to be in a conception for the discipline of HCI itself. A conception (at least in form) as might be assumed by an engineering HCI discipline has been previously proposed (Dowell and Long, 1988a). It supports the conception for HF as an engineering discipline of HCI presented in Part II.

1.2. Characterisation of the Human Factors Discipline

HF seeks to support systems development through the systematic and reasoned design of human-computer interactions. As an endeavour, however, HF is still in its infancy, seeking to establish its identity and its proper contribution to systems development. For example, there is little consensus on how the role of HF in systems development is, or should be, configured with the role of SE (Walsh, Lim, Long, and Carver, 1988). A characterisation of the HF discipline is needed to clarify our understanding of both its current form and any conceivable future form. A framework supporting such a characterisation is summarised below (following Long and Dowell, 1989).

Most definitions of disciplines assume three primary characteristics: a general problem; practices, providing solutions to that problem; and knowledge, supporting those practices. This characterisation presupposes classes of general problem corresponding with types of discipline. For example, one class of general problem is that of the general design problem1 and includes the design of artefacts (of bridges, for example) and the design of ‘states of the world’ (of public administration, for example). Engineering and craft disciplines address general design problems.

Further consideration also suggests that any general problem has the necessary property of a scope, delimiting the province of concern of the associated discipline. Hence may disciplines also be distinguished from each other; for example, the engineering disciplines of Electrical and Mechanical Engineering are distinguished by their respective scopes of electrical and mechanical artefacts. So, knowledge possessed by Electrical Engineering supports its practices solving the general design problem of designing electrical artefacts (for example, Kirchoff’s Laws would support the analysis of branch currents for a given network design for an amplifier’s power supply).

Although rudimentary, this framework can be used to provide a characterisation of the HF discipline. It also allows a distinction to be made between the disciplines of HF and SE. First, however, it is required that the super-ordinate discipline of HCI be postulated. Thus, HCI is a discipline addressing a general design problem expressed informally as:

‘to design human-computer interactions for effective working’.

The scope of the HCI general design problem includes: humans, both as individuals, as groups, and as social organisations; computers, both as programmable machines, stand-alone and networked, and as functionally embedded devices within machines; and work, both with regard to individuals and the organisations in which it occurs (Long, 1989). For example, the general design problem of HCI

1They are to be distinguished from the class of general scientific problem of the explanation and prediction of phenomena. Dowell and Long 4

includes the problems of designing the effective use of navigation systems by aircrew on flight-decks, and the effective use of wordprocessors by secretaries in offices.

The general design problem of HCI can be decomposed into two general design problems, each having a particular scope. Whilst subsumed within the general design problem of HCI, these two general design problems are expressed informally as:

‘to design human interactions with computers for effective working’; and

‘to design computer interactions with humans for effective working’.

Each general design problem can be associated with a different discipline of the superordinate discipline of HCI. HF addresses the former, SE addresses the latter. With different – though complementary – aims, both disciplines address the design of human-computer interactions for effective working. The HF discipline concerns the physical and mental aspects of the human interacting with the computer. The SE discipline concerns the physical and software aspects of the computer interacting with the human.

The practices of HF and SE are the activities providing solutions to their respective general design problems and are supported by their respective discipline knowledge. Figure 1 shows schematically this characterisation of HF as a sub-discipline of HCI (following Long and Dowell, 1989). The following section employs the characterisation to evaluate contemporary HF.

1.3. State of the Human Factors Art

It would be difficult to reject the claim that the contemporary HF discipline has the character of a craft (at times even of a technocratic art). Its practices can justifiably be described as a highly refined form of design by ‘trial and error’ (Long and Dowell, 1989). Characteristic of a craft, the execution and success of its practices in systems development depends principally on the expertise, guided intuition and accumulated experience which the practitioner brings to bear on the design problem1.

It is also claimed that HF will always be a craft: that ultimately only the mind itself has the capability for reasoning about mental states, and for solving the under-specified and complex problem of designing user-interactions (see Carey, 1989); that only the designer’s mind can usefully infer the motivations underlying purposeful human behaviour, or make subjective assessments of the elegance or aesthetics of a computer interface (Bornat and Thimbleby, 1989).

The dogma of HF as necessarily a craft whose knowledge may only be the accrued experience of its practitioners, is nowhere presented rationally. Notions of the indeterminism, or the un-predictability of human behaviour are raised simply as a gesture. Since the dogma has support, it needs to be challenged to establish the extent to which it is correct, or to which it compels a misguided and counter-productive doctrine (see also, Carroll and Campbell, 1986).

Current HF practices exhibit four primary deficiencies which prompt the need to identify alternative forms for HF. First, HF practices are in general poorly integrated into systems development practices, nullifying the influence they might otherwise exert. Developers make implicit and explicit decisions with implications for user-interactions throughout the development process, typically without involving HF specialists. At an early stage of design, HF may offer only advice – advice which may all too easily be ignored and so not implemented. Its main contribution to the development of user-interactive systems is the evaluations it provides. Yet these are too often relegated to the closing stages of development programmes, where they can only suggest minor enhancements to completed designs because of the prohibitive costs of even modest re-implementations (Walsh et al,1988).

Second, HF practices have a suspect efficacy. Their contribution to improving product quality in any instance remains highly variable. Because there is no guarantee that experience of one development programme is appropriate or complete in its recruitment to another, re-application of that experience cannot be assured of repeated success (Long and Dowell, 1989).

Third, HF practices are inefficient. Each development of a system requires the solving of new problems by implementation then testing. There is no formal structure within which experience accumulated in the successful development of previous systems can be recruited to support solutions to the new problems, except through the memory and intuitions of the designer. These may not be shared by others, except indirectly (for example, through the formulation of heuristics), and so experience may be lost and may have to be re-acquired (Long and Dowell, 1989).

1The claimed craft status of HF practice remains unaffected by the counterclaim that science and, in particular, psychology, offers guidance to the designer. The guidance may be direct – by the designer’s familiarity with psychological theory and practice, or may be indirect by means of guidelines derived from psychological findings. In both cases, the guidance can offer only advice which must be implemented then tested to assess its effectiveness. Since the general scientific problem is the explanation and prediction of phenomena, and not the design of artifacts, the guidance cannot be directly embodied in design specifications which offer a guarantee with respect to the effectiveness of the implemented design. It is not being claimed here that the application of psychology directly or indirectly cannot contribute to better practice or to better designs, only that a practice supported in such a manner remains a craft, because its practice is by implementation then test, that is, by trial and error (see also Long and Dowell, 1989). Dowell and Long 6

Fourth, there are insufficient signs of systematic and intentional progress which will alleviate the three deficiencies of HF practices cited above. The lack of progress is particularly noticeable when HF is compared with the similarly nascent discipline of SE (Gries, 1981; Morgan, Shorter and Tainsh, 1988).

These four deficiencies are endemic to the craft nature of contemporary HF practice. They indict the tacit HF discipline knowledge consisting of accumulated experience embodied in procedures, even where that experience has been influenced by guidance offered by the science of psychology (see earlier footnote). Because the knowledge is tacit (i.e., implicit or informal), it cannot be operationalised, and hence the role of HF in systems development cannot be planned as would be necessary for the proper integration of the knowledge. Without being operationalised, its knowledge cannot be tested, and so the efficacy of the practices it supports cannot be guaranteed. Without being tested, its knowledge cannot be generalised for new applications and so the practices it can support will be inefficient. Without being operationalised, testable, and general, the knowledge cannot be developed in any structured way as required for supporting the systematic and intentional progress of the HF discipline.

It would be incorrect to assume the current absence of formality of HF knowledge to be a necessary response to the indeterminism of human behaviour. Both tacit discipline knowledge and ‘trial and error’ practices may simply be symptomatic of the early stage of development of the discipline1. The extent to which human behaviour is deterministic for the purposes of designing interactive computer-based systems needs to be independently established. Only then might it be known if HF discipline knowledge could be formal. Section 1.4. considers what form that knowledge might take, and Section 1.5. considers what might be the process of its formulation.

1.4. Human Factors Engineering Principles

HF has been viewed earlier (Section 1.2.) as comparable to other disciplines which address general design problems: for example, Civil Engineering and Health Administration. The nature of the formal knowledge of a future HF discipline might, then, be suggested by examining such disciplines. The general design problems of different disciplines, however, must first be related to their characteristic practices, in order to relate the knowledge supporting those practices. The establishment of this relationship follows.

The ‘design’ disciplines are ranged according to the ‘hardness’ or ‘softness’ of their respective general design problems. ‘Hard’ and ‘soft’ may have various meanings in this context. For example, hard design problems may be understood as those which include criteria for their ‘optimal’ solution (Checkland, 1981). In contrast, soft design problems are those which do not include such criteria. Any solution is assessed as ‘better or worse’ relative to other solutions. Alternatively, the hardness of a problem may be distinguished by its level of description, or the formality of the knowledge available for its specification (Carroll and Campbell, 1986). However, here hard and soft problems will be generally distinguished by their determinism for the purpose, that is, by the need for design solutions to be determinate. In this distinction between problems is implicated: the proliferation of variables expressed in a problem and their relations; the changes of variables and their relations, both with regard to their values and their number; and more generally, complexity, where it includes factors other than those identified. The variables implicated in the HF general design problem are principally those of human behaviours and structures.

A discipline’s practices construct solutions to its general design problem. Consideration of disciplines indicates much variation in their use of specification as a practice in constructing solutions.

1 Such was the history of many disciplines: the origin of modern day Production Engineering, for example, was a nineteenth century set of craft practices and tacit knowledge. Dowell and Long 7

This variation, however, appears not to be dependent on variations in the hardness of the general design problems. Rather, disciplines appear to differ in the completeness with which they specify solutions to their respective general design problems before implementation occurs. At one extreme, some disciplines specify solutions completely before implementation: their practices may be described as ‘specify then implement’ (an example might be Electrical Engineering). At the other extreme, disciplines appear not to specify their solutions at all before implementing them: their practices may be described as ‘implement and test’ (an example might be Graphic Design). Other disciplines, such as SE, appear characteristically to specify solutions partially before implementing them: their practices may be described as ‘specify and implement’. ‘Specify then Implement’, therefore, and ‘implement and test’, would appear to represent the extremes of a dimension by which disciplines may be distinguished by their practices. It is a dimension of the completeness with which they specify design solutions.

Taken together, the dimension of problem hardness, characterising general design problems, and the dimension of specification completeness, characterising discipline practices, constitute a classification space for design disciplines such as Electrical Engineering and Graphic Design. The space is shown in Figure 2, including for illustrative purposes, the speculative location of SE.

Two conclusions are prompted by Figure 2. First, a general relation may be apparent between the hardness of a general design problem and the realiseable completeness with which its solutions might be specified. In particular, a boundary condition is likely to be present beyond which more complete solutions could not be specified for a problem of given hardness. The shaded area of Figure 2 is intended to indicate this condition, termed the ‘Boundary of Determinism’ – because it derives from the determinism of the phenomena implicated in the general design problem. It suggests that whilst Dowell and Long 8

very soft problems may only be solved by ‘implement and test’ practices, hard problems may be solved by ‘specify then implement’ practices.

Second, it is concluded from Figure 2 that the actual completeness with which solutions to a general design problem are specified, and the realiseable completeness, might be at variance. Accordingly, there may be different possible forms of the same discipline – each form addressing the same problem but with characteristically different practices. With reference to HF then, the contemporary discipline, a craft, will characteristically solve the HF general design problem mainly by ‘implementation and testing’. If solutions are specified at all, they will be incomplete before being implemented. Yet depending on the hardness of the HF general design problem, the realiseable completeness of specified solutions may be greater and a future form of the discipline, with practices more characteristically those of ‘specify then implement’, may be possible. For illustrative purposes, those different forms of the HF discipline are located speculatively in the figure.

Whilst the realiseable completeness with which a discipline may specify design solutions is governed by the hardness of the general design problem, the actual completeness with which it does so is governed by the formality of the knowledge it possesses. Consideration of the traditional engineering disciplines supports this assertion. Their modern-day practices are characteristically those of ‘specify then implement’, yet historically, their antecedents were ‘specify and implement’ practices, and earlier still – ‘implement and test’ practices. For example, the early steam engine preceded formal knowledge of thermodynamics and was constructed by ‘implementation and testing’. Yet designs of thermodynamic machines are now relatively completely specified before being implemented, a practice supported by formal knowledge. Such progress then, has been marked by the increasing formality of knowledge. It is also in spite of the increasing complexity of new technology – an increase which might only have served to make the general design problem more soft, and the boundary of determinism more constraining. The dimension of the formality of a discipline’s knowledge – ranging from experience to principles, is shown in Figure 2 and completes the classification space for design disciplines.

It should be clear from Figure 2 that there exists no pre-ordained relationship between the formality of a discipline’s knowledge and the hardness of its general design problem. In particular, the practices of a (craft) discipline supported by experience – that is, by informal knowledge – may address a hard problem. But also, within the boundary of determinism, that discipline could acquire formal knowledge to support specification as a design practice.

In Section 1.3, four deficiencies of the contemporary HF discipline were identified. The absence of formal discipline knowledge was proposed to account for these deficiencies. The present section has been concerned to examine the potential for HF to develop a more formal discipline knowledge. The potential would appear to be governed by the hardness of the HF general design problem, that is, by the determinism of the human behaviours which it implicates, at least with respect to any solution of that problem. And clearly, human behaviour is, in some respects and to some degree, deterministic. For example, drivers’ behaviour on the roads is determined, at least within the limits required by a particular design solution, by traffic system protocols. A training syllabus determines, within the limits required by a particular solution, the behaviour of the trainees – both in terms of learning strategies and the level of training required. Hence, formal HF knowledge is to some degree attainable. At the very least, it cannot be excluded that the model for that formal knowledge is the knowledge possessed by the established engineering disciplines.

Generally, the established engineering disciplines possess formal knowledge: a corpus of operationalised, tested, and generalised principles. Those principles are prescriptive, enabling the complete specification of design solutions before those designs are implemented (see Dowell and Long, 1988b). This theme of prescription in design is central to the thesis offered here.

Engineering principles can be substantive or methodological (see Checkland, 1981; Pirsig, 1974). Methodological Principles prescribe the methods for solving a general design problem optimally. For example, methodological principles might prescribe the representations of designs specified at a general level of description and procedures for systematically decomposing those representations Dowell and Long 9

until complete specification is possible at a level of description of immediate design implementation (Hubka, Andreason and Eder, 1988). Methodological principles would assure each lower level of specification as being a complete representation of an immediately higher level.

Substantive Principles prescribe the features and properties of artefacts, or systems that will constitute an optimal solution to a general design problem. As a simple example, a substantive principle deriving from Kirchoff’s Laws might be one which would specify the physical structure of a network design (sources, resistances and their nodes etc) whose behaviour (e.g., distribution of current) would constitute an optimal solution to a design problem concerning an amplifier’s power supply.

1.5. The Requirement for an Engineering Conception for Human Factors

The contemporary HF discipline does not possess either methodological or substantive engineering principles. The heuristics it possesses are either ‘rules of thumb’ derived from experience or guidelines derived from psychological theories and findings. Neither guidelines nor rules of thumb offer assurance of their efficacy in any given instance, and particularly with regard to the effectiveness of a design. The methods and models of HF (as opposed to methodological and substantive principles) are similarly without such an assurance. Clearly, any evolution of HF as an engineering discipline in the manner proposed here has yet to begin. There is an immediate need then, for a view of how it might begin, and how formulation of engineering principles might be precipitated.

van Gisch and Pipino (1986) have suggested the process by which scientific (as opposed to engineering) disciplines acquire formal knowledge. They characterise the activities of scientific disciplines at a number of levels, the most general being an epistemological enquiry concerning the nature and origin of discipline knowledge. From such an enquiry a paradigm may evolve. Although a paradigm may be considered to subsume all discipline activities (Long, 1987), it must, at the very least, subsume a coherent and complete definition of the concepts which in this case describe the General (Scientific) Problem of a scientific discipline. Those concepts, and their derivatives, are embodied in the explanatory and predictive theories of science and enable the formulation of research problems. For example, Newton’s Principia commences with an epistemological enquiry, and a paradigm in which the concept of inertia first occurs. The concept of inertia is embodied in scientific theories of mechanics, as for example, in Newton’s Second Law.

Engineering disciplines may be supposed to require an equivalent epistemological enquiry. However, rather than that enquiry producing a paradigm, we may construe its product as a conception. Such a conception is a unitary (and consensus) view of the general design problem of a discipline. Its power lies in the coherence and completeness of its definition of concepts which express that problem. Hence, it enables the formulation of engineering principles which embody and instantiate those concepts. A conception (like a paradigm) is always open to rejection and replacement.

HF currently does not possess a conception of its general design problem. Current views of the issue are ill-formed, fragmentary, or implicit (Shneiderman, 1980; Card, Moran and Newell, 1983; Norman and Draper, 1986). The lack of such a shared view is particularly apparent within the HF research literature in which concepts are ambiguous and lacking in coherence; those associated with the ‘interface’ (eg, ‘virtual objects’, ‘human performance’, ‘task semantics’, ‘user error’ etc) are particular examples of this failure. It is inconceiveable that a formulation of HF engineering principles might occur whilst there is no consensus understanding of the concepts which they would embody. Articulation of a conception must then be a pre-requisite for formulation of engineering principles for HF. Dowell and Long 10

The origin of a conception for the HF discipline must be a conception for the HCI discipline itself, the superordinate discipline incorporating HF. A conception (at least in form) as might be assumed by an engineering HCI discipline has been previously proposed (Dowell and Long, 1988a). It supports the conception for HF as an engineering discipline presented in Part II.

In conclusion, Part I has presented the case for an engineering conception for HF. A proposal for such a conception follows in Part II. The status of the conception, however, should be emphasised. First, the conception at this point in time is speculative. Second, the conception continues to be developed in support of, and supported by, the research of the authors. Third, there is no validation in the conventional sense to be offered for the conception at this time. Validation of the conception for HF will come from its being able to describe the design problems of HF, and from the coherence of its concepts, that is, from the continuity of relations, and agreement, between concepts. Readers may assess these aspects of validity for themselves. Finally, the validity of the conception for HF will also rest in its being a consensus view held by the discipline as a whole and this is currently not the case. Dowell and Long 11

Part II. Conception for an Engineering Discipline of Human Factors

2.1. Conception of the Human Factors General Design Problem……………………………………………………………………………………….13

2.2 . Conception of Work and the User……………………………………………………………15

2.3. Conception of the Interactive Worksystem and the User……………………………………………………………………………………………………18

2.4. Conception of Performance of the Interactive Worksystem and the User………………………………………………………………………..24

2.5. Conclusions and the Prospect for Human Factors Engineering Principles.26

The potential for HF to become an engineering discipline, and so better to respond to the problem of interactive systems design, was examined in Part I. The possibility of realising this potential through HF engineering principles was suggested – principles which might prescriptively support HF design expressed as ‘specify then implement’. It was concluded that a pre-requisite to the development of HF engineering principles, is a conception of the general design problem of HF, which was informally expressed as:

‘to design human interactions with computers for effective working’.

Part II proposes a conception for HF. It attempts to establish the set of related concepts which can express the general design problem of HF more formally. Such concepts would be those embodied in HF engineering principles. As indicated in Section 1.1, the conception for HF is supported by a conception for an engineering discipline of HCI earlier proposed by Dowell and Long (1988a). Space precludes re-iteration of the conception for HCI here, other than as required for the derivation of the conception for HF. Part II first asserts a more formal expression of the HF general design problem which an engineering discipline would address. Part II then continues by elaborating and illustrating the concepts and their relations embodied in that expression.

2.1. Conception of the Human Factors General Design Problem.

The conception for the (super-ordinate) engineering discipline of HCI asserts a fundamental distinction between behavioural systems which perform work, and a world in which work originates, is performed and has its consequences. Specifically conceptualised are interactive worksystems consisting of human and computer behaviours together performing work. It is work evidenced in a world of physical and informational objects disclosed as domains of application. The distinction between worksystems and domains of application is represented schematically in Figure 3. Dowell and Long 12

Effectiveness derives from the relationship of an interactive worksystem with its domain of application – it assimilates both the quality of the work performed by the worksystem, and the costs it incurs. Quality and cost are the primary constituents of the concept of performance through which effectiveness is expressed.

The concern of an engineering HCI discipline would be the design of interactive worksystems for performance. More precisely, its concern would be the design of behaviours constituting a worksystem {S} whose actual performance (PA) conformed with some desired performance (PD). And to design {S} would require the design of human behaviours {U} interacting with computer behaviours {C}. Hence, conception of the general design problem of an engineering discipline of HCI is expressed as:

Specify then implement {U} and {C}, such that

{U} interacting with {C} = {S}as PAPD

where PD = fn. { QD ,KD }

QD expresses the desired quality of the products of work within the given domain of application,

KD expresses acceptable (i.e., desired) costs incurred by the worksystem, i.e., by both human and computer.

The problem, when expressed as one of to ‘specify then implement’ designs of interactive worksystems, is equivalent to the general design problems characteristic of other engineering disciplines (see Section 1.4.).

The interactive worksystem can be distinguished as two separate, but interacting sub-systems, that is, a system of human behaviours interacting with a system of computer behaviours. The human behaviours may be treated as a behavioural system in their own right, but one interacting with the system of computer behaviours to perform work. It follows that the general design problem of HCI may be decomposed with regard to its scope (with respect to the human and computer behavioural Dowell and Long 13

sub-systems) giving two related problems. Decomposition with regard to the human behaviours gives the general design problem of the HF1 discipline as:

Specify then implement {U} such that

{U} interacting with {C} = {S}as PAPD

The general design problem of HF then, is one of producing implementable specifications of human behaviours {U} which, interacting with computer behaviours {C}, are constituted within a worksystem {S} whose performance conforms with a desired performance (PD).

The following sections elaborate the conceptualisation of human behaviours (the user, or users) with regard to the work they perform, the interactive worksystem in which they are constituted, and performance.

2.2 . Conception of Work and the User

The conception for HF identifies a world in which work originates, is performed and has its consequences. This section presents the concepts by which work and its relations with the user are expressed.

Objects and their attributes

Work occurs in a world consisting of objects and arises in the intersection of organisations and (computer) technology. Objects may be both abstract as well as physical, and are characterised by their attributes. Abstract attributes of objects are attributes of information and knowledge. Physical attributes are attributes of energy and matter. Letters (i.e., correspondence) are objects; their abstract attributes support the communication of messages etc; their physical attributes support the visual/verbal representation of information via language.

Attributes and levels of complexity

The different attributes of an object may emerge at different levels within a hierarchy of levels of complexity (see Checkland, 1981). For example, characters and their configuration on a page are physical attributes of the object ‘a letter’ which emerge at one level of complexity; the message of the letter is an abstract attribute which emerges at a higher level of complexity.

Objects are described at different levels of description commensurate with their levels of complexity. However, at a high level of description, separate objects may no longer be differentiated. For example, the object ‘income tax return’ and the object ‘personal letter’ are both ‘correspondence’ objects at a higher level of description. Lower levels of description distinguish their respective attributes of content, intended correspondent etc. In this way, attributes of an object described at one level of description completely re-represent those described at a lower level.

Relations between attributes

Attributes of objects are related, and in two ways. First, attributes at different levels of complexity are related. As indicated earlier, those at one level are completely subsumed in those at a higher level. In particular, abstract attributes will occur at higher levels of complexity than physical attributes and will subsume those lower level physical attributes. For example, the abstract attributes of an object ‘message’ concerning the representation of its content by language subsume the lower level physical attributes, such as the font of the characters expressing the language. As an alternative example, an

1The General Design Problem of SE would be equivalent and be expressed as ‘Specify then implement {C} such that .. etc. Dowell and Long 14

industrial process, such as a steel rolling process in a foundry, is an object whose abstract attributes will include the process’s efficiency. Efficiency subsumes physical attributes of the process, – its power consumption, rate of output, dimensions of the output (the rolled steel), etc – emerging at a lower level of complexity.

Second, attributes of objects are related within levels of complexity. There is a dependency between the attributes of an object emerging within the same level of complexity. For example, the attributes of the industrial process of power consumption and rate of output emerge at the same level and are inter-dependent.

Attribute states and affordance

At any point or event in the history of an object, each of its attributes is conceptualised as having a state. Further, those states may change. For example, the content and characters (attributes) of a letter (object) may change state: the content with respect to meaning and grammar etc; its characters with respect to size and font etc. Objects exhibit an affordance for transformation, engendered by their attributes’ potential for state change (see Gibson, 1977). Affordance is generally pluralistic in the sense that there may be many, or even, infinite transformations of objects, according to the potential changes of state of their attributes.

Attributes’ relations are such that state changes of one attribute may also manifest state changes in related attributes, whether within the same level of complexity, or across different levels of complexity. For example, changing the rate of output of an industrial process (lower level attribute) will change both its power consumption (same level attribute) and its efficiency (higher level attribute).

Organisations, domains (of application), and the requirement for attribute state changes

A domain of application may be conceptualised as: ‘a class of affordance of a class of objects’. Accordingly, an object may be associated with a number of domains of application (‘domains’). The object ‘book’ may be associated with the domain of typesetting (state changes of its layout attributes) and with the domain of authorship (state changes of its textual content). In principle, a domain may have any level of generality, for example, the writing of letters and the writing of a particular sort of letter.

Organisations are conceptualised as having domains as their operational province and of requiring the realisation of the affordance of objects. It is a requirement satisfied through work. Work is evidenced in the state changes of attributes by which an object is intentionally transformed: it produces transforms, that is, objects whose attributes have an intended state. For example, ‘completing a tax return’ and ‘writing to an acquaintance’, each have a ‘letter’ as their transform, where those letters are objects whose attributes (their content, format and status, for example) have an intended state. Further editing of those letters would produce additional state changes, and therein, new transforms.

Goals

Organisations express their requirement for the transformation of objects through specifying goals. A product goal specifies a required transform – a required realisation of the affordance of an object. In expressing the required transformation of an object, a product goal will generally suppose necessary state changes of many attributes. The requirement of each attribute state change can be expressed as a task goal, deriving from the product goal. So for example, the product goal demanding transformation of a letter making its message more courteous, would be expressed by task goals possibly requiring state changes of semantic attributes of the propositional structure of the text, and of syntactic attributes of the grammatical structure. Hence, a product goal can be re-expressed as a task goal structure, a hierarchical structure expressing the relations between task goals, for example, their sequences. Dowell and Long 15

In the case of the computer-controlled steel rolling process, the process is an object whose transformation is required by a foundry organisation and expressed by a product goal. For example, the product goal may specify the elimination of deviations of the process from a desired efficiency. As indicated earlier, efficiency will at least subsume the process’s attributes of power consumption, rate of output, dimensions of the output (the rolled steel), etc. As also indicated earlier, those attributes will be inter-dependent such that state changes of one will produce state changes in the others – for example, changes in rate of output will also change the power consumption and the efficiency of the process. In this way, the product goal (of correcting deviations from the desired efficiency) supposes the related task goals (of setting power consumption, rate of output, dimensions of the output etc). Hence, the product goal can be expressed as a task goal structure and task goals within it will be assigned to the operator monitoring the process.

Quality

The transformation of an object demanded by a product goal will generally be of a multiplicity of attribute state changes – both within and across levels of complexity. Consequently, there may be alternative transforms which would satisfy a product goal – letters with different styles, for example – where those different transforms exhibit differing compromises between attribute state changes of the object. By the same measure, there may also be transforms which will be at variance with the product goal. The concept of quality (Q) describes the variance of an actual transform with that specified by a product goal. It enables all possible outcomes of work to be equated and evaluated.

Work and the user

Conception of the domain then, is of objects, characterised by their attributes, and exhibiting an affordance arising from the potential changes of state of those attributes. By specifying product goals, organisations express their requirement for transforms – objects with specific attribute states. Transforms are produced through work, which occurs only in the conjunction of objects affording transformation and systems capable of producing a transformation.

From product goals derive a structure of related task goals which can be assigned either to the human or to the computer (or both) within an associated worksystem. The task goals assigned to the human are those which motivate the human’s behaviours. The actual state changes (and therein transforms) which those behaviours produce may or may not be those specified by task and product goals, a difference expressed by the concept of quality.

Taken together, the concepts presented in this section support the HF conception’s expression of work as relating to the user. The following section presents the concepts expressing the interactive worksystem as relating to the user.

2.3. Conception of the Interactive Worksystem and the User.

The conception for HF identifies interactive worksystems consisting of human and computer behaviours together performing work. This section presents the concepts by which interactive worksystems and the user are expressed.

Interactive worksystems

Humans are able to conceptualise goals and their corresponding behaviours are said to be intentional (or purposeful). Computers, and machines more generally, are designed to achieve goals, and their corresponding behaviours are said to be intended (or purposive1). An interactive worksystem

1 Human behaviour is teleological, machine behaviour is teleonomic (Checkland, 1981). Dowell and Long 16

(‘worksystem’) is a behavioural system distinguished by a boundary enclosing all human and computer behaviours whose purpose is to achieve and satisfy a common goal. For example, the behaviours of a secretary and wordprocessor whose purpose is to produce letters constitute a worksystem. Critically, it is only by identifying that common goal that the boundary of the worksystem can be established: entities, and more so – humans, may exhibit a range of contiguous behaviours, and only by specifying the goals of concern, might the boundary of the worksystem enclosing all relevant behaviours be correctly identified.

Worksystems transform objects by producing state changes in the abstract and physical attributes of those objects (see Section 2.2). The secretary and wordprocessor may transform the object ‘correspondence’ by changing both the attributes of its meaning and the attributes of its layout. More generally, a worksystem may transform an object through state changes produced in related attributes. An operator monitoring a computer-controlled industrial process may change the efficiency of the process through changing its rate of output.

The behaviours of the human and computer are conceptualised as behavioural sub-systems of the worksystem – sub-systems which interact1. The human behavioural sub-system is here more appropriately termed the user. Behaviour may be loosely understood as ‘what the human does’, in contrast with ‘what is done’ (i.e. attribute state changes in a domain). More precisely the user is conceptualised as:

a system of distinct and related human behaviours, identifiable as the sequence of states of a person2 interacting with a computer to perform work, and corresponding with a purposeful (intentional) transformation of objects in a domain3 (see also Ashby, 1956).

Although possible at many levels, the user must at least be expressed at a level commensurate with the level of description of the transformation of objects in the domain. For example, a secretary interacting with an electronic mailing facility is a user whose behaviours include receiving and replying to messages. An operator interacting with a computer-controlled milling machine is a user whose behaviours include planning the tool path to produce a component of specified geometry and tolerance.

The user as a system of mental and physical human behaviours

The behaviours constituting a worksystem are both physical as well as abstract. Abstract behaviours are generally the acquisition, storage, and transformation of information. They represent and process information at least concerning: domain objects and their attributes, attribute relations and attribute states, and the transformations required by goals. Physical behaviours are related to, and express, abstract behaviours.

Accordingly, the user is conceptualised as a system of both mental (abstract) and overt (physical) behaviours which extend a mutual influence – they are related. In particular, they are related within an assumed hierarchy of behaviour types (and their control) wherein mental behaviours generally determine, and are expressed by, overt behaviours. Mental behaviours may transform (abstract) domain objects represented in cognition, or express through overt behaviour plans for transforming domain objects.

1 The human behaviours and computer behaviours are separate systems ‘coupled’ to form a worksystem (see Ashby, 1956)

2Behaviours are conceptualised as being supported and enabled by co-extensive structures. The user, however, is a description of a behavioural system and does not describe the corresponding human structures (see later in Section 2.3.).

3This conception of human behaviour differs from that of behaviourist psychology which generally seeks correlations between observable inputs and outputs of a mental ‘blackbox’ without reference to any postulated artifacts of the mind or brain. Dowell and Long 17

So for example, the operator working in the control room of the foundry has the product goal required to maintain a desired condition of the computer-controlled steel rolling process. The operator attends to the computer (whose behaviours include the transmission of information about the process). Hence, the operator acquires a representation of the current condition of the process by collating the information displayed by the computer and assessing it by comparison with the condition specified by the product goal. The operator`s acquisition, collation and assessment are each distinct mental behaviours, conceptualised as representing and processing information. The operator reasons about the attribute state changes necessary to eliminate any discrepancy between current and desired conditions of the process, that is, the set of related changes which will produce the required transformation of the process. That decision is expressed in the set of instructions issued to the computer through overt behaviour – making keystrokes, for example.

The user is conceptualised as having cognitive, conative and affective aspects. The cognitive aspects of the user are those of their knowing, reasoning and remembering, etc; the conative aspects are those of their acting, trying and persevering, etc; and the affective aspects are those of their being patient, caring, and assured, etc. Both mental and overt human behaviours are conceptualised as having these three aspects.

Human-computer interaction

Although the human and computer behaviours may be treated as separable sub-systems of the worksystem, those sub-systems extend a “mutual influence”, or interaction whose configuration principally determines the worksystem (Ashby, 1956).

Interaction is conceptualised as:

the mutual influence of the user (i.e., the human behaviours) and the computer behaviours associated within an interactive worksystem

Hence, the user {U} and computer behaviours {C} constituting a worksystem {S}, were expressed in the general design problem of HF (Section 2.1) as:

{U} interacting with {C} = {S}

Interaction of the human and computer behaviours is the fundamental determinant of the worksystem, rather than their individual behaviours per se. For example, the behaviours of an operator interact with the behaviours of a computer-controlled milling machine. The operator’s behaviours influence the behaviours of the machine, perhaps in the tool path program – the behaviours of the machine, perhaps the run-out of its tool path, influences the selection behaviour of the operator. The configuration of their interaction – the inspection that the machine allows the operator, the tool path control that the operator allows the machine – determines the worksystem that the operator and machine behaviours constitute in their planning and execution of the machining work.

The assignment of task goals then, to either the human or the computer delimits the user and therein configures the interaction. For example, replacement of a mis-spelled word required in a document is a product goal which can be expressed as a task goal structure of necessary and related attribute state changes. In particular, the text field for the correctly spelled word demands an attribute state change in the text spacing of the document. Specifying that state change may be a task goal assigned to the user, as in interaction with the behaviours of early text editor designs, or it may be a task goal assigned to the computer, as in interaction with the ‘wrap-round’ behaviours of contemporary wordprocessor designs. The assignment of the task goal of specification configures the interaction of the human and computer behaviours in each case; it delimits the user.

On-line and off-line human behaviours Dowell and Long 18

The user may include both on-line and off-line human behaviours: on-line behaviours are associated with the computer’s representation of the domain; offline behaviours are associated with non-computer representations of the domain, or the domain itself.

As an illustration of the distinction, consider the example of an interactive worksystem consisting of behaviours of a secretary and a wordprocessor and required to produce a paper-based copy of a dictated letter stored on audio tape. The product goal of the worksystem here requires the transformation of the physical representation of the letter from one medium to another, that is, from tape to paper. From the product goal derives the task goals relating to required attribute state changes of the letter. Certain of those task goals will be assigned to the secretary. The secretary’s off-line behaviours include listening to, and assimilating the dictated letter, so acquiring a representation of the domain directly. By contrast, the secretary’s on-line behaviours include specifying the represention by the computer of the transposed content of the letter in a desired visual/verbal format of stored physical symbols.

On-line and off-line human behaviours are a particular case of the ‘internal’ interactions between a human’s behaviours as, for example, when the secretary’s typing interacts with memorisations of successive segments of the dictated letter.

Human structures and the user

Conceptualisation of the user as a system of human behaviours needs to be extended to the structures supporting behaviour.

Whereas human behaviours may be loosely understood as ‘what the human does’, the structures supporting them can be understood as ‘how they are able to do what they do’ (see Marr, 1982; Wilden, 1980). There is a one to many mapping between a human`s structures and the behaviours they might support: the structures may support many different behaviours.

In co-extensively enabling behaviours at each level, structures must exist at commensurate levels. The human structural architecture is both physical and mental, providing the capability for a human’s overt and mental behaviours. It provides a represention of domain information as symbols (physical and abstract) and concepts, and the processes available for the transformation of those representations. It provides an abstract structure for expressing information as mental behaviour. It provides a physical structure for expressing information as physical behaviour.

Physical human structure is neural, bio-mechanical and physiological. Mental structure consists of representational schemes and processes. Corresponding with the behaviours it supports and enables, human structure has cognitive, conative and affective aspects. The cognitive aspects of human structures include information and knowledge – that is, symbolic and conceptual representations – of the domain, of the computer and of the person themselves, and it includes the ability to reason. The conative aspects of human structures motivate the implementation of behaviour and its perseverence in pursuing task goals. The affective aspects of human structures include the personality and temperament which respond to and supports behaviour.

To illustrate the conceptualisation of mental structure, consider the example of structure supporting an operator’s behaviours in the foundry control room. Physical structure supports perception of the steel rolling process and executing corrective control actions to the process through the computer input devices. Mental structures support the acquisition, memorisation and transformation of information about the steel rolling process. The knowledge which the operator has of the process and of the computer supports the collation, assessment and reasoning about corrective control actions to be executed.

The limits of human structure determine the limits of the behaviours they might support. Such structural limits include those of: intellectual ability; knowledge of the domain and the computer; memory and attentional capacities; patience; perseverence; dexterity; and visual acuity etc. The structural limits on behaviour may become particularly apparent when one part of the structure (a Dowell and Long 19

channel capacity, perhaps) is required to support concurrent behaviours, perhaps simultaneous visual attending and reasoning behaviours. The user then, is ‘resource’ limited by the co-extensive human structure.

The behavioural limits of the human determined by structure are not only difficult to define with any kind of completeness, they will also be variable because that structure can change, and in a number of respects. A person may have self-determined changes in response to the domain – as expressed in learning phenomena, acquiring new knowledge of the domain, of the computer, and indeed of themselves, to better support behaviour. Also, human structure degrades with the expenditure of resources in behaviour, as evidenced in the phenomena of mental and physical fatigue. It may also change in response to motivating or de-motivating influences of the organisation which maintains the worksystem.

It must be emphasised that the structure supporting the user is independent of the structure supporting the computer behaviours. Neither structure can make any incursion into the other, and neither can directly support the behaviours of the other. (Indeed this separability of structures is a pre-condition for expressing the worksystem as two interacting behavioural sub-systems.) Although the structures may change in response to each other, they are not, unlike the behaviours they support, interactive; they are not included within the worksystem. The combination of structures of both human and computer supporting their interacting behaviours is conceptualised as the user interface .

Resource costs of the user

Work performed by interactive worksystems always incurs resource costs. Given the separability of the human and the computer behaviours, certain resource costs are associated directly with the user and distinguished as structural human costs and behavioural human costs.

Structural human costs are the costs of the human structures co-extensive with the user. Such costs are incurred in developing and maintaining human skills and knowledge. More specifically, structural human costs are incurred in training and educating people, so developing in them the structures which will enable their behaviours necessary for effective working. Training and educating may augment or modify existing structures, provide the person with entirely novel structures, or perhaps even reduce existing structures. Structural human costs will be incurred in each case and will frequently be borne by the organisation. An example of structural human costs might be the costs of training a secretary in the particular style of layout required for an organisation’s correspondence with its clients, and in the operation of the computer by which that layout style can be created.

Structural human costs may be differentiated as cognitive, conative and affective structural costs of the user. Cognitive structural costs express the costs of developing the knowledge and reasoning abilities of people and their ability for formulating and expressing novel plans in their overt behaviour – as necessary for effective working. Conative structural costs express the costs of developing the activity, stamina and persistence of people as necessary for effective working. Affective structural costs express the costs of developing in people their patience, care and assurance as necessary as necessary for effective working.

Behavioural human costs are the resource costs incurred by the user (i.e by human behaviours) in recruiting human structures to perform work. They are both physical and mental resource costs. Physical behavioural costs are the costs of physical behaviours, for example, the costs of making keystrokes on a keyboard and of attending to a screen display; they may be expressed without differentiation as physical workload. Mental behavioural costs are the costs of mental behaviours, for example, the costs of knowing, reasoning, and deciding; they may be expressed without differentiation as mental workload. Mental behavioural costs are ultimately manifest as physical behavioural costs.

When differentiated, mental and physical behavioural costs are conceptualised as the cognitive, conative and affective behavioural costs of the user. Cognitive behavioural costs relate to both the mental representing and processing of information, and the demands made on the individual`s extant Dowell and Long 20

knowledge, as well as the physical expression thereof in the formulation and expression of a novel plan. Conative behavioural costs relate to the repeated mental and physical actions and effort required by the formulation and expression of the novel plan. Affective behavioural costs relate to the emotional aspects of the mental and physical behaviours required in the formulation and expression of the novel plan. Behavioural human costs are evidenced in human fatigue, stress and frustration; they are costs borne directly by the individual. Dowell and Long 21

2.4. Conception of Performance of the Interactive Worksystem and the User.

In asserting the general design problem of HF (Section 2.1.), it was reasoned that:

“Effectiveness derives from the relationship of an interactive worksystemwith its domain of application – it assimilates both the quality of the work performed by the worksystem, and the costs incurred by it. Quality and cost are the primary constituents of the concept of performance through which effectiveness is expressed. ”

This statement followed from the distinction between interactive worksystems performing work, and the work they perform. Subsequent elaboration upon this distinction enables reconsideration of the concept of performance, and examination of its central importance within the conception for HF.

Because the factors which constitute this engineering concept of performance (i.e the quality and costs of work) are determined by behaviour, a concordance is assumed between the behaviours of worksystems and their performance: behaviour determines performance (see Ashby, 1956; Rouse, 1980). The quality of work performed by interactive worksystems is conceptualised as the actual transformation of objects with regard to their transformation demanded by product goals. The costs of work are conceptualised as the resource costs incurred by the worksystem, and are separately attributed to the human and computer. Specifically, the resource costs incurred by the human are differentiated as: structural human costs – the costs of establishing and maintaining the structure supporting behaviour; and behavioural human costs – the costs of the behaviour recruiting structure to its own support. Structural and behavioural human costs were further differentiated as cognitive, conative and affective costs.

A desired performance of an interactive worksystem may be conceptualised. Such a desired performance might either be absolute, or relative as in a comparative performance to be matched or improved upon. Accordingly, criteria expressing desired performance, may either specify categorical gross resource costs and quality, or they may specify critical instances of those factors to be matched or improved upon1.

Discriminating the user’s performance within the performance of the interactive worksystem would require the separate assimilation of human resource costs and their achievement of desired attribute state changes demanded by their assigned task goals. Further assertions concerning the user arise from the conceptualisation of worksystem performance. First, the conception of performance is able to distinguish the quality of the transform from the effectiveness of the worksystems which produce them. This distinction is essential as two worksystems might be capable of producing the same transform, yet if one were to incur a greater resource cost than the other, its effectiveness would be the lesser of the two systems.

Second, given the concordance of behaviour with performance, optimal human (and equally, computer) behaviours may be conceived as those which incur a minimum of resource costs in producing a given transform. Optimal human behaviour would minimise the resource costs incurred in producing a transform of given quality (Q). However, that optimality may only be categorically determined with regard to worksystem performance, and the best performance of a worksystem may still be at variance with the performance desired of it (PD). To be more specific, it is not sufficient for human behaviours simply to be error-free. Although the elimination of errorful human behaviours may contribute to the best performance possible of a given worksystem, that performance may still be

1See Section 1.4. where the possibility for expressing, by an absolute value, the desired performance of a system or artifact is associated with the hardness of the design problem. Dowell and Long 22

less than desired performance. Conversely, although human behaviours may be errorful, a worksystem may still support a desired performance.

Third, the common measures of human ‘performance’ – errors and time, are related in this conceptualisation of performance. Errors are behaviours which increase resource costs incurred in producing a given transform, or which reduce the quality of transform, or both. The duration of human behaviours may (very generally) be associated with increases in behavioural user costs.

Fourth, structural and behavioural human costs may be traded-off in performance. More sophisticated human structures supporting the user, that is, the knowledge and skills of experienced and trained people, will incur high (structural) costs to develop, but enable more efficient behaviours – and therein, reduced behavioural costs.

Fifth, resource costs incurred by the human and the computer may be traded-off in performance. A user can sustain a level of performance of the worksystem by optimising behaviours to compensate for the poor behaviours of the computer (and vice versa), i.e., behavioural costs of the user and computer are traded-off. This is of particular concern for HF as the ability of humans to adapt their behaviours to compensate for poor computer-based systems often obscures the low effectiveness of worksystems.

This completes the conception for HF. From the initial assertion of the general design problem of HF, the concepts that were invoked in its formal expression have subsequently been defined and elaborated, and their coherence established.

2.5. Conclusions and the Prospect for Human Factors Engineering Principles

Part I of this paper examined the possibility of HF becoming an engineering discipline and specifically, of formulating HF engineering principles. Engineering principles, by definition prescriptive, were seen to offer the opportunity for a significantly more effective discipline, ameliorating the problems which currently beset HF – problems of poor integration, low efficiency, efficacy without guarantee, and slow development.

A conception for HF is a pre-requisite for the formulation of HF engineering principles. It is the concepts and their relations which express the HF general design problem and which would be embodied in HF engineering principles. The form of a conception for HF was proposed in Part II. Originating in a conception for an engineering discipline of HCI (Dowell and Long, 1988a), the conception for HF is postulated as appropriate for supporting the formulation of HF engineering principles.

The conception for HF is a broad view of the HF general design problem. Instances of the general design problem may include the development of a worksystem, or the utilisation of a worksystem within an organisation. Developing worksystems which are effective, and maintaining the effectiveness of worksystems within a changing organisational environment, are both expressed within the problem. In addition, the conception takes the broad view on the research and development activities necessary to solve the general design problem and its instantiations, respectively. HF engineering research practices would seek solutions, in the form of (methodological and substantive) engineering principles, to the general design problem. HF engineering practices in systems development programmes would seek to apply those principles to solve instances of the general design problem, that is, to the design of specific users within specific interactive worksystems. Collaboration of HF and SE specialists and the integration of their practices is assumed.

Notwithstanding the comprehensive view of determinacy developed in Part I, the intention of specification associated with people might be unwelcome to some. Yet, although the requirement for Dowell and Long 23

design and specification of the user is being unequivocally proposed, techniques for implementing those specifications are likely to be more familiar than perhaps expected – and possibly more welcome. Such techniques might include selection tests, aptitude tests, training programmes, manuals and help facilities, or the design of the computer.

A selection test would assess the conformity of a candidates’ behaviours with a specification for the user. An aptitude test would assess the potential for a candidates’ behaviours to conform with a specification for the user. Selection and aptitude tests might assess candidates either directly or indirectly. A direct test would observe candidates’ behaviours in ‘hands on’ trial periods with the ‘real’ computer and domain, or with simulations of the computer and domain. An indirect test would examine the knowledge and skills (i.e., the structures) of candidates, and might be in the form of written examinations. A training programme would develop the knowledge and skills of a candidate as necessary for enabling their behaviours to conform with a specification for the user. Such programmes might take the form of either classroom tuition or ‘hands on’ learning. A manual or on-line help facility would augment the knowledge possessed by a human, enabling their behaviours to conform with a specification for the user. Finally, the design of the computer itself, through the interactions of its behaviours with the user, would enable the implementation of a specification for the user.

To conclude, discussion of the status of the conception for HF must be briefly extended. The contemporary HF discipline was characterised as a craft discipline. Although it may alternatively be claimed as an applied science discipline, such claims must still admit the predominantly craft nature of systems development practices (Long and Dowell, 1989). No instantiations of the HF engineering discipline implied in this paper are visible, and examples of supposed engineering practices may be readily associated with craft or applied science disciplines. There are those, however, who would claim the craft nature of the HF discipline to be dictated by the nature of the problem it addresses. They may maintain that the indeterminism and complexity of the problem of designing human systems (the softness of the problem) precludes the application of formal and prescriptive knowledge. This claim was rejected in Part I on the grounds that it mistakes the current absence of formal discipline knowledge as an essential reflection of the softness of its general design problem. The claim fails to appreciate that this absence may rather be symptomatic of the early stage of the discipline`s development. The alternative position taken by this paper is that the softness of the problem needs to be independently established. The general design problem of HF is, to some extent, hard – human behaviour is clearly to some useful degree deterministic – and certainly sufficiently deterministic for the design of certain interactive worksystems. It may accordingly be presumed that HF engineering principles can be formulated to support product quality within a systems development ethos of ‘design for performance’.

The extent to which HF engineering principles might be realiseable in practice remains to be seen. It is not supposed that the development of effective systems will never require craft skills in some form, and engineering principles are not seen to be incompatible with craft knowledge, particularly with respect to their instantiation (Long and Dowell, 1989). At a minimum, engineering principles might be expected to augment the craft knowledge of HF professionals. Yet the great potential of HF engineering principles for the effectiveness of the discipline demands serious consideration. However, their development would only be by intention, and would be certain to demand a significant research effort. This paper is intended to contribute towards establishing the conception required for the formulation of HF engineering principles. Dowell and Long 24

References

Ashby W. Ross, (1956), An Introduction to Cybernetics. London: Methuen.

Bornat R. and Thimbleby H., (1989), The Life and Times of ded, Text Display Editor. In J.B. Long and A.D. Whitefield (ed.s), Cognitive Ergonomics and Human Computer Interaction. Cambridge: Cambridge University Press.

Card, S. K., Moran, T., and Newell, A., (1983), The Psychology of Human Computer Interaction, New Jersey: Lawrence Erlbaum Associates.

Carey, T., (1989), Position Paper: The Basic HCI Course For Software Engineers. SIGCHI Bulletin, Vol. 20, no. 3.

Carroll J.M., and Campbell R. L., (1986), Softening up Hard Science: Reply to Newell and Card. Human Computer Interaction, Vol. 2, pp. 227-249.

Checkland P., (1981), Systems Thinking, Systems Practice. Chichester: John Wiley and Sons.

Cooley M.J.E., (1980), Architect or Bee? The Human/Technology Relationship. Slough: Langley Technical Services.

Didner R.S. A Value Added Approach to Systems Design. Human Factors Society Bulletin, May 1988.

Dowell J., and Long J. B., (1988a), Human-Computer Interaction Engineering. In N. Heaton and M . Sinclair (ed.s), Designing End-User Interfaces. A State of the Art Report. 15:8. Oxford: Pergamon Infotech.

Dowell, J., and Long, J. B., 1988b, A Framework for the Specification of Collaborative Research in Human Computer Interaction, in UK IT 88 Conference Publication 1988, pub. IEE and BCS.

Gibson J.J., (1977), The Theory of Affordances. In R.E. Shaw and J. Branford (ed.s), Perceiving, Acting and Knowing. New Jersey: Erlbaum.

Gries D., (1981), The Science of Programming, New York: Springer Verlag.

Hubka V., Andreason M.M. and Eder W.E., (1988), Practical Studies in Systematic Design, London: Butterworths.

Long J.B., Hammond N., Barnard P. and Morton J., (1983), Introducing the Interactive Computer at Work: the Users’ Views. Behaviour And Information Technology, 2, pp. 39-106.

Long, J., (1987), Cognitive Ergonomics and Human Computer Interaction. In P. Warr (ed.), Psychology at Work. England: Penguin.

Long J.B., (1989), Cognitive Ergonomics and Human Computer Interaction: an Introduction. In J.B. Long and A.D. Whitefield (ed.s), Cognitive Ergonomics and Human Computer Interaction. Cambridge: Cambridge University Press.

Long J.B. and Dowell J., (1989), Conceptions of the Discipline of HCI: Craft, Applied Science, and Engineering. In Sutcliffe A. and Macaulay L., Proceedings of the Fifth Conference of the BCS HCI SG. Cambridge: Cambridge University Press. Dowell and Long 25

Marr D., (1982), Vision. New York: Wh Freeman and Co.

Morgan D.G., Shorter D.N. and Tainsh M., (1988), Systems Engineering. Improved Design and Construction of Complex IT systems. Available from IED, Kingsgate House, 66-74 Victoria Street, London, SW1.

Norman D.A. and Draper S.W. (eds) (1986): User Centred System Design. Hillsdale, New Jersey: Lawrence Erlbaum;

Pirsig R., 1974, Zen and the Art of Motorcycle Maintenance. London: Bodley Head.

Rouse W. B., (1980), Systems Engineering Models of Human Machine Interaction. New York: Elsevier North Holland.

Shneiderman B. (1980): Software Psychology: Human Factors in Computer and Information Systems. Cambridge, Mass.: Winthrop.

Thimbleby H., (1984), Generative User Engineering Principles for User Interface Design. In B. Shackel (ed.), Proceedings of the First IFIP conference on Human-Computer Interaction. Human-Computer Interaction – INTERACT’84. Amsterdam: Elsevier Science. Vol.2, pp. 102-107.

van Gisch J. P. and Pipino L.L., (1986), In Search of a Paradigm for the Discipline of Information Systems, Future Computing Systems, 1 (1), pp. 71-89.

Walsh P., Lim K.Y., Long J.B., and Carver M.K., (1988), Integrating Human Factors with System Development. In: N. Heaton and M. Sinclair (eds): Designing End-User Interfaces. Oxford: Pergamon Infotech.

Wilden A., 1980, System and Structure; Second Edition. London: Tavistock Publications.

This paper has greatly benefited from discussion with others and from their criticisms. We would like to thank our collegues at the Ergonomics Unit, University College London and in particular, Andy Whitefield, Andrew Life and Martin Colbert. We would also like to thank the editors of the special issue for their support and two anonymous referees for their helpful comments. Any remaining infelicities – of specification and implementation – are our own.

A Sketch of the ‘Conversion Funnel’. Can Cognitive Engineering Assist in its Design? https://www.hci-site-experiments.co.uk/wp-content/themes/engic/images/empty/thumbnail.jpg 150 150 admin admin https://secure.gravatar.com/avatar/70ab8042c890a79badea944069380d4eb7fbb790b0baa4997e68c168133992f0?s=96&d=mm&r=g March 10, 2011 May 9, 2014

A Sketch of the ‘Conversion Funnel’. Can Cognitive Engineering Assist in its Design?

admin

March 10, 2011

Peter Timmer

Bisant Ltd, 49 Clonmell Road, London, N17 6JY, England

1. Abstract

The ‘white heat’ of commercial web design is increasingly around a business’ ‘conversion funnel’. Conversion funnels are the means by which the business services its customers, and such funnels are critical to the performance of the digital channel. In this paper, a sketch of commercial conversion funnel design practice is offered. A complex relationship is described, where web analytics increasingly helps the business measure performance. Dowell’s and Long’s conception (1998) [1], of the cognitive engineering design problem, is then used to suggest how the conversion funnel design process can be better structured, in a manner suitable for ‘design for performance’, and to address a business context of engineering that is at present ignored in commercial web site design.

2. Introduction

Business services are delivered by ‘channels’, and channels are maintained by businesses for the purpose of contact and communication with consumers. Door‐to‐door sales representatives, direct mail catalogues, a branch network of shops, call centres, and digital (web sites) are all channels for managed contact with consumers. Channels all come with associated financial expenses to the business, at minimum, for maintenance of the channel.

Sales and Marketing functions within businesses drive product sales to consumers across all relevant channels, via techniques such as advertising campaigns. While it has always proved hard to measure the impact of a budgeted television commercial on sales within a particular channel, in the case of digital marketing this has proved to be easy. While a television advert rarely mentions a particular store, to the detriment of the product or brand being advertised, in the case of the digital channel, a banner advert can drive the prospective customer directly to a retail ecommerce site, and onwards into a purchasing journey.

One consequence of the directness of this relationship, between digital adverting and digital purchasing, is that all budgets can be measured within the business, and the expense of a banner advertising campaign, combined with the expenses of designing, building and maintaining a retail web site for the digital channel, can be weighed against the direct contribution to sales that the digital channel makes. Business expenses can be measured end‐to‐end, and incremental sales from the digital channel, and therefore profits, are known. The digital channel has a bright future because of this accountability between the marketing expense of the channel, and the channel’s marginal contribution to product sales.

Given that the design of ‘effective’ digital web sites has a bright future; this paper looks at how design for the digital channel might be structured within this context. In the first part of this paper, commercial practice is examined, in terms of the analysis tools available to the information architect who will design the core goal‐oriented journeys (education, consideration & fulfilment) and interactive experiences, such as navigation, that the web site needs to support. The concept of the ‘Conversion Funnel’ will be outlined in sketch form due to the size of the actual area of interest. In the second part of the paper, an attempt will be made to understand the sketched space in terms of a design problem of ‘cognitive engineering’ where ‘performance’ is the beginning and end of a design cycle; and the hypothesis testing of website analytics should be replaced with efforts to optimise web site performance through a more rigorous approach to the prescription of informed design specifications. The information architect will thereby become a cognitive engineer.

3. A Sketch of the conversion funnel

The concept of the conversion funnel has arisen from web site analytics, and speaks of a user’s journey across web pages that lead to a business service goal being achieved, such as a sale, appointment being booked, or completion of an application form. In these instances, such as clicking on a ‘Checkout’ button (in a retail context); the user is normally guided through a step‐wise process, to buy the product. This normally involves steps such as ‘Personal details’ (needed for shipping and logistics), Financial details (needed for billing and finance), Accept Terms and Conditions (needed by legal), ‘Opt‐in’ to future product marketing updates and offers, and so forth. Each of these sub‐sets of customer data is mandated by the business, for service delivery in the digital channel.

A statistical phenomena of such step‐wise journeys, is that you can’t have more users at step 3 than were at step 1. Through the conversion funnel you can only lose users, and subsequent service opportunities. Web site analytics enable each page on the user’s journey to receive a unique ‘tag’ that enables user clicks to be tracked. All buttons and controls on all pages can be tagged, so where there is more than one way to the next step, individual buttons on the page can be assessed for success, at guiding the user’s journey. Web analytics’ claim is that this tagging and analysis process informs web site ‘optimisation’, by generating statistical aggregations of user behaviour on a set of tagged pages.

If we take an example, Figure 1 shows illustrative conversion funnel data for a ‘quote and buy’ service, offered by a business in the digital channel. A five page journey is of interest, and while all pages are tagged, a conversion funnel has been set‐up for a user journey from offering a quotation, to purchase confirmation – the last four pages of the journey.

Figure 1 tells the following story.

On a business web site, a Pet Insurance product is described, and offered for purchase in the digital channel. Product information is displayed to the user on a single page, the ‘Pet Insurance Product Page’, shown in Figure 1 top left. Analytics data reveals that on a single day, this product page had 832 unique visitors. The page holds an important primary Call to Action, ’Get a quote’.

When the user clicks the ‘Get a quote’ button, on the product page, they are taken to a ‘Quote’ page. Accurate quotations require a number of pieces of information to be elicited from the prospective customer, so on the Quote page radio buttons and drop‐down choice controls enable the pet (to be insured) to be described. A dynamic quote is thereby generated within the page, based on the user’s actions. Here, a quote of £9.00 results, and once a well‐formed quotation is generated and displayed, a ‘Buy now’ button is offered to the user. Analytics show 261 users, of the 832 users who visited the product page, progressed to see the Quote page. That is 31.3% retention of interest in the product. This group, of 261 users who click on the ‘Get a Quote’ button, may be considered by the business to be a target group of ‘warm prospects’, users judged as having a genuine interested in the product, and are ready to act within the digital channel. Of the 571 visitors to the product page that didn’t seek a quote, some may have been conducting research, and then progressed an order via a Call Centre channel, or approached a shop branch or broker channel. The digital channel may still be serving important user needs even when action within the channel is not evident.

If the prospective customer then clicks the ‘Buy now’ button on the Quote page, having considered the quote, they are taken to a ‘Personal details’ form, where contact and financial details are captured. A ‘Next’ button then progresses the user onto a ‘Review Order’ page, and a ‘Confirm’ button will generate an Order Confirmation page. In theory, everybody who sees the Order Confirmation page is a customer of the business, as they have bought the product via the digital channel. The Order Confirmation page is therefore the end of a transaction process, and cannot be found and visited via the navigation, or passed as a URL within an email.

Analytics data show, across these pages, a gradual drop‐off in visitor numbers as prospective customers ‘bounce’ out of this journey to purchase the product, maybe via bookmarks or by closing the window. It can be seen that only 10 users saw the Confirmation page on the day in question, 10 prospective customers became actual customers of the business. Tabulated below the analytics data for page visits are percentage statistics that reflect the volume of visitors retained by the digital channel, from preceding steps. Only 14.6% percent of people who sought a quote, elected to then ‘Buy now’, and of this elite group, two thirds were then lost during form filling.

Finally, a conversion funnel has been set up from the Quote page through to the Confirmation page. If the 261 visitors to the ‘Quote’ page are considered a target group for purchasing (the warm prospects), and only 10 of that group of prospective customers actually became customers (the ones that saw the Confirmation page), the businesses digital channel has a 3.8% conversion rate for this product.

4. Information architecture and the conversion funnel sketch

Now, information architects design the pages that are reflected in such statistical analyses of the web site, and web site analytics reflect a limited ‘glimpse’ of a web site design’s ‘performance’. Where a business has sales targets for the digital channel, it is possible that the current volume level of 10 product sales per day is insufficient to justify the expense of the channel, and uplift is needed to 20 sales per day, for the achievement of business benefit in offering the channel to prospective customers. In this case, of the 261 visitors to the Quote page, changes need to be made to the digital channel (web site) and/or sales process, so that a 7.6% conversion rate is attained (20 confirmation pages are seen from the 261 who saw the Quote page). This may be possible without changing the website significantly. A ‘5% off’ offer could be given to the user. Alternatively, theories may be created about how the Quote page could be improved, to get more people through to click ‘Buy now’ and start filling the Personal Details Form. Instructional text may be insufficient to reassure the user how easy the process will be, or ‘Buy now’ buttons may be out of sight and require scrolling. The business, in conjunction with the information architect, may thereby seek to redesign parts of the site to achieve improved conversion rates and sales volume uplift. Alternatively, the business, in conjunction with digital marketers, may place an increasing number of banner adverts on partner sites, to drive twice the volume of users to the product page, and thereby mine the existing statistical patterns through the conversion funnel to reach the same end result, 20 target sales for the channel. This paper is concerned with the former.

Drawing on conversion funnel data, the web site is thereby changed, maybe with clearer instructional text and two ‘Buy now’ buttons, one at the top and one at the bottom of the page. The changed site is then measured, in a similar manner to the first design, and over time a comparative conversion funnel is generated. While a method of applied science would appear to be being employed, the difference between two (measured) digital experiences may be a great number of site alterations, each alteration reflecting a hypothesis, with many parties within the business generating hypotheses and requesting changes. If uplift is attained, it may be hard to know which alteration was most effective; but then all the business wants is uplift, not the practice of a purely scientific approach.

To compliment multiple design changes to pages, it is possible to conduct ‘split’ testing, or AB testing, on page elements such as button designs, labelling, font size, imagery and so forth. In this case, two buttons may be designed, one saying ‘Buy now’, and another saying ‘Join us’, and placed in the page in equal measure until a statistical difference in usage is detected. Then, one design may prevail as being more effective, and so testing single hypotheses can also be undertaken. An image of a sick animal may be more effective at driving people to ‘buy’ pet insurance than an image of a happy customer. The page ‘design specifications’ that support the conversion funnel can in this way be tweaked and thereby optimised to meet business goals. The cost of changing the digital channel’s user experience needs to be weighed against incremental gains, through better conversion rates.

Earlier, when the conversion funnel was sketched, the claim that web analytics can ‘optimise’ a web site was touched upon. Web site analytics does appear to support web site optimisation, but optimisation is about more than measuring aggregated and individual click paths. Optimisation has a second component that has been mentioned here, design specifications. Behind the aggregated statistics are designed pages that form the user’s experience. To optimise the pages, in line with business targets, design specifications are required, and specifically ‘designs for performance’. Designed changes need to attain targets, and address unnecessary user problems getting through the funnel. The user can at any point leave the funnel simply by clicking away on a bookmark. If the prospective customer is to be converted into a business customer in the digital channel, but are confused by what is being viewed during the purchasing journey, such confusions need to be removed from the design. A specification for a design solution is needed, to compliment the design problems uncovered by the web site analytics. Together, analytics and design specifications support web site optimisation.

5. Cognitive engineering and design problems

The above picture of commercial design practice takes in many individuals, disciplines and concerns; and so the design process needs guidance, in how to structure the way in which the root design problem of digital channel ‘performance’ is conceived, and thereby approached in the design process. In this regard, the paper asserts that the sketch provided, of the conversion funnel, can be best understood in terms of a design problem of cognitive engineering. A conception of cognitive engineering as ‘design for performance’ (Dowell & Long, 1998 [1]) will be used to re‐express the problem of conversion funnel design, in a manner suitable to the engineering context of ‘optimisation’ that exists around it, in all but name.

The digital channel to prospective customers is one channel of many, but all channels can be conceived of in terms of Ergonomics, in that all channels will involve to a greater or lesser extent, humans interacting with devices to perform effective work (Dowell & Long, 1989 [2]; Long, 1987 [3]). Where businesses offer common services across multiple channels, the humans in question (prospective customers) interact with functionally equivalent devices to perform the same ‘work’, such as product purchase. Web sites offer ‘electronic form’ devices to the user. Direct mail channels offer paper‐based form devices, pre‐paid envelope devices, and sometimes even pens. Via different channels a common range of products can be purchased. To the prospective customer, the work is the same – generating an order; but with different devices, levels of performance may widely differ. Dowell and Long conceive of performance as having two inseparable components, ‘Task Quality’ and ‘User Costs’. Quality here refers to how well formed the order is, by interacting with the devices that the business offers (via the channel). The static and standardised images of a paper‐based catalogue may generate more returned products to a business than a web site that offers zooming, panning, and a library of product photographs from different angles. When the order is fulfilled, users of the digital channel may be more likely to be delighted with their purchase, than a user who was unable to determine aspects of the product until closer inspection was possible. The quality of the user’s work varies across channels. Associated with work of a given quality are costs to the user. These may be most simply conceived of as time and effort, or may be more refined and make reference to costs of mental and physical human behaviours (e.g. training), plus device costs such as load on a server of posting back information to that server, after every click. In our retail example, the time cost of waiting for the postal delivery of a paper‐based order form (back to the business in the direct mail channel) may be removed by using the digital channel’s electronic form. Going to a branch to buy the product may involve the prospective customer incurring greater costs (time, effort, financial), but generate the highest quality of work. There again, a lack of stock at a local branch may lead to the work being left undone when this channel is chosen, with other consequent costs to the user, of performing work of zero quality, such as frustration. When looking at digital channel performance, it is therefore helpful to look at performance in these terms, quality of work done at some cost.

In this retail example, the similar work undertaken across different channels may be abstractly conceived of as one whereby ‘ownership’ of an object or objects, is transferred from the seller to the buyer, along with the physical location or evidence of ownership. A lamp appears in your living room, or a policy number is sent to you via post. In each case, the prospective customer interacts with devices, provided by the seller, to ensure this work is performed effectively. Dowell & Long call the human interacting with the channel’s devices the ‘worksystem’. The worksystem incurs costs as its contribution to the expression of work performance.

Worksystem design for the digital channel needs to be similarly conceived. The product browsing experience needs to be simple and informative, the route to the checkout needs to be self evident, and product selection supported by devices such as ‘baskets’, with clear means of action (‘Add to Basket’ buttons) and action reversal (‘Remove’ buttons). A shopping checkout experience that looses 96% of customers that approach it, with something in their basket, thereby becomes a very interesting and commercially valuable design concern. Channel performance needs to be improved, but can that improvement be brought about by increasing the likelihood of higher quality work being done, at a lower cost of time and effort to the worksystem? It is suggested here that this is a helpful way to think about structuring the root design problem of the digital channel, performance. Generating uplift in the digital channel through redesign of web pages is a problem of cognitive engineering.

Dowell and Long’s conception mirrors the behaviours of worksystem, with an abstract place, entitled the ‘domain’ which is comprised of domain ‘objects’. In retail domains, the objects may be modelled in a variety of ways; what is important to the conception is that each behaviour of the worksystem is understood in terms of progressing some desired transformation of a domain object on its way to a final state whereby the work is accomplished. In a retail domain, ownership of a domain object is transferred. One simple model of such a domain may be in terms of transforming ‘order’ objects from a state of ‘empty and unfulfillable’ (no object in a shopping basket) through to ‘valid and unfulfillable’ (something in the shopping basket that can be purchased), and then on through the conversion funnel to a domain object (target) state of ‘valid and fulfillable’ (all personal, financial, and logistical details known to pay for and deliver the object to the new owner in return for the cash value). Until ‘Terms and Conditions’ are accepted for example, a valid order is unfulfillable, as the prospective customer has not yet accepted the ‘conditions of business’, for scenarios such as theft in transit. Such business logic pervades the digital channel, and yet is not a necessary part of the ‘work’ carried out by worksystems in the branch (shop) channel, where responsibility for safe transport is undertaken by the consumer once they have left the shop, with a well manufactured bag supplied by the business. The conception of a cognitive engineering design problem therefore ties worksystem behaviour into a view of how that behaviour is transforming the domain object into its target state – accomplishing effective work. Task quality is used to measure the work done in the domain, and so the measurement of worksystem performance has a second element, inseparable from worksystem costs. Costs cannot be truly understood unless we know what work they were incurred ‘doing’.

Engineering ‘design problems’ then arise when desired levels of worksystem performance are poorly aligned (do not match) with levels of cost and quality desired by the business (digital channel provider). At such a point, after desired performance is known and expressed, and actual performance known and expressed, and poor alignment is thereby expressible, cognitive engineering design processes can be undertaken, as they can be grounded in worksystem performance measurement.

The cognitive engineering design processes that are broadly employed when solving performance‐ oriented design problems are ‘diagnosis’ and ‘prescription’. The business, at minimum, will have expectations that a digital channel will generate, across all its ‘valid and fulfillable’ orders, a level of profit. If this level of profit is being attained, but the business has unusually high levels of returned goods, and subsequent re‐funds, a place to start diagnosis might be on the ‘browse’ and ‘add to basket’ experience. Are product sizes well displayed? Do prospective customers know what they are adding to the basket? Alternatively, profit may be visible in the value of total order objects that progress from ‘empty and unfulfillable’ to ‘valid and unfulfillable’, but enormous numbers of prospective orders may be abandoned in the Conversion Funnel, maybe at the Terms and Conditions step. Diagnosis in this case may start by looking at the Worksystem at this point in the retail experience.

The cognitive engineering process of diagnosis will draw on as much Business Intelligence data as is available, about worksystem behaviour, to establish a plausible theory about why the performance data indicates a design problem exists. Prescription, then involves specifying a design solution, one that will alter the performance data, bring about ‘uplift’, and align the businesses desired level of performance with actual levels. Cognitive engineering, by separating worksystem from domain, modelling worksystem behaviours as separate from domain object transformations, and measuring worksystem costs alongside work quality, offers digital channel designers a valuable means of structuring how the root design problem of performance is conceived (Dowell & Long, 1998) [1].

6. Performance measurement

Dowell & Long’s conception of the cognitive engineering design problem is not alone, in trying to outline basic fundamental components that make up an engineering discipline of ‘cognitive design’. It has however been chosen, because of its emphasis, less on ‘cognitive behaviour’ leading the design process, as much as deficiency in ‘performance’, and then re‐specifying the worksystem, and most importantly the human ‘cognitive’ component (that leads to the creation of business ‘customers’), as a means to address a design solution to the problem. The user’s cognition drives their action, which thereby progresses them through the conversion funnel. A button may be rendered larger, and brighter, to capture a customer’s mental process of ‘attention’ (Long & Baddeley, 1984 [4]), a clear table may support ‘reasoning’ about a choice (Buckley & Long, 1990 [5]), and thereby encourage the desired user ‘click’ (behaviour), that will improve performance data towards the businesses desired performance levels, and turn problem into solution (alignment).

By attributing a measurement of cost and quality to an expression of performance, and the concepts of ‘desired’ and ‘actual’ levels of performance to assist in the framing of a ‘design problem’, cognitive engineers have a number of places to start the diagnosis process.

Firstly, the business needs to be able to express desired performance. This will likely be in aggregate terms, and largely reference task quality, especially the number of occasions the worksystem generated ‘valid and fulfillable’ orders, and the total value of those orders, against some measurement of cost (expense), in supporting the digital channel to the consumer. Performance is also likely to make reference to the missed opportunity in terms of low task quality and high user costs. Orders abandoned in the conversion funnel (‘work’ left undone in the domain), time to progress through the conversion funnel at each step, complaints and customer satisfaction ratings (worksystem costs). It is in measuring actual performance via such criteria that web site analytics, and wider Business Intelligence, provides the cognitive engineer with a valuable toolset for performance measurement and thereby design problem diagnosis.

While the engineering approach relies on the business to express ‘desired performance’, once this is expressed, means are required for measuring the actual performance, and so by: tagging pages across the site; modelling user journeys and paths across the site; measuring the time spent on each page; and comparing this to the value of the basket at each step; measurements of actual performance (of individual prospective customers) can be aggregated into summary statistics. These summary statistics are models of performance that support diagnosis, diagnosis of points in the journey where performance starts to deteriorate, against the performance which is desired. Diagnosis may start from pages where performance deteriorated beyond repair, instances where prospective customers were lost from the funnel (and wider site) completely – they ‘bounced’ away, or pages where time spent before advancement appeared unusually long, as hesitancy and uncertainty sets in. Web site analytics can provide granular and aggregated data to support the cognitive engineer in diagnosis, and reasoning about why the performance data are as measured, and indicative of the origins of a plausible design problem; the design problem under consideration.

In contrast to scientific hypotheses, that are part of a scientific method to generate scientific knowledge that supports better ‘explanation and prediction’; cognitive engineering needs to develop a set of diagnoses that when addressed during the cognitive engineering process of ‘prescription’ (specification), will improve and align the Worksystem’s performance data so that it better matches the businesses desired level of performance. If uplift is not possible, the digital channel may disappear as a candidate user experience choice for prospective customers, when interacting with the business.

Diagnosis therefore needs to go beyond an approach of applied science, and instead to look at the user’s journey to a point where performance deteriorates, as well as the page where deterioration is most marked. Theories need to be generated by the cognitive engineer that draw upon all the disciplines that support cognitive design (attention, reasoning, vision), to explain the performance (design) problem. The abandonment of a basket at the page before confirmation may have little to do with a problem with the order summary page itself. Quite the opposite is possible when the user’s journey is examined. The order summary page may reveal the true transportation expense of the ‘valid and fulfillable’ order, expenses that if known earlier would have resulted in fewer prospective customers abandoning their order at such a late stage. Diagnoses need to consider the customer’s journey, and the mental events that the pages support, such as ‘true order cost realisation’. Prescriptions will then follow, in the form of design specifications that address the diagnosis and thereby solve the design problem.

7. The Conversion Funnel design revisited

Business Intelligence for the digital channel is taking many forms, and introducing many new issues around not only consumer privacy, but also around the limits of what can be inferred about consumers from their digital footprints (Baker, 2008 [6]). As many interests compete for influence over the user experience through the conversion funnel, a framework is required to structure the design process so that interests are weighted appropriately. The business owns the channel to its prospective customers, and once it has decided on the key objectives for that channel, it will inevitably set performance targets for the channel. Technologies are evolving to subsequently service the businesses interest in digital channel performance measurement, and it is important to understand the strengths and limitations of the ‘glimpse’ of customer cognition that statistical models provide, something Guy Debord would call ‘the spectacle’, and how we design based on these impoverished representations of the consumer’s reality (Debord, 1967 [7]). In this paper, Dowell & Long’s (1998) conception of the cognitive engineering design problem has been used to bring some order to these competing interests; and better understand what contribution different parties are making to the primary business objective of digital channel performance. Click paths and conversion rates represent aggregations of humans interacting with computers to perform work, and their greatest contribution to the primary business objective is to measure performance, and provide the cognitive engineer with diagnostic inputs, and evidence a re‐designed conversion funnel really is the design solution the business is looking for. To compliment measures of quality and cost, the cognitive engineer needs to tie a digital experience in the digital channel (prospective customers interacting with digital devices), to a domain where work is done, and business targets are attained. This design process will then draw upon models of cognition. Cognitive ‘engineering’ of a design solution is thereby grounded in performance, at the start (diagnosis) and end (prescription) of each design cycle. It is because this is increasingly how the digital channel is being designed, that Dowell & Long’s conception has been chosen. Cognitive engineering in practice, is about designing devices that support cognition, which supports action, which transforms domain objects to accomplish work, of a quality and at a cost. The interests of the cognitive engineer and the business are thereby aligned, by a common interest in channel performance.

8. References

[1] Dowell, J., Long, J. B., 1998. Conception of the cognitive engineering design problem. Ergonomics 41 (2), 126‐139.

[2] Dowell, J., Long, J. B., 1989. Towards a conception of an engineering discipline of human factors. Ergonomics, 32 (11), 1513‐1535.

[3] Long, J.B., 1987. Cognitive Ergonomics and Human Computer Interaction. In: Warr, P., (Ed.) Psychology at Work. Penguin, Harmondsworth.

[4] Long, J. B., Baddeley, A., (Eds.). 1984. Attention and Performance IX (International Symposium on Attention and Performance). Lawrence Erlbaum.

[5] Buckley, P., Long, J. B., 1990. Using videotext for shopping – a qualitative analysis. Behaviour & Information Technology, 9 ( 9), 47‐61.

[6] Baker, S., 2008. The Numerati. Jonathan Cape, London. [7] Debord, G., 2004. Society of the Spectacle. Rebel Press.

[7] Debord, G., 2004. Society of the Spectacle. Rebel Press.