next up previous contents
Next: Hypothesis Up: Text Generation Previous: Specifying needed information

Action Descriptions and Instructions

In considering text, the first distinction that must be drawn is between actions, which are things that happen in the world, and action descriptions, which are text strings -- descriptions of actions in a language. The same action can be described by many different text strings, and many text strings can contribute to the description of one action. The description one chooses to give of an action depends on many things, including how much one knows of the action, the purpose of describing the action to a given audience, and the knowledge and skills of that audience.

In considering action descriptions then, a second distinction that must be drawn is between narrative and instructions. Roughly, narrative text describes what has happened (enriched by scene-setting descriptions, descriptions of characters and their motivation for acting, etc.), while instructions describe what should be done to achieve one or more specific aims in a given situation. How much is described explicitly and how much is left implicit, depends in part on beliefs about the audience's knowledge and skills. In the latter case, one relies on the audience's knowledge and skills to fill in what is necessary to behave appropriately [SC88,WD90,WBB92]. For example, consider what the audience for the following (separate) instructions must know to carry out them out correctly:

1. Carry the beakers carefully.
2. Go to the mirror and fix your hair.
3. Get me a cold soda.
In the first example, a hearer must derive from ``carefully'', behavioral constraints that will keep the beakers from breaking and their contents from spilling. In the second example, the hearer must derive from the phrases ``to the mirror'' and ``fix your hair'', that the appropriate location in front of the mirror is one that will enable him to see his hair clearly. In the third example, if a cold soda isn't immediately to hand, the hearer must derive from ``a cold soda'' locations where one might be located.

It is not that equivalent inferences can not be drawn from corresponding declarative sentences, such as one might find in a narrative:

1'. Mary carried the beakers carefully.
2'. Fred went to the mirror and fixed his hair.
3'. Fred got Mary a cold soda.
it is just that to behave appropriately, such inferences must be drawn, if the ``missing'' information is not otherwise provided.

Many different kinds of information are provided in instructions, sometimes integrated into a single text, sometimes separated out and placed in identifiable locations or structures within a text. These types of information include:

As the final example shows, more than one element (here, warning plus explanation) may appear in a single sentence. While the focus of this report is on procedural instructions, we will note requirements for explanations and warnings, as necessary (albeit not sufficient) for ensuring that actions are performed appropriately.

In considering the generation of procedural instructions, there is one final distinction that must be made before noting the elements of a procedural instruction: that is whether instructions are given prior to action or in the context of action. In the latter case, Cohen [Coh84] has observed significant interleaving of fine-grained requests or commands on the one hand, and acknowledgements of understanding or requests for clarification on the other. For example, he notes

In the present study, there were at least two requests used for each assembly step in Telephone mode. Each pair of requests (identification requests followed by assembly requests, or requests to pick up followed by assembly requests) involved at least one common object being manipulated. ([Coh84], p. 110).
Cohen observed that identification requests, where the speaker requests the hearer identify the referent of a noun phrase, dominate the first mention of an object in spoken instruction-giving discourse. These rarely had the form of an explicit request such as ``Find the yellow tube.'' but rather took the form of an existential proposition -- e.g. ``There's a little blue cap.''; a perceptual request or statement -- e.g. ``You'll see three very small pieces of plastic.''; or a sentence fragment -- e.g. ``Now, the smallest of the red pieces''. Subsequent reference to an object was, as in written text, through pronouns and definite noun phrases.

Instructions given prior to action rarely have this form. More often, they are given in the form of steps consisting of one or more utterances. For example,

With door opened, adjust switch until roller contacts cam and continuity is indicated at pins A and B. Verify positive switch contact by tightening bottom nut one additional turn.

While there are no firm guidelines as to what a single instruction step should encompass, often steps are organized around small coherent sub-tasks (such as adjusting the switch, in the example above). A step may specify several actions that need to be performed together (possibly in some partially-specified order) to accomplish a single subtask, or several aspects of a single complex action (e.g. its purpose, manner, things to watch out for, appropriate termination conditions, etc.). The important point is that an agent must develop some degree of understanding of the whole step before starting to act, so that he knows what all to do and what all to attend to.

The issues then that one needs to consider in generating procedural instructions include the following:



next up previous contents
Next: Hypothesis Up: Text Generation Previous: Specifying needed information



DTOG Group