Martin Kay - Contributions

Contributions

1. Martin Kay’s “proper” paper After the ALPAC report in 1966, the conclusion was made as "There is no immediate or predictable prospect of useful MT producing useful translation of general scientific texts." And because of this result, the field of machine translation entered into a dark period. From 1966 to 1976, almost ten years, few researches were done. However, in 1980’s, the Renaissance period was coming. “The Proper Place of Men and Machines in Language Translation” attracted more attention on the machine translation. In this paper, new thoughts were achieved about the relationship between machine translation and human translation. At that time, with the application of cheaper computers and broad usage of domains in machine translation, high quality outputs were badly needed. And the theory of Fully Automatic High Quality Translation was just the ideal level for machine translation after the criticisms by Bar-Hillel in his 1960’s review of MT progress: “The goal of MT should not be the fully automatic high quality translation (FAHQT) that can replace human translators. Instead, MT should adopt less ambitious goals, e.g. more cost-effective human-machine interaction and aim at enhancement of human translation productivity.” The useful of human translation was promoted to a new higher level. According to this thought, Martin Kay proposed a more practical idea about the relationship between human and machine in the process of machine translation, called “translator’s amanuensis”.

1.1 Two arguments against the useful of machine translation Because this idea includes the human and machine at the same time, so both computer scientists and linguists have responsibilities to the MT. But “they should never be asked to provide an engineering solution to a problem that they only dimly understand.” They just need to achieve “by doing only what can be done with absolute surety and reliability …can be virtually guaranteed to all concerned.” As the main parts of the translation, there are two related arguments against the plausibility of machine translation as an industrial enterprise from the point of view of linguistics and computer science. Two arguments are commonly made for ad hoc solutions to the problems of machine translation. In the former argument, “Ad hoc solutions tend to be based on case-by-case analyses of what linguists call surface phenomena, essentially strings of words, and on real or imagined statistical properties of particular styles of writing and domains of discourse.” It is a simple statistical claim that can be dismissed. In the second argument, ad hoc solutions is only alluded to the understanding of the second language by reading text, and was called sorcerer’s apprentice, because “this kind of argument is to the effect that the kind of incomplete theory that linguists and computer scientists have been able to provide is often a worse base on which to build practical devices than no theory at all because the theory does not know when to stop.” “The main problem with the sorcerer’s-apprentice argument is that the decision that a sentence could be translated without analysis can only be made after the fact. Example sentence shows that there is more than one interpretation of a sentence at some level and further analysis shows that there is a single translation that is compatible with each of them. In short, the algorithm required to decide when analysis is required would have to use the results of the very analysis it is designed to avoid.”

1.2 The Translator’s Amanuensis and translation memory This is the main part of the paper, for illustrate what is translator’s amanuensis, the author showed three aspects: text editing, translation aids, and machine translation. “Suppose that the translators are provided with a terminal consisting of a keyboard, a screen, and some way of pointing at individual words and letters. The display on the screen is divided into two windows. The text to be translated appears in the upper window and the translation will be composed in the bottom one.” It is the form of the translator’s amanuensis which is not a real device and never will. “Both windows behave in the same way. Using the pointing device, the translator can select a letter, word, sentence, line, or paragraph and, by pressing the appropriate key, cause some operation to be visited upon it.”

These two figures show the translation process from the initial display to selection. This device is not simple as these two figures, more special service can be made to translator by it. In the translation aids, the author showed the third figure:

“A relatively trivial addition would be a dictionary. The translator selects a word or sequence of words and gives a command to cause them to be looked up…This new window gives the effect of overlaying some portion of the windows already present. In this case, the new window contains a deceptively simple dictionary entry for the selected word.” What’s more, the device has many other features. For example, the simplicity of the dictionary entry, words Syntax and Semantics will be included when pointing to symbols, modifiable dictionary entries and the temporary amendments make this device more practical. Then, machine translation be explained. “One of the options that should be offered to a user of the hypothetical system I have been describing, at a fairly early stage, be a command that will direct the program to translate the currently selected unit. What will happen when this command is given will be different at different stages of the system’s development. But a user of the system will always be empowered to intervene in the translation process to the extent that he himself specifies. If he elects not to intervene at all, a piece of text purporting to translate the current unit will be displayed in the lower window of his screen. He will be able to edit this in any way he likes, just as post-editors have done in the past. Alternatively, he may ask to be consulted whenever the program is confronted with a decision of a specified type, when certain kinds of ambiguities are detected, or whatever. On these occasions, the system will put a question to the human translator. He may, for example, ask to be consulted on questions of pronominal reference.” In this part, idea of translation memory was showed as a dictionary operation. “Suppose, for example, that a word is put in the local store – that part of the dictionary that persists only as long as this document is being worked on – if it occurs in the text significantly more frequently than statistics stored in the main dictionary indicate. A phrase will be noted if it occurs two or three times but is not recognized as an idiom or set phrase by the dictionary. By examining the contents of this store before embarking on the translation, a user may hope to get a preview of the difficulties ahead and to make some decisions in advance about how to treat them. These decisions, of course, will be recorded in the store itself. In the course of doing this or, indeed, for any reason whatever, the translator can call for a display of all the units in the text that contain a certain word, phrase, string of characters, or whatever. After all, the most important reference to have when translating a text is the text itself. If the piece of text to be translated next is anything but entirely straightforward, the translator might start by issuing a command causing the system to display anything in the store that might be relevant to it. This will bring to his attention decisions he made before the actual translation started, statistically significant words and phrases, and a record of anything that had attracted attention when it occurred before. Before going on, he can examine past and future fragments of text that contain similar material.”

1.3 Expectation of the better performance of the translator’s amanuensis At the end of the paper, Kay mentioned some reasons to expect better performance of this device. First, the system is in a position to draw its human collaborator’s attention to the matters most likely to need it, second, the decisions that have to be made in the course of translating a passage are rarely independent, third, one of the most important facilities in the system is the one that keeps track of words and phrases that are used in some special way in the current text.

«A Life in Language». A speech given in acknowledgement of the Lifetime Achievement Award at the 43rd Annual Meeting of the Association for Computational Linguistics, Ann Arbor, Michigan, 27 June 2005. http://www.stanford.edu/~mjkay/LifeOfLanguage.pdf

String Alignment Using Suffix Trees. A paper about the possible use of suffix trees for aligning texts and their translations. http://www.stanford.edu/~mjkay/CYCLING.pdf

Some unfinished musings on the nature of translation.Here are some unfinished musings on the nature of translation. http://www.stanford.edu/~mjkay/CurrentState.pdf

Some half-baked thoughts on language models in statistical NLP on which I need some help. http://www.stanford.edu/~mjkay/language_models.pdf

His 1994 paper on "Regular Models of Phonological Rule Systems". Computational Linguistics 20(3):331-378" with Ronald Kaplan. http://www.stanford.edu/~mjkay/Kaplan%26Kay.pdf