请输入您要查询的百科知识:

 

词条 Example-based machine translation
释义

  1. Translation by analogy

  2. History

  3. Example

  4. Phrasal verbs

  5. See also

  6. References

  7. Further reading

  8. External links

{{ref improve|date=June 2012}}

Example-based machine translation (EBMT) is a method of machine translation often characterized by its use of a bilingual corpus with parallel texts as its main knowledge base at run-time. It is essentially a translation by analogy and can be viewed as an implementation of a case-based reasoning approach to machine learning.

Translation by analogy

At the foundation of example-based machine translation is the idea of translation by analogy. When applied to the process of human translation, the idea that translation takes place by analogy is a rejection of the idea that people translate sentences by doing deep linguistic analysis. Instead, it is founded on the belief that people translate by first decomposing a sentence into certain phrases, then by translating these phrases, and finally by properly composing these fragments into one long sentence. Phrasal translations are translated by analogy to previous translations. The principle of translation by analogy is encoded to example-based machine translation through the example translations that are used to train such a system.

Other approaches to machine translation, including statistical machine translation, also use bilingual corpora to learn the process of translation.

History

Example-based machine translation was first suggested by Makoto Nagao in 1984.[1] He pointed out that it is especially adapted to translation between two totally different languages, such as English and Japanese. In this case, one sentence can be translated into several well-structured sentences in another language, therefore, it is no use to do the deep linguistic analysis characteristic of rule-based machine translation.

Example

Example of bilingual corpus
English Japanese
How much is that red umbrella? Ano akai kasa wa ikura desu ka.
How much is that small camera? Ano chiisai kamera wa ikura desu ka.

Example-based machine translation systems are trained from bilingual parallel corpora containing sentence pairs like the example shown in the table above. Sentence pairs contain sentences in one language with their translations into another. The particular example shows an example of a minimal pair, meaning that the sentences vary by just one element. These sentences make it simple to learn translations of portions of a sentence. For example, an example-based machine translation system would learn three units of translation from the above example:

  1. How much is that X ? corresponds to Ano X wa ikura desu ka.
  2. red umbrella corresponds to akai kasa
  3. small camera corresponds to chiisai kamera

Composing these units can be used to produce novel translations in the future. For example, if we have been trained using some text containing the sentences:

President Kennedy was shot dead during the parade. and The convict escaped on July 15th. We could translate the sentence The convict was shot dead during the parade. by substituting the appropriate parts of the sentences.

Phrasal verbs

Example-based machine translation is best suited for sub-language phenomena like phrasal verbs. Phrasal verbs have highly context-dependent meanings. They are common in English, where they comprise a verb followed by an adverb and/or a preposition, which are called the particle to the verb. Phrasal verbs produce specialized context-specific meanings that may not be derived from the meaning of the constituents. There is almost always an ambiguity during word-to-word translation from source to the target language.

As an example, consider the phrasal verb "put on" and its Urdu/Hindi meaning. It may be used in any of the following ways:

  • Ram put on the lights. (Switched on) (Urdu/Hindi translation: Jalana)
  • Ram put on a cap. (Wear) (Urdu/Hindi translation: Pahenna)

See also

  • Programming by example
  • Translation memory
  • Natural Language Processing

References

1. ^{{Cite book |author= Makoto Nagao |title= Artificial and Human Intelligence |chapter= A framework of a mechanical translation between Japanese and English by analogy principle |editor= A. Elithorn and R. Banerji |year= 1984 |publisher= Elsevier Science Publishers |url= http://www.mt-archive.info/Nagao-1984.pdf }}

Further reading

  • {{cite book |last=Carl|first=Michael |first2=Andy |last2=Way |year=2003|title=Recent Advances in Example-Based Machine Translation |url=https://www.springer.com/de/book/9781402014000#aboutBook |publisher=Springer |location=Netherlands|isbn=978-1-4020-1400-0 |doi=10.1007/978-94-010-0181-6}}

External links

  • Cunei - an open source platform for data-driven machine translation that grew out of research in EBMT, but also includes recent advances from the SMT field
{{Approaches to machine translation}}

2 : Machine translation|Natural language processing

随便看

 

开放百科全书收录14589846条英语、德语、日语等多语种百科知识,基本涵盖了大多数领域的百科知识,是一部内容自由、开放的电子版国际百科全书。

 

Copyright © 2023 OENC.NET All Rights Reserved
京ICP备2021023879号 更新时间:2024/11/13 9:59:27