Machine Translation for Chinese-Spanish
[ANGLÈS] Due to the overwhelming increment of information in multiple languages, Machine Translation has become an essential application in our lives. Proof of this are the continuous investments made by technology companies to develop new and improved translation systems. However, translation between distant language pairs such as Chinese and Spanish, which are commonly used in both business and daily life, have been seldom addressed from a research point of view. This project focuses on online translation between Chinese and Spanish. Initially, we present a brief introduction to the field of Machine Translation, and a quick overview to its history and its main approaches. After that, we introduce Statistical Machine Translation, which is the translation paradigm behind our online system. We explain the mathematical basis and the structure of the models. Likewise, we present the evaluation framework used to evaluate our system. Next, we describe how the online system has been built, which allows for translating either from a web-based interface or from two mobile applications (one for Android and one for iOS). Also, we explain how special methods for inputting Chinese and Latin characters are included in the web-based interface and the application. Then, we present the implementation details regarding our statistical translation system. We cover both the description of the corpora used to train the system and the quality assessment of the resulting translations. Finally, we briefly explore the paradigm of rule-based machine translation (also between Chinese and Spanish). We explain the theory of this translation system and we describe the construction of a toy system in order to illustrate how this kind of systems works. This last task is the basis for an open-source rule-based machine translation system that is being developed within the framework of the Google Summer of Code 2013.