1. Objectives of the Project
One of the most important issues facing Koreans today is how they can play a leading role in this ever-more-competitive age of information and globalization where a country's ability to develop advanced knowledge and technologies is a key factor. One vital asset they need to achieve this is believed to bring in a new level of efficiency in the use of their own language by which they can establish independent information culture. In the situation, this project aims at constructing a large scale national corpus of Korean, thus bringing in new energy and value to this precious language.
Under the project, the Special Data Division is in charge of transcription of colloquial corpus, Korean-English parallel corpus, corpus of Korean used by the North and overseas Koreans, and historical corpus. The corpus of modern Korean colloquial transcription, a result of study on the dynamic changes of language from the point of phonetics, is expected to be efficiently applied to the information processing and other newly emerging industries. The Korean-English parallel corpus is expected to contribute to the development not only of natural language processing but also many areas in which knowledge of linguistic engineering is involved such as machine translation, translation evaluation system, language education, comparative linguistics and education of Korean as a foreign language. Meanwhile, the development of the corpus of language used by North and overseas Koreans can be profitably exploited for getting over the linguistic difference between Koreans across the world, achieving unification of their language life. It will also help make academic and political preparation for a full-scale exchange movement between the South and the North and the post unification era. Finally, the development of the historical corpus will be used for research and education of the Korean language, lexicography, and globalization of Korean language and culture.
|