Wikipedia Bilingual Dictionary
From Wikipedia-Lab
A lot of bilingual dictionaries have been released on the WWW. However, these dictionaries insufficiently cover new and domain-specific terminology. We developed a dictionary constructed by analyzing the link structure of Wikipedia, since the huge scale encyclopedia contains a large amount of links between articles in different languages.
Wikipedia Bilingual Dictionary
We analyzed not only these interlanguage links but extracted even more translation candidates from redirect page and anchor text information. In an experiment, we have already proved the advantages of our dictionary compared to manually created dictionaries as well as to extracting bilingual terminology from parallel corpora.
The dictionary currently translates from English to Japanese and vice versa, but we are planning to add other language pairs in near future.
References
- M.Erdmann, K.Nakayama, T.Hara, and S.Nishio: Extraction of Bilingual Terminology from a Multilingual Web-based Encyclopedia, The Information Processing Society of Japan (IPSJ) Journal (to appear)
- M.Erdmann, K.Nakayama, T.Hara, and S.Nishio: An Approach for Extracting Bilingual Terminology from Wikipedia, Proc. of International Conference on Database Systems for Advanced Applications (DASFAA) (Mar. 2008)
- M.Erdmann, K.Nakayama, T.Hara, and S.Nishio: A Bilingual Dictionary Extracted from the Wikipedia Link Structure, Proc. of International Conference on Database Systems for Advanced Applications (DASFAA), Demonstration Track (Mar. 2008)
- M.Erdmann, K.Nakayama, T.Hara, and S.Nishio: Wikipedia Link Structure Analysis for Extracting Bilingual Terminology, Proc. of Summer Database Workshop of The Information Processing Society of Japan (DBWS) (Sept. 2007)
