The EFNILEX project has explored the use of machine translation tools for lexicon generation in "medium" European languages (less-resourced official languages of th EU). In 2012, dictionaries have been generated from parallel corpora. Since 2014, we have followed the method of Mikolov et al (2013) where dictionaries are generated from monolingual corpora plus a seed dictionary. Since 2015, we have targeted disambiguation in two ways: scoring translations inferred in pivot-based fashion from Wiktionary with wikt2dict, and translating from multi-sense embeddings.
Any kind of comment is welcome.
questions-words-hu.txt following Mikolov et al. (2013 a and b).