Corpora

  • Textaro: This project seeks to provide a basic research corpus for Esperanto. In the first phase of the project, a corpus of approximately 2 million words was produced, including the major part of Zamenhof’s work and a selection of literary and journalistic texts. The technical specifications for the corpus follow the guidelines of the Text Encoding Initiative (TEI). This project is currently available online and is equipped with a powerful set of search functions. Plans are also underway to expand the corpus significantly and incorporate a wider range of genres and authors.To access the Corpus, click here.
  • WikiTrans – Esperanto offers a corpus of the English Wikipedia machine-translated into Esperanto (~ 3 million articles). The WikiTrans project is an ongoing research and development project carried out by the Danish language technology company GrammarSoft ApS, in cooperation with the University of Southern Denmark. The primary goal of the project is to provide cross-language access between individual-language Wikipedias. WikiTrans translations are high-quality machine translations, using a rule-based system, GramTrans, that combines deep linguistic analysis (Constraint Grammar) with context-based transfer

Dictionaries

  • Online Esperanto Dictionary (in Esperanto), by Wolfram Diestel
  • Plena Ilustrita Vortaro de Esperanto (Full Illustrated Dictionary of Esperanto), a monolingual dictionary of Esperanto. First compiled in 1970 by a large team of Esperanto linguists under the guidance of Gaston Waringhien and published by the Sennacieca Asocio Tutmonda (SAT), it may now be consulted online at Vortaro.net after free registration (if you are a registered user of Lernu.net, you can use the same login to access vortaro.net).

 

Comments are closed