OPUS - an open source parallel corpus

OPUS is a growing collection of translated texts from the web. In the OPUS project we try to convert and align free online data, to add linguistic annotation, and to provide the community with a publicly available parallel corpus. OPUS is based on open source products and the corpus is also delivered as an open content package. We used several tools to compile the current collection. All pre-processing is done automatically. No manual corrections have been carried out.

Data and Resources

Additional Info

Field Value
Source http://opus.lingfil.uu.se/
Author Jörg Tiedemann
Last Updated July 29, 2014, 10:13 (UTC)
Created February 15, 2011, 15:05 (UTC)
comments powered by Disqus
comments powered by Disqus