The project Wortschatz Leipzig or Leipzig Corpora Collection has been making information on many languages and their vocabularies available since the mid-1990s. All of our applications and services are based on language-statistical algorithms and text-mining methods that are used to analyze large amounts of text. This is done on the basis of our own web crawling infrastructure and using self-developed toolchains and quality assurance procedures.
The data used is collected automatically from selected publicly available sources. The example sentences are selected automatically and do not represent an expression of opinion by the project. The authors are solely responsible for the content and opinions contained therein. Even without special labelling, trademarks reproduced in the vocabulary, such as common names, trade names, product designations, etc., are subject to legal provisions. The synonymous use of a trademark does not necessarily describe product-specific characteristics but instead characterises the use of the term in a general linguistic context.