Vocabulary lexicon of 26,730 lemmas from 771,937 tokens across 6,196 Estonian treasure legends (varandusemuistendid). Browse, filter, and explore the linguistic landscape of hidden treasure narratives.
What this is
A searchable lexicon built from 6,196 Estonian treasure legends (varandusemuistendid) — a distinct genre of folk prose about hidden treasures, their guardians, curses, and the people who try to find them. The corpus was lemmatised with EstNLTK 1.7 and enriched with English glosses from the main RunoVerse lexicon where possible.
How to use the table
Letter bar (top) — jump to lemmas starting with a specific letter. A second letter bar below filters by the first letter of the English translation.
Search box — searches Estonian lemma, any wordform variant, or English gloss. Type-ahead suggestions show matches as you type.
POS filter — restrict to nouns, verbs, adjectives, etc.
Type filter — restrict to lemmas that appear in a specific motif type (the 93 motif categories in the treasure legend classification).
Freq min/max — show only lemmas with token frequency in a given range.
Export CSV — downloads the current filtered set, including glosses and frequencies.
Row contents
Lemma — Estonian headword. Click to expand variant wordforms and example sentences.
POS — part-of-speech tag (NOUN, VERB, ADJ, …).
English — gloss(es) if available. Gaps here reflect the lexicon's coverage; many ET-only lemmas are untranslated.
Frequency — token count across the whole 772K-token legend corpus.
Motif types — which of the 93 motif categories the lemma is attested in.