RunoVerse

Lexicon & Dictionary Guide

What is this page?

This is a reference guide to the six dictionary and lexicon tools in RunoVerse. Each section below describes one tool, lists its main features, and links directly to it.

How to use this guide

Scroll through the sections or jump to a tool that interests you. Each section ends with an "Open" link that takes you straight to the tool. If you are new to RunoVerse, start with the Lexicon — it is the main entry point for word-level exploration.

Related explorer pages

Beyond the six tools covered here, several other pages work with word-level data: Dictionary Explorer (browse definitions from five dictionaries), Dictionary Comparison (headword overlap across dictionaries), Cognates (Estonian–Finnish cognate families), and Thesaurus (synonym and semantic clusters).

Other guides

See also: Corpus Guide, Languages Guide, Poetics Guide, and Similarity Guide.

These tools let you search, browse, and analyze the RunoVerse word-level data — 439,746 lemmas drawn from three runosong corpora and enriched with definitions from nine dictionaries and AI-generated translations. Whether you want to look up a single word, compare two entries side by side, or explore how well existing dictionaries cover this historical vocabulary, you will find the right tool below.

Dictionary

The main dictionary is the starting point for exploring the RunoVerse lexicon. Type any word — a lemma, an inflected form, or an English translation — and the dictionary returns matching entries instantly. Each entry opens a detailed view with seven tabs covering word forms, AI analysis, dictionary definitions, similar words, semantic neighbors, poem occurrences, and geographic distribution.

Open Dictionary →

Word Comparison

The comparison tool places any two words side by side so you can examine their differences and similarities at a glance. Select two lemmas and the tool displays their frequencies, part-of-speech tags, dictionary definitions, word forms, and corpus distributions in parallel columns. This is especially useful for comparing near-synonyms, dialectal variants, or Estonian–Finnish cognate pairs.

Open Word Comparison →

Dictionary Coverage

This page analyzes how thoroughly nine published dictionaries cover the vocabulary found in the runosong corpora. It shows that 83.5% of the 1,166,348 unique word forms in the corpus appear in at least one dictionary, and breaks the coverage down by individual dictionary, language, and frequency band. You can search for any word to see which dictionaries include it.

Open Dictionary Coverage →

Category Browser

The category browser lets you explore 65 grammatical and poetic categories that have been assigned to 1.17 million word-form annotations across the corpus. Categories include morphological cases (nominative, genitive, partitive, and others), verb forms (infinitives, participles, imperatives), and stylistic markers specific to folk poetry. Select any category to see its most frequent word forms with example contexts.

Open Category Browser →

Frequency Explorer

The frequency explorer lets you compare how often words appear in the Estonian and Finnish corpora. Enter a word to see its rank and raw count in each language, or browse the top-N most frequent lemmas. The tool also shows BERT semantic neighbors for each word, revealing which other words appear in similar poetic contexts.

Open Frequency Explorer →

Lemma Ambiguity

Many word forms in historical folk poetry can be mapped to more than one lemma. The ambiguity browser lets you explore this phenomenon: search for any word form and see all the candidate lemmas it has been assigned to, along with the frequency of each assignment. This is valuable for understanding the challenges of lemmatizing dialectal and archaic texts, and for assessing how confident a given lemmatization is.

Open Lemma Ambiguity →

← Back to About