RunoVerse / Semantic Explorer

What is the Semantic Explorer?

This tool lets you explore 704K+ annotated wordforms in the Finnic runosong corpus through three semantic tiers, organized by reliability:

1. Thesaurus Domains (25) — Recommended

Translation-based semantic groupings using English seed keywords. High precision for this corpus because the method relies on dictionary translations rather than NLP models. Domains include: Family & Kinship, Animals, Body & Health, Plants & Crops, Water & Sea, and 20 more.

2. Emotion Categories (26) — Reliable

Multi-method emotion vocabulary audit using GoEmotions, SetFit, NRC EmoLex, and EKKD dictionary data. ~90% accurate. Organized into 38 families across 26 domains (e.g., rõõm, viha).

3. NLP Entity Categories (26) — Experimental

Entity categories from NLP models (GLiNER NER, WordNet, thesaurus keyword matching, morphological detection, lemma propagation). These models were trained on modern English/general text and may have lower accuracy on archaic dialectal runosong vocabulary. Results are preliminary — use with caution.

Tabs

Confidence Levels

Categories by Count

Annotation Methods

Top Co-occurrences

Select a category from the sidebar to explore its annotated wordforms.
Top Places
Estonian
Finnish
Loading semantic data...