RunoVerse

Regional Vocabulary Explorer

Loading...

Regional Vocabulary Explorer

About

Every poem in the runosong corpus was collected from a specific place (parish or region). This page maps the vocabulary to those 803 collection places, revealing regional vocabulary patterns across Estonian and Finnish traditions. Distinctive words are computed using TF-IDF scoring: words that appear frequently in poems from one place but rarely across all places get a high score.

Keyboard Shortcuts

/Focus search box 1Places mode 2Word Search mode 3Compare mode 4Map mode 5Toponyms mode 6Semantic mode SShare current URL XExport CSV RRandom place ?Toggle this help EscClose this overlay

Modes

Places — Browse all 803 collection places in a sortable table. Click a place to see its distinctive vocabulary, song types, and linked pages.

Word Search — Type a wordform to see its geographic distribution across collection places as a horizontal bar chart.

Compare — Select two places and compare their distinctive vocabulary. See shared words, unique words, and Jaccard overlap score.

Map — Interactive choropleth map with 11 color modes: statistics, word frequency, translation frequency, collector, emotion vocabulary, time period, verse search, verse cluster, similar poems, poem uniqueness, and custom CSV overlay.

Toponyms — Place names found in verse texts. Cross-references collection places with mentions in poetic content, including mythological, biblical, and foreign place names.

Semantic — Semantic category profile per place. Shows dominant thematic category (kinship, animals, emotions, nature, etc.) with top-10 bar chart detail. Filter by category to find places with specific themes. Also available as a map color mode.

Map Color Modes

Statistics — Poem count, word count, verse count, unique wordforms, avg words/line, collectors, alliteration rate, parallelism score.

Word frequency — Search for specific wordforms and see their geographic distribution.

Translation — Search by English translation to find all matching wordforms geographically.

Collector — See where a specific collector gathered poems (top 10 places per collector).

Emotion — Emotion vocabulary density by 11 unified categories with positive/negative presets.

Time period — Filter poems by collection decade range.

Uniqueness — How distinctive each place's poems are (avg/median/percentile).

Data

803 collection places across 292,092 poems from four corpora: ERAB (Estonian), SKVR and JR (Finnish), and KR (Literary Finnish). Distinctive words computed using TF-IDF scoring against the full corpus.

URL copied to clipboard
Press ? for shortcuts
Loading place data...