Every poem in the runosong corpus was collected from a specific place (parish or region). This page maps the vocabulary to those 803 collection places, revealing regional vocabulary patterns across Estonian and Finnish traditions. Distinctive words are computed using TF-IDF scoring: words that appear frequently in poems from one place but rarely across all places get a high score.
Keyboard Shortcuts
/Focus search box1Places mode2Word Search mode3Compare mode4Map mode5Toponyms mode6Semantic modeSShare current URLXExport CSVRRandom place?Toggle this helpEscClose this overlay
Modes
Places — Browse all 803 collection places in a sortable table. Click a place to see its distinctive vocabulary, song types, and linked pages.
Word Search — Type a wordform to see its geographic distribution across collection places as a horizontal bar chart.
Compare — Select two places and compare their distinctive vocabulary. See shared words, unique words, and Jaccard overlap score.
Map — Interactive choropleth map with 11 color modes: statistics, word frequency, translation frequency, collector, emotion vocabulary, time period, verse search, verse cluster, similar poems, poem uniqueness, and custom CSV overlay.
Toponyms — Place names found in verse texts. Cross-references collection places with mentions in poetic content, including mythological, biblical, and foreign place names.
Semantic — Semantic category profile per place. Shows dominant thematic category (kinship, animals, emotions, nature, etc.) with top-10 bar chart detail. Filter by category to find places with specific themes. Also available as a map color mode.
Word frequency — Search for specific wordforms and see their geographic distribution.
Translation — Search by English translation to find all matching wordforms geographically.
Collector — See where a specific collector gathered poems (top 10 places per collector).
Emotion — Emotion vocabulary density by 11 unified categories with positive/negative presets.
Time period — Filter poems by collection decade range.
Uniqueness — How distinctive each place's poems are (avg/median/percentile).
Data
803 collection places across 292,092 poems from four corpora: ERAB (Estonian), SKVR and JR (Finnish), and KR (Literary Finnish). Distinctive words computed using TF-IDF scoring against the full corpus.