Corpus Statistics Explorer

Loading lexicon data...

Corpus overview

Charts explained

Additional panels

Frequency Distribution (Zipf's Law)

Part-of-Speech Distribution

POS by Language (ET vs FI)

Language & Source Distribution

Vocabulary Overlap (Venn)

Collection Distribution

Vocabulary Richness

Collection Profiles

Annotation Confidence Distribution

Top Lemmas by Frequency

Morphological Complexity

Grammatical Categories

Loading categories...

Frequency Distribution

7 bands, hapax, coverage, POS

Verse Opening Patterns

4.3M verses, ET + FI