A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to athletes in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
athletes (0) - 8 freq
athlete (1) - 5 freq
athlete's (1) - 1 freq
athletics (2) - 1 freq
athletic (2) - 7 freq
themes (3) - 20 freq
thees (3) - 4 freq
applees (3) - 3 freq
athene (3) - 1 freq
staetes (3) - 1 freq
thieves (3) - 24 freq
bathites (3) - 1 freq
theets (3) - 1 freq
ashets (3) - 14 freq
theres (3) - 108 freq
achieves (3) - 1 freq
athooten (3) - 1 freq
atheen (3) - 1 freq
athritis (3) - 1 freq
threes (3) - 11 freq
machetes (3) - 1 freq
theses (3) - 4 freq
athens (3) - 17 freq
acolytes (3) - 1 freq
theeter (3) - 1 freq
athletes (0) - 8 freq
athlete's (2) - 1 freq
athlete (2) - 5 freq
athletic (3) - 7 freq
athletics (3) - 1 freq
athritis (4) - 1 freq
theets (4) - 1 freq
tilts (5) - 8 freq
tholes (5) - 17 freq
toilets (5) - 30 freq
threats (5) - 17 freq
thurties (5) - 1 freq
outlets (5) - 1 freq
theats (5) - 2 freq
thirties (5) - 15 freq
ootlets (5) - 6 freq
threits (5) - 4 freq
thits (5) - 1 freq
chalets (5) - 1 freq
thyftis (5) - 1 freq
thraets (5) - 1 freq
thauts (5) - 2 freq
houlets (5) - 1 freq
hoolets (5) - 5 freq
therties (5) - 1 freq
SoundEx code - A343
athletes - 8 freq
adult - 63 freq
adultery - 5 freq
adults - 49 freq
addle-dub - 1 freq
athletic - 7 freq
athlete - 5 freq
aidled - 1 freq
athlete's - 1 freq
adultèrie - 6 freq
adultèress - 1 freq
adultérous - 1 freq
adultérie - 2 freq
adulterous - 1 freq
adulthood - 5 freq
addled - 3 freq
adulterie - 1 freq
adulatioun - 1 freq
adultress - 1 freq
addlet - 1 freq
adult-heid - 1 freq
athletics - 1 freq
MetaPhone code - A0LTS
athletes - 8 freq
athlete's - 1 freq
ATHLETES
Time to execute Levenshtein function - 0.205387 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.422736 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.036707 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.044421 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001301 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.