A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to choosing in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
choosing (0) - 3 freq
choosin (1) - 13 freq
hoosing (1) - 2 freq
crossing (2) - 5 freq
housing (2) - 4 freq
chopping (2) - 1 freq
cooling (2) - 1 freq
hoosin (2) - 19 freq
crooning (2) - 2 freq
choosan (2) - 2 freq
choking (2) - 4 freq
closing (2) - 8 freq
hoofing (2) - 8 freq
hoosin' (2) - 2 freq
cooking (2) - 15 freq
chasing (2) - 5 freq
hosing (2) - 1 freq
hooting (2) - 1 freq
schooling (2) - 1 freq
choonin (2) - 1 freq
thoosint (2) - 2 freq
choosie (2) - 1 freq
shooting (2) - 7 freq
thoosin (2) - 1 freq
hoopin (3) - 5 freq
choosing (0) - 3 freq
choosin (2) - 13 freq
hoosing (2) - 2 freq
chasing (2) - 5 freq
choking (3) - 4 freq
choosan (3) - 2 freq
hosing (3) - 1 freq
closing (3) - 8 freq
housing (3) - 4 freq
chuisin (4) - 4 freq
cheering (4) - 2 freq
phasing (4) - 1 freq
cheating (4) - 1 freq
cruising (4) - 3 freq
chosen (4) - 48 freq
cursing (4) - 3 freq
chasin' (4) - 1 freq
causing (4) - 8 freq
ching (4) - 14 freq
chusin (4) - 4 freq
chasin (4) - 40 freq
casing (4) - 1 freq
cosying (4) - 1 freq
echoing (4) - 2 freq
hoosin' (4) - 2 freq
SoundEx code - C252
chickens - 30 freq
cousins - 46 freq
chuckneys - 2 freq
chuckie-hens - 4 freq
cousin's - 15 freq
cushions - 11 freq
casing - 1 freq
coggins - 1 freq
cessnock - 8 freq
cooking - 15 freq
ceacencw - 1 freq
chacking - 1 freq
chukin's - 1 freq
chasing - 5 freq
casinos - 1 freq
choking - 4 freq
cosmic - 11 freq
cake-makin - 2 freq
cognoscenti - 1 freq
chucknies - 1 freq
cosmos - 3 freq
coughing - 2 freq
chuckens - 5 freq
co-conspirator - 1 freq
causing - 8 freq
€œchukkens - 1 freq
chukkens - 3 freq
cognosce - 1 freq
cashing - 1 freq
cuisins - 1 freq
cockneys - 1 freq
chucking - 2 freq
checking - 4 freq
cocoons - 1 freq
cowgang - 1 freq
chicken-cocks - 1 freq
cosying - 1 freq
cojones - 1 freq
coo-scones - 3 freq
casxnk - 1 freq
choosing - 3 freq
chugging - 1 freq
coaching - 13 freq
chickmckenna - 3 freq
chasmically - 2 freq
cousinÂ’s - 1 freq
casino's - 1 freq
cockenzie - 1 freq
cizzens - 1 freq
MetaPhone code - XSNK
chasing - 5 freq
choosing - 3 freq
CHOOSING
Time to execute Levenshtein function - 0.199964 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.376816 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027157 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037959 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001057 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.