A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to faye in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
faye (0) - 7 freq
fae (1) - 8937 freq
aye (1) - 6376 freq
waye (1) - 1 freq
faee (1) - 1 freq
fame (1) - 47 freq
fare (1) - 62 freq
-aye (1) - 3 freq
fane (1) - 1 freq
fay (1) - 11 freq
fake (1) - 57 freq
gaye (1) - 2 freq
face (1) - 1675 freq
fate (1) - 103 freq
fave (1) - 14 freq
faze (1) - 2 freq
fade (1) - 41 freq
kaye (1) - 4 freq
fayer (1) - 3 freq
fayr (1) - 1 freq
'aye (1) - 304 freq
fayre (1) - 8 freq
fayce (1) - 2 freq
fage (1) - 1 freq
rays (2) - 29 freq
faye (0) - 7 freq
fay (1) - 11 freq
fae (1) - 8937 freq
faee (1) - 1 freq
fyi (2) - 1 freq
fue (2) - 1 freq
aafe (2) - 1 freq
fa (2) - 748 freq
fuyi (2) - 2 freq
fee (2) - 39 freq
fiy (2) - 1 freq
faa (2) - 280 freq
fy (2) - 7 freq
foie (2) - 4 freq
fao (2) - 20 freq
fe (2) - 10 freq
fai (2) - 2 freq
foy (2) - 35 freq
fey (2) - 14 freq
fage (2) - 1 freq
fau (2) - 1 freq
foe (2) - 9 freq
ifye (2) - 1 freq
fie (2) - 3 freq
face (2) - 1675 freq
SoundEx code - F000
fae - 8937 freq
fey - 14 freq
fou - 317 freq
faw - 144 freq
few - 815 freq
fu - 377 freq
foo - 751 freq
faa - 280 freq
fa - 748 freq
ff - 7 freq
f'ae - 1 freq
f'ah - 1 freq
fue - 1 freq
'foo - 11 freq
'fa - 7 freq
fyow - 50 freq
fyowe - 28 freq
fuyi - 2 freq
fo - 16 freq
fu' - 46 freq
fa' - 15 freq
fe - 10 freq
fi - 29 freq
'fae - 4 freq
f - 187 freq
fee - 39 freq
feow - 2 freq
fiew - 1 freq
fie - 3 freq
foo' - 3 freq
fay - 11 freq
fi' - 1 freq
'fee - 3 freq
ffi - 1 freq
'f' - 4 freq
fea - 4 freq
fow - 2 freq
faee - 1 freq
fah - 9 freq
foy - 35 freq
fy - 7 freq
ïf - 93 freq
'faa - 3 freq
'fu - 1 freq
fiy - 1 freq
foe - 9 freq
faa' - 1 freq
fae' - 1 freq
fvow - 1 freq
fiu - 1 freq
fbi - 1 freq
fæ - 13 freq
faye - 7 freq
fiow - 1 freq
”f - 4 freq
'fou - 1 freq
f'ou - 1 freq
fu-aye - 1 freq
€˜foo - 1 freq
°f - 1 freq
€˜fa - 2 freq
fyew - 8 freq
€œfa - 2 freq
€œfoo - 13 freq
€˜f - 2 freq
€˜fae - 1 freq
'fae' - 1 freq
fyou - 1 freq
€™fy - 1 freq
fau - 1 freq
€œfaa - 1 freq
fb - 27 freq
fvp - 1 freq
fhe - 1 freq
fai - 2 freq
fpu - 1 freq
fvy - 1 freq
fii - 1 freq
foie - 4 freq
fw - 3 freq
fpf - 1 freq
fp - 4 freq
fbpe - 2 freq
fh - 4 freq
fv - 3 freq
feuy - 1 freq
fwe - 1 freq
fao - 20 freq
ffa - 2 freq
fyi - 1 freq
f' - 1 freq
MetaPhone code - FY
fyow - 50 freq
fuyi - 2 freq
vyee - 8 freq
vyow - 6 freq
faye - 7 freq
fu-aye - 1 freq
fyew - 8 freq
fyou - 1 freq
fyi - 1 freq
FAYE
Time to execute Levenshtein function - 0.184615 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.310670 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027267 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040285 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000842 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.