A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to fa in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
fa (0) - 751 freq
fao (1) - 20 freq
ua (1) - 3 freq
ta (1) - 2534 freq
ƒa (1) - 1 freq
fas (1) - 17 freq
fw (1) - 3 freq
sfa (1) - 28 freq
²a (1) - 2 freq
pa (1) - 27 freq
qa (1) - 2 freq
-a (1) - 2 freq
va (1) - 5 freq
fat (1) - 297 freq
fe (1) - 10 freq
fad (1) - 1 freq
ft (1) - 19 freq
fay (1) - 11 freq
fau (1) - 1 freq
wa (1) - 148 freq
fag (1) - 118 freq
ff (1) - 7 freq
aa (1) - 7151 freq
ya (1) - 481 freq
ga (1) - 29 freq
fa (0) - 751 freq
fo (1) - 16 freq
fai (1) - 2 freq
ifa (1) - 4 freq
fay (1) - 11 freq
fau (1) - 1 freq
fae (1) - 9131 freq
fy (1) - 7 freq
f (1) - 187 freq
faa (1) - 281 freq
fea (1) - 4 freq
afa (1) - 12 freq
fu (1) - 379 freq
fe (1) - 10 freq
fi (1) - 29 freq
fao (1) - 20 freq
ha (2) - 181 freq
of (2) - 4117 freq
fv (2) - 3 freq
fp (2) - 4 freq
far (2) - 1549 freq
xa (2) - 3 freq
na (2) - 788 freq
ifi (2) - 1 freq
fx (2) - 4 freq
SoundEx code - F000
fae - 9131 freq
fey - 14 freq
fou - 318 freq
faw - 148 freq
few - 844 freq
fu - 379 freq
foo - 753 freq
faa - 281 freq
fa - 751 freq
ff - 7 freq
f'ae - 1 freq
f'ah - 1 freq
fue - 1 freq
'foo - 11 freq
'fa - 7 freq
fyow - 50 freq
fyowe - 28 freq
fuyi - 2 freq
fo - 16 freq
fu' - 47 freq
fa' - 17 freq
fe - 10 freq
fi - 29 freq
'fae - 4 freq
f - 187 freq
fee - 41 freq
feow - 2 freq
fiew - 1 freq
fie - 3 freq
foo' - 3 freq
fay - 11 freq
fi' - 1 freq
fffffffffff - 1 freq
fffffffffffffff-fh - 1 freq
fiu - 2 freq
'fee - 3 freq
ffi - 1 freq
'f' - 4 freq
fea - 4 freq
fow - 2 freq
faee - 1 freq
fah - 9 freq
foy - 35 freq
fy - 7 freq
ïf - 93 freq
'faa - 3 freq
'fu - 1 freq
fiy - 1 freq
foe - 9 freq
faa' - 1 freq
fae' - 1 freq
fvow - 1 freq
fbi - 1 freq
fæ - 13 freq
faye - 7 freq
fiow - 1 freq
”f - 4 freq
'fou - 1 freq
f'ou - 1 freq
fu-aye - 1 freq
€˜foo - 1 freq
°f - 1 freq
€˜fa - 2 freq
fyew - 8 freq
€œfa - 2 freq
€œfoo - 13 freq
€˜f - 2 freq
€˜fae - 1 freq
'fae' - 1 freq
fyou - 1 freq
€™fy - 1 freq
fau - 1 freq
€œfaa - 1 freq
fb - 27 freq
fvp - 1 freq
fhe - 1 freq
fai - 2 freq
fpu - 1 freq
fvy - 1 freq
fii - 1 freq
foie - 4 freq
fw - 3 freq
fpf - 1 freq
fp - 4 freq
fbpe - 2 freq
fh - 4 freq
fv - 3 freq
feuy - 1 freq
fwe - 1 freq
fao - 20 freq
ffa - 2 freq
fyi - 1 freq
f' - 1 freq
MetaPhone code - F
fae - 9131 freq
fey - 14 freq
fou - 318 freq
view - 246 freq
faw - 148 freq
few - 844 freq
fu - 379 freq
vo - 8 freq
foo - 753 freq
faa - 281 freq
fa - 751 freq
ff - 7 freq
v - 266 freq
vi - 84 freq
f'ae - 1 freq
f'ah - 1 freq
vii - 16 freq
viii - 13 freq
ve - 41 freq
fue - 1 freq
vou - 5 freq
'foo - 11 freq
'fa - 7 freq
fo - 16 freq
fu' - 47 freq
fa' - 17 freq
fe - 10 freq
fi - 29 freq
'fae - 4 freq
f - 187 freq
phew - 12 freq
via - 80 freq
fee - 41 freq
've - 12 freq
vow - 22 freq
vie - 4 freq
feow - 2 freq
fiew - 1 freq
fie - 3 freq
wyfe - 19 freq
voe - 26 freq
foo' - 3 freq
fay - 11 freq
wv' - 1 freq
fi' - 1 freq
v' - 1 freq
fffffffffff - 1 freq
phhhew - 1 freq
wyve - 8 freq
fiu - 2 freq
'fee - 3 freq
''ve - 1 freq
ffi - 1 freq
hfe - 2 freq
h've - 1 freq
'f' - 4 freq
fea - 4 freq
fow - 2 freq
faee - 1 freq
fah - 9 freq
foy - 35 freq
va - 5 freq
fy - 7 freq
vu - 4 freq
ïf - 93 freq
'faa - 3 freq
'fu - 1 freq
fiy - 1 freq
foe - 9 freq
faa' - 1 freq
'phew - 1 freq
fae' - 1 freq
voiee - 1 freq
'gh' - 1 freq
'vi - 1 freq
hygh - 1 freq
fæ - 13 freq
wyf - 5 freq
höve - 1 freq
voo - 3 freq
vuo' - 1 freq
fiow - 1 freq
vv - 1 freq
”f - 4 freq
'fou - 1 freq
f'ou - 1 freq
ph - 53 freq
gh - 5 freq
wüve - 1 freq
€˜foo - 1 freq
°f - 1 freq
€™ve - 825 freq
€™v - 67 freq
€˜fa - 2 freq
wyfie - 2 freq
€œfa - 2 freq
€œfoo - 13 freq
wwf - 1 freq
€˜f - 2 freq
€˜fae - 1 freq
'fae' - 1 freq
€™fy - 1 freq
fau - 1 freq
€œw'uv - 1 freq
€œfaa - 1 freq
wfi - 1 freq
hyv - 1 freq
€˜-gh - 1 freq
€˜v - 1 freq
vy - 3 freq
vh - 4 freq
wf - 4 freq
yf - 4 freq
wv - 3 freq
fai - 2 freq
vw - 3 freq
ywv - 1 freq
hv - 4 freq
fii - 1 freq
yÂ’iv - 1 freq
hfi - 1 freq
hf - 2 freq
foie - 4 freq
yfe - 1 freq
fw - 3 freq
vuh - 1 freq
vaa - 1 freq
yvh - 1 freq
yvio - 1 freq
fh - 4 freq
wfh - 1 freq
hvhh - 1 freq
feuy - 1 freq
yff - 1 freq
fao - 20 freq
'view - 1 freq
ywf - 1 freq
vaw - 1 freq
vvh - 1 freq
wgh - 1 freq
ffa - 2 freq
hvw - 1 freq
yv - 1 freq
f' - 1 freq
FA
fa - 751 freq
fall - 62 freq
falling - 10 freq
fallin - 48 freq
falls - 15 freq
fell - 719 freq
who - 1084 freq
whom - 17 freq
foo - 753 freq
fa - 751 freq
fa's - 80 freq
who's - 77 freq
whae - 450 freq
Time to execute Levenshtein function - 0.241148 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.453482 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.065889 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.072084 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000917 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.