A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to a-z in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
a-z (0) - 13 freq
az (1) - 1 freq
ayz (1) - 1 freq
a-a (1) - 1 freq
a- (1) - 4 freq
abz (1) - 17 freq
arz (1) - 1 freq
adz (1) - 1 freq
rzz (2) - 1 freq
yaz (2) - 1 freq
qjz (2) - 1 freq
baez (2) - 1 freq
aiy (2) - 4 freq
vcz (2) - 1 freq
-c (2) - 2 freq
amd (2) - 2 freq
ayw (2) - 1 freq
viz (2) - 6 freq
ag (2) - 12 freq
azl (2) - 1 freq
apg (2) - 1 freq
aug (2) - 6 freq
a'a (2) - 77 freq
baaz (2) - 1 freq
ard (2) - 4 freq
a-z (0) - 13 freq
abz (2) - 17 freq
arz (2) - 1 freq
a- (2) - 4 freq
adz (2) - 1 freq
a-a (2) - 1 freq
az (2) - 1 freq
ayz (2) - 1 freq
ahaz (3) - 4 freq
z (3) - 119 freq
uz (3) - 50 freq
kiz (3) - 18 freq
o'z (3) - 6 freq
iz (3) - 403 freq
fz (3) - 12 freq
ayxz (3) - 1 freq
emz (3) - 4 freq
otz (3) - 1 freq
evz (3) - 1 freq
lyz (3) - 1 freq
goz (3) - 1 freq
siz (3) - 3 freq
xz (3) - 4 freq
eez (3) - 30 freq
moz (3) - 1 freq
SoundEx code - A200
as - 17482 freq
ach - 306 freq
'as - 15 freq
aiks - 3 freq
ajee - 21 freq
ash - 31 freq
ask - 518 freq
ago - 388 freq
aes - 4 freq
ayewis - 156 freq
ack - 42 freq
aige - 11 freq
age - 409 freq
ak - 12 freq
ayeweys - 87 freq
aw's - 3 freq
awauk - 11 freq
ache - 14 freq
aik - 55 freq
aggie - 49 freq
ayweys - 25 freq
ags - 1 freq
ask-' - 1 freq
ashe - 1 freq
asks - 204 freq
awake - 29 freq
aix - 42 freq
aisy - 27 freq
aigs - 1 freq
ax - 76 freq
assay - 5 freq
'ach - 27 freq
azzy - 25 freq
ahs - 1 freq
'azzy - 3 freq
ac - 24 freq
'awk - 1 freq
ask' - 1 freq
aywiss - 7 freq
'awchhh - 1 freq
aa-weys - 1 freq
axe - 22 freq
aaweys - 5 freq
aik's - 2 freq
acks - 11 freq
aywis - 14 freq
ayewiz - 44 freq
aywiz - 1 freq
ayewes - 10 freq
aqua - 12 freq
ace - 19 freq
ahoj - 2 freq
ag - 12 freq
auche - 1 freq
awoke - 10 freq
aish - 2 freq
awash - 2 freq
aise - 14 freq
aisie - 11 freq
a''ways - 1 freq
a'ways - 3 freq
ass - 26 freq
ayweis - 2 freq
ask's - 2 freq
ayeweis - 4 freq
aij - 2 freq
auch - 3 freq
ayeways - 24 freq
askew - 2 freq
aks - 18 freq
'ask - 1 freq
aas - 6 freq
akh - 1 freq
aussie - 1 freq
asia - 12 freq
aa's - 4 freq
ayewyes - 9 freq
agai - 2 freq
'ach' - 1 freq
agi - 1 freq
ahice - 1 freq
ahk - 1 freq
ahhce - 1 freq
aka - 12 freq
aye-wiz - 20 freq
aese - 8 freq
agh - 11 freq
aiwyes - 1 freq
'age - 4 freq
'agh - 2 freq
ayes - 2 freq
ah'g - 1 freq
a's - 2 freq
ahj - 1 freq
asa - 5 freq
ahaz - 4 freq
aq - 2 freq
ax' - 1 freq
ais - 18 freq
agae - 2 freq
aga - 8 freq
a-z - 13 freq
'ayewis - 3 freq
assie - 1 freq
aiggs - 4 freq
¬‚ask - 1 freq
aesy - 57 freq
aigg - 6 freq
'aussie - 1 freq
ayewss - 1 freq
auga - 1 freq
as-she - 1 freq
ayoch - 1 freq
a'ess - 1 freq
acce - 1 freq
aisk - 4 freq
ayesha - 1 freq
age' - 1 freq
aich - 2 freq
asky - 2 freq
akse - 1 freq
aiss - 3 freq
aaagh - 1 freq
ase - 1 freq
aisse - 4 freq
aus - 4 freq
agg - 6 freq
agg's - 1 freq
aach - 1 freq
€œach - 16 freq
€˜ach - 3 freq
€˜as - 8 freq
aagh - 1 freq
€œas - 14 freq
aze - 2 freq
ake - 7 freq
aywise - 1 freq
aywyes - 1 freq
aawyes - 1 freq
ajie - 3 freq
awyes - 3 freq
ayways - 2 freq
aikey - 4 freq
aw-weys - 1 freq
aesie - 2 freq
awk - 1 freq
awak - 1 freq
aug - 6 freq
aeyways - 1 freq
ay'ways - 1 freq
aeways - 3 freq
auss - 2 freq
€™as - 1 freq
aawise - 1 freq
awis - 3 freq
€˜ago - 2 freq
aig - 1 freq
ayewys - 1 freq
agee - 1 freq
awch - 2 freq
aj - 6 freq
akio - 1 freq
aiko - 1 freq
akxjz - 1 freq
aÂ’s - 1 freq
az - 1 freq
ayj - 1 freq
aezzc - 1 freq
aocau - 1 freq
aecc - 5 freq
ahg - 1 freq
asjjx - 1 freq
ayz - 1 freq
awayego - 1 freq
assq - 1 freq
ajya - 1 freq
aaywiz - 1 freq
aghw - 1 freq
axae - 1 freq
axj - 1 freq
aggggh - 1 freq
agaaaaaa - 1 freq
aÂ’wik - 1 freq
agcc - 5 freq
ajay - 2 freq
a'yese - 1 freq
ahgxo - 1 freq
aaach - 1 freq
aws - 1 freq
aghhhhhhhhh - 1 freq
aghhhhhhhhhh - 1 freq
“as - 1 freq
aqs - 1 freq
ayÂ’s - 1 freq
aikawa - 1 freq
aox - 1 freq
aoxckse - 1 freq
aic - 2 freq
aoagaw - 1 freq
aways - 58 freq
ayechihuahua - 1 freq
aussie' - 1 freq
auk - 1 freq
aye's - 1 freq
a'sae - 1 freq
awsae - 3 freq
aaaaagh - 1 freq
aok - 1 freq
ayxz - 1 freq
awjh - 1 freq
aqsc - 1 freq
akk - 1 freq
agyw - 1 freq
agj - 1 freq
awys - 1 freq
aque - 1 freq
ayq - 1 freq
auqi - 1 freq
aaways - 1 freq
'ack - 2 freq
MetaPhone code - AS
as - 17482 freq
'as - 15 freq
aw's - 3 freq
aisy - 27 freq
assay - 5 freq
azzy - 25 freq
ahs - 1 freq
'azzy - 3 freq
ace - 19 freq
aise - 14 freq
aisie - 11 freq
ass - 26 freq
aas - 6 freq
aussie - 1 freq
aa's - 4 freq
ahhce - 1 freq
a's - 2 freq
asa - 5 freq
ais - 18 freq
a-z - 13 freq
assie - 1 freq
'aussie - 1 freq
a'ess - 1 freq
aiss - 3 freq
ase - 1 freq
aisse - 4 freq
aus - 4 freq
€˜as - 8 freq
€œas - 14 freq
aze - 2 freq
auss - 2 freq
€™as - 1 freq
aÂ’s - 1 freq
az - 1 freq
ayz - 1 freq
aws - 1 freq
“as - 1 freq
ayÂ’s - 1 freq
aussie' - 1 freq
a'sae - 1 freq
awsae - 3 freq
awys - 1 freq
A-Z
Time to execute Levenshtein function - 0.499639 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.726196 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.074169 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.080454 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001073 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.