A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to zns in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
zns (0) - 1 freq
zs (1) - 5 freq
zvs (1) - 1 freq
zis (1) - 2 freq
zn (1) - 1 freq
jns (1) - 1 freq
ans (1) - 2 freq
ins (1) - 14 freq
znn (1) - 1 freq
ens (1) - 16 freq
ons (1) - 1 freq
zss (1) - 1 freq
ns (1) - 15 freq
rns (1) - 5 freq
znv (1) - 1 freq
zys (1) - 1 freq
nms (2) - 2 freq
snx (2) - 1 freq
insy (2) - 4 freq
'us (2) - 3 freq
gos (2) - 1 freq
pms (2) - 7 freq
bs (2) - 6 freq
any (2) - 719 freq
one (2) - 766 freq
zns (0) - 1 freq
ns (2) - 15 freq
zss (2) - 1 freq
znv (2) - 1 freq
ons (2) - 1 freq
zys (2) - 1 freq
zynes (2) - 3 freq
zones (2) - 3 freq
ens (2) - 16 freq
rns (2) - 5 freq
zs (2) - 5 freq
znn (2) - 1 freq
zis (2) - 2 freq
zvs (2) - 1 freq
zn (2) - 1 freq
ans (2) - 2 freq
jns (2) - 1 freq
ins (2) - 14 freq
zeus (3) - 12 freq
zinc (3) - 5 freq
zeno (3) - 1 freq
sons (3) - 92 freq
enns (3) - 11 freq
zyne (3) - 4 freq
aans (3) - 1 freq
SoundEx code - Z520
zoink - 16 freq
zoink's - 2 freq
zinc - 5 freq
zing - 3 freq
zones - 3 freq
zink - 1 freq
zynes - 3 freq
zheng - 1 freq
zonks - 1 freq
znzq - 1 freq
zmigi - 1 freq
zunc - 1 freq
zns - 1 freq
zymsk - 1 freq
MetaPhone code - SNS
suns - 15 freq
since - 586 freq
sense - 527 freq
soons - 31 freq
sons - 92 freq
sneeze - 24 freq
sins - 59 freq
seein's - 7 freq
snooze - 19 freq
sonsie - 54 freq
souns - 52 freq
sinse - 20 freq
snaws - 16 freq
séance - 1 freq
science - 76 freq
sun's - 21 freq
sonsy - 10 freq
'seein's - 1 freq
'seence - 1 freq
suin's - 3 freq
scenes - 46 freq
sens - 6 freq
sains - 4 freq
wysins - 5 freq
synes - 1 freq
sauns - 3 freq
seen's - 2 freq
seyn's - 2 freq
saan's - 1 freq
syne's - 1 freq
sannies - 9 freq
son's - 5 freq
'since - 1 freq
soon's - 1 freq
scene's - 1 freq
siine's - 1 freq
seance - 2 freq
seeins - 2 freq
sawney's - 3 freq
cence - 3 freq
sciencey - 1 freq
sïns - 12 freq
sinns - 4 freq
senns - 2 freq
sen's - 2 freq
saunnies - 3 freq
senzie - 1 freq
sans - 7 freq
sunny's - 9 freq
soun's - 1 freq
sannis - 1 freq
suins - 3 freq
sin's - 1 freq
zones - 3 freq
sinews - 2 freq
sonsi - 1 freq
wycins - 3 freq
saains - 1 freq
sanns - 3 freq
sune's - 1 freq
sonnis - 1 freq
'science - 1 freq
zynes - 3 freq
snhs - 3 freq
€˜since - 4 freq
€œsince - 1 freq
sonse - 5 freq
so-an-so - 1 freq
sean's - 1 freq
'suenos' - 1 freq
snoozey - 1 freq
sonÂ’s - 2 freq
sence - 1 freq
sines - 1 freq
sony's - 1 freq
saunies - 1 freq
zns - 1 freq
snow's - 1 freq
souness - 1 freq
ZNS
Time to execute Levenshtein function - 0.214361 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.398835 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.034816 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040761 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000987 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.