A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to xauipy in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
xauipy (0) - 1 freq
daiy (3) - 1 freq
guiy (3) - 1 freq
parity (3) - 2 freq
easily (3) - 60 freq
equip (3) - 1 freq
amity (3) - 2 freq
equity (3) - 7 freq
juicy (3) - 15 freq
rainy (3) - 16 freq
buiy (3) - 5 freq
wauy (3) - 1 freq
nappy (3) - 15 freq
wuiy (3) - 15 freq
baiy (3) - 1 freq
awuiy (3) - 1 freq
aiy (3) - 4 freq
aup (3) - 1 freq
tuip (3) - 4 freq
wuidy (3) - 1 freq
tappy (3) - 1 freq
aisy (3) - 27 freq
causey (3) - 25 freq
hazily (3) - 1 freq
xjuiu (3) - 1 freq
xauipy (0) - 1 freq
jaup (4) - 3 freq
xp (4) - 3 freq
appy (4) - 1 freq
quip (4) - 2 freq
gaip (4) - 1 freq
xavi (4) - 1 freq
caup (4) - 11 freq
raip (4) - 32 freq
kaip (4) - 2 freq
gaup (4) - 4 freq
caip (4) - 3 freq
aip (4) - 10 freq
paiy (4) - 8 freq
maup (4) - 1 freq
yubpy (4) - 4 freq
soupy (4) - 1 freq
yappy (4) - 1 freq
haip (4) - 7 freq
saip (4) - 11 freq
apy (4) - 1 freq
paipe (4) - 1 freq
xihy (4) - 1 freq
xjuiu (4) - 1 freq
uvpy (4) - 1 freq
SoundEx code - X100
xiv - 4 freq
xv - 6 freq
xvi - 3 freq
xvii - 1 freq
xviii - 1 freq
xxiv - 1 freq
xxv - 2 freq
xxvi - 1 freq
xxxv - 1 freq
xcf - 1 freq
xjb - 1 freq
xp - 3 freq
xvo - 1 freq
xfbv - 2 freq
xf - 6 freq
xwpa - 1 freq
xhb - 1 freq
xb - 3 freq
xqp - 1 freq
xzv - 1 freq
xavi - 1 freq
xbo - 1 freq
xfu - 1 freq
xauipy - 1 freq
xcb - 1 freq
xfaaa - 1 freq
xeb - 1 freq
xgqbp - 1 freq
MetaPhone code - SP
sepo - 34 freq
soup - 243 freq
s'up - 1 freq
soup' - 6 freq
soo-oop - 29 freq
sype - 4 freq
soop - 17 freq
sappy - 19 freq
spy - 35 freq
zip - 10 freq
sip - 28 freq
suppie - 24 freq
sup - 53 freq
sappie - 12 freq
soapy - 7 freq
spew - 21 freq
seep - 5 freq
soap - 45 freq
spie - 3 freq
sap - 10 freq
sowp - 3 freq
soopaa - 2 freq
spae - 7 freq
spee - 1 freq
zippy - 3 freq
spa - 6 freq
sp - 8 freq
sepia - 2 freq
soep - 1 freq
speh - 1 freq
spey - 4 freq
sope - 3 freq
sop - 4 freq
cepp - 1 freq
sappho - 8 freq
saip - 11 freq
cep - 1 freq
soo--oop - 1 freq
sopp - 1 freq
supp - 2 freq
€˜spey - 1 freq
sapie - 1 freq
'sup - 1 freq
suppy - 1 freq
swype - 4 freq
spaee - 1 freq
soupy - 1 freq
ssp - 8 freq
€œsup - 1 freq
zap - 1 freq
xp - 3 freq
xwpa - 1 freq
spewy - 1 freq
spÂ’y - 1 freq
zp - 6 freq
ssoap - 1 freq
xauipy - 1 freq
spo - 1 freq
spi - 1 freq
zppow - 2 freq
cyp - 1 freq
spu - 1 freq
spow - 1 freq
supw - 1 freq
zippo - 1 freq
XAUIPY
Time to execute Levenshtein function - 0.183315 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.340138 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027799 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036977 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000895 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.