A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to top in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
top (0) - 313 freq
tp (1) - 5 freq
rop (1) - 6 freq
toe (1) - 21 freq
dop (1) - 1 freq
to' (1) - 2 freq
tot (1) - 4 freq
stop (1) - 512 freq
nop (1) - 1 freq
tod (1) - 275 freq
toc (1) - 2 freq
toy (1) - 44 freq
tos (1) - 3 freq
toh (1) - 4 freq
op (1) - 7 freq
mop (1) - 16 freq
sop (1) - 4 freq
tops (1) - 21 freq
tup (1) - 6 freq
tom (1) - 134 freq
toq (1) - 1 freq
tcp (1) - 5 freq
toi (1) - 34 freq
kop (1) - 2 freq
toa (1) - 2 freq
top (0) - 313 freq
tap (1) - 757 freq
toap (1) - 37 freq
tip (1) - 78 freq
tup (1) - 6 freq
atop (1) - 5 freq
tp (1) - 5 freq
tow (2) - 44 freq
ton (2) - 14 freq
pop (2) - 80 freq
tot (2) - 4 freq
to (2) - 4049 freq
tok (2) - 2 freq
hop (2) - 13 freq
toe (2) - 21 freq
rop (2) - 6 freq
tor (2) - 4 freq
ytp (2) - 5 freq
teip (2) - 2 freq
typo (2) - 3 freq
tape (2) - 29 freq
tuip (2) - 4 freq
atap (2) - 23 freq
type (2) - 93 freq
oop (2) - 2 freq
SoundEx code - T100
tap - 757 freq
thief - 66 freq
tapiwa - 19 freq
tap-ee-wah - 1 freq
'thief - 2 freq
'they've - 7 freq
they've - 203 freq
tyaave - 2 freq
tae've - 4 freq
toffee - 26 freq
toap - 37 freq
th've - 1 freq
tf - 9 freq
tv - 207 freq
tie-up - 1 freq
tip - 78 freq
type - 93 freq
tuip - 4 freq
top - 313 freq
they'veee - 1 freq
toffey - 1 freq
tippy - 2 freq
tyauve - 20 freq
tube - 46 freq
toff - 12 freq
t've - 4 freq
tawpie - 1 freq
theif - 4 freq
tub - 22 freq
tubby - 1 freq
taffy - 1 freq
thieve - 3 freq
thay've - 2 freq
'tap' - 1 freq
tup - 6 freq
tubie - 1 freq
tibbie - 10 freq
thiv - 19 freq
tippa - 4 freq
taboo - 4 freq
they'v - 6 freq
tape - 29 freq
thev - 3 freq
t'v - 1 freq
thi've - 3 freq
tup- - 1 freq
tippie - 3 freq
tofu - 2 freq
thuv - 5 freq
tovey - 2 freq
tibby - 16 freq
toff' - 1 freq
tif - 1 freq
thay'v - 4 freq
the've - 1 freq
't've - 1 freq
tab - 6 freq
tiff - 1 freq
tb - 14 freq
toby - 7 freq
tabby - 15 freq
tief - 3 freq
teefy - 1 freq
tiffy - 1 freq
tabbie - 1 freq
tuba - 1 freq
tove - 3 freq
t'bie - 1 freq
they-ye've - 1 freq
toffy - 2 freq
'tap- - 1 freq
tv' - 1 freq
tiefe - 1 freq
thai've - 2 freq
teeff - 1 freq
thof - 4 freq
tib - 10 freq
toffie - 1 freq
teip - 2 freq
€™tap - 1 freq
taffee - 1 freq
tbe - 1 freq
€˜tip - 1 freq
€¦tap - 11 freq
typhoo - 1 freq
ttip - 2 freq
toyboy - 1 freq
€œthief - 1 freq
€˜tabby - 1 freq
€œtap - 1 freq
teepee - 5 freq
tappy - 1 freq
tubey - 1 freq
toaffay - 1 freq
tdup - 1 freq
tupee - 1 freq
toooooopay - 1 freq
tp - 5 freq
tbh - 35 freq
tfi - 2 freq
tipp - 1 freq
theyÂ’ve - 5 freq
tbw - 1 freq
tpea - 1 freq
tuff - 6 freq
typo - 3 freq
teevee - 1 freq
tvh - 1 freq
taf - 1 freq
tbf - 3 freq
thu've - 4 freq
tyauv - 1 freq
toof - 1 freq
tawbu - 1 freq
'tabu' - 1 freq
tpu - 1 freq
theyve - 4 freq
tuffy - 1 freq
theve - 1 freq
to've - 1 freq
tvah - 1 freq
MetaPhone code - TP
tap - 757 freq
deep - 562 freq
dip - 21 freq
toap - 37 freq
tie-up - 1 freq
tip - 78 freq
type - 93 freq
dowp - 53 freq
tuip - 4 freq
ytp - 5 freq
dope - 11 freq
top - 313 freq
tippy - 2 freq
tawpie - 1 freq
deip - 14 freq
doup - 18 freq
'tap' - 1 freq
tup - 6 freq
dap - 1 freq
tippa - 4 freq
tape - 29 freq
dwp - 2 freq
http - 482 freq
tup- - 1 freq
tippie - 3 freq
dopey - 1 freq
'deep' - 1 freq
daep - 1 freq
dee-eep - 2 freq
diep - 3 freq
'tap- - 1 freq
'dowp - 1 freq
depe - 1 freq
teip - 2 freq
€™tap - 1 freq
depp - 1 freq
€˜tip - 1 freq
€¦tap - 11 freq
dupe - 1 freq
ttip - 2 freq
€œtap - 1 freq
teepee - 5 freq
tappy - 1 freq
€œdeep - 1 freq
dup - 5 freq
tupee - 1 freq
toooooopay - 1 freq
dp - 3 freq
tp - 5 freq
htp - 1 freq
tipp - 1 freq
ydp - 1 freq
‘deep - 1 freq
tpea - 1 freq
typo - 3 freq
dpe - 2 freq
dop - 1 freq
dep - 2 freq
tpu - 1 freq
TOP
tap - 757 freq
top - 313 freq
taps - 59 freq
tops - 21 freq
tappin - 20 freq
Time to execute Levenshtein function - 0.200335 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.349312 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027735 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036680 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000849 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.