A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to type in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
type (0) - 93 freq
tyde (1) - 24 freq
typo (1) - 3 freq
tyle (1) - 1 freq
tyme (1) - 219 freq
tyse (1) - 1 freq
pype (1) - 9 freq
tyke (1) - 27 freq
tyne (1) - 30 freq
tyre (1) - 17 freq
sype (1) - 4 freq
types (1) - 27 freq
tape (1) - 29 freq
tye (1) - 8 freq
gype (1) - 47 freq
typed (1) - 10 freq
rype (1) - 5 freq
hype (1) - 6 freq
lyke (2) - 124 freq
jape (2) - 6 freq
tre (2) - 5 freq
thyme (2) - 4 freq
twyse (2) - 1 freq
tyed (2) - 2 freq
yle (2) - 2 freq
type (0) - 93 freq
typo (1) - 3 freq
tape (1) - 29 freq
typed (2) - 10 freq
hype (2) - 6 freq
rype (2) - 5 freq
tup (2) - 6 freq
tap (2) - 757 freq
tpea (2) - 1 freq
tpu (2) - 1 freq
tip (2) - 78 freq
top (2) - 313 freq
gype (2) - 47 freq
tp (2) - 5 freq
tupee (2) - 1 freq
tyse (2) - 1 freq
tyne (2) - 30 freq
tyke (2) - 27 freq
tye (2) - 8 freq
pype (2) - 9 freq
tyre (2) - 17 freq
tyme (2) - 219 freq
tyle (2) - 1 freq
sype (2) - 4 freq
types (2) - 27 freq
SoundEx code - T100
tap - 757 freq
thief - 66 freq
tapiwa - 19 freq
tap-ee-wah - 1 freq
'thief - 2 freq
'they've - 7 freq
they've - 203 freq
tyaave - 2 freq
tae've - 4 freq
toffee - 26 freq
toap - 37 freq
th've - 1 freq
tf - 9 freq
tv - 207 freq
tie-up - 1 freq
tip - 78 freq
type - 93 freq
tuip - 4 freq
top - 313 freq
they'veee - 1 freq
toffey - 1 freq
tippy - 2 freq
tyauve - 20 freq
tube - 46 freq
toff - 12 freq
t've - 4 freq
tawpie - 1 freq
theif - 4 freq
tub - 22 freq
tubby - 1 freq
taffy - 1 freq
thieve - 3 freq
thay've - 2 freq
'tap' - 1 freq
tup - 6 freq
tubie - 1 freq
tibbie - 10 freq
thiv - 19 freq
tippa - 4 freq
taboo - 4 freq
they'v - 6 freq
tape - 29 freq
thev - 3 freq
t'v - 1 freq
thi've - 3 freq
tup- - 1 freq
tippie - 3 freq
tofu - 2 freq
thuv - 5 freq
tovey - 2 freq
tibby - 16 freq
toff' - 1 freq
tif - 1 freq
thay'v - 4 freq
the've - 1 freq
't've - 1 freq
tab - 6 freq
tiff - 1 freq
tb - 14 freq
toby - 7 freq
tabby - 15 freq
tief - 3 freq
teefy - 1 freq
tiffy - 1 freq
tabbie - 1 freq
tuba - 1 freq
tove - 3 freq
t'bie - 1 freq
they-ye've - 1 freq
toffy - 2 freq
'tap- - 1 freq
tv' - 1 freq
tiefe - 1 freq
thai've - 2 freq
teeff - 1 freq
thof - 4 freq
tib - 10 freq
toffie - 1 freq
teip - 2 freq
€™tap - 1 freq
taffee - 1 freq
tbe - 1 freq
€˜tip - 1 freq
€¦tap - 11 freq
typhoo - 1 freq
ttip - 2 freq
toyboy - 1 freq
€œthief - 1 freq
€˜tabby - 1 freq
€œtap - 1 freq
teepee - 5 freq
tappy - 1 freq
tubey - 1 freq
toaffay - 1 freq
tdup - 1 freq
tupee - 1 freq
toooooopay - 1 freq
tp - 5 freq
tbh - 35 freq
tfi - 2 freq
tipp - 1 freq
theyÂ’ve - 5 freq
tbw - 1 freq
tpea - 1 freq
tuff - 6 freq
typo - 3 freq
teevee - 1 freq
tvh - 1 freq
taf - 1 freq
tbf - 3 freq
thu've - 4 freq
tyauv - 1 freq
toof - 1 freq
tawbu - 1 freq
'tabu' - 1 freq
tpu - 1 freq
theyve - 4 freq
tuffy - 1 freq
theve - 1 freq
to've - 1 freq
tvah - 1 freq
MetaPhone code - TP
tap - 757 freq
deep - 562 freq
dip - 21 freq
toap - 37 freq
tie-up - 1 freq
tip - 78 freq
type - 93 freq
dowp - 53 freq
tuip - 4 freq
ytp - 5 freq
dope - 11 freq
top - 313 freq
tippy - 2 freq
tawpie - 1 freq
deip - 14 freq
doup - 18 freq
'tap' - 1 freq
tup - 6 freq
dap - 1 freq
tippa - 4 freq
tape - 29 freq
dwp - 2 freq
http - 482 freq
tup- - 1 freq
tippie - 3 freq
dopey - 1 freq
'deep' - 1 freq
daep - 1 freq
dee-eep - 2 freq
diep - 3 freq
'tap- - 1 freq
'dowp - 1 freq
depe - 1 freq
teip - 2 freq
€™tap - 1 freq
depp - 1 freq
€˜tip - 1 freq
€¦tap - 11 freq
dupe - 1 freq
ttip - 2 freq
€œtap - 1 freq
teepee - 5 freq
tappy - 1 freq
€œdeep - 1 freq
dup - 5 freq
tupee - 1 freq
toooooopay - 1 freq
dp - 3 freq
tp - 5 freq
htp - 1 freq
tipp - 1 freq
ydp - 1 freq
‘deep - 1 freq
tpea - 1 freq
typo - 3 freq
dpe - 2 freq
dop - 1 freq
dep - 2 freq
tpu - 1 freq
TYPE
Time to execute Levenshtein function - 0.398479 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.785885 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.087746 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.097292 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000842 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.