A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to thiv in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
thiv (0) - 19 freq
thig (1) - 3 freq
thin (1) - 317 freq
'hiv (1) - 5 freq
thie (1) - 8 freq
thi (1) - 2576 freq
thiz (1) - 1 freq
thik (1) - 3 freq
thim (1) - 193 freq
thir (1) - 1508 freq
shiv (1) - 3 freq
thit (1) - 566 freq
this (1) - 11106 freq
thil (1) - 2 freq
thid (1) - 1 freq
thev (1) - 3 freq
hiv (1) - 1171 freq
thuv (1) - 5 freq
thou (2) - 95 freq
tuin (2) - 27 freq
hin (2) - 35 freq
thor (2) - 20 freq
tin (2) - 180 freq
tik (2) - 4 freq
trim (2) - 15 freq
thiv (0) - 19 freq
thuv (1) - 5 freq
thev (1) - 3 freq
'hiv (2) - 5 freq
thid (2) - 1 freq
thil (2) - 2 freq
thin (2) - 317 freq
theve (2) - 1 freq
thieve (2) - 3 freq
thig (2) - 3 freq
this (2) - 11106 freq
hiv (2) - 1171 freq
thi (2) - 2576 freq
thit (2) - 566 freq
thiz (2) - 1 freq
thie (2) - 8 freq
thik (2) - 3 freq
thir (2) - 1508 freq
shiv (2) - 3 freq
thim (2) - 193 freq
hav (3) - 9 freq
theyve (3) - 4 freq
thd (3) - 1 freq
thief (3) - 66 freq
tulv (3) - 1 freq
SoundEx code - T100
tap - 757 freq
thief - 66 freq
tapiwa - 19 freq
tap-ee-wah - 1 freq
'thief - 2 freq
'they've - 7 freq
they've - 203 freq
tyaave - 2 freq
tae've - 4 freq
toffee - 26 freq
toap - 37 freq
th've - 1 freq
tf - 9 freq
tv - 207 freq
tie-up - 1 freq
tip - 78 freq
type - 93 freq
tuip - 4 freq
top - 313 freq
they'veee - 1 freq
toffey - 1 freq
tippy - 2 freq
tyauve - 20 freq
tube - 46 freq
toff - 12 freq
t've - 4 freq
tawpie - 1 freq
theif - 4 freq
tub - 22 freq
tubby - 1 freq
taffy - 1 freq
thieve - 3 freq
thay've - 2 freq
'tap' - 1 freq
tup - 6 freq
tubie - 1 freq
tibbie - 10 freq
thiv - 19 freq
tippa - 4 freq
taboo - 4 freq
they'v - 6 freq
tape - 29 freq
thev - 3 freq
t'v - 1 freq
thi've - 3 freq
tup- - 1 freq
tippie - 3 freq
tofu - 2 freq
thuv - 5 freq
tovey - 2 freq
tibby - 16 freq
toff' - 1 freq
tif - 1 freq
thay'v - 4 freq
the've - 1 freq
't've - 1 freq
tab - 6 freq
tiff - 1 freq
tb - 14 freq
toby - 7 freq
tabby - 15 freq
tief - 3 freq
teefy - 1 freq
tiffy - 1 freq
tabbie - 1 freq
tuba - 1 freq
tove - 3 freq
t'bie - 1 freq
they-ye've - 1 freq
toffy - 2 freq
'tap- - 1 freq
tv' - 1 freq
tiefe - 1 freq
thai've - 2 freq
teeff - 1 freq
thof - 4 freq
tib - 10 freq
toffie - 1 freq
teip - 2 freq
€™tap - 1 freq
taffee - 1 freq
tbe - 1 freq
€˜tip - 1 freq
€¦tap - 11 freq
typhoo - 1 freq
ttip - 2 freq
toyboy - 1 freq
€œthief - 1 freq
€˜tabby - 1 freq
€œtap - 1 freq
teepee - 5 freq
tappy - 1 freq
tubey - 1 freq
toaffay - 1 freq
tdup - 1 freq
tupee - 1 freq
toooooopay - 1 freq
tp - 5 freq
tbh - 35 freq
tfi - 2 freq
tipp - 1 freq
theyÂ’ve - 5 freq
tbw - 1 freq
tpea - 1 freq
tuff - 6 freq
typo - 3 freq
teevee - 1 freq
tvh - 1 freq
taf - 1 freq
tbf - 3 freq
thu've - 4 freq
tyauv - 1 freq
toof - 1 freq
tawbu - 1 freq
'tabu' - 1 freq
tpu - 1 freq
theyve - 4 freq
tuffy - 1 freq
theve - 1 freq
to've - 1 freq
tvah - 1 freq
MetaPhone code - 0F
thief - 66 freq
'thief - 2 freq
'they've - 7 freq
they've - 203 freq
th've - 1 freq
they'veee - 1 freq
theif - 4 freq
thieve - 3 freq
thay've - 2 freq
thiv - 19 freq
thigh - 16 freq
they'v - 6 freq
thev - 3 freq
thi've - 3 freq
thuv - 5 freq
thay'v - 4 freq
the've - 1 freq
thai've - 2 freq
thof - 4 freq
€œthief - 1 freq
theyÂ’ve - 5 freq
thu've - 4 freq
theyve - 4 freq
theve - 1 freq
THIV
Time to execute Levenshtein function - 0.194321 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.355252 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.026944 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036683 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000825 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.