A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to theve in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
theve (0) - 1 freq
these (1) - 1129 freq
thev (1) - 3 freq
thee (1) - 234 freq
there (1) - 6822 freq
th've (1) - 1 freq
theme (1) - 39 freq
the've (1) - 1 freq
thieve (1) - 3 freq
theyve (1) - 4 freq
shuve (2) - 2 freq
theng (2) - 2 freq
eve (2) - 80 freq
theyre (2) - 6 freq
hever (2) - 2 freq
whee (2) - 1 freq
theo (2) - 1 freq
their (2) - 5115 freq
thern (2) - 1 freq
thre (2) - 2 freq
trev (2) - 73 freq
there' (2) - 6 freq
they (2) - 11452 freq
heven (2) - 1 freq
haeve (2) - 5 freq
theve (0) - 1 freq
thieve (1) - 3 freq
theyve (1) - 4 freq
thev (1) - 3 freq
the've (2) - 1 freq
thuv (2) - 5 freq
thiv (2) - 19 freq
theme (2) - 39 freq
thee (2) - 234 freq
these (2) - 1129 freq
th've (2) - 1 freq
there (2) - 6822 freq
thate (3) - 1 freq
thi've (3) - 3 freq
then (3) - 4541 freq
theit (3) - 2 freq
trove (3) - 3 freq
thine (3) - 15 freq
thane (3) - 3 freq
them (3) - 5422 freq
theat (3) - 1 freq
hive (3) - 29 freq
thare (3) - 505 freq
three (3) - 1523 freq
thake (3) - 1 freq
SoundEx code - T100
tap - 773 freq
thief - 66 freq
tapiwa - 19 freq
tap-ee-wah - 1 freq
'thief - 2 freq
'they've - 7 freq
they've - 217 freq
tyaave - 2 freq
tae've - 5 freq
toffee - 28 freq
toap - 44 freq
th've - 1 freq
tf - 9 freq
tv - 208 freq
tie-up - 1 freq
tip - 79 freq
type - 100 freq
tuip - 4 freq
top - 315 freq
they'veee - 1 freq
toffey - 1 freq
tippy - 2 freq
tyauve - 20 freq
tube - 47 freq
toff - 12 freq
t've - 5 freq
tawpie - 1 freq
theif - 4 freq
tub - 22 freq
tubby - 1 freq
taffy - 1 freq
thieve - 3 freq
thay've - 2 freq
'tap' - 1 freq
tup - 6 freq
tubie - 1 freq
tibbie - 10 freq
thiv - 19 freq
tippa - 4 freq
taboo - 4 freq
they'v - 6 freq
tape - 30 freq
thev - 3 freq
tab - 7 freq
taobh - 1 freq
tabie - 1 freq
t'v - 1 freq
thi've - 3 freq
tup- - 1 freq
tippie - 3 freq
tofu - 2 freq
thuv - 5 freq
tovey - 2 freq
tibby - 16 freq
toff' - 1 freq
tif - 1 freq
thay'v - 4 freq
the've - 1 freq
't've - 1 freq
tiff - 1 freq
tb - 14 freq
toby - 7 freq
tabby - 15 freq
tief - 3 freq
teefy - 1 freq
tiffy - 1 freq
tabbie - 1 freq
tuba - 1 freq
tove - 3 freq
t'bie - 1 freq
they-ye've - 1 freq
toffy - 2 freq
'tap- - 1 freq
tv' - 1 freq
tiefe - 1 freq
thai've - 2 freq
teeff - 1 freq
thof - 4 freq
tib - 10 freq
toffie - 1 freq
teip - 2 freq
€™tap - 1 freq
taffee - 1 freq
tbe - 1 freq
€˜tip - 1 freq
€¦tap - 11 freq
typhoo - 1 freq
ttip - 2 freq
toyboy - 1 freq
€œthief - 1 freq
€˜tabby - 1 freq
€œtap - 1 freq
teepee - 5 freq
tappy - 1 freq
tubey - 1 freq
toaffay - 1 freq
tdup - 1 freq
tupee - 1 freq
toooooopay - 1 freq
tp - 5 freq
tbh - 35 freq
tfi - 2 freq
tipp - 1 freq
theyÂ’ve - 5 freq
tbw - 1 freq
tpea - 1 freq
tuff - 6 freq
typo - 3 freq
teevee - 1 freq
tvh - 1 freq
taf - 1 freq
tbf - 3 freq
thu've - 4 freq
tyauv - 1 freq
toof - 1 freq
tawbu - 1 freq
'tabu' - 1 freq
tpu - 1 freq
theyve - 4 freq
tuffy - 1 freq
theve - 1 freq
to've - 1 freq
tvah - 1 freq
MetaPhone code - 0F
thief - 66 freq
'thief - 2 freq
'they've - 7 freq
they've - 217 freq
th've - 1 freq
they'veee - 1 freq
theif - 4 freq
thieve - 3 freq
thay've - 2 freq
thiv - 19 freq
thigh - 17 freq
they'v - 6 freq
thev - 3 freq
thi've - 3 freq
thuv - 5 freq
thay'v - 4 freq
the've - 1 freq
thai've - 2 freq
thof - 4 freq
€œthief - 1 freq
theyÂ’ve - 5 freq
thu've - 4 freq
theyve - 4 freq
theve - 1 freq
THEVE
Time to execute Levenshtein function - 0.217308 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.371675 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.033777 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038623 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000907 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.