A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to thoomb in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
thoomb (0) - 6 freq
thooms (1) - 8 freq
thoombs (1) - 1 freq
thoom (1) - 12 freq
hoomp (2) - 1 freq
thumb (2) - 25 freq
thom (2) - 11 freq
thoomit (2) - 1 freq
shoom (2) - 1 freq
thoo'd (2) - 2 freq
thoor (2) - 11 freq
thoomed (2) - 3 freq
throb (2) - 5 freq
thoomin (2) - 2 freq
thoum (2) - 13 freq
thool (2) - 2 freq
thoo' (2) - 1 freq
thoums (2) - 2 freq
thooht (2) - 1 freq
choob (2) - 4 freq
toom (2) - 35 freq
tomb (2) - 31 freq
thoo (2) - 277 freq
thomp (2) - 1 freq
toons (3) - 80 freq
thoomb (0) - 6 freq
thoom (2) - 12 freq
thumb (2) - 25 freq
thoombs (2) - 1 freq
thooms (2) - 8 freq
thoum (3) - 13 freq
throb (3) - 5 freq
thoums (3) - 2 freq
thomp (3) - 1 freq
thoomed (3) - 3 freq
tomb (3) - 31 freq
thoomin (3) - 2 freq
thom (3) - 11 freq
thoomit (3) - 1 freq
thumbs (4) - 11 freq
tham (4) - 43 freq
thems (4) - 1 freq
thim- (4) - 2 freq
them' (4) - 5 freq
thomas (4) - 81 freq
thoumed (4) - 1 freq
them (4) - 5422 freq
thaim' (4) - 1 freq
thame (4) - 27 freq
thayme (4) - 1 freq
SoundEx code - T510
thump - 33 freq
thumb - 25 freq
tomb - 31 freq
tomboy - 1 freq
tump - 1 freq
tempo - 4 freq
temp - 734 freq
twa-an-a-hauf - 2 freq
thomp - 1 freq
thoomb - 6 freq
tumb - 1 freq
timpo - 1 freq
temp- - 7 freq
tmv - 1 freq
tnv - 1 freq
tomb” - 1 freq
tmmf - 1 freq
timehop - 1 freq
tenv - 1 freq
tywanb - 1 freq
tonyip - 1 freq
MetaPhone code - 0M
them - 5422 freq
thaim - 2522 freq
them- - 2 freq
them-aw - 1 freq
them-' - 2 freq
thumb - 25 freq
theme - 39 freq
thum - 463 freq
thyme - 4 freq
thoum - 13 freq
thaim- - 1 freq
thaim-aa - 1 freq
theem - 7 freq
thoom - 12 freq
tham - 43 freq
thim - 193 freq
theim - 12 freq
them' - 5 freq
they'm - 20 freq
thuma - 1 freq
thaem - 3 freq
thum- - 1 freq
thum-aw - 1 freq
thim- - 2 freq
them'ii - 1 freq
thame - 27 freq
thoomb - 6 freq
thaum - 1 freq
thom - 11 freq
thayme - 1 freq
thaim' - 1 freq
€˜them - 4 freq
€˜thaim - 2 freq
€œthem - 2 freq
theym - 27 freq
€™them - 1 freq
'thom - 1 freq
“thaim - 4 freq
thoumie - 1 freq
THOOMB
Time to execute Levenshtein function - 0.304833 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.584808 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030054 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.084456 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001252 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.