A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to botherin in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
botherin (0) - 13 freq
bothering (1) - 2 freq
bothern (1) - 4 freq
batherin (1) - 5 freq
butterin (2) - 2 freq
bothert- (2) - 1 freq
watherin (2) - 1 freq
botherd (2) - 1 freq
totterin (2) - 1 freq
potterin (2) - 3 freq
mitherin (2) - 7 freq
getherin (2) - 34 freq
tocherin (2) - 1 freq
bletherin (2) - 81 freq
sowtherin (2) - 2 freq
githerin (2) - 2 freq
bother't (2) - 1 freq
dother-in (2) - 1 freq
gatherin (2) - 9 freq
bethern (2) - 1 freq
batterin (2) - 28 freq
bovverin (2) - 1 freq
hotterin (2) - 15 freq
bother (2) - 329 freq
sotterin (2) - 2 freq
botherin (0) - 13 freq
bothern (1) - 4 freq
batherin (1) - 5 freq
bethern (2) - 1 freq
bothering (2) - 2 freq
betterin (3) - 3 freq
bother (3) - 329 freq
sootherin (3) - 7 freq
batterin (3) - 28 freq
bothers (3) - 9 freq
butherin' (3) - 1 freq
witherin (3) - 4 freq
gatherin (3) - 9 freq
nitherin (3) - 3 freq
bothert (3) - 37 freq
bothered (3) - 59 freq
mitherin (3) - 7 freq
botherd (3) - 1 freq
butterin (3) - 2 freq
getherin (3) - 34 freq
watherin (3) - 1 freq
githerin (3) - 2 freq
bletherin (3) - 81 freq
thern (4) - 1 freq
gathern (4) - 1 freq
SoundEx code - B365
bathroom - 44 freq
bedroom - 154 freq
batherin - 5 freq
baudrans - 1 freq
better'n - 2 freq
bitter-an-an - 1 freq
batterin - 28 freq
bawdrons - 3 freq
butterin - 2 freq
bettermaist - 5 freq
botherin - 13 freq
bedrooms - 7 freq
bitterness - 9 freq
bedruim - 5 freq
batteren - 1 freq
batterins - 1 freq
badderin - 1 freq
bothering - 2 freq
bothern - 4 freq
bawdrins - 2 freq
bathroom's - 1 freq
budderin - 6 freq
betterin - 3 freq
butherin' - 1 freq
butter-nut - 1 freq
bedroom's - 2 freq
buttermilk - 3 freq
baudrons - 38 freq
betrayan - 1 freq
bettherment - 3 freq
betterment - 6 freq
baudrons' - 2 freq
baudrons-in-buits - 2 freq
bitteran'-an - 1 freq
bitternis - 1 freq
bodhran - 2 freq
better-maist - 1 freq
bethern - 1 freq
betrump - 1 freq
bathrooms - 1 freq
bawdrin - 2 freq
baudran - 2 freq
bathruim - 1 freq
bathroom-o - 1 freq
bittorrentking - 1 freq
butternuts - 1 freq
battering - 2 freq
bettering - 1 freq
MetaPhone code - B0RN
batherin - 5 freq
botherin - 13 freq
bothern - 4 freq
butherin' - 1 freq
bethern - 1 freq
BOTHERIN
bother - 329 freq
bothered - 59 freq
botherin - 13 freq
bothering - 2 freq
bothers - 9 freq
Time to execute Levenshtein function - 0.197247 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.347480 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027427 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037358 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000959 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.