A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to botherin in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
botherin (0) - 13 freq
bothern (1) - 4 freq
batherin (1) - 5 freq
bothering (1) - 2 freq
bothert- (2) - 1 freq
gatherin (2) - 6 freq
sotterin (2) - 2 freq
coherin (2) - 1 freq
borderin (2) - 2 freq
botherd (2) - 1 freq
nitherin (2) - 3 freq
watherin (2) - 1 freq
smotherin (2) - 2 freq
betterin (2) - 3 freq
hotterin (2) - 15 freq
bothers (2) - 9 freq
sootherin (2) - 7 freq
sowtherin (2) - 2 freq
butterin (2) - 2 freq
batterin (2) - 26 freq
mitherin (2) - 7 freq
bothert (2) - 37 freq
potterin (2) - 3 freq
getherin (2) - 34 freq
bletherin (2) - 80 freq
botherin (0) - 13 freq
batherin (1) - 5 freq
bothern (1) - 4 freq
bethern (2) - 1 freq
bothering (2) - 2 freq
bothert (3) - 37 freq
mitherin (3) - 7 freq
butterin (3) - 2 freq
getherin (3) - 34 freq
batterin (3) - 26 freq
bother (3) - 322 freq
butherin' (3) - 1 freq
sootherin (3) - 7 freq
witherin (3) - 4 freq
githerin (3) - 2 freq
bothered (3) - 58 freq
bletherin (3) - 80 freq
botherd (3) - 1 freq
watherin (3) - 1 freq
gatherin (3) - 6 freq
betterin (3) - 3 freq
bothers (3) - 9 freq
nitherin (3) - 3 freq
gethern (4) - 1 freq
featherin (4) - 2 freq
SoundEx code - B365
bathroom - 43 freq
bedroom - 151 freq
batherin - 5 freq
baudrans - 1 freq
better'n - 2 freq
bitter-an-an - 1 freq
batterin - 26 freq
bawdrons - 3 freq
butterin - 2 freq
bettermaist - 5 freq
botherin - 13 freq
bedrooms - 7 freq
bitterness - 6 freq
bedruim - 5 freq
batteren - 1 freq
badderin - 1 freq
bothering - 2 freq
bothern - 4 freq
bawdrins - 2 freq
bathroom's - 1 freq
budderin - 6 freq
betterin - 3 freq
butherin' - 1 freq
butter-nut - 1 freq
bedroom's - 2 freq
buttermilk - 3 freq
baudrons - 38 freq
betrayan - 1 freq
bettherment - 3 freq
betterment - 6 freq
baudrons' - 2 freq
baudrons-in-buits - 2 freq
bitteran'-an - 1 freq
bitternis - 1 freq
bodhran - 2 freq
better-maist - 1 freq
bethern - 1 freq
betrump - 1 freq
bathrooms - 1 freq
bawdrin - 2 freq
baudran - 2 freq
bathruim - 1 freq
bathroom-o - 1 freq
bittorrentking - 1 freq
butternuts - 1 freq
battering - 2 freq
bettering - 1 freq
MetaPhone code - B0RN
batherin - 5 freq
botherin - 13 freq
bothern - 4 freq
butherin' - 1 freq
bethern - 1 freq
BOTHERIN
bother - 322 freq
bothered - 58 freq
botherin - 13 freq
bothering - 2 freq
bothers - 9 freq
Time to execute Levenshtein function - 0.225167 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.365650 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027394 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037279 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000840 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.