A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to budderin in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
budderin (0) - 6 freq
rudderin (1) - 1 freq
ludderin (1) - 1 freq
badderin (1) - 1 freq
bulderin (1) - 2 freq
bufferin (2) - 1 freq
didderin (2) - 1 freq
budder (2) - 47 freq
butterin (2) - 2 freq
bullerin (2) - 1 freq
shudderin (2) - 1 freq
budgetin (2) - 1 freq
murderin (2) - 14 freq
dodderin (2) - 1 freq
bulderit (2) - 1 freq
buddered (2) - 7 freq
budders (2) - 2 freq
gadderin (2) - 14 freq
blunderin (2) - 1 freq
gulderin (2) - 9 freq
suddern (2) - 1 freq
whudderin (2) - 1 freq
tedderin (2) - 1 freq
sunderin (2) - 1 freq
dunderin (2) - 4 freq
budderin (0) - 6 freq
badderin (1) - 1 freq
rudderin (2) - 1 freq
bulderin (2) - 2 freq
ludderin (2) - 1 freq
tedderin (3) - 1 freq
dodderin (3) - 1 freq
buddered (3) - 7 freq
gedderin (3) - 1 freq
gadderin (3) - 14 freq
budders (3) - 2 freq
buddin (3) - 11 freq
didderin (3) - 1 freq
budder (3) - 47 freq
widderin (3) - 1 freq
suddern (3) - 1 freq
borderin (3) - 2 freq
hudderie (4) - 7 freq
badder (4) - 5 freq
bidder (4) - 1 freq
suddren (4) - 3 freq
beddin (4) - 16 freq
buddan (4) - 1 freq
biddin (4) - 30 freq
bawdrin (4) - 2 freq
SoundEx code - B365
bathroom - 44 freq
bedroom - 154 freq
batherin - 5 freq
baudrans - 1 freq
better'n - 2 freq
bitter-an-an - 1 freq
batterin - 28 freq
bawdrons - 3 freq
butterin - 2 freq
bettermaist - 5 freq
botherin - 13 freq
bedrooms - 7 freq
bitterness - 9 freq
bedruim - 5 freq
batteren - 1 freq
batterins - 1 freq
badderin - 1 freq
bothering - 2 freq
bothern - 4 freq
bawdrins - 2 freq
bathroom's - 1 freq
budderin - 6 freq
betterin - 3 freq
butherin' - 1 freq
butter-nut - 1 freq
bedroom's - 2 freq
buttermilk - 3 freq
baudrons - 38 freq
betrayan - 1 freq
bettherment - 3 freq
betterment - 6 freq
baudrons' - 2 freq
baudrons-in-buits - 2 freq
bitteran'-an - 1 freq
bitternis - 1 freq
bodhran - 2 freq
better-maist - 1 freq
bethern - 1 freq
betrump - 1 freq
bathrooms - 1 freq
bawdrin - 2 freq
baudran - 2 freq
bathruim - 1 freq
bathroom-o - 1 freq
bittorrentking - 1 freq
butternuts - 1 freq
battering - 2 freq
bettering - 1 freq
MetaPhone code - BTRN
better'n - 2 freq
batterin - 28 freq
butterin - 2 freq
batteren - 1 freq
badderin - 1 freq
budderin - 6 freq
betterin - 3 freq
bodhran - 2 freq
bawdrin - 2 freq
baudran - 2 freq
BUDDERIN
Time to execute Levenshtein function - 0.204822 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.361080 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027701 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038576 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000874 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.