A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to betterin in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
betterin (0) - 3 freq
etterin (1) - 1 freq
butterin (1) - 2 freq
letterin (1) - 6 freq
better'n (1) - 2 freq
bettering (1) - 1 freq
batterin (1) - 28 freq
gutterin (2) - 5 freq
bettert (2) - 2 freq
utterin (2) - 3 freq
yatterin (2) - 4 freq
teeterin (2) - 6 freq
lettern (2) - 1 freq
lettering (2) - 2 freq
sotterin (2) - 2 freq
batteren (2) - 1 freq
potterin (2) - 3 freq
better' (2) - 3 freq
bletherin (2) - 81 freq
nytterin (2) - 1 freq
cletterin (2) - 1 freq
enterin (2) - 20 freq
banterin (2) - 1 freq
merterin (2) - 1 freq
watterin (2) - 12 freq
betterin (0) - 3 freq
butterin (1) - 2 freq
batterin (1) - 28 freq
etterin (2) - 1 freq
letterin (2) - 6 freq
bettering (2) - 1 freq
better'n (2) - 2 freq
batteren (2) - 1 freq
bettered (3) - 4 freq
butterig (3) - 1 freq
bethern (3) - 1 freq
hotterin (3) - 15 freq
bettin (3) - 12 freq
betters (3) - 6 freq
litterin (3) - 1 freq
better (3) - 1704 freq
mutterin (3) - 40 freq
totterin (3) - 1 freq
natterin (3) - 4 freq
botherin (3) - 13 freq
bleeterin (3) - 1 freq
blatterin (3) - 5 freq
batterit (3) - 1 freq
matterin (3) - 1 freq
witterin (3) - 3 freq
SoundEx code - B365
bathroom - 44 freq
bedroom - 154 freq
batherin - 5 freq
baudrans - 1 freq
better'n - 2 freq
bitter-an-an - 1 freq
batterin - 28 freq
bawdrons - 3 freq
butterin - 2 freq
bettermaist - 5 freq
botherin - 13 freq
bedrooms - 7 freq
bitterness - 9 freq
bedruim - 5 freq
batteren - 1 freq
batterins - 1 freq
badderin - 1 freq
bothering - 2 freq
bothern - 4 freq
bawdrins - 2 freq
bathroom's - 1 freq
budderin - 6 freq
betterin - 3 freq
butherin' - 1 freq
butter-nut - 1 freq
bedroom's - 2 freq
buttermilk - 3 freq
baudrons - 38 freq
betrayan - 1 freq
bettherment - 3 freq
betterment - 6 freq
baudrons' - 2 freq
baudrons-in-buits - 2 freq
bitteran'-an - 1 freq
bitternis - 1 freq
bodhran - 2 freq
better-maist - 1 freq
bethern - 1 freq
betrump - 1 freq
bathrooms - 1 freq
bawdrin - 2 freq
baudran - 2 freq
bathruim - 1 freq
bathroom-o - 1 freq
bittorrentking - 1 freq
butternuts - 1 freq
battering - 2 freq
bettering - 1 freq
MetaPhone code - BTRN
better'n - 2 freq
batterin - 28 freq
butterin - 2 freq
batteren - 1 freq
badderin - 1 freq
budderin - 6 freq
betterin - 3 freq
bodhran - 2 freq
bawdrin - 2 freq
baudran - 2 freq
BETTERIN
Time to execute Levenshtein function - 0.207174 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.384264 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028275 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037655 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000871 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.