A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to doughball in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
doughball (0) - 1 freq
doughbaw (2) - 1 freq
doughbaws (2) - 2 freq
dougal (3) - 2 freq
domnhall (3) - 1 freq
dough-baw (3) - 1 freq
ddougall (3) - 1 freq
doughboys (3) - 2 freq
highball (3) - 1 freq
downhall (3) - 2 freq
dounhill (3) - 1 freq
doorbell (4) - 15 freq
dodgebaa (4) - 1 freq
junkball (4) - 2 freq
dougal's (4) - 5 freq
dugdale (4) - 7 freq
doubil (4) - 3 freq
roughly (4) - 16 freq
joshhall (4) - 1 freq
loughran (4) - 1 freq
doublt (4) - 2 freq
dingwall (4) - 6 freq
fooball (4) - 1 freq
doubly (4) - 4 freq
douglass (4) - 4 freq
doughball (0) - 1 freq
highball (4) - 1 freq
doughbaw (4) - 1 freq
doughbaws (4) - 2 freq
doughboys (5) - 2 freq
dounhill (5) - 1 freq
dunghill (6) - 1 freq
ddougall (6) - 1 freq
beachball (6) - 1 freq
oddball (6) - 1 freq
doonhill (6) - 6 freq
deuk-bill (6) - 1 freq
dougal (6) - 2 freq
domnhall (6) - 1 freq
doorbell (6) - 15 freq
dough-baw (6) - 1 freq
door-bell (6) - 1 freq
downhall (6) - 2 freq
southwell (7) - 1 freq
truchbull (7) - 1 freq
dugald (7) - 13 freq
doughty (7) - 3 freq
doughnut (7) - 7 freq
vauxhall (7) - 2 freq
dought (7) - 1 freq
SoundEx code - D214
disbelief - 23 freq
discipline - 10 freq
displays - 12 freq
displayed - 7 freq
display - 50 freq
dispel - 4 freq
disabled - 7 freq
displeisur - 1 freq
disciple - 6 freq
decibels - 1 freq
disciples - 55 freq
dishevelled - 5 freq
decibel - 1 freq
despiled - 1 freq
deciplin - 1 freq
doughball - 1 freq
disabilities - 5 freq
despoilation - 1 freq
disability - 5 freq
displayin - 2 freq
dispels - 1 freq
disbelievin - 1 freq
disciple's - 1 freq
displease - 1 freq
dispelled - 2 freq
disciplines - 1 freq
displeasure - 1 freq
'discipline - 1 freq
disablit - 1 freq
deiscipline - 2 freq
dogsflourish - 1 freq
discipleine - 1 freq
deisciplines - 1 freq
disbelieve - 1 freq
deuk-bill - 1 freq
disabeilities - 1 freq
displaeced - 1 freq
disbelievers - 1 freq
displacing - 1 freq
disable - 1 freq
disabeelity - 1 freq
displeased - 1 freq
displaying - 1 freq
disipleenaree - 1 freq
doygeflns - 1 freq
disabalist - 2 freq
dishevel - 1 freq
displaced - 1 freq
disciplined - 1 freq
decvalts - 1 freq
dgplacenames - 38 freq
dsfl - 1 freq
dasgefalltmir - 1 freq
digbylj - 1 freq
disciplinary - 1 freq
MetaPhone code - TBL
table - 690 freq
double - 129 freq
dooble - 26 freq
doubly - 4 freq
'double - 3 freq
tebel - 1 freq
dubble - 2 freq
taible - 2 freq
doughball - 1 freq
dibble - 1 freq
tibble - 18 freq
dabble - 2 freq
doubil - 3 freq
teeble - 6 freq
taibil - 3 freq
taeble - 3 freq
t'boyl - 1 freq
€˜double - 2 freq
€œdouble - 1 freq
table' - 1 freq
deeable - 1 freq
tbl - 1 freq
DOUGHBALL
Time to execute Levenshtein function - 0.279489 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.438659 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030724 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.041542 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000916 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.