A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to delta in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
delta (0) - 1 freq
delt (1) - 3 freq
dela (1) - 1 freq
deet (2) - 24 freq
telt (2) - 1538 freq
kepta (2) - 1 freq
eesta (2) - 1 freq
netta (2) - 6 freq
eltz (2) - 1 freq
eela (2) - 3 freq
malta (2) - 2 freq
death (2) - 168 freq
helja (2) - 1 freq
delyte (2) - 4 freq
vesta (2) - 2 freq
beta (2) - 3 freq
derts (2) - 2 freq
geta (2) - 1 freq
deena (2) - 1 freq
kelty (2) - 37 freq
dental (2) - 7 freq
det (2) - 3 freq
delft (2) - 2 freq
deft (2) - 4 freq
debt (2) - 44 freq
delta (0) - 1 freq
delt (1) - 3 freq
dalt (2) - 1 freq
delite (2) - 4 freq
dlt (2) - 1 freq
delyte (2) - 4 freq
dealt (2) - 33 freq
dolt (2) - 1 freq
delyt (2) - 3 freq
dela (2) - 1 freq
delete (2) - 10 freq
daelt (2) - 2 freq
delhi (3) - 13 freq
belt (3) - 155 freq
delyts (3) - 1 freq
data (3) - 68 freq
deity (3) - 2 freq
dent (3) - 4 freq
deli (3) - 3 freq
adela (3) - 4 freq
dellt (3) - 3 freq
delay (3) - 17 freq
pelt (3) - 11 freq
dept (3) - 7 freq
adult (3) - 63 freq
SoundEx code - D430
daily-day - 8 freq
dwalt - 4 freq
dealt - 33 freq
daled - 5 freq
delete - 10 freq
delt - 3 freq
dollt - 1 freq
dwallt - 4 freq
dwallit - 2 freq
dulled - 1 freq
dolled - 6 freq
dialled - 2 freq
doled - 3 freq
deludey - 1 freq
dellt - 3 freq
delite - 4 freq
dilled - 3 freq
delyte - 4 freq
day-auld - 1 freq
delyt - 3 freq
delude - 1 freq
dillt - 1 freq
doilt - 1 freq
delled - 2 freq
dalt - 1 freq
daelt - 2 freq
dolt - 1 freq
delta - 1 freq
dailiday - 1 freq
dailt - 1 freq
dallied - 1 freq
duality - 1 freq
delayed - 4 freq
dewalt - 1 freq
delayedÂ… - 2 freq
dlt - 1 freq
'dildo - 1 freq
MetaPhone code - TLT
tellt - 505 freq
telt - 1538 freq
till't - 13 freq
tell't - 3 freq
til't - 40 freq
daily-day - 8 freq
told - 133 freq
toilet - 91 freq
til'it - 1 freq
tauld - 11 freq
dealt - 33 freq
tilt - 30 freq
'telt - 1 freq
daled - 5 freq
delete - 10 freq
delt - 3 freq
tailed - 7 freq
dollt - 1 freq
tallied - 1 freq
tilled - 3 freq
telit - 1 freq
dulled - 1 freq
dolled - 6 freq
tyle't - 1 freq
tull't - 4 freq
dialled - 2 freq
doled - 3 freq
tiled - 2 freq
tolt - 138 freq
deludey - 1 freq
tould - 2 freq
tolled - 1 freq
dellt - 3 freq
delite - 4 freq
dilled - 3 freq
telled - 2 freq
toiled - 2 freq
delyte - 4 freq
day-auld - 1 freq
töllied - 1 freq
delyt - 3 freq
delude - 1 freq
tolta - 1 freq
toalt - 1 freq
towld - 5 freq
told' - 2 freq
dillt - 1 freq
doilt - 1 freq
toledo - 1 freq
delled - 2 freq
dalt - 1 freq
daelt - 2 freq
teelt - 1 freq
dolt - 1 freq
€˜telt - 1 freq
delta - 1 freq
tooled - 3 freq
€œtelt - 1 freq
€œtoilet - 1 freq
dailiday - 1 freq
dailt - 1 freq
dallied - 1 freq
tel't - 2 freq
duality - 1 freq
toult - 1 freq
teld - 2 freq
tlod - 1 freq
dlt - 1 freq
telttt - 1 freq
'dildo - 1 freq
tltu - 4 freq
DELTA
Time to execute Levenshtein function - 0.219780 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.400548 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027247 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036478 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000849 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.