A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to eidmubarak in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
eidmubarak (0) - 1 freq
embark (4) - 2 freq
eidgubnryq (4) - 2 freq
tidemark (5) - 2 freq
inbrak (5) - 2 freq
eidart (5) - 1 freq
widd-wark (5) - 1 freq
yirdquaak (5) - 2 freq
simbata (5) - 1 freq
embra (5) - 96 freq
embarras (5) - 1 freq
widdwark (5) - 1 freq
edouard (5) - 1 freq
widbank (5) - 1 freq
embrae (5) - 4 freq
doublan (6) - 2 freq
idaia (6) - 27 freq
reidware (6) - 1 freq
exemplary (6) - 2 freq
daybreak (6) - 1 freq
fag-brak (6) - 1 freq
samurai (6) - 7 freq
dark (6) - 378 freq
feedback (6) - 20 freq
brak (6) - 181 freq
eidmubarak (0) - 1 freq
embark (5) - 2 freq
edimbro (7) - 2 freq
daybreak (7) - 1 freq
embrae (7) - 4 freq
inbrak (7) - 2 freq
eidgubnryq (7) - 2 freq
tidemark (7) - 2 freq
embra (7) - 96 freq
cumback (8) - 1 freq
daybrakk (8) - 1 freq
draabaak (8) - 1 freq
barbarik (8) - 2 freq
winbreak (8) - 1 freq
admiral (8) - 3 freq
hamewark (8) - 5 freq
edinburry (8) - 1 freq
embers (8) - 7 freq
embro (8) - 86 freq
timber (8) - 4 freq
edinburra (8) - 5 freq
dmjack (8) - 1 freq
edinburg (8) - 1 freq
laebrak (8) - 1 freq
embarked (8) - 1 freq
SoundEx code - E351
edinburgh - 343 freq
edinbruh - 1 freq
edinbro - 21 freq
edinburry - 1 freq
edinburrie - 3 freq
edinburgh's - 10 freq
edimbro - 2 freq
edumification - 1 freq
edinboro - 2 freq
edinburrae - 2 freq
edinb - 1 freq
edenburgh - 3 freq
edinburgov - 1 freq
edinburra - 5 freq
edinbugh - 1 freq
edinburghpaper - 78 freq
edinvalefarm - 2 freq
edinburgb - 1 freq
edinburghtvfest - 1 freq
edinburg’s - 1 freq
edinburg - 1 freq
edinburghuni - 2 freq
eidmubarak - 1 freq
edinburghy - 1 freq
edinburghcityfc - 1 freq
edinburghrecord - 1 freq
edinburghlive - 83 freq
edinburgheastfc - 7 freq
edinburghsport - 3 freq
edinburghcity - 1 freq
edinburghup - 1 freq
MetaPhone code - ETMBRK
eidmubarak - 1 freq
EIDMUBARAK
Time to execute Levenshtein function - 0.246260 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.456067 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.032668 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.044647 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001135 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.