A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to mustafa in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
mustafa (0) - 1 freq
mustna (2) - 1 freq
musta (2) - 47 freq
mustae (2) - 6 freq
mustard (2) - 18 freq
mustart (2) - 8 freq
ustae (3) - 2 freq
mutant (3) - 3 freq
mustert (3) - 1 freq
masala (3) - 1 freq
mustet (3) - 1 freq
must- (3) - 1 freq
mustuv (3) - 1 freq
mistaen (3) - 7 freq
custart (3) - 4 freq
muntain (3) - 25 freq
gustfu (3) - 1 freq
musty (3) - 2 freq
bustan (3) - 1 freq
austria (3) - 8 freq
mistake (3) - 73 freq
staffa (3) - 2 freq
mestalla (3) - 2 freq
mistaks (3) - 13 freq
must-uv (3) - 1 freq
mustafa (0) - 1 freq
mustae (3) - 6 freq
musta (3) - 47 freq
mustna (3) - 1 freq
mistake (4) - 73 freq
muster (4) - 7 freq
gustfu (4) - 1 freq
musto (4) - 1 freq
musty (4) - 2 freq
justify (4) - 14 freq
mistaek (4) - 2 freq
must (4) - 659 freq
mustnae (4) - 1 freq
mistak (4) - 41 freq
mustn (4) - 2 freq
mustart (4) - 8 freq
mustard (4) - 18 freq
must- (4) - 1 freq
mustet (4) - 1 freq
mistaen (4) - 7 freq
mustuv (4) - 1 freq
misty (5) - 15 freq
most (5) - 237 freq
mast (5) - 20 freq
fistfu (5) - 1 freq
SoundEx code - M231
might've - 2 freq
must've - 26 freq
most-definitely - 1 freq
micht've - 4 freq
macduff - 18 freq
mustafa - 1 freq
misadventure - 1 freq
must-uv - 1 freq
mashed-up - 2 freq
mixed-planet - 1 freq
mixed-up - 1 freq
mcduff's - 1 freq
moist-pinted - 1 freq
misty-blue - 1 freq
mystified - 2 freq
micht-hae-been - 1 freq
mystification - 1 freq
musket-bursts - 1 freq
mactavish - 15 freq
mactavish's - 6 freq
mctavish - 7 freq
mis-stepped - 1 freq
misstep - 1 freq
mystifeed - 1 freq
mystifin - 1 freq
mustuv - 1 freq
mak-it-up-as-ye-gang-alang - 1 freq
makkit-up - 1 freq
messed-aboot-wi - 1 freq
macduffer - 1 freq
misskatieprice - 2 freq
muhgtv - 1 freq
mhsdyfyodi - 1 freq
must'v - 1 freq
mikeydfc - 16 freq
mysideoflife - 1 freq
macduffaquarium - 1 freq
MetaPhone code - MSTF
must've - 26 freq
mustafa - 1 freq
must-uv - 1 freq
mustuv - 1 freq
must'v - 1 freq
MUSTAFA
Time to execute Levenshtein function - 0.215924 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.371383 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.034459 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040545 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000844 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.