A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to indyonskye in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
indyonskye (0) - 1 freq
indylassie (4) - 5 freq
indysoosie (4) - 1 freq
in-hoose (5) - 2 freq
indylive (5) - 3 freq
indyhibby (5) - 1 freq
donside (5) - 4 freq
doonside (5) - 4 freq
indy’s (5) - 1 freq
donsie (5) - 2 freq
ingyne (5) - 8 freq
indignance (5) - 2 freq
invoke (5) - 1 freq
indoors (5) - 9 freq
instinse (5) - 1 freq
indiane (5) - 1 freq
insense (5) - 7 freq
endorse (5) - 3 freq
doonsize (5) - 1 freq
norske (5) - 1 freq
anyone (5) - 67 freq
indignity (5) - 1 freq
yonkee (5) - 1 freq
ivryone (5) - 2 freq
indians (5) - 11 freq
indyonskye (0) - 1 freq
indians (6) - 11 freq
indysoosie (6) - 1 freq
indylassie (6) - 5 freq
intensive (7) - 7 freq
endonym (7) - 2 freq
incense (7) - 11 freq
dunskey (7) - 5 freq
norske (7) - 1 freq
norskie (7) - 1 freq
inions (7) - 1 freq
endeens (7) - 4 freq
dansk (7) - 1 freq
endins (7) - 13 freq
undone (7) - 4 freq
intense (7) - 25 freq
doonsize (7) - 1 freq
unsonsie (7) - 1 freq
nijinski (7) - 1 freq
endorse (7) - 3 freq
donsie (7) - 2 freq
indoors (7) - 9 freq
insense (7) - 7 freq
indiane (7) - 1 freq
donside (7) - 4 freq
SoundEx code - I535
in-atween - 2 freq
intent - 37 freq
intimidatin - 2 freq
intendit - 25 freq
indiana - 2 freq
intention - 24 freq
intense - 25 freq
indian - 34 freq
indian's - 1 freq
immediant - 1 freq
intend - 10 freq
intensive - 7 freq
intimmers - 25 freq
inhaudin - 1 freq
intended - 14 freq
intimations - 1 freq
indiaman - 1 freq
intonit - 1 freq
inhauden - 2 freq
intentions - 17 freq
inatween - 7 freq
intensity - 7 freq
intentional - 1 freq
intimidated - 3 freq
intently - 10 freq
intimidate - 2 freq
intentioned - 1 freq
indomitable - 1 freq
indians - 11 freq
intantly - 1 freq
intentionally - 1 freq
intimate - 10 freq
intmint - 1 freq
intimmers' - 1 freq
intments - 1 freq
indentured - 1 freq
intenshuns - 1 freq
intin - 2 freq
inhauddin - 1 freq
immediantlie - 1 freq
indiane - 1 freq
inten - 2 freq
intennin - 1 freq
indentation - 1 freq
intonation - 11 freq
intimacy - 1 freq
intendant - 1 freq
immedanthe - 1 freq
immedantlie - 4 freq
intimatit - 5 freq
intimately - 1 freq
intensitie - 1 freq
intangible - 3 freq
intiimers - 1 freq
intimeitit - 1 freq
intoned - 1 freq
intensifyin - 1 freq
intensely - 4 freq
indentify - 1 freq
intimidation - 2 freq
intemperate - 1 freq
intensified - 1 freq
inaathentic - 1 freq
intonaetion - 1 freq
intents - 2 freq
intimidatit - 1 freq
intendin - 3 freq
intented - 1 freq
intimbers - 1 freq
intimmer - 1 freq
inteemasee - 1 freq
imdoium - 1 freq
imodium - 1 freq
intensifier - 2 freq
indynursebrian - 1 freq
intmastclmc - 1 freq
indieandluna - 1 freq
iamnotanornithologist - 1 freq
indyonskye - 1 freq
intamzq - 1 freq
intensifiers - 2 freq
intensifer - 1 freq
indynowsnp - 1 freq
iandunt - 1 freq
MetaPhone code - INTYNSKY
indyonskye - 1 freq
INDYONSKYE
Time to execute Levenshtein function - 0.242513 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.397112 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028157 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038176 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000913 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.