A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to indian in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
indian (0) - 34 freq
indians (1) - 11 freq
indiane (1) - 1 freq
indiana (1) - 2 freq
india (1) - 21 freq
india' (1) - 1 freq
injin (2) - 33 freq
ninian (2) - 5 freq
indigo (2) - 2 freq
indite (2) - 1 freq
intin (2) - 2 freq
inklan (2) - 1 freq
binnian (2) - 2 freq
indict (2) - 1 freq
inlan (2) - 1 freq
indrawn (2) - 1 freq
inginan (2) - 1 freq
innin (2) - 34 freq
indie (2) - 1 freq
undoan (2) - 2 freq
mindan (2) - 7 freq
chindian (2) - 1 freq
inion (2) - 1 freq
inkin (2) - 1 freq
indian's (2) - 1 freq
indian (0) - 34 freq
indiana (1) - 2 freq
indiane (1) - 1 freq
andean (2) - 1 freq
endin (2) - 43 freq
undoan (2) - 2 freq
endan (2) - 2 freq
india (2) - 21 freq
indians (2) - 11 freq
india' (2) - 1 freq
findin (3) - 71 freq
ninian (3) - 5 freq
ingan (3) - 11 freq
indyah (3) - 1 freq
inda (3) - 1 freq
ingin (3) - 18 freq
unduin (3) - 1 freq
midian (3) - 1 freq
bindin (3) - 6 freq
undain (3) - 1 freq
undon (3) - 1 freq
ndna (3) - 8 freq
endeen (3) - 3 freq
undaen (3) - 1 freq
windin (3) - 28 freq
SoundEx code - I535
in-atween - 2 freq
intent - 37 freq
intimidatin - 2 freq
intendit - 25 freq
indiana - 2 freq
intention - 24 freq
intense - 25 freq
indian - 34 freq
indian's - 1 freq
immediant - 1 freq
intend - 10 freq
intensive - 7 freq
intimmers - 25 freq
inhaudin - 1 freq
intended - 14 freq
intimations - 1 freq
indiaman - 1 freq
intonit - 1 freq
inhauden - 2 freq
intentions - 17 freq
inatween - 7 freq
intensity - 7 freq
intentional - 1 freq
intimidated - 3 freq
intently - 10 freq
intimidate - 2 freq
intentioned - 1 freq
indomitable - 1 freq
indians - 11 freq
intantly - 1 freq
intentionally - 1 freq
intimate - 10 freq
intmint - 1 freq
intimmers' - 1 freq
intments - 1 freq
indentured - 1 freq
intenshuns - 1 freq
intin - 2 freq
inhauddin - 1 freq
immediantlie - 1 freq
indiane - 1 freq
inten - 2 freq
intennin - 1 freq
indentation - 1 freq
intonation - 11 freq
intimacy - 1 freq
intendant - 1 freq
immedanthe - 1 freq
immedantlie - 4 freq
intimatit - 5 freq
intimately - 1 freq
intensitie - 1 freq
intangible - 3 freq
intiimers - 1 freq
intimeitit - 1 freq
intoned - 1 freq
intensifyin - 1 freq
intensely - 4 freq
indentify - 1 freq
intimidation - 2 freq
intemperate - 1 freq
intensified - 1 freq
inaathentic - 1 freq
intonaetion - 1 freq
intents - 2 freq
intimidatit - 1 freq
intendin - 3 freq
intented - 1 freq
intimbers - 1 freq
intimmer - 1 freq
inteemasee - 1 freq
imdoium - 1 freq
imodium - 1 freq
intensifier - 2 freq
indynursebrian - 1 freq
intmastclmc - 1 freq
indieandluna - 1 freq
iamnotanornithologist - 1 freq
indyonskye - 1 freq
intamzq - 1 freq
intensifiers - 2 freq
intensifer - 1 freq
indynowsnp - 1 freq
iandunt - 1 freq
MetaPhone code - INTN
indiana - 2 freq
indian - 34 freq
intin - 2 freq
indiane - 1 freq
inten - 2 freq
INDIAN
Time to execute Levenshtein function - 0.205847 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.365957 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028748 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040193 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000922 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.