A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to initiallie in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
initiallie (0) - 2 freq
initially (2) - 4 freq
initials (3) - 9 freq
installit (3) - 1 freq
initial (3) - 18 freq
naitrallie (3) - 1 freq
initiative (3) - 15 freq
install (4) - 1 freq
totallie (4) - 1 freq
natalie (4) - 7 freq
intiative (4) - 1 freq
intilla (4) - 1 freq
mentallie (4) - 1 freq
finallie (4) - 1 freq
installed (4) - 2 freq
intill't (4) - 1 freq
instilled (4) - 1 freq
mutuallie (4) - 1 freq
initiatives (4) - 27 freq
actuallie (4) - 3 freq
tillie (4) - 8 freq
instillt (4) - 1 freq
tentillie (4) - 2 freq
indwallin (4) - 1 freq
officiallie (4) - 1 freq
initiallie (0) - 2 freq
initially (2) - 4 freq
intilla (4) - 1 freq
initial (4) - 18 freq
naitrallie (4) - 1 freq
initials (4) - 9 freq
intill (4) - 125 freq
naitralie (5) - 1 freq
mutuallie (5) - 1 freq
actuallie (5) - 3 freq
tentillie (5) - 2 freq
naiturallie (5) - 2 freq
entirelie (5) - 1 freq
intull (5) - 5 freq
tillie (5) - 8 freq
mentallie (5) - 1 freq
install (5) - 1 freq
initiative (5) - 15 freq
installit (5) - 1 freq
totallie (5) - 1 freq
natalie (5) - 7 freq
cantillie (5) - 1 freq
stallie (5) - 1 freq
ontill (5) - 3 freq
litill (6) - 1 freq
SoundEx code - I534
indelible - 2 freq
intil - 554 freq
intil't - 41 freq
initials - 9 freq
intellect - 12 freq
indwaller - 1 freq
intelleck - 3 freq
indwallers - 9 freq
intult - 1 freq
indwallin - 1 freq
intill't - 1 freq
intelligence - 20 freq
intul - 1 freq
intill - 125 freq
inthelead - 1 freq
indulge - 4 freq
indulged - 3 freq
indulgit - 1 freq
intl - 1 freq
indulgent - 5 freq
intellectual - 18 freq
intellectuals - 5 freq
intolerable - 3 freq
initial - 18 freq
intolerant - 2 freq
initiallie - 2 freq
intilt - 11 freq
initially - 4 freq
indulgin - 1 freq
intelligence' - 1 freq
in-til - 1 freq
intelligent - 12 freq
intelligint - 1 freq
intull - 5 freq
i'middle - 3 freq
intelligibeility - 5 freq
intilla - 1 freq
ineetial - 1 freq
intelligible - 5 freq
intelligibility - 2 freq
indelibly - 1 freq
intolerance - 4 freq
indwellers - 2 freq
indulgence - 4 freq
indelicate - 2 freq
intelligentsia - 5 freq
€˜intae-lecher-all - 1 freq
indwell - 1 freq
intellek - 2 freq
indolence - 1 freq
intellectually - 1 freq
‘intellectual - 1 freq
indylive - 3 freq
indylassie - 5 freq
ihndls - 1 freq
intulectual - 1 freq
indylanp - 1 freq
intellect'll - 1 freq
indywildycat - 4 freq
MetaPhone code - INXL
initial - 18 freq
initiallie - 2 freq
initially - 4 freq
ineetial - 1 freq
INITIALLIE
Time to execute Levenshtein function - 0.377769 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.782605 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028378 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.089744 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000982 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.