A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to innocence in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
innocence (0) - 18 freq
indolence (2) - 1 freq
insolence (2) - 4 freq
innocent (2) - 35 freq
innocents (2) - 2 freq
annoyance (3) - 12 freq
ignorance (3) - 32 freq
incense (3) - 10 freq
insolence' (3) - 1 freq
announce (3) - 7 freq
innocently (3) - 2 freq
influence (3) - 76 freq
impotence (3) - 1 freq
annoonce (3) - 4 freq
annoyince (3) - 2 freq
inference (3) - 2 freq
anoonce (4) - 1 freq
influenced (4) - 16 freq
invokerie (4) - 1 freq
conscience (4) - 43 freq
florence (4) - 29 freq
announced (4) - 31 freq
innoyer (4) - 1 freq
jflorence (4) - 1 freq
impidence (4) - 8 freq
innocence (0) - 18 freq
innocent (3) - 35 freq
announce (4) - 7 freq
annoyince (4) - 2 freq
annoonce (4) - 4 freq
annoyance (4) - 12 freq
insolence (4) - 4 freq
innocents (4) - 2 freq
indolence (4) - 1 freq
ninepence (5) - 1 freq
nonsence (5) - 1 freq
concience (5) - 1 freq
inference (5) - 2 freq
innocently (5) - 2 freq
incense (5) - 10 freq
ignorance (5) - 32 freq
influence (5) - 76 freq
annuncee (6) - 1 freq
annooncet (6) - 1 freq
announces (6) - 11 freq
indecent (6) - 1 freq
annone (6) - 1 freq
annoonced (6) - 9 freq
instance (6) - 158 freq
licence (6) - 29 freq
SoundEx code - I525
inchin - 2 freq
imagine - 231 freq
innocent - 35 freq
imaginin - 14 freq
incomers - 13 freq
ingin - 18 freq
innocent-like - 2 freq
insnorlin - 1 freq
incomer - 9 freq
imagined - 35 freq
ingans - 9 freq
ingan - 11 freq
insense - 7 freq
ingine - 33 freq
ingine-ile - 1 freq
income - 15 freq
ingang - 21 freq
imagination - 56 freq
'imagine - 1 freq
inconvenienced - 1 freq
injunction - 1 freq
insensed - 4 freq
imkin - 1 freq
ingins - 14 freq
injins - 1 freq
injin - 33 freq
ingineer - 2 freq
ingens - 1 freq
innocence - 18 freq
incendiaries - 1 freq
insane - 8 freq
imaginins - 2 freq
insinuations - 1 freq
imaigination - 7 freq
insensin - 2 freq
ingines - 4 freq
insnorlit - 1 freq
innocents - 2 freq
incandescent - 1 freq
incompleteness - 1 freq
imaigined - 4 freq
imaigine - 9 freq
injuns - 4 freq
inchmaholm - 1 freq
imaginary - 17 freq
incensin - 1 freq
incaains - 1 freq
incense - 10 freq
injine - 7 freq
injin's - 6 freq
injine's - 3 freq
inginan - 1 freq
insensitive - 3 freq
incomplete - 4 freq
insincerity - 1 freq
incomins - 2 freq
'imagination' - 1 freq
insanity - 4 freq
imagines - 1 freq
insomnia - 3 freq
inconsequential - 3 freq
inconvenience - 4 freq
inchantment - 1 freq
'ingyne' - 1 freq
inchantit - 2 freq
innocently - 2 freq
imaagine - 3 freq
imaaginary - 1 freq
injoyin - 2 freq
incompetence - 3 freq
inkin - 1 freq
incomprehensible - 3 freq
inconsiderable - 1 freq
incumbent - 2 freq
inchinnan - 1 freq
insinuate - 1 freq
incinerator - 2 freq
ingine-hoose - 1 freq
inchin' - 1 freq
incinerate - 2 freq
inconsistent - 3 freq
inconvenient - 2 freq
imagin - 5 freq
incomin - 6 freq
imaginations - 3 freq
inconsolable - 2 freq
innismurray - 1 freq
incummers - 4 freq
ingaun - 2 freq
ingangs - 5 freq
imaginautiouns - 1 freq
incummin - 1 freq
incoman - 1 freq
injines - 2 freq
injineer - 1 freq
imajin - 2 freq
ingenious - 2 freq
incentive - 6 freq
imaagination - 1 freq
incontinent - 2 freq
incontinence - 1 freq
ink-smudged - 1 freq
imaginan - 4 freq
imagean - 1 freq
i'ingin - 3 freq
inconsistency - 1 freq
incum - 1 freq
inconstant - 1 freq
ingyne - 8 freq
ingines- - 1 freq
incomparable - 1 freq
ingineert - 1 freq
incensed - 3 freq
ins-an-oots - 5 freq
ingenuity - 2 freq
incoonter - 1 freq
ingenuitie - 1 freq
insensitivity - 1 freq
imaigin - 2 freq
inchmakenneth - 1 freq
-inghame - 2 freq
injum - 1 freq
€œingan - 1 freq
incam - 3 freq
imaiginins - 1 freq
inginerein - 1 freq
ingenerit - 1 freq
insnorled - 1 freq
inginerin - 1 freq
imaiginautioun - 1 freq
ingineerin - 1 freq
imaginative - 4 freq
incantation - 1 freq
incommin - 1 freq
€˜incantations - 1 freq
imagining - 3 freq
incenses - 1 freq
insnorlt - 1 freq
imaiginable - 1 freq
incoming - 1 freq
incontrovertible - 1 freq
inconspicuous - 1 freq
insenses - 1 freq
imaginaetion - 3 freq
incompatible - 2 freq
imaginatively - 1 freq
insinuatin - 2 freq
€œimagine - 1 freq
inconsistencies - 1 freq
ingyin - 2 freq
ingaen - 1 freq
insnorl - 1 freq
insentients - 1 freq
insaemuckle - 1 freq
incongruously - 1 freq
imagint - 2 freq
incompatibeelity - 1 freq
incongruous - 1 freq
incompetent - 2 freq
in-comers - 1 freq
in-comin - 1 freq
inchna - 2 freq
incomes - 1 freq
innocent-kythin - 1 freq
inkomirs - 1 freq
insinseer - 1 freq
imachin - 1 freq
inconspikuos - 1 freq
inching - 1 freq
incinerators - 1 freq
iamacant - 14 freq
ianjamesparsley - 15 freq
ianssmart - 1 freq
insomniac - 1 freq
imcmillan - 2 freq
iainkingsport - 8 freq
iansummer - 1 freq
in-gang'n - 1 freq
imaginable - 1 freq
imaginery - 1 freq
ianswansonen - 1 freq
iancumnock - 1 freq
inginanaw - 1 freq
incongru - 1 freq
ingenuitynasa - 1 freq
iansmudger - 1 freq
MetaPhone code - INSNS
insense - 7 freq
innocence - 18 freq
incense - 10 freq
INNOCENCE
Time to execute Levenshtein function - 0.205622 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.363166 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027793 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037613 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000797 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.