A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to income in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
income (0) - 16 freq
incomer (1) - 9 freq
incomes (1) - 1 freq
oncome (1) - 40 freq
incore (1) - 2 freq
incoman (2) - 1 freq
winsome (2) - 5 freq
incase (2) - 11 freq
insole (2) - 1 freq
inlove (2) - 1 freq
incomers (2) - 13 freq
incum (2) - 1 freq
invoke (2) - 1 freq
encore (2) - 5 freq
upcome (2) - 2 freq
come (2) - 3162 freq
€œcome (2) - 32 freq
on-come (2) - 1 freq
become (2) - 197 freq
inco (2) - 1 freq
dinsome (2) - 3 freq
€˜come (2) - 8 freq
incomin (2) - 7 freq
'come (2) - 73 freq
gnome (2) - 2 freq
income (0) - 16 freq
oncome (1) - 40 freq
incum (2) - 1 freq
incam (2) - 3 freq
incore (2) - 2 freq
incomer (2) - 9 freq
incomes (2) - 1 freq
become (3) - 197 freq
on-come (3) - 1 freq
inco (3) - 1 freq
incite (3) - 1 freq
oncoms (3) - 1 freq
come (3) - 3162 freq
'come (3) - 73 freq
incomin (3) - 7 freq
upcome (3) - 2 freq
incoman (3) - 1 freq
incase (3) - 11 freq
oncum (3) - 50 freq
encore (3) - 5 freq
coma (4) - 13 freq
unco (4) - 328 freq
como (4) - 1 freq
nom (4) - 2 freq
uncomfy (4) - 1 freq
SoundEx code - I525
inchin - 2 freq
imagine - 241 freq
innocent - 37 freq
imaginin - 14 freq
incomers - 13 freq
ingin - 18 freq
innocent-like - 2 freq
insnorlin - 1 freq
incomer - 9 freq
imagined - 38 freq
ingans - 9 freq
ingan - 11 freq
insense - 7 freq
ingine - 33 freq
ingine-ile - 1 freq
income - 16 freq
ingang - 21 freq
imagination - 60 freq
'imagine - 1 freq
inconvenienced - 1 freq
injunction - 1 freq
insensed - 4 freq
imkin - 1 freq
ingins - 15 freq
injins - 1 freq
injin - 33 freq
ingineer - 2 freq
ingens - 1 freq
innocence - 18 freq
incendiaries - 1 freq
insane - 8 freq
imaginins - 2 freq
insinuations - 1 freq
imaigination - 7 freq
insensin - 2 freq
ingines - 4 freq
insnorlit - 1 freq
innocents - 2 freq
incandescent - 1 freq
incompleteness - 1 freq
imaigined - 4 freq
imaigine - 9 freq
injuns - 4 freq
inchmaholm - 1 freq
imaginary - 19 freq
incensin - 1 freq
incaains - 1 freq
incense - 11 freq
injine - 7 freq
injin's - 6 freq
injine's - 3 freq
inginan - 1 freq
insensitive - 3 freq
incomplete - 4 freq
insincerity - 1 freq
imagining - 4 freq
incongruously - 2 freq
insantly - 1 freq
incomin - 7 freq
ingaun - 3 freq
imaginations - 4 freq
incomins - 2 freq
'imagination' - 1 freq
insanity - 4 freq
imagines - 1 freq
insomnia - 3 freq
inconsequential - 3 freq
inconvenience - 4 freq
inchantment - 1 freq
'ingyne' - 1 freq
inchantit - 2 freq
innocently - 2 freq
imaagine - 3 freq
imaaginary - 1 freq
injoyin - 2 freq
incompetence - 3 freq
inkin - 1 freq
incomprehensible - 3 freq
inconsiderable - 1 freq
incumbent - 2 freq
inchinnan - 1 freq
insinuate - 1 freq
incinerator - 2 freq
ingine-hoose - 1 freq
inchin' - 1 freq
incinerate - 2 freq
inconsistent - 3 freq
inconvenient - 2 freq
imagin - 5 freq
inconsolable - 2 freq
innismurray - 1 freq
incummers - 4 freq
ingangs - 5 freq
imaginautiouns - 1 freq
incummin - 1 freq
incoman - 1 freq
injines - 2 freq
injineer - 1 freq
imajin - 2 freq
ingenious - 2 freq
incentive - 6 freq
imaagination - 1 freq
incontinent - 2 freq
incontinence - 1 freq
ink-smudged - 1 freq
imaginan - 4 freq
imagean - 1 freq
i'ingin - 3 freq
inconsistency - 1 freq
incum - 1 freq
inconstant - 1 freq
ingyne - 8 freq
ingines- - 1 freq
incomparable - 1 freq
ingineert - 1 freq
incensed - 3 freq
ins-an-oots - 5 freq
ingenuity - 2 freq
incoonter - 1 freq
ingenuitie - 1 freq
insensitivity - 1 freq
imaigin - 2 freq
inchmakenneth - 1 freq
-inghame - 2 freq
injum - 1 freq
€œingan - 1 freq
incam - 3 freq
imaiginins - 1 freq
inginerein - 1 freq
ingenerit - 1 freq
insnorled - 1 freq
inginerin - 1 freq
imaiginautioun - 1 freq
ingineerin - 1 freq
imaginative - 4 freq
incantation - 1 freq
incommin - 1 freq
€˜incantations - 1 freq
incenses - 1 freq
insnorlt - 1 freq
imaiginable - 1 freq
incoming - 1 freq
incontrovertible - 1 freq
inconspicuous - 1 freq
insenses - 1 freq
imaginaetion - 3 freq
incompatible - 2 freq
imaginatively - 1 freq
insinuatin - 2 freq
€œimagine - 1 freq
inconsistencies - 1 freq
ingyin - 2 freq
ingaen - 1 freq
insnorl - 1 freq
insentients - 1 freq
insaemuckle - 1 freq
imagint - 2 freq
incompatibeelity - 1 freq
incongruous - 1 freq
incompetent - 2 freq
in-comers - 1 freq
in-comin - 1 freq
inchna - 2 freq
incomes - 1 freq
innocent-kythin - 1 freq
inkomirs - 1 freq
insinseer - 1 freq
imachin - 1 freq
inconspikuos - 1 freq
inching - 1 freq
incinerators - 1 freq
iamacant - 14 freq
ianjamesparsley - 15 freq
ianssmart - 1 freq
insomniac - 1 freq
imcmillan - 2 freq
iainkingsport - 8 freq
iansummer - 1 freq
in-gang'n - 1 freq
imaginable - 1 freq
imaginery - 1 freq
ianswansonen - 1 freq
iancumnock - 1 freq
inginanaw - 1 freq
incongru - 1 freq
ingenuitynasa - 1 freq
iansmudger - 1 freq
MetaPhone code - INKM
income - 16 freq
incum - 1 freq
incam - 3 freq
INCOME
Time to execute Levenshtein function - 0.600361 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.991055 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028976 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.097572 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000881 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.