A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to industry in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
industry (0) - 55 freq
ministry (3) - 4 freq
industries (3) - 15 freq
nddusty (3) - 1 freq
intry (3) - 3 freq
infantry (3) - 2 freq
injust (3) - 1 freq
ancestry (3) - 6 freq
injury (3) - 16 freq
inquiry (3) - 14 freq
inducted (3) - 1 freq
unjustly (3) - 2 freq
industrial (3) - 32 freq
dusty (3) - 18 freq
dusted (4) - 6 freq
dusts (4) - 2 freq
inkster (4) - 1 freq
i'dustbin (4) - 1 freq
'nesty (4) - 1 freq
i'dundy (4) - 1 freq
infur (4) - 1 freq
industrious (4) - 2 freq
insistan (4) - 1 freq
fidgetry (4) - 1 freq
circuitry (4) - 1 freq
industry (0) - 55 freq
ancestry (4) - 6 freq
industries (4) - 15 freq
industrial (4) - 32 freq
inkster (5) - 1 freq
dusty (5) - 18 freq
nostre (5) - 1 freq
duster (5) - 12 freq
destroy (5) - 16 freq
unjustly (5) - 2 freq
industrious (5) - 2 freq
dustir (5) - 1 freq
intry (5) - 3 freq
injust (5) - 1 freq
infantry (5) - 2 freq
inducted (5) - 1 freq
nddusty (5) - 1 freq
ministry (5) - 4 freq
windsor (6) - 8 freq
dust (6) - 89 freq
pedester (6) - 1 freq
histry (6) - 14 freq
naisty (6) - 10 freq
intro (6) - 6 freq
artistry (6) - 1 freq
SoundEx code - I532
industrial - 32 freq
indignantly - 10 freq
intoxicatin - 1 freq
indies - 6 freq
index - 17 freq
intact - 14 freq
industries - 15 freq
integrity - 13 freq
industry - 55 freq
indigestible - 1 freq
indicatin - 8 freq
indignance - 2 freq
intestate - 1 freq
indicated - 4 freq
intestines - 1 freq
intak - 2 freq
in'ts - 2 freq
intac - 2 freq
indicater - 1 freq
indicates - 13 freq
indicate - 9 freq
indicating - 1 freq
indicator - 2 freq
inducees - 1 freq
indigenous - 36 freq
induce - 1 freq
inducin - 3 freq
indication - 5 freq
immodest - 1 freq
induced - 5 freq
inducted - 1 freq
indigestion - 1 freq
ints - 1 freq
induction - 2 freq
'index-linked' - 2 freq
inte's - 13 freq
indignint - 2 freq
industries' - 1 freq
indoctrination - 2 freq
inuits - 2 freq
induces - 1 freq
indignant - 8 freq
inmates - 1 freq
inadequate - 4 freq
indignity - 1 freq
indiscernible - 2 freq
inducements - 1 freq
indicatit - 4 freq
indisgestible - 1 freq
integral - 10 freq
indigo - 2 freq
inidcative - 1 freq
indicative - 2 freq
integratin - 2 freq
indispensable - 2 freq
intake - 4 freq
indignation - 9 freq
industrious - 2 freq
integrated - 3 freq
indications - 2 freq
indescrievable - 1 freq
indo-chinois - 1 freq
indestructible - 2 freq
inhauds - 3 freq
'industrial - 1 freq
in-twist- - 1 freq
indiginous - 2 freq
intaks - 1 freq
integration - 5 freq
intakkin - 1 freq
indistinctly - 1 freq
industrialisation - 1 freq
integrate - 5 freq
integratit - 3 freq
indecipherable - 1 freq
industrialised - 2 freq
intoxicating' - 1 freq
indeigenous - 1 freq
indict - 1 freq
intestine - 1 freq
€œintegrity - 1 freq
intack - 1 freq
indecent - 1 freq
indisputable - 1 freq
intoxicated - 1 freq
indiscretions - 1 freq
indyscotwales - 7 freq
inthisweather - 1 freq
‘industrial’ - 1 freq
indescretions - 1 freq
indyÂ’s - 1 freq
indykaila - 1 freq
iantakto - 1 freq
industrialvid - 1 freq
indyscotparty - 1 freq
indyscotland - 1 freq
inthechoir - 1 freq
iaindoesjokes - 2 freq
iainwhytesnp - 1 freq
'indigenous' - 1 freq
indycamp - 3 freq
indstatehapp - 2 freq
indigodreamspub - 1 freq
indoctrinated - 2 freq
indysoosie - 1 freq
imwatson - 1 freq
iantsaoir - 1 freq
indigofast - 1 freq
intoxicating - 1 freq
indoctrinators - 1 freq
iaindgordon - 1 freq
indigenousyouth - 1 freq
indigenouspeoples - 1 freq
indigenouscommunities - 1 freq
indyscotnews - 1 freq
ineedquiet - 1 freq
MetaPhone code - INTSTR
industry - 55 freq
INDUSTRY
Time to execute Levenshtein function - 0.182262 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.344719 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027751 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037047 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000936 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.