A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to imwatson in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
imwatson (0) - 1 freq
watson (2) - 46 freq
imprison (3) - 1 freq
watsons (3) - 3 freq
mwilson (3) - 1 freq
hewitson (3) - 1 freq
'maison (3) - 3 freq
maison (3) - 4 freq
jimwaterson (3) - 11 freq
iewilson (3) - 1 freq
ilawson (3) - 8 freq
imitation (3) - 4 freq
masson (3) - 3 freq
manson (3) - 14 freq
matron (3) - 2 freq
mason (3) - 11 freq
impsan (4) - 1 freq
adamson (4) - 1 freq
wappon (4) - 4 freq
duration (4) - 5 freq
downton (4) - 1 freq
massin (4) - 2 freq
animation (4) - 6 freq
watshod (4) - 2 freq
teason (4) - 2 freq
imwatson (0) - 1 freq
watson (3) - 46 freq
hewitson (4) - 1 freq
mwilson (4) - 1 freq
imitation (5) - 4 freq
masson (5) - 3 freq
manson (5) - 14 freq
matron (5) - 2 freq
ilawson (5) - 8 freq
mason (5) - 11 freq
iewilson (5) - 1 freq
watsons (5) - 3 freq
imprison (5) - 1 freq
jimwaterson (5) - 11 freq
maison (5) - 4 freq
morison (6) - 1 freq
matsuo (6) - 2 freq
wulson (6) - 8 freq
bertson (6) - 1 freq
wattin (6) - 2 freq
featsin (6) - 2 freq
mowten (6) - 1 freq
maxton (6) - 1 freq
milton (6) - 4 freq
mansion (6) - 25 freq
SoundEx code - I532
industrial - 32 freq
indignantly - 10 freq
intoxicatin - 1 freq
indies - 6 freq
index - 16 freq
intact - 14 freq
industries - 14 freq
integrity - 13 freq
industry - 54 freq
indigestible - 1 freq
indicatin - 8 freq
indignance - 2 freq
intestate - 1 freq
indicated - 3 freq
intestines - 1 freq
intak - 2 freq
in'ts - 2 freq
intac - 2 freq
indicater - 1 freq
indicates - 13 freq
indicate - 9 freq
induced - 5 freq
inducted - 1 freq
indigestion - 1 freq
ints - 1 freq
indication - 4 freq
induction - 2 freq
'index-linked' - 2 freq
inte's - 13 freq
indignint - 2 freq
industries' - 1 freq
indigenous - 35 freq
indoctrination - 2 freq
inuits - 2 freq
induces - 1 freq
indignant - 8 freq
inmates - 1 freq
inadequate - 4 freq
indignity - 1 freq
indiscernible - 2 freq
inducements - 1 freq
indicatit - 4 freq
indisgestible - 1 freq
integral - 10 freq
indigo - 2 freq
inidcative - 1 freq
indicative - 2 freq
integratin - 2 freq
indispensable - 2 freq
intake - 4 freq
indignation - 9 freq
industrious - 2 freq
integrated - 3 freq
indications - 2 freq
indescrievable - 1 freq
indo-chinois - 1 freq
indestructible - 2 freq
inhauds - 3 freq
'industrial - 1 freq
in-twist- - 1 freq
indiginous - 2 freq
intaks - 1 freq
integration - 5 freq
intakkin - 1 freq
indicator - 1 freq
indistinctly - 1 freq
industrialisation - 1 freq
integrate - 5 freq
integratit - 3 freq
indecipherable - 1 freq
industrialised - 2 freq
intoxicating' - 1 freq
indeigenous - 1 freq
indict - 1 freq
intestine - 1 freq
inducin - 2 freq
€œintegrity - 1 freq
intack - 1 freq
indecent - 1 freq
indisputable - 1 freq
intoxicated - 1 freq
indiscretions - 1 freq
indyscotwales - 7 freq
inthisweather - 1 freq
‘industrial’ - 1 freq
indescretions - 1 freq
indyÂ’s - 1 freq
indykaila - 1 freq
iantakto - 1 freq
industrialvid - 1 freq
indyscotparty - 1 freq
indyscotland - 1 freq
inthechoir - 1 freq
iaindoesjokes - 2 freq
iainwhytesnp - 1 freq
'indigenous' - 1 freq
indycamp - 3 freq
indstatehapp - 2 freq
indigodreamspub - 1 freq
indoctrinated - 2 freq
indysoosie - 1 freq
imwatson - 1 freq
iantsaoir - 1 freq
indigofast - 1 freq
intoxicating - 1 freq
indoctrinators - 1 freq
iaindgordon - 1 freq
indigenousyouth - 1 freq
indigenouspeoples - 1 freq
indigenouscommunities - 1 freq
indyscotnews - 1 freq
ineedquiet - 1 freq
MetaPhone code - IMWTSN
imwatson - 1 freq
IMWATSON
Time to execute Levenshtein function - 0.252952 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.469629 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.031082 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.042786 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000862 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.