A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to intact in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
intact (0) - 14 freq
intac (1) - 2 freq
intack (1) - 1 freq
inact (1) - 1 freq
ontac (2) - 2 freq
intak (2) - 2 freq
intake (2) - 4 freq
infant (2) - 20 freq
intert (2) - 1 freq
inti't (2) - 1 freq
ingait (2) - 8 freq
inject (2) - 4 freq
inyect (2) - 1 freq
infect (2) - 1 freq
tact (2) - 8 freq
intaed (2) - 1 freq
entac (2) - 1 freq
intay (2) - 3 freq
yin-act (2) - 2 freq
intae't (2) - 1 freq
intae (2) - 4586 freq
intaks (2) - 1 freq
intent (2) - 37 freq
intaea (2) - 1 freq
contact (2) - 83 freq
intact (0) - 14 freq
intack (2) - 1 freq
inact (2) - 1 freq
intac (2) - 2 freq
nact (3) - 1 freq
instict (3) - 1 freq
contact (3) - 83 freq
intae't (3) - 1 freq
intult (3) - 1 freq
intent (3) - 37 freq
interact (3) - 9 freq
notict (3) - 1 freq
insect (3) - 3 freq
yin-act (3) - 2 freq
indict (3) - 1 freq
intilt (3) - 11 freq
intit (3) - 11 freq
intert (3) - 1 freq
inyect (3) - 1 freq
inject (3) - 4 freq
inti't (3) - 1 freq
tact (3) - 8 freq
infect (3) - 1 freq
ontac (3) - 2 freq
entac (3) - 1 freq
SoundEx code - I532
industrial - 32 freq
indignantly - 10 freq
intoxicatin - 1 freq
indies - 6 freq
index - 16 freq
intact - 14 freq
industries - 14 freq
integrity - 13 freq
industry - 54 freq
indigestible - 1 freq
indicatin - 8 freq
indignance - 2 freq
intestate - 1 freq
indicated - 3 freq
intestines - 1 freq
intak - 2 freq
in'ts - 2 freq
intac - 2 freq
indicater - 1 freq
indicates - 13 freq
indicate - 9 freq
induced - 5 freq
inducted - 1 freq
indigestion - 1 freq
ints - 1 freq
indication - 4 freq
induction - 2 freq
'index-linked' - 2 freq
inte's - 13 freq
indignint - 2 freq
industries' - 1 freq
indigenous - 35 freq
indoctrination - 2 freq
inuits - 2 freq
induces - 1 freq
indignant - 8 freq
inmates - 1 freq
inadequate - 4 freq
indignity - 1 freq
indiscernible - 2 freq
inducements - 1 freq
indicatit - 4 freq
indisgestible - 1 freq
integral - 10 freq
indigo - 2 freq
inidcative - 1 freq
indicative - 2 freq
integratin - 2 freq
indispensable - 2 freq
intake - 4 freq
indignation - 9 freq
industrious - 2 freq
integrated - 3 freq
indications - 2 freq
indescrievable - 1 freq
indo-chinois - 1 freq
indestructible - 2 freq
inhauds - 3 freq
'industrial - 1 freq
in-twist- - 1 freq
indiginous - 2 freq
intaks - 1 freq
integration - 5 freq
intakkin - 1 freq
indicator - 1 freq
indistinctly - 1 freq
industrialisation - 1 freq
integrate - 5 freq
integratit - 3 freq
indecipherable - 1 freq
industrialised - 2 freq
intoxicating' - 1 freq
indeigenous - 1 freq
indict - 1 freq
intestine - 1 freq
inducin - 2 freq
€œintegrity - 1 freq
intack - 1 freq
indecent - 1 freq
indisputable - 1 freq
intoxicated - 1 freq
indiscretions - 1 freq
indyscotwales - 7 freq
inthisweather - 1 freq
‘industrial’ - 1 freq
indescretions - 1 freq
indyÂ’s - 1 freq
indykaila - 1 freq
iantakto - 1 freq
industrialvid - 1 freq
indyscotparty - 1 freq
indyscotland - 1 freq
inthechoir - 1 freq
iaindoesjokes - 2 freq
iainwhytesnp - 1 freq
'indigenous' - 1 freq
indycamp - 3 freq
indstatehapp - 2 freq
indigodreamspub - 1 freq
indoctrinated - 2 freq
indysoosie - 1 freq
imwatson - 1 freq
iantsaoir - 1 freq
indigofast - 1 freq
intoxicating - 1 freq
indoctrinators - 1 freq
iaindgordon - 1 freq
indigenousyouth - 1 freq
indigenouspeoples - 1 freq
indigenouscommunities - 1 freq
indyscotnews - 1 freq
ineedquiet - 1 freq
MetaPhone code - INTKT
intact - 14 freq
indicate - 9 freq
inadequate - 4 freq
indict - 1 freq
iantakto - 1 freq
ineedquiet - 1 freq
INTACT
Time to execute Levenshtein function - 0.270508 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.445383 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.034437 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.045420 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001107 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.