A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to unkit in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
unkist (1) - 2 freq
unfit (1) - 1 freq
gunkit (1) - 1 freq
unrit (1) - 1 freq
unkil (1) - 4 freq
yunkit (1) - 1 freq
junkit (1) - 1 freq
hunkit (1) - 2 freq
unit (1) - 41 freq
unkin (1) - 4 freq
inpit (2) - 49 freq
muntit (2) - 3 freq
rankit (2) - 1 freq
lukkit (2) - 6 freq
stunkit (2) - 2 freq
bikit (2) - 1 freq
nait (2) - 1 freq
luikit (2) - 132 freq
prunkit (2) - 1 freq
unkept (2) - 2 freq
unmet (2) - 3 freq
makit (2) - 41 freq
hunkie (2) - 20 freq
spunkit (2) - 1 freq
lykit (2) - 5 freq
yunkit (1) - 1 freq
unkin (2) - 4 freq
unkist (2) - 2 freq
nekit (2) - 2 freq
nkt (2) - 1 freq
nakit (2) - 30 freq
unit (2) - 41 freq
yankit (2) - 1 freq
gunkit (2) - 1 freq
unfit (2) - 1 freq
hunkit (2) - 2 freq
unkil (2) - 4 freq
unrit (2) - 1 freq
junkit (2) - 1 freq
jinkit (3) - 7 freq
unkent (3) - 50 freq
neukit (3) - 1 freq
skit (3) - 3 freq
intit (3) - 11 freq
gunkt (3) - 2 freq
snakit (3) - 3 freq
winkit (3) - 11 freq
inkin (3) - 1 freq
linkit (3) - 15 freq
sinkit (3) - 1 freq
SoundEx code - U523
unsettl't - 2 freq
unquateness - 1 freq
uncut - 1 freq
unsteady - 3 freq
unsettled - 3 freq
uncouthie - 3 freq
unstringin - 1 freq
unstoppable - 5 freq
unsettlin - 3 freq
unsettle - 1 freq
un-yeesed - 1 freq
unsaed - 1 freq
ungoadly - 1 freq
unskaithd - 1 freq
unsettles - 1 freq
unheuked - 1 freq
unsaid - 5 freq
unsteik - 1 freq
unsteikit - 1 freq
unsheathit - 1 freq
'unsteik - 1 freq
unstappit - 2 freq
unsticking - 1 freq
unkit's - 1 freq
unhooked - 1 freq
unasked-for - 1 freq
uncouth - 5 freq
unmistakeable - 3 freq
unsatisfied - 1 freq
unsuitable - 2 freq
unwashit - 1 freq
unsteeked - 1 freq
unstuck - 1 freq
unstick - 1 freq
unctioneer - 2 freq
unstaundart - 2 freq
unwashed - 3 freq
unst - 12 freq
unsettling - 2 freq
unstitute - 1 freq
unsteek - 3 freq
ungat - 1 freq
unstapt - 1 freq
unsheddied - 1 freq
unshadowed - 1 freq
unstable - 1 freq
unsatisfactorie - 1 freq
unwasht - 1 freq
unsteekit - 2 freq
unwaashed - 1 freq
unstappable - 2 freq
unstressed - 27 freq
unstessed - 1 freq
unquait - 1 freq
unction - 1 freq
unwaged - 1 freq
unsattled - 1 freq
unstintin - 1 freq
unsturdy - 1 freq
ungodly - 2 freq
unsatisfactor - 1 freq
uncuddomt - 1 freq
unstappin - 1 freq
unstickan - 1 freq
unmistakible - 1 freq
unhowkit - 1 freq
unused - 2 freq
unstintit - 1 freq
unctuous - 1 freq
unsaturatit - 1 freq
unweshed - 1 freq
unmistakable - 2 freq
unsteeks - 1 freq
unscathed - 2 freq
unsatisfaiän - 1 freq
ungwdrj - 1 freq
unstfest - 1 freq
unstagram - 1 freq
unmasked - 1 freq
unistrathclyde - 1 freq
unsteddy - 1 freq
unstlass - 3 freq
MetaPhone code - UNKT
uncut - 1 freq
ungat - 1 freq
unquait - 1 freq
UNKIT
Time to execute Levenshtein function - 0.611085 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 1.007554 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.087363 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.098245 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000875 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.