A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to uncut in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
uncut (0) - 1 freq
€œcut (2) - 1 freq
cut (2) - 455 freq
nout (2) - 2 freq
duncht (2) - 1 freq
unhurt (2) - 1 freq
uncos (2) - 3 freq
€˜cut (2) - 1 freq
unca (2) - 12 freq
ungat (2) - 1 freq
cnut (2) - 1 freq
unum (2) - 1 freq
uncast (2) - 1 freq
undur (2) - 2 freq
untul (2) - 1 freq
unyt (2) - 7 freq
unjust (2) - 4 freq
uncle (2) - 274 freq
uncrc (2) - 1 freq
unul (2) - 1 freq
unit (2) - 41 freq
uncan (2) - 57 freq
ancus (2) - 2 freq
input (2) - 4 freq
unst (2) - 12 freq
uncut (0) - 1 freq
nact (3) - 1 freq
incum (3) - 1 freq
unce (3) - 7 freq
incite (3) - 1 freq
unt (3) - 1 freq
ancus (3) - 2 freq
input (3) - 4 freq
unst (3) - 12 freq
nut (3) - 127 freq
incur (3) - 2 freq
unco (3) - 325 freq
oncum (3) - 50 freq
inact (3) - 1 freq
ncht (3) - 5 freq
unrit (3) - 1 freq
uncan (3) - 57 freq
'cut (3) - 5 freq
uncouth (3) - 5 freq
unmet (3) - 3 freq
unfit (3) - 1 freq
nout (3) - 2 freq
cut (3) - 455 freq
uncast (3) - 1 freq
uncos (3) - 3 freq
SoundEx code - U523
unsettl't - 2 freq
unquateness - 1 freq
uncut - 1 freq
unsteady - 3 freq
unsettled - 3 freq
uncouthie - 3 freq
unstringin - 1 freq
unstoppable - 5 freq
unsettlin - 3 freq
unsettle - 1 freq
un-yeesed - 1 freq
unsaed - 1 freq
ungoadly - 1 freq
unskaithd - 1 freq
unsettles - 1 freq
unheuked - 1 freq
unsaid - 5 freq
unsteik - 1 freq
unsteikit - 1 freq
unsheathit - 1 freq
'unsteik - 1 freq
unstappit - 2 freq
unsticking - 1 freq
unkit's - 1 freq
unhooked - 1 freq
unasked-for - 1 freq
uncouth - 5 freq
unmistakeable - 3 freq
unsatisfied - 1 freq
unsuitable - 2 freq
unwashit - 1 freq
unsteeked - 1 freq
unstuck - 1 freq
unstick - 1 freq
unctioneer - 2 freq
unstaundart - 2 freq
unwashed - 3 freq
unst - 12 freq
unsettling - 2 freq
unstitute - 1 freq
unsteek - 3 freq
ungat - 1 freq
unstapt - 1 freq
unsheddied - 1 freq
unshadowed - 1 freq
unstable - 1 freq
unsatisfactorie - 1 freq
unwasht - 1 freq
unsteekit - 2 freq
unwaashed - 1 freq
unstappable - 2 freq
unstressed - 27 freq
unstessed - 1 freq
unquait - 1 freq
unction - 1 freq
unwaged - 1 freq
unsattled - 1 freq
unstintin - 1 freq
unsturdy - 1 freq
ungodly - 2 freq
unsatisfactor - 1 freq
uncuddomt - 1 freq
unstappin - 1 freq
unstickan - 1 freq
unmistakible - 1 freq
unhowkit - 1 freq
unused - 2 freq
unstintit - 1 freq
unctuous - 1 freq
unsaturatit - 1 freq
unweshed - 1 freq
unmistakable - 2 freq
unsteeks - 1 freq
unscathed - 2 freq
unsatisfaiän - 1 freq
ungwdrj - 1 freq
unstfest - 1 freq
unstagram - 1 freq
unmasked - 1 freq
unistrathclyde - 1 freq
unsteddy - 1 freq
unstlass - 3 freq
MetaPhone code - UNKT
uncut - 1 freq
ungat - 1 freq
unquait - 1 freq
UNCUT
Time to execute Levenshtein function - 0.203091 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.330472 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027448 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036593 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000844 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.