A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to unst in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
unst (0) - 12 freq
unit (1) - 41 freq
unyt (1) - 7 freq
unt (1) - 1 freq
uist (1) - 9 freq
wist (2) - 3 freq
uppt (2) - 1 freq
usb (2) - 2 freq
upset (2) - 65 freq
uks (2) - 1 freq
un (2) - 31 freq
ast (2) - 8 freq
munt (2) - 14 freq
urns (2) - 2 freq
uesw (2) - 1 freq
on't (2) - 39 freq
usc (2) - 1 freq
ns (2) - 15 freq
puns (2) - 4 freq
unce (2) - 7 freq
nut (2) - 127 freq
in't (2) - 42 freq
unum (2) - 1 freq
buns (2) - 22 freq
test (2) - 145 freq
unst (0) - 12 freq
onset (2) - 7 freq
ainst (2) - 2 freq
inset (2) - 2 freq
nest (2) - 81 freq
unit (2) - 41 freq
yinst (2) - 77 freq
unt (2) - 1 freq
unyt (2) - 7 freq
insta (2) - 2 freq
uist (2) - 9 freq
buist (3) - 5 freq
rist (3) - 15 freq
jyst (3) - 7 freq
uncut (3) - 1 freq
ngt (3) - 1 freq
ungat (3) - 1 freq
canst (3) - 2 freq
west (3) - 211 freq
lest (3) - 215 freq
quest (3) - 18 freq
kost (3) - 1 freq
aist (3) - 21 freq
nt (3) - 34 freq
nsh (3) - 1 freq
SoundEx code - U523
unsettl't - 2 freq
unquateness - 1 freq
uncut - 1 freq
unsteady - 3 freq
unsettled - 3 freq
uncouthie - 3 freq
unstringin - 1 freq
unstoppable - 5 freq
unsettlin - 3 freq
unsettle - 1 freq
un-yeesed - 1 freq
unsaed - 1 freq
ungoadly - 1 freq
unskaithd - 1 freq
unsettles - 1 freq
unheuked - 1 freq
unsaid - 5 freq
unsteik - 1 freq
unsteikit - 1 freq
unsheathit - 1 freq
'unsteik - 1 freq
unstappit - 2 freq
unsticking - 1 freq
unkit's - 1 freq
unhooked - 1 freq
unasked-for - 1 freq
uncouth - 5 freq
unmistakeable - 3 freq
unsatisfied - 1 freq
unsuitable - 2 freq
unwashit - 1 freq
unsteeked - 1 freq
unstuck - 1 freq
unstick - 1 freq
unctioneer - 2 freq
unstaundart - 2 freq
unwashed - 3 freq
unst - 12 freq
unsettling - 2 freq
unstitute - 1 freq
unsteek - 3 freq
ungat - 1 freq
unstapt - 1 freq
unsheddied - 1 freq
unshadowed - 1 freq
unstable - 1 freq
unsatisfactorie - 1 freq
unwasht - 1 freq
unsteekit - 2 freq
unwaashed - 1 freq
unstappable - 2 freq
unstressed - 27 freq
unstessed - 1 freq
unquait - 1 freq
unction - 1 freq
unwaged - 1 freq
unsattled - 1 freq
unstintin - 1 freq
unsturdy - 1 freq
ungodly - 2 freq
unsatisfactor - 1 freq
uncuddomt - 1 freq
unstappin - 1 freq
unstickan - 1 freq
unmistakible - 1 freq
unhowkit - 1 freq
unused - 2 freq
unstintit - 1 freq
unctuous - 1 freq
unsaturatit - 1 freq
unweshed - 1 freq
unmistakable - 2 freq
unsteeks - 1 freq
unscathed - 2 freq
unsatisfaiän - 1 freq
ungwdrj - 1 freq
unstfest - 1 freq
unstagram - 1 freq
unmasked - 1 freq
unistrathclyde - 1 freq
unsteddy - 1 freq
unstlass - 3 freq
MetaPhone code - UNST
unsaed - 1 freq
unsaid - 5 freq
unst - 12 freq
unused - 2 freq
UNST
Time to execute Levenshtein function - 0.176592 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.319477 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027803 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037154 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000799 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.