A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to unused in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
unused (0) - 2 freq
doused (2) - 2 freq
onuised (2) - 1 freq
loused (2) - 5 freq
snushed (2) - 3 freq
used (2) - 682 freq
knuse (2) - 1 freq
‘used (2) - 1 freq
hunsed (2) - 1 freq
unusal (2) - 1 freq
unnser (2) - 8 freq
hunksed (2) - 1 freq
snuved (2) - 3 freq
unwed (2) - 1 freq
unser (2) - 3 freq
unsee (2) - 1 freq
roused (2) - 14 freq
amused (2) - 18 freq
uised (2) - 281 freq
fused (2) - 6 freq
unisex (2) - 1 freq
infused (2) - 4 freq
abused (2) - 6 freq
paused (2) - 74 freq
ansed (2) - 1 freq
unused (0) - 2 freq
nosed (2) - 6 freq
unsaed (2) - 1 freq
ansed (2) - 1 freq
onuised (2) - 1 freq
caused (3) - 78 freq
abused (3) - 6 freq
mused (3) - 4 freq
uissed (3) - 4 freq
haused (3) - 3 freq
paused (3) - 74 freq
nursed (3) - 5 freq
housed (3) - 1 freq
ensued (3) - 3 freq
aniseed (3) - 3 freq
untied (3) - 7 freq
united (3) - 75 freq
infused (3) - 4 freq
eused (3) - 1 freq
soused (3) - 2 freq
unsaid (3) - 6 freq
loused (3) - 5 freq
unwed (3) - 1 freq
unusal (3) - 1 freq
hunsed (3) - 1 freq
SoundEx code - U523
unsettl't - 2 freq
unquateness - 1 freq
uncut - 1 freq
unsteady - 3 freq
unsettled - 3 freq
uncouthie - 3 freq
unstringin - 1 freq
unstoppable - 5 freq
unsettlin - 3 freq
unsettle - 1 freq
un-yeesed - 1 freq
unsaed - 1 freq
ungoadly - 1 freq
unskaithd - 1 freq
unsettles - 1 freq
unheuked - 1 freq
unsaid - 6 freq
unsteik - 1 freq
unsteikit - 1 freq
unsheathit - 1 freq
'unsteik - 1 freq
unstappit - 2 freq
unsticking - 1 freq
unsettling - 3 freq
unmistakable - 3 freq
unwashed - 5 freq
unkit's - 1 freq
unhooked - 1 freq
unasked-for - 1 freq
uncouth - 5 freq
unmistakeable - 3 freq
unsatisfied - 1 freq
unsuitable - 2 freq
unwashit - 1 freq
unsteeked - 1 freq
unstuck - 1 freq
unstick - 1 freq
unctioneer - 2 freq
unstaundart - 2 freq
unst - 12 freq
unstitute - 1 freq
unsteek - 3 freq
ungat - 1 freq
unstapt - 1 freq
unsheddied - 1 freq
unshadowed - 1 freq
unstable - 1 freq
unsatisfactorie - 1 freq
unwasht - 1 freq
unsteekit - 2 freq
unwaashed - 1 freq
unstappable - 2 freq
unstressed - 27 freq
unstessed - 1 freq
unquait - 1 freq
unction - 1 freq
unwaged - 1 freq
unsattled - 1 freq
unstintin - 1 freq
unsturdy - 1 freq
ungodly - 2 freq
unsatisfactor - 1 freq
uncuddomt - 1 freq
unstappin - 1 freq
unstickan - 1 freq
unmistakible - 1 freq
unhowkit - 1 freq
unused - 2 freq
unstintit - 1 freq
unctuous - 1 freq
unsaturatit - 1 freq
unweshed - 1 freq
unsteeks - 1 freq
unscathed - 2 freq
unsatisfaiän - 1 freq
ungwdrj - 1 freq
unstfest - 1 freq
unstagram - 1 freq
unmasked - 1 freq
unistrathclyde - 1 freq
unsteddy - 1 freq
unstlass - 3 freq
MetaPhone code - UNST
unsaed - 1 freq
unsaid - 6 freq
unst - 12 freq
unused - 2 freq
UNUSED
Time to execute Levenshtein function - 0.555019 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.909260 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.088163 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.103755 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000880 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.