A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to alaska in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
alaska (0) - 2 freq
alanna (2) - 2 freq
flasks (2) - 9 freq
lanka (2) - 7 freq
plasma (2) - 5 freq
alaek (2) - 2 freq
flask (2) - 22 freq
glassa (2) - 1 freq
alsua (2) - 1 freq
eliska (2) - 1 freq
alsa (2) - 1 freq
alana (2) - 12 freq
alack (2) - 1 freq
alas (2) - 42 freq
laik (3) - 62 freq
blast (3) - 70 freq
alaang (3) - 1 freq
gask (3) - 2 freq
alairm (3) - 7 freq
clank (3) - 2 freq
plaiks (3) - 1 freq
clak (3) - 2 freq
glasgw (3) - 1 freq
masks (3) - 27 freq
'las (3) - 4 freq
alaska (0) - 2 freq
eliska (2) - 1 freq
alsa (3) - 1 freq
alas (3) - 42 freq
alsua (3) - 1 freq
alack (3) - 1 freq
laesk (3) - 4 freq
lanka (3) - 7 freq
alaek (3) - 2 freq
flask (3) - 22 freq
ulrika (4) - 2 freq
lark (4) - 23 freq
lika (4) - 1 freq
lak (4) - 10 freq
ask (4) - 518 freq
ziska (4) - 1 freq
als (4) - 27 freq
laste (4) - 2 freq
alsae (4) - 9 freq
alek (4) - 3 freq
alson (4) - 1 freq
lanky (4) - 11 freq
hask (4) - 4 freq
ilkka (4) - 1 freq
alset (4) - 1 freq
SoundEx code - A420
always - 408 freq
alex - 62 freq
alice - 1584 freq
ailsa - 32 freq
also - 320 freq
alkie - 3 freq
alas - 42 freq
'always - 3 freq
alike - 26 freq
alky - 10 freq
awhyles - 1 freq
alec - 112 freq
ales' - 1 freq
alloos - 38 freq
'allie's - 1 freq
ails - 9 freq
alicia - 1 freq
aalweys - 2 freq
alwis - 23 freq
algae - 3 freq
alleys - 3 freq
aloes - 1 freq
alikie - 1 freq
aleckie - 6 freq
aleck - 2 freq
alck - 1 freq
alek - 3 freq
alcckie - 1 freq
ayewels - 1 freq
ayewils - 1 freq
ahlways - 1 freq
als - 27 freq
alweys - 44 freq
allows - 4 freq
alsae - 9 freq
al's - 1 freq
alec's - 8 freq
allous - 27 freq
ahlice - 380 freq
allies - 8 freq
alcoho - 1 freq
alexa - 14 freq
alice' - 1 freq
allus - 4 freq
al'wiys - 1 freq
al'ways - 2 freq
al'wis - 2 freq
al'wiz - 4 freq
alwiz' - 1 freq
alwyes - 9 freq
ailwyes - 2 freq
aeolus - 2 freq
aless - 26 freq
alce - 1 freq
'aless - 1 freq
alloys - 1 freq
alecs - 1 freq
allege - 1 freq
aalways - 29 freq
aloos - 4 freq
alse - 8 freq
alki - 1 freq
aalwis - 6 freq
alaska - 2 freq
alwayis - 10 freq
alleyways - 1 freq
alhce - 1 freq
alaek - 2 freq
-alees - 1 freq
ailice - 295 freq
'ailice' - 1 freq
aiulice - 1 freq
alick - 1 freq
alwys - 1 freq
alwais - 1 freq
'alec - 1 freq
allays - 1 freq
alehoose - 1 freq
allooes - 2 freq
€™allais - 1 freq
€˜alex - 1 freq
alsua - 1 freq
alwaaays - 1 freq
alok - 1 freq
alessia - 2 freq
alack - 1 freq
€œalexa - 9 freq
€˜alexa - 1 freq
alex' - 1 freq
'alas - 1 freq
alsa - 1 freq
alæk - 1 freq
€˜also - 1 freq
€˜always - 2 freq
ali-a's - 1 freq
€œalias - 1 freq
ales - 1 freq
aliÂ’s - 1 freq
allyÂ’s - 2 freq
all's - 5 freq
aalegs - 1 freq
alexx - 9 freq
alles - 1 freq
alzzz - 1 freq
alwaes - 1 freq
alwgq - 1 freq
a-holes - 1 freq
MetaPhone code - ALSK
alaska - 2 freq
ALASKA
Time to execute Levenshtein function - 0.456034 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.903801 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028779 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.095713 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000861 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.