A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to light in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
light (0) - 300 freq
wight (1) - 1 freq
lyght (1) - 2 freq
liht (1) - 2 freq
right (1) - 1436 freq
night (1) - 955 freq
slight (1) - 11 freq
hight (1) - 2 freq
flight (1) - 45 freq
plight (1) - 11 freq
tight (1) - 75 freq
eight (1) - 69 freq
ight (1) - 1 freq
fight (1) - 95 freq
might (1) - 421 freq
sight (1) - 134 freq
blight (1) - 5 freq
gight (1) - 3 freq
dight (1) - 3 freq
licht (1) - 909 freq
lights (1) - 104 freq
alight (1) - 3 freq
laigh (2) - 15 freq
lit (2) - 283 freq
signt (2) - 7 freq
light (0) - 300 freq
lyght (1) - 2 freq
alight (1) - 3 freq
licht (2) - 909 freq
gight (2) - 3 freq
blight (2) - 5 freq
sight (2) - 134 freq
lights (2) - 104 freq
liht (2) - 2 freq
laught (2) - 1 freq
lightee (2) - 1 freq
wight (2) - 1 freq
laaght (2) - 1 freq
might (2) - 421 freq
dight (2) - 3 freq
slight (2) - 11 freq
right (2) - 1436 freq
flight (2) - 45 freq
fight (2) - 95 freq
plight (2) - 11 freq
tight (2) - 75 freq
ight (2) - 1 freq
eight (2) - 69 freq
night (2) - 955 freq
hight (2) - 2 freq
SoundEx code - L230
looked - 839 freq
licht - 909 freq
least - 520 freq
last - 1873 freq
luiked - 75 freq
list - 193 freq
lashed - 11 freq
lauched - 75 freq
leesed - 1 freq
lost - 493 freq
lowsed - 72 freq
liked - 189 freq
laacht - 6 freq
light - 300 freq
laist - 110 freq
lest - 218 freq
leukit - 305 freq
loackt - 2 freq
lookt - 14 freq
loast - 142 freq
laughed - 97 freq
lassie'd - 2 freq
locked - 48 freq
'looked - 1 freq
liquid - 30 freq
laest - 48 freq
likit - 158 freq
lowst - 3 freq
loshtie - 3 freq
leukt - 52 freq
lookit - 338 freq
'least - 1 freq
licet - 1 freq
lukked - 56 freq
'last - 3 freq
luikt - 49 freq
luikit - 133 freq
lockit - 33 freq
luikd - 1 freq
lauchd - 1 freq
laucht - 22 freq
lust - 20 freq
lukkit - 6 freq
liggit - 22 freq
lochhead - 24 freq
loused - 5 freq
liket - 4 freq
lykit - 5 freq
logged - 2 freq
lickit - 19 freq
lecht - 3 freq
lsd - 2 freq
lusty - 5 freq
licked - 10 freq
locate - 2 freq
'leist - 1 freq
lowsit - 15 freq
lustie - 1 freq
luist - 4 freq
lauchit - 3 freq
leist - 3 freq
legged - 10 freq
lyket - 7 freq
leest - 12 freq
laggit - 1 freq
lokket - 3 freq
lced - 1 freq
looket - 36 freq
leekit - 1 freq
lauch't - 1 freq
legit - 4 freq
looke't - 1 freq
lees't - 1 freq
leeket - 1 freq
lookoot' - 1 freq
leggid - 1 freq
liqwid - 2 freq
laced - 6 freq
lacht - 1 freq
lochaidh - 1 freq
lowest - 10 freq
loacked - 4 freq
lcid - 1 freq
leuked - 23 freq
leased - 2 freq
look-oot - 5 freq
lukt - 69 freq
lichtie - 2 freq
legat - 1 freq
lik'd - 4 freq
leukid - 2 freq
lockt - 5 freq
likt - 9 freq
laught - 1 freq
laughit - 1 freq
lookoot - 1 freq
lyke-white - 1 freq
luk-oot - 1 freq
luked - 14 freq
luk'ed - 1 freq
'lusty - 1 freq
licht' - 4 freq
loost - 8 freq
liecht - 1 freq
leicht - 2 freq
laaght - 1 freq
lucid - 3 freq
laekit - 18 freq
liicht - 1 freq
'light' - 1 freq
lukk'd - 22 freq
look't - 1 freq
lugged - 5 freq
lyked - 3 freq
leised - 9 freq
lowssit - 1 freq
laast - 8 freq
lockid - 1 freq
lizzie'd - 1 freq
laste - 2 freq
laached - 22 freq
luggit - 8 freq
lyght - 2 freq
læk'it - 1 freq
lass'at - 1 freq
lached - 5 freq
leiquit - 1 freq
loest - 1 freq
loessit - 1 freq
lukkt - 1 freq
lachatdee - 1 freq
lack-a-day - 2 freq
lochheid - 1 freq
locket - 5 freq
last' - 1 freq
lecked - 6 freq
lekd - 1 freq
liquit - 1 freq
lugshot - 1 freq
lyeukit - 10 freq
luckit - 1 freq
loasst - 1 freq
'lest - 1 freq
lykt - 1 freq
liquidy - 1 freq
lycht - 3 freq
lows't - 1 freq
leggit - 1 freq
leuched - 1 freq
leuchit - 4 freq
likkit - 2 freq
laichit - 1 freq
lawest - 1 freq
€œlast - 2 freq
loggit - 2 freq
ligged - 1 freq
luckt - 2 freq
laked - 1 freq
loosed - 1 freq
lukkid - 1 freq
lasswade - 2 freq
lichty - 1 freq
€˜last - 2 freq
lochead - 3 freq
leashed - 1 freq
loaked - 1 freq
lueked - 2 freq
luc't - 6 freq
luct - 1 freq
lacked - 1 freq
leeched - 1 freq
lackit - 1 freq
licht- - 1 freq
€œloshty - 3 freq
leaked - 3 freq
lucked - 2 freq
leukked - 4 freq
luekit - 2 freq
€˜licht - 1 freq
luckked - 1 freq
likeet - 2 freq
lookeet - 1 freq
lestie - 1 freq
'lost - 1 freq
“loused” - 1 freq
lcxdi - 1 freq
lst - 1 freq
lswdd - 1 freq
leaskyht - 4 freq
least' - 1 freq
lqd - 1 freq
ljt - 1 freq
lightee - 1 freq
lukewatt - 1 freq
likid - 1 freq
MetaPhone code - LFT
left - 1591 freq
luved - 70 freq
loved - 190 freq
light - 300 freq
laughed - 97 freq
lived - 150 freq
lift - 496 freq
leived - 11 freq
lft - 2 freq
leeft - 11 freq
loft - 13 freq
leeved - 55 freq
luft - 2 freq
life'd - 1 freq
laft - 27 freq
leave't - 2 freq
luvit - 1 freq
leviet - 1 freq
leevit - 4 freq
leyft - 1 freq
laift - 1 freq
lyft - 2 freq
lofty - 4 freq
livit - 2 freq
'lift - 2 freq
lufit - 1 freq
laught - 1 freq
laughit - 1 freq
luvved - 4 freq
levite - 2 freq
lïft - 32 freq
livid - 4 freq
laaght - 1 freq
levied - 1 freq
luved- - 1 freq
left' - 2 freq
'light' - 1 freq
levit - 2 freq
lifit - 2 freq
lyght - 2 freq
luift - 3 freq
leftie - 1 freq
leivit - 3 freq
leift - 1 freq
lovit - 2 freq
luiffed - 2 freq
leftÂ’ - 1 freq
lefty - 2 freq
livet - 1 freq
lightee - 1 freq
LIGHT
licht - 909 freq
light - 300 freq
lights - 104 freq
lighting - 2 freq
lightin - 11 freq
lichts - 180 freq
lighted - 3 freq
lichted - 14 freq
lichter - 8 freq
lichten - 8 freq
lit - 283 freq
Time to execute Levenshtein function - 0.247733 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.367908 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028103 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036997 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000922 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.