A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to leprechaun in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
leprechaun (0) - 3 freq
leprechauns (1) - 1 freq
'leprechaun (1) - 2 freq
aforehaun (4) - 7 freq
nearer-haun (4) - 1 freq
nerr-haun (4) - 1 freq
approchan (4) - 1 freq
depressan (4) - 1 freq
meechan (4) - 2 freq
meerschaum (4) - 4 freq
beforehaun (4) - 1 freq
merchan (4) - 3 freq
pechan (4) - 2 freq
afore-haun (4) - 1 freq
lenchan (4) - 1 freq
leechin (4) - 1 freq
efterhaun (4) - 22 freq
sprechin (4) - 1 freq
freehaun (4) - 1 freq
left-haun (4) - 9 freq
sprechen (4) - 1 freq
near-haun (4) - 5 freq
nearhaun (4) - 69 freq
kerrycann (5) - 1 freq
sprechs (5) - 2 freq
leprechaun (0) - 3 freq
'leprechaun (2) - 2 freq
leprechauns (2) - 1 freq
approchan (5) - 1 freq
sprechin (5) - 1 freq
sprechen (5) - 1 freq
lenchan (6) - 1 freq
leechin (6) - 1 freq
preichin (6) - 1 freq
pechan (6) - 2 freq
merchan (6) - 3 freq
preachin (6) - 52 freq
lurchin (6) - 1 freq
a-pechin (7) - 1 freq
mairchan (7) - 3 freq
greachan (7) - 5 freq
spreched (7) - 1 freq
seyrchen (7) - 1 freq
screchin (7) - 1 freq
praichin (7) - 12 freq
approachin (7) - 21 freq
reachan (7) - 3 freq
raechan (7) - 1 freq
punchan (7) - 1 freq
enrichin (7) - 1 freq
SoundEx code - L162
luvers - 12 freq
lippers - 9 freq
lipperous - 1 freq
laboryus - 1 freq
lea-perk - 2 freq
labours - 5 freq
laverocks - 5 freq
lappers - 1 freq
liver's - 2 freq
lovers - 20 freq
liberace's - 2 freq
leverage - 4 freq
leverock - 1 freq
levers - 6 freq
leavers - 1 freq
leprosie - 8 freq
lepers - 4 freq
laverock's - 2 freq
lipprosy - 1 freq
liprosy - 2 freq
lavroos - 1 freq
luver's - 2 freq
leprosy - 1 freq
'leprechaun - 2 freq
leprechauns - 1 freq
leprechaun - 3 freq
laverock - 14 freq
laveroks - 1 freq
liver-spotted - 1 freq
laebrack - 1 freq
lapraik - 1 freq
lawburrows - 14 freq
laevrick - 2 freq
laborious - 2 freq
lawborrowis - 6 freq
lawborrouis - 1 freq
law-borowis - 1 freq
law-borrowis - 1 freq
lawbawrous - 1 freq
lawbarrowis - 1 freq
lauborris - 1 freq
lawborch - 1 freq
laebrak - 1 freq
laverick - 1 freq
lawbors - 1 freq
laabours - 1 freq
laphroaig - 2 freq
low-prestige - 2 freq
luve-arras - 1 freq
luvvers - 1 freq
louvres - 2 freq
livers - 1 freq
€œleverage - 1 freq
leipers - 2 freq
liveries - 1 freq
loversaberdeen - 1 freq
law-breaking - 1 freq
labourrichard - 2 freq
lavrock's - 1 freq
MetaPhone code - LPRXN
'leprechaun - 2 freq
leprechaun - 3 freq
LEPRECHAUN
Time to execute Levenshtein function - 0.241383 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.462005 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.037248 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038166 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000952 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.