A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to ilka-day in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
ilka-day (0) - 20 freq
ilkae-day (1) - 1 freq
ilkaday (1) - 37 freq
ilkabody (3) - 2 freq
ilkaday's (3) - 1 freq
ilk-day's (3) - 2 freq
landan (4) - 2 freq
laddy (4) - 1 freq
the-day (4) - 32 freq
-day (4) - 2 freq
blue-ray (4) - 1 freq
ilka (4) - 887 freq
leadan (4) - 5 freq
iliad (4) - 3 freq
one-day (4) - 1 freq
polka-dot (4) - 1 freq
holy-day (4) - 1 freq
ilkane's (4) - 1 freq
yalday (4) - 1 freq
iskander (4) - 70 freq
sanday (4) - 13 freq
d-day (4) - 3 freq
b-day (4) - 1 freq
a'day (4) - 7 freq
ilkae (4) - 4 freq
ilka-day (0) - 20 freq
ilkae-day (1) - 1 freq
ilkaday (2) - 37 freq
ilkabody (4) - 2 freq
ilk-day's (5) - 2 freq
ilk-ane (5) - 1 freq
ilky (6) - 61 freq
wik-days (6) - 1 freq
ilkka (6) - 1 freq
ilkane (6) - 51 freq
ae-day (6) - 1 freq
ilkley (6) - 1 freq
lady (6) - 163 freq
wikkday (6) - 1 freq
loady (6) - 1 freq
ildtdai (6) - 1 freq
loadda (6) - 1 freq
landy (6) - 9 freq
alkahoal (6) - 1 freq
loada (6) - 3 freq
to-day (6) - 1 freq
ill-dain (6) - 1 freq
low-pay (6) - 1 freq
daily-day (6) - 8 freq
laada (6) - 2 freq
SoundEx code - I423
ilk-ither - 1 freq
illicit - 5 freq
ilka-day - 20 freq
ill-luckit - 4 freq
ilkither - 6 freq
ilkaday - 37 freq
ill-set - 4 freq
ill-got - 1 freq
ill-staured - 1 freq
ill-gotten - 1 freq
illustrator - 3 freq
ill-gaits - 1 freq
illustrate - 14 freq
illustrations - 8 freq
illustration - 6 freq
illustrative - 2 freq
illustrated - 10 freq
ill-likit - 1 freq
ill-gattit - 1 freq
ill-uised - 1 freq
illluckit - 1 freq
ilk-day's - 2 freq
illustratours - 1 freq
ielektrisitie - 1 freq
i'licht - 3 freq
ill-lichts - 1 freq
ill-setten - 3 freq
ill-used - 1 freq
illustrates - 4 freq
illustrious - 1 freq
illgettins - 1 freq
ilkae-day - 1 freq
ill-gaited - 1 freq
illustratit - 8 freq
illustrater - 1 freq
ill-yokit - 1 freq
illustratin - 1 freq
ill-yaised - 1 freq
illegitimacy - 1 freq
ilikeduggee - 1 freq
ilkaday's - 1 freq
MetaPhone code - ILKT
ilka-day - 20 freq
ilkaday - 37 freq
ill-got - 1 freq
illluckit - 1 freq
ilkae-day - 1 freq
ILKA-DAY
ilka - 887 freq
ilk - 317 freq
ilkie - 92 freq
ilky - 61 freq
ilkane - 51 freq
ilkaday - 37 freq
ilka-day - 20 freq
ilk'ane - 6 freq
ilkae - 4 freq
ilkanither - 3 freq
ilke - 2 freq
ilkabody - 2 freq
ilk-day's - 2 freq
Time to execute Levenshtein function - 0.177728 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.397440 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.031452 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037311 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000967 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.