A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to tarlan in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
tarlan (0) - 4 freq
tartan (1) - 92 freq
tirlan (1) - 5 freq
tarland (1) - 4 freq
tarzan (1) - 4 freq
sailan (2) - 5 freq
tyrian (2) - 1 freq
farland (2) - 1 freq
fallan (2) - 4 freq
carlaw (2) - 12 freq
talkan (2) - 18 freq
baelan (2) - 2 freq
warran (2) - 4 freq
harlaw (2) - 4 freq
tailen (2) - 1 freq
tryan (2) - 32 freq
warnan (2) - 1 freq
barkan (2) - 2 freq
tampan (2) - 1 freq
turban (2) - 4 freq
taxman (2) - 2 freq
'tartan (2) - 1 freq
callan (2) - 3 freq
aran (2) - 5 freq
staran (2) - 6 freq
tarlan (0) - 4 freq
tirlan (1) - 5 freq
tirlin (2) - 13 freq
tarzan (2) - 4 freq
tartan (2) - 92 freq
tarland (2) - 4 freq
traalin (3) - 1 freq
harlin (3) - 3 freq
turnan (3) - 20 freq
trailin (3) - 27 freq
airlan (3) - 174 freq
tyran (3) - 1 freq
toilan (3) - 1 freq
mairlan (3) - 1 freq
carlin (3) - 6 freq
hurlan (3) - 4 freq
trailen (3) - 1 freq
birlan (3) - 18 freq
tarlair (3) - 7 freq
norlan (3) - 21 freq
marlon (3) - 1 freq
tarrin (3) - 1 freq
tarin (3) - 1 freq
traan (3) - 1 freq
curlan (3) - 2 freq
SoundEx code - T645
trailin - 27 freq
trilingual - 8 freq
truelins - 4 freq
thirlin-tae - 1 freq
trailing - 6 freq
tirlin - 13 freq
twirlin' - 1 freq
trailen - 1 freq
tirlan - 5 freq
traalin - 1 freq
thairlane - 4 freq
treelined - 1 freq
thirlins - 4 freq
thirlin - 7 freq
trewlins - 3 freq
twirlin - 5 freq
trellan - 2 freq
trawlin - 12 freq
tri-lingual - 1 freq
trillin - 1 freq
tree-lined - 2 freq
tarland - 4 freq
tarlan - 4 freq
trillions - 1 freq
trilingualism - 1 freq
trillion - 1 freq
twirling - 1 freq
trailin' - 1 freq
trolling - 3 freq
trollin - 1 freq
thrilling - 1 freq
therealmac - 1 freq
MetaPhone code - TRLN
trailin - 27 freq
drawlin - 7 freq
darlin - 54 freq
dirlin - 45 freq
droolin' - 1 freq
dreelin - 1 freq
tirlin - 13 freq
trailen - 1 freq
droolin - 3 freq
tirlan - 5 freq
'darlin - 1 freq
traalin - 1 freq
derlin - 1 freq
drillin - 13 freq
'darlin' - 1 freq
draalin - 2 freq
dirlan - 3 freq
trellan - 2 freq
trawlin - 12 freq
drawlan-e - 1 freq
drawlan - 1 freq
trillin - 1 freq
tarlan - 4 freq
dreillin - 1 freq
trillion - 1 freq
trailin' - 1 freq
trollin - 1 freq
TARLAN
Time to execute Levenshtein function - 0.188419 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.328250 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027826 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036894 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000809 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.