A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to ija in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
ija (0) - 1 freq
ita (1) - 10 freq
ja (1) - 7 freq
ijc (1) - 2 freq
ij (1) - 2 freq
fja (1) - 1 freq
ira (1) - 7 freq
isa (1) - 26 freq
ifa (1) - 4 freq
ina (1) - 99 freq
ia (1) - 5 freq
ida (1) - 127 freq
ipa (1) - 1 freq
'ja (1) - 1 freq
iya (1) - 1 freq
i-a (1) - 1 freq
xja (1) - 1 freq
ijz (1) - 1 freq
iwa (1) - 3 freq
iva (1) - 1 freq
ika (1) - 3 freq
ima (1) - 1 freq
iii (2) - 34 freq
a (2) - 1 freq
a (2) - 185 freq
ija (0) - 1 freq
ja (1) - 7 freq
ij (1) - 2 freq
jae (2) - 2 freq
uj (2) - 3 freq
jay (2) - 7 freq
ajay (2) - 2 freq
aj (2) - 6 freq
ejo (2) - 1 freq
jo (2) - 29 freq
ita (2) - 10 freq
jaa (2) - 31 freq
je (2) - 10 freq
ojo (2) - 3 freq
oj (2) - 2 freq
ajya (2) - 1 freq
ej (2) - 1 freq
ju (2) - 6 freq
jai (2) - 1 freq
jy (2) - 2 freq
ji (2) - 7 freq
aij (2) - 2 freq
j (2) - 186 freq
ouija (2) - 7 freq
ida (2) - 127 freq
SoundEx code - I200
is - 18321 freq
'is - 151 freq
icy - 62 freq
ice - 254 freq
ika - 3 freq
iz - 434 freq
issue - 100 freq
ix - 15 freq
is-'oh - 5 freq
is- - 6 freq
iss - 480 freq
i's - 6 freq
isaiah - 20 freq
isa - 26 freq
'ic - 1 freq
is-oh - 3 freq
is-' - 1 freq
ike - 3 freq
'is' - 7 freq
ikea - 8 freq
ihese - 1 freq
ihis - 6 freq
iak - 5 freq
iki - 1 freq
ici - 2 freq
i'se - 27 freq
isie - 79 freq
ic's - 1 freq
'iwas - 1 freq
issy - 1 freq
iis - 1 freq
ik - 8 freq
iways - 17 freq
icey - 3 freq
ig - 16 freq
icu - 3 freq
''is - 1 freq
is-aw - 1 freq
iece - 1 freq
iik - 1 freq
is' - 2 freq
iiky - 1 freq
ious - 1 freq
ic - 6 freq
iika - 1 freq
'iss - 1 freq
isg - 1 freq
iss' - 1 freq
igg - 1 freq
'iz' - 3 freq
i'zoo - 3 freq
i'sea - 5 freq
i'hoose - 1 freq
i'high - 1 freq
i'queue - 1 freq
i'sky - 4 freq
iz' - 2 freq
ie's - 1 freq
iys - 2 freq
is-'e - 1 freq
ich - 4 freq
-ik - 1 freq
ise - 2 freq
'i'se - 1 freq
iss- - 1 freq
ii's - 1 freq
iii's - 1 freq
is - 22 freq
ichy - 1 freq
ies - 1 freq
iq - 4 freq
is - 27 freq
'isie - 1 freq
isiah - 1 freq
ikie - 1 freq
ish - 10 freq
ischew - 1 freq
is - 8 freq
igc - 1 freq
iws - 1 freq
ish - 2 freq
ies - 1 freq
'ice - 1 freq
i-chi - 2 freq
ick - 56 freq
iss - 2 freq
is - 1 freq
iyways - 1 freq
-is - 1 freq
is's - 2 freq
iwis - 1 freq
iss - 1 freq
iicq - 1 freq
isss - 1 freq
‘is - 12 freq
ize - 1 freq
iqk - 1 freq
iyq - 1 freq
ijz - 1 freq
iwq - 1 freq
ijc - 2 freq
iss… - 1 freq
iwise - 9 freq
ixy - 1 freq
iyjs - 1 freq
ikg - 1 freq
ij - 2 freq
iwxwu - 1 freq
izi - 2 freq
icw - 1 freq
’is - 1 freq
ija - 1 freq
izzy - 1 freq
“is - 2 freq
ikzxg - 1 freq
iugw - 1 freq
ios - 2 freq
is… - 1 freq
iqczc - 1 freq
iqz - 1 freq
iccy - 1 freq
iko - 1 freq
izoiw - 1 freq
izx - 1 freq
ixcz - 1 freq
iyz - 1 freq
igs - 1 freq
iek - 1 freq
ixsc - 1 freq
ixgc - 1 freq
MetaPhone code - IJ
ij - 2 freq
ija - 1 freq
IJA
Time to execute Levenshtein function - 0.199173 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.355732 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.033270 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038202 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000967 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.