A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to ia in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
ia (0) - 5 freq
dia (1) - 2 freq
il (1) - 39 freq
ba (1) - 139 freq
ira (1) - 7 freq
la (1) - 116 freq
va (1) - 5 freq
ita (1) - 10 freq
is (1) - 18023 freq
'a (1) - 282 freq
ipa (1) - 1 freq
ra (1) - 12 freq
mia (1) - 8 freq
ix (1) - 15 freq
ga (1) - 29 freq
pa (1) - 27 freq
ja (1) - 7 freq
a (1) - 1 freq
pia (1) - 3 freq
iu (1) - 2 freq
tia (1) - 2 freq
nia (1) - 1 freq
iw (1) - 2 freq
na (1) - 788 freq
ic (1) - 6 freq
ia (0) - 5 freq
iya (1) - 1 freq
a (1) - 91162 freq
oa (1) - 14 freq
ya (1) - 473 freq
aa (1) - 7091 freq
iy (1) - 3 freq
i (1) - 18446 freq
ie (1) - 40 freq
ii (1) - 69 freq
ua (1) - 3 freq
iu (1) - 2 freq
io (1) - 7 freq
ea (1) - 4 freq
iae (1) - 5 freq
iq (2) - 4 freq
ai (2) - 29 freq
yo (2) - 15 freq
ig (2) - 16 freq
yu (2) - 40 freq
iak (2) - 5 freq
eo (2) - 6 freq
yea (2) - 129 freq
ye (2) - 20449 freq
wa (2) - 145 freq
SoundEx code - I000
i - 18446 freq
'i - 531 freq
ii - 69 freq
ih' - 1 freq
iii - 31 freq
i' - 347 freq
ioo - 1 freq
ihe - 2 freq
-i - 1 freq
i- - 1 freq
-'i - 1 freq
io - 7 freq
iae - 5 freq
i'we - 1 freq
ie - 40 freq
iho - 1 freq
ih - 81 freq
iwa - 3 freq
«i - 1 freq
ia - 5 freq
- 1 freq
iowa - 3 freq
i-a - 1 freq
éi - 1 freq
'ie - 1 freq
iwo - 1 freq
'i' - 3 freq
i'wee - 1 freq
i'wa - 1 freq
i'wie - 10 freq
i'waa - 1 freq
iy - 3 freq
i - 1 freq
-ie - 8 freq
þið - 1 freq
i - 99 freq
i - 191 freq
iye - 10 freq
i - 1 freq
i - 1 freq
i - 1 freq
i - 12 freq
- 1 freq
“i - 6 freq
‘i - 1 freq
iya - 1 freq
“i” - 1 freq
”i - 1 freq
ihy - 1 freq
ieh - 1 freq
iaw - 1 freq
iu - 2 freq
iio - 1 freq
iyuyh - 1 freq
iw - 2 freq
MetaPhone code - I
i - 18446 freq
'i - 531 freq
ii - 69 freq
ih' - 1 freq
iii - 31 freq
i' - 347 freq
ioo - 1 freq
-i - 1 freq
i- - 1 freq
-'i - 1 freq
io - 7 freq
iae - 5 freq
ie - 40 freq
ih - 81 freq
«i - 1 freq
ia - 5 freq
- 1 freq
i-a - 1 freq
éi - 1 freq
'ie - 1 freq
'i' - 3 freq
iy - 3 freq
i - 1 freq
-ie - 8 freq
þið - 1 freq
i - 99 freq
i - 191 freq
i - 1 freq
i - 1 freq
i - 1 freq
i - 12 freq
- 1 freq
“i - 6 freq
‘i - 1 freq
“i” - 1 freq
”i - 1 freq
ihy - 1 freq
ieh - 1 freq
iaw - 1 freq
iu - 2 freq
iio - 1 freq
iw - 2 freq
IA
Time to execute Levenshtein function - 0.179799 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.328439 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.034130 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.045195 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000841 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.