A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to ir in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
ir (0) - 1540 freq
il (1) - 39 freq
iz (1) - 403 freq
“r (1) - 1 freq
'r (1) - 1 freq
ar (1) - 208 freq
mr (1) - 1243 freq
ior (1) - 1 freq
tir (1) - 4 freq
im (1) - 370 freq
hir (1) - 1279 freq
irn (1) - 35 freq
bir (1) - 2 freq
lr (1) - 5 freq
wir (1) - 2162 freq
kr (1) - 4 freq
dr (1) - 199 freq
iq (1) - 4 freq
ira (1) - 7 freq
ic (1) - 6 freq
eir (1) - 18 freq
ij (1) - 2 freq
vr (1) - 5 freq
pr (1) - 21 freq
ird (1) - 1 freq
ir (0) - 1540 freq
air (1) - 970 freq
ur (1) - 541 freq
iry (1) - 1 freq
r (1) - 446 freq
eir (1) - 18 freq
ier (1) - 1 freq
yir (1) - 1337 freq
ire (1) - 9 freq
or (1) - 9206 freq
er (1) - 627 freq
iar (1) - 2 freq
ira (1) - 7 freq
iri (1) - 1 freq
yr (1) - 16 freq
ar (1) - 208 freq
ior (1) - 1 freq
oer (2) - 238 freq
yer (2) - 7938 freq
kr (2) - 4 freq
ora (2) - 1 freq
ure (2) - 5 freq
yur (2) - 221 freq
ry (2) - 7 freq
aer (2) - 67 freq
SoundEx code - I600
ir - 1540 freq
irae - 1 freq
'ir - 11 freq
ire - 9 freq
iry - 1 freq
ihere - 6 freq
ihor - 1 freq
ira - 7 freq
ir' - 4 freq
iwer - 4 freq
æir - 1 freq
i'air - 7 freq
i'war - 1 freq
Ÿiyor - 1 freq
ihre - 1 freq
€™ir - 5 freq
Éire - 2 freq
€œir - 2 freq
€˜irie - 1 freq
€˜ir - 1 freq
irr - 2 freq
irw - 1 freq
ier - 1 freq
iar - 2 freq
iri - 1 freq
irh - 1 freq
ior - 1 freq
MetaPhone code - IR
ir - 1540 freq
irae - 1 freq
'ir - 11 freq
ire - 9 freq
iry - 1 freq
ira - 7 freq
ir' - 4 freq
æir - 1 freq
i'air - 7 freq
ihre - 1 freq
€™ir - 5 freq
Éire - 2 freq
€œir - 2 freq
€˜irie - 1 freq
€˜ir - 1 freq
irr - 2 freq
irw - 1 freq
ier - 1 freq
iar - 2 freq
iri - 1 freq
irh - 1 freq
ior - 1 freq
IR
am - 1527 freq
be - 14795 freq
wis - 27947 freq
is - 18023 freq
been - 5087 freq
bön - 23 freq
are - 5053 freq
were - 4054 freq
wir - 2162 freq
was - 3361 freq
wus - 1426 freq
wur - 1596 freq
bein - 1744 freq
being - 296 freq
wiz - 1272 freq
wes - 1816 freq
war - 1438 freq
bin - 954 freq
bes - 315 freq
ir - 1540 freq
re - 48 freq
isnae - 544 freq
wisnae - 999 freq
wisna - 1037 freq
ur - 541 freq
isna - 390 freq
Time to execute Levenshtein function - 0.172283 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.295402 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027573 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037169 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001012 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.