A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to marry in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
marry (0) - 11 freq
arry (1) - 1 freq
barry (1) - 33 freq
marra (1) - 19 freq
harry (1) - 186 freq
marr (1) - 5 freq
merry (1) - 69 freq
mearry (1) - 1 freq
garry (1) - 10 freq
darry (1) - 1 freq
tarry (1) - 20 freq
marr' (1) - 1 freq
carry (1) - 45 freq
larry (1) - 105 freq
mairy (1) - 7 freq
sarry (1) - 2 freq
mary (1) - 763 freq
mirry (1) - 8 freq
marky (1) - 1 freq
parry (1) - 1 freq
mairry (1) - 43 freq
sairy (2) - 5 freq
matty (2) - 54 freq
haary (2) - 3 freq
marrow (2) - 32 freq
marry (0) - 11 freq
merry (1) - 69 freq
mearry (1) - 1 freq
mairry (1) - 43 freq
marr (1) - 5 freq
mirry (1) - 8 freq
marra (1) - 19 freq
morra (2) - 76 freq
parry (2) - 1 freq
marrae (2) - 8 freq
moarra (2) - 1 freq
merro (2) - 1 freq
murray (2) - 96 freq
morro (2) - 18 freq
murr (2) - 1 freq
marky (2) - 1 freq
merr (2) - 6 freq
mirr (2) - 3 freq
mirra (2) - 3 freq
tarry (2) - 20 freq
mary (2) - 763 freq
carry (2) - 45 freq
darry (2) - 1 freq
garry (2) - 10 freq
arry (2) - 1 freq
SoundEx code - M600
mair - 6074 freq
mere - 24 freq
muir - 77 freq
more - 461 freq
mary - 763 freq
mr - 1243 freq
marrae - 8 freq
merry - 69 freq
morra - 76 freq
mour - 4 freq
mar - 16 freq
mare - 190 freq
maire - 146 freq
mora - 8 freq
mer - 8 freq
moor - 29 freq
mera - 2 freq
'mere - 1 freq
more-' - 1 freq
'mr - 16 freq
maria - 8 freq
maireh - 3 freq
marrow - 32 freq
maori - 2 freq
marie - 212 freq
moira - 88 freq
merrie - 7 freq
mire - 14 freq
'mair - 7 freq
marra - 19 freq
mearry - 1 freq
mairy - 7 freq
mirrae - 1 freq
meir - 7 freq
moray - 80 freq
mairrie - 7 freq
murray - 96 freq
myrrh - 10 freq
myrh - 1 freq
mairry - 43 freq
mario - 10 freq
meer - 9 freq
mear - 3 freq
morrae - 2 freq
mower - 15 freq
mair' - 4 freq
morra- - 1 freq
marr - 5 freq
merro - 1 freq
mair-a - 1 freq
mareh - 3 freq
meyer - 5 freq
'muir - 1 freq
moir - 1 freq
maria' - 2 freq
mhòr' - 1 freq
morrow - 11 freq
more' - 1 freq
mirry - 8 freq
maer - 3 freq
merr - 6 freq
'mary' - 1 freq
'merry - 1 freq
maiur - 1 freq
marry - 11 freq
miair - 1 freq
'marie - 2 freq
mohair - 2 freq
moorie - 5 freq
moory - 3 freq
mor - 15 freq
mirr - 3 freq
meroo - 1 freq
mayor - 3 freq
mairi - 4 freq
mireio - 1 freq
mirrie - 7 freq
Ÿmur - 1 freq
màiri - 2 freq
mir - 5 freq
mowrie - 1 freq
marr' - 1 freq
'mehr - 1 freq
€™mere - 9 freq
€˜more - 2 freq
mearð - 1 freq
€œmair - 4 freq
moore - 2 freq
mairry' - 1 freq
myra - 14 freq
myre - 2 freq
mairie - 1 freq
mòr - 2 freq
marye - 1 freq
'meerie - 6 freq
muwer - 1 freq
€œmary - 2 freq
€˜mair - 6 freq
€˜mere - 1 freq
€˜mr - 7 freq
€˜mare - 2 freq
€œmr - 5 freq
morra' - 1 freq
maara - 1 freq
€œmoira - 1 freq
mirra - 3 freq
mere' - 1 freq
moar - 3 freq
mrrow - 1 freq
moarra - 1 freq
€™mr - 1 freq
mhairi - 6 freq
mayer - 1 freq
mrei - 1 freq
morro - 18 freq
murr - 1 freq
maree - 1 freq
mehr - 1 freq
mri - 1 freq
‘mary - 1 freq
MetaPhone code - MR
mair - 6074 freq
mere - 24 freq
muir - 77 freq
more - 461 freq
mary - 763 freq
mr - 1243 freq
marrae - 8 freq
merry - 69 freq
morra - 76 freq
mour - 4 freq
mar - 16 freq
mare - 190 freq
maire - 146 freq
mora - 8 freq
mer - 8 freq
moor - 29 freq
mera - 2 freq
'mere - 1 freq
more-' - 1 freq
'mr - 16 freq
maria - 8 freq
maireh - 3 freq
marrow - 32 freq
maori - 2 freq
marie - 212 freq
moira - 88 freq
merrie - 7 freq
mire - 14 freq
'mair - 7 freq
marra - 19 freq
mearry - 1 freq
mairy - 7 freq
mirrae - 1 freq
meir - 7 freq
moray - 80 freq
mairrie - 7 freq
ymir - 3 freq
murray - 96 freq
myrrh - 10 freq
myrh - 1 freq
mairry - 43 freq
mario - 10 freq
meer - 9 freq
mear - 3 freq
morrae - 2 freq
mair' - 4 freq
morra- - 1 freq
marr - 5 freq
merro - 1 freq
mair-a - 1 freq
mareh - 3 freq
'muir - 1 freq
moir - 1 freq
maria' - 2 freq
mhòr' - 1 freq
morrow - 11 freq
more' - 1 freq
mirry - 8 freq
maer - 3 freq
merr - 6 freq
'mary' - 1 freq
'merry - 1 freq
maiur - 1 freq
marry - 11 freq
miair - 1 freq
'marie - 2 freq
moorie - 5 freq
moory - 3 freq
mor - 15 freq
mirr - 3 freq
meroo - 1 freq
mairi - 4 freq
mireio - 1 freq
mirrie - 7 freq
Ÿmur - 1 freq
màiri - 2 freq
mir - 5 freq
mowrie - 1 freq
marr' - 1 freq
'mehr - 1 freq
€™mere - 9 freq
€˜more - 2 freq
mearð - 1 freq
€œmair - 4 freq
moore - 2 freq
mairry' - 1 freq
myra - 14 freq
myre - 2 freq
mairie - 1 freq
mòr - 2 freq
'meerie - 6 freq
€œmary - 2 freq
€˜mair - 6 freq
€˜mere - 1 freq
€˜mr - 7 freq
€˜mare - 2 freq
€œmr - 5 freq
morra' - 1 freq
maara - 1 freq
€œmoira - 1 freq
mirra - 3 freq
mere' - 1 freq
moar - 3 freq
mrrow - 1 freq
moarra - 1 freq
€™mr - 1 freq
ymrh - 1 freq
mrei - 1 freq
morro - 18 freq
murr - 1 freq
maree - 1 freq
mehr - 1 freq
mri - 1 freq
‘mary - 1 freq
MARRY
Time to execute Levenshtein function - 0.174533 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.330005 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028077 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037006 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000818 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.