A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to reggaeheaven in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
reggaeheaven (0) - 1 freq
remaineen (6) - 1 freq
newheiven (6) - 1 freq
reggae (6) - 1 freq
whitehaven (6) - 1 freq
eleeven (6) - 35 freq
reheatin (6) - 1 freq
reeseeved (6) - 1 freq
buchanhaven (6) - 1 freq
breatheen (6) - 1 freq
'heaven (6) - 1 freq
recleeved (6) - 1 freq
reachan (6) - 3 freq
releeased (6) - 1 freq
re-leevin (6) - 1 freq
reappeared (6) - 1 freq
regalate (6) - 1 freq
regressive (6) - 2 freq
stonehaven (6) - 11 freq
teacheen (6) - 1 freq
raedeen (6) - 1 freq
eeleeven (6) - 1 freq
receever (6) - 1 freq
riggeen (6) - 1 freq
eggheads (6) - 1 freq
reggaeheaven (0) - 1 freq
longhaven (8) - 2 freq
riggeen (8) - 1 freq
stonehaven (9) - 11 freq
re-leevin (9) - 1 freq
gaethereen (9) - 2 freq
belhaven (9) - 2 freq
ruthven (9) - 6 freq
rgohev (9) - 1 freq
riggy-bane (9) - 2 freq
reachan (9) - 3 freq
heaven (9) - 69 freq
eggheads (9) - 1 freq
reheatin (9) - 1 freq
'heaven (9) - 1 freq
newheiven (9) - 1 freq
whitehaven (9) - 1 freq
reggae (9) - 1 freq
orgreave (10) - 2 freq
buckhaven (10) - 1 freq
regretin (10) - 1 freq
misbehavin (10) - 1 freq
reguerdon (10) - 1 freq
aggrieved (10) - 4 freq
relievin (10) - 3 freq
SoundEx code - R215
responsibility - 44 freq
raspin - 5 freq
responsible - 36 freq
response - 47 freq
rigbane' - 1 freq
rigbane - 7 freq
responsal - 6 freq
receivin - 7 freq
respin - 1 freq
rig-bane - 2 freq
respondit - 6 freq
respon - 1 freq
reshapin - 1 freq
responded - 4 freq
respondin - 1 freq
respond - 24 freq
responds - 6 freq
responsibilitie - 1 freq
responses - 12 freq
'rejuvenate - 1 freq
rock-boond - 2 freq
responsibeelity - 3 freq
raspan - 1 freq
responsibilities - 9 freq
recipients - 3 freq
receiving - 4 freq
rigbanes - 3 freq
recipient - 5 freq
riggy-bane - 2 freq
responsorial - 1 freq
respondents - 3 freq
responsibeility - 1 freq
responsibeilities - 1 freq
responsibeilitie - 1 freq
responsibly - 1 freq
responsibeelitie - 2 freq
rig-banes - 1 freq
rispin - 1 freq
responsinle - 1 freq
respondent - 2 freq
respone - 1 freq
responsiebility - 1 freq
response' - 1 freq
rejuvenation - 2 freq
respons - 1 freq
rockupmaspikkersmin - 1 freq
reggaeheaven - 1 freq
rexchapman - 1 freq
responding - 1 freq
riseupmelbourne - 1 freq
MetaPhone code - RKHFN
wrack-hofn - 2 freq
reggaeheaven - 1 freq
REGGAEHEAVEN
Time to execute Levenshtein function - 0.297043 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.398857 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027206 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038325 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000870 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.