A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to region in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
region (0) - 28 freq
legion (1) - 26 freq
regions (1) - 20 freq
regional (2) - 106 freq
begin (2) - 104 freq
regina (2) - 1 freq
regime (2) - 10 freq
rein (2) - 14 freq
regiar (2) - 1 freq
reason (2) - 324 freq
ration (2) - 5 freq
remin (2) - 1 freq
redin (2) - 2 freq
regan (2) - 1 freq
rig-on (2) - 1 freq
regain (2) - 10 freq
aegin (2) - 1 freq
recon (2) - 1 freq
regi (2) - 24 freq
reign (2) - 15 freq
religion (2) - 56 freq
reckon (2) - 69 freq
legions (2) - 9 freq
regimen (2) - 3 freq
ragin (2) - 67 freq
region (0) - 28 freq
regan (2) - 1 freq
regain (2) - 10 freq
regina (2) - 1 freq
reign (2) - 15 freq
legion (2) - 26 freq
ragin (2) - 67 freq
regions (2) - 20 freq
repon (3) - 8 freq
reunion (3) - 6 freq
egon (3) - 1 freq
resin (3) - 6 freq
reggio (3) - 1 freq
origin (3) - 24 freq
orgon (3) - 4 freq
oreegin (3) - 4 freq
ragein (3) - 1 freq
ragan (3) - 1 freq
oreigin (3) - 1 freq
regimen (3) - 3 freq
rougin (3) - 1 freq
raign (3) - 3 freq
resign (3) - 2 freq
regiar (3) - 1 freq
reason (3) - 324 freq
SoundEx code - R250
risin - 137 freq
reason - 324 freq
rizzon - 8 freq
rackin - 8 freq
roseanna - 2 freq
reachin - 32 freq
raisin - 33 freq
raikin - 14 freq
risen - 17 freq
ruggin - 15 freq
resume - 6 freq
recaain - 1 freq
raison - 51 freq
ragin - 67 freq
reckon - 69 freq
riggin - 20 freq
russian - 37 freq
raxin - 61 freq
raeson - 28 freq
reekin - 45 freq
racin - 43 freq
rakin - 27 freq
reign - 15 freq
rooshian - 1 freq
region - 28 freq
reikin - 2 freq
rockin - 18 freq
riskin - 5 freq
racin' - 3 freq
reekin' - 2 freq
ragin' - 3 freq
raiken - 1 freq
reesen - 3 freq
regime - 10 freq
regain - 10 freq
ruggin' - 1 freq
roisin - 2 freq
rushin - 17 freq
reuken - 1 freq
risin' - 3 freq
reachin' - 1 freq
rijn - 1 freq
rashan - 2 freq
'rogan - 1 freq
raxxin - 13 freq
rousin - 5 freq
rage-in - 2 freq
recon - 1 freq
risan - 7 freq
ragein - 1 freq
rïggin - 1 freq
raisin' - 2 freq
rushin' - 1 freq
rikkin - 3 freq
row-chowin - 1 freq
rizzin - 1 freq
reesin - 1 freq
reckin - 5 freq
rockeen - 1 freq
rochian - 1 freq
rasin - 1 freq
raxan - 3 freq
rescuin - 1 freq
rekkin - 2 freq
riseen - 2 freq
raechan - 1 freq
raesan - 1 freq
raceen - 1 freq
rejoin - 3 freq
riggeen - 1 freq
raisan - 3 freq
requiem - 1 freq
resin - 6 freq
reasin - 2 freq
reachan - 3 freq
roussian - 1 freq
reukin - 1 freq
raign - 3 freq
ruisan - 1 freq
rauchan - 1 freq
ruisin - 2 freq
reezin - 1 freq
regína - 2 freq
regan - 1 freq
rösin - 1 freq
rejyne - 2 freq
rauchen - 1 freq
racine - 3 freq
ragan - 1 freq
rasmie - 10 freq
rig-on - 1 freq
rayson - 1 freq
rougin - 1 freq
roakin - 1 freq
€˜regime - 1 freq
roushian - 1 freq
regina - 1 freq
reakin' - 1 freq
rockin' - 1 freq
roqccym - 1 freq
raecomm - 2 freq
reken - 1 freq
reckn - 1 freq
rai-con - 1 freq
roughin - 1 freq
rgm - 1 freq
rajamie - 1 freq
rzm - 1 freq
rowson - 2 freq
rhhyjuan - 1 freq
ryzm - 1 freq
rycammy - 13 freq
reekan - 1 freq
MetaPhone code - RJN
ragin - 67 freq
region - 28 freq
ragin' - 3 freq
rijn - 1 freq
rage-in - 2 freq
ragein - 1 freq
rejoin - 3 freq
rejyne - 2 freq
rougin - 1 freq
regina - 1 freq
rhhyjuan - 1 freq
REGION
Time to execute Levenshtein function - 0.190288 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.329701 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028111 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036635 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000902 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.