A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to remit in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
remit (0) - 11 freq
remin (1) - 1 freq
emit (1) - 4 freq
demit (1) - 2 freq
redit (1) - 2 freq
dremit (1) - 1 freq
fremit (1) - 13 freq
semit (1) - 1 freq
resit (1) - 1 freq
remix (1) - 1 freq
reikit (2) - 1 freq
rem (2) - 2 freq
reipit (2) - 2 freq
hempt (2) - 1 freq
nemmit (2) - 1 freq
demi (2) - 1 freq
reckt (2) - 1 freq
re-fit (2) - 1 freq
sumit (2) - 1 freq
recite (2) - 9 freq
ruit (2) - 19 freq
relait (2) - 1 freq
nemyt (2) - 1 freq
grevit (2) - 1 freq
naemit (2) - 1 freq
remit (0) - 11 freq
remix (2) - 1 freq
roamit (2) - 1 freq
armit (2) - 4 freq
semit (2) - 1 freq
remote (2) - 27 freq
resit (2) - 1 freq
demit (2) - 2 freq
emit (2) - 4 freq
remin (2) - 1 freq
fremit (2) - 13 freq
redit (2) - 2 freq
dremit (2) - 1 freq
react (3) - 17 freq
removit (3) - 1 freq
fremyt (3) - 2 freq
remak (3) - 1 freq
risit (3) - 1 freq
leimit (3) - 1 freq
requit (3) - 1 freq
reet (3) - 8 freq
reset (3) - 1 freq
kermit (3) - 2 freq
roit (3) - 1 freq
rivit (3) - 2 freq
SoundEx code - R530
rent - 48 freq
round - 122 freq
rained - 13 freq
roond - 955 freq
runt - 7 freq
ruint - 7 freq
rammed - 9 freq
rend - 2 freq
rant - 26 freq
rhymed - 8 freq
rand - 2 freq
remote - 27 freq
ruined - 29 freq
remead - 2 freq
randy - 15 freq
remedy - 4 freq
randie - 8 freq
remeid - 7 freq
rooint - 2 freq
reunite - 1 freq
rewind - 3 freq
renewed - 12 freq
roamed - 3 freq
rehaimed - 1 freq
re-haimed - 1 freq
rwintie - 1 freq
roamit - 1 freq
'roond - 1 freq
rroned - 1 freq
rint - 4 freq
remade - 1 freq
rimed - 1 freq
reeined - 1 freq
runty - 1 freq
rewynit - 1 freq
remit - 11 freq
reined - 2 freq
remede - 1 freq
'round' - 1 freq
ranit - 2 freq
roun't - 1 freq
rounit - 1 freq
remote' - 1 freq
rund - 2 freq
rynyt - 1 freq
'remit' - 1 freq
rhind - 1 freq
€˜roond- - 1 freq
rennet - 1 freq
rehomed - 1 freq
reamed - 1 freq
ramed - 2 freq
€˜round - 1 freq
rahnd - 1 freq
rimmed - 1 freq
rooond - 1 freq
rintaeye - 1 freq
rnnd - 1 freq
roundÂ’ - 1 freq
ronda - 1 freq
'remote - 1 freq
rwanda - 2 freq
'remote' - 2 freq
MetaPhone code - RMT
rammed - 9 freq
rhymed - 8 freq
remote - 27 freq
rimbaud - 1 freq
remead - 2 freq
remedy - 4 freq
remeid - 7 freq
roamed - 3 freq
roamit - 1 freq
remade - 1 freq
rimed - 1 freq
remit - 11 freq
remede - 1 freq
remote' - 1 freq
'remit' - 1 freq
reamed - 1 freq
ramed - 2 freq
rimmed - 1 freq
'remote - 1 freq
'remote' - 2 freq
REMIT
Time to execute Levenshtein function - 0.178532 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.347187 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028009 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.055187 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000865 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.