A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to cement in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
cement (0) - 27 freq
wemen (2) - 16 freq
comeat (2) - 1 freq
mament (2) - 14 freq
ceest (2) - 1 freq
tenent (2) - 2 freq
client (2) - 2 freq
event (2) - 103 freq
ceen (2) - 2 freq
peent (2) - 1 freq
waement (2) - 1 freq
meent (2) - 22 freq
remeent (2) - 2 freq
peyment (2) - 6 freq
segment (2) - 1 freq
cumen (2) - 1 freq
lemen (2) - 1 freq
cent (2) - 20 freq
relent (2) - 5 freq
recent (2) - 72 freq
foment (2) - 2 freq
ferment (2) - 4 freq
wament (2) - 1 freq
element (2) - 45 freq
moment (2) - 231 freq
cement (0) - 27 freq
remant (3) - 1 freq
comment (3) - 117 freq
cementin (3) - 1 freq
lament (3) - 22 freq
moment (3) - 231 freq
wament (3) - 1 freq
element (3) - 45 freq
cogent (3) - 1 freq
ciemen (3) - 1 freq
casement (3) - 3 freq
cemented (3) - 2 freq
wycement (3) - 1 freq
comena (3) - 2 freq
ment (3) - 18 freq
comet (3) - 12 freq
foment (3) - 2 freq
memento (3) - 1 freq
meent (3) - 22 freq
remeent (3) - 2 freq
client (3) - 2 freq
comeat (3) - 1 freq
mament (3) - 14 freq
peyment (3) - 6 freq
waement (3) - 1 freq
SoundEx code - C553
commentator - 9 freq
commaund - 5 freq
comment - 117 freq
community - 254 freq
commaunds - 11 freq
command - 36 freq
commanded - 6 freq
comin-douns - 1 freq
communities - 64 freq
'community - 2 freq
commandments - 9 freq
commandment - 3 freq
commander - 10 freq
commanders - 1 freq
community's - 1 freq
cement - 27 freq
commaundit - 3 freq
commandit - 5 freq
commands - 4 freq
commentate - 1 freq
comments - 65 freq
conundrum - 4 freq
comin't - 1 freq
commmunitarian - 1 freq
communitie - 21 freq
commentators - 11 freq
commander-in-chief - 1 freq
commented - 6 freq
commination - 2 freq
commentin - 4 freq
commentary - 13 freq
community' - 2 freq
canaanite - 1 freq
commendin - 1 freq
commaand - 4 freq
commonties - 18 freq
commandin - 2 freq
commonty - 22 freq
commonties' - 2 freq
commontyis - 1 freq
comin-doon - 1 freq
commandar - 1 freq
commontie - 2 freq
commends - 2 freq
conundrums - 1 freq
commonity - 177 freq
commonities - 57 freq
'commonity - 1 freq
commandis - 1 freq
comaands - 1 freq
commaands - 6 freq
commandos - 1 freq
commentarie - 1 freq
commmunitie - 1 freq
€œcommunity - 1 freq
ceenematic - 1 freq
comunatez - 1 freq
cheinie-metall - 1 freq
commend - 2 freq
comin-oot - 1 freq
€˜community - 1 freq
commando - 1 freq
cementing - 1 freq
cementin - 1 freq
commonty' - 1 freq
cinematic - 2 freq
commending - 1 freq
commentatin - 1 freq
canonade - 1 freq
community-run - 1 freq
community-driven - 1 freq
communed - 1 freq
commendit - 4 freq
commentit - 3 freq
communities-ni - 1 freq
communitiesni - 3 freq
commentars - 1 freq
commander-in-heid - 1 freq
communityworldservice - 1 freq
communitarian - 1 freq
commentariat - 4 freq
cemented - 2 freq
commendable - 1 freq
c'monthedean - 1 freq
‘community’ - 1 freq
community-wide - 1 freq
community-minded - 1 freq
cmonthebears - 1 freq
cmoondl - 1 freq
cmondave - 1 freq
MetaPhone code - SMNT
summoned - 4 freq
cement - 27 freq
summonit - 2 freq
summont - 2 freq
summond - 1 freq
soumand - 1 freq
wycement - 1 freq
someone'd - 1 freq
CEMENT
Time to execute Levenshtein function - 0.175401 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.328002 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.035924 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.044065 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001163 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.