A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to ceegarette in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
ceegarette (0) - 1 freq
ceegarettes (1) - 7 freq
cígarette (2) - 1 freq
cigarette (2) - 21 freq
ceigarettes (2) - 1 freq
cigarettes (3) - 9 freq
clearest (4) - 4 freq
garotte (4) - 1 freq
ungaretti (4) - 1 freq
margarete (4) - 1 freq
ceegar (4) - 1 freq
regretted (4) - 4 freq
bernadette (4) - 2 freq
chefgareths (4) - 3 freq
gazette (4) - 3 freq
denwette (5) - 1 freq
beeriett (5) - 2 freq
dearest (5) - 19 freq
cheritie (5) - 3 freq
celebrate (5) - 86 freq
cremate (5) - 1 freq
teegars' (5) - 1 freq
grete (5) - 2 freq
coogate (5) - 2 freq
baguette (5) - 2 freq
ceegarette (0) - 1 freq
ceegarettes (2) - 7 freq
cigarette (2) - 21 freq
ceigarettes (3) - 1 freq
cigarettes (4) - 9 freq
cígarette (4) - 1 freq
ungaretti (5) - 1 freq
garotte (5) - 1 freq
gazette (6) - 3 freq
regretted (6) - 4 freq
grett (6) - 34 freq
clearest (6) - 4 freq
ceegar (6) - 1 freq
certie (7) - 4 freq
clartie (7) - 4 freq
lazaretto (7) - 1 freq
majorette (7) - 2 freq
cassette (7) - 3 freq
segregate (7) - 1 freq
carte (7) - 4 freq
begrutten (7) - 6 freq
cheatet (7) - 1 freq
clearit (7) - 2 freq
courgette (7) - 1 freq
regrettit (7) - 3 freq
SoundEx code - C263
cigarette - 21 freq
cigarettes - 9 freq
cowk-wirthy - 1 freq
choochert - 1 freq
chequered - 1 freq
checquered - 1 freq
ceegarettes - 7 freq
ceegarette - 1 freq
ceigarettes - 1 freq
MetaPhone code - SKRT
secret - 196 freq
scared - 44 freq
screed - 31 freq
skreid - 18 freq
scairt - 6 freq
skirt - 56 freq
skairt - 1 freq
saicret - 37 freq
skeert - 1 freq
cigarette - 21 freq
security - 54 freq
scart - 23 freq
sigurd - 40 freq
skyrit - 1 freq
sacred - 24 freq
scurrit - 2 freq
scarred - 15 freq
sugart - 1 freq
squared - 6 freq
scrat - 31 freq
scoured - 5 freq
saucrit - 6 freq
scaured - 4 freq
scourit - 2 freq
squarrt - 1 freq
sukkert - 1 freq
squirt - 4 freq
scoort - 2 freq
so-cried - 1 freq
scratty - 4 freq
scoored - 3 freq
security' - 1 freq
scaredy - 1 freq
scurried - 5 freq
scored - 41 freq
scort - 1 freq
socried - 1 freq
sacret - 1 freq
secured - 4 freq
saicred - 2 freq
skrit - 6 freq
skurt - 4 freq
scrit - 6 freq
sae-cried - 3 freq
scar'd - 1 freq
skreed - 1 freq
skord - 1 freq
skart - 2 freq
scaird - 1 freq
securit - 2 freq
skurried - 1 freq
screid - 8 freq
seecrit - 1 freq
scaurt - 2 freq
sacrit - 3 freq
securitie - 3 freq
skared - 1 freq
skaired - 6 freq
saecret - 1 freq
ceegarette - 1 freq
sugared - 1 freq
seicret - 3 freq
sikkart - 1 freq
scurred - 2 freq
skyred - 1 freq
scored- - 1 freq
scrote - 1 freq
zqqrd - 1 freq
CEEGARETTE
Time to execute Levenshtein function - 0.200029 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.382222 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028584 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039056 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000842 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.