A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to citin in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
citin (0) - 1 freq
fitin (1) - 4 freq
sitin (1) - 1 freq
hitin (1) - 1 freq
cimin (1) - 1 freq
gitin (1) - 2 freq
aitin (1) - 21 freq
witin (1) - 1 freq
pitin (1) - 1 freq
citie (1) - 10 freq
citit (1) - 2 freq
bitin (1) - 27 freq
dizin (2) - 3 freq
cumin (2) - 70 freq
wirtin (2) - 2 freq
coftin (2) - 1 freq
cin (2) - 306 freq
cits (2) - 1 freq
celin (2) - 1 freq
bikin (2) - 7 freq
hikin (2) - 1 freq
teitin (2) - 1 freq
acktin (2) - 1 freq
cuisin (2) - 1 freq
ceilin (2) - 65 freq
citin (0) - 1 freq
citit (2) - 2 freq
citie (2) - 10 freq
bitin (2) - 27 freq
pitin (2) - 1 freq
coatin (2) - 1 freq
actin (2) - 58 freq
ectin (2) - 1 freq
sitin (2) - 1 freq
witin (2) - 1 freq
fitin (2) - 4 freq
hitin (2) - 1 freq
gitin (2) - 2 freq
cimin (2) - 1 freq
aitin (2) - 21 freq
carin (3) - 45 freq
hatin (3) - 2 freq
cain (3) - 24 freq
haitin (3) - 7 freq
cavin (3) - 1 freq
itan (3) - 1 freq
evitin (3) - 4 freq
cantin (3) - 4 freq
cretin (3) - 3 freq
cairtin (3) - 7 freq
SoundEx code - C350
cuttin - 74 freq
cut-doon - 2 freq
cuidnae - 135 freq
caution - 10 freq
coudna - 47 freq
cotton - 25 freq
cuidna - 101 freq
cudnae - 144 freq
cudna - 165 freq
chattin - 22 freq
cheatin - 4 freq
cotton-woo - 2 freq
cidna - 2 freq
cidnae - 2 freq
cwidna - 47 freq
chidin - 1 freq
cuttin' - 1 freq
coodna - 71 freq
coudnae - 20 freq
coddin - 2 freq
chaitin - 3 freq
cud'nae - 1 freq
'cudna - 1 freq
cuttan - 4 freq
chaetin - 1 freq
cheatan - 1 freq
coudno - 3 freq
cadona - 6 freq
coodnae - 52 freq
cuddie-an - 1 freq
chatham - 1 freq
cydonia - 2 freq
cweedna - 2 freq
cidni - 1 freq
cottown - 1 freq
chattan - 1 freq
€œcudna - 1 freq
coatin - 1 freq
citin - 1 freq
chutney - 4 freq
cowden - 7 freq
codeine - 1 freq
ctyem - 1 freq
cudnea - 2 freq
MetaPhone code - STN
stane - 421 freq
sittin - 737 freq
staun - 224 freq
stany - 5 freq
sudden - 213 freq
steen - 114 freq
staan - 13 freq
settin - 144 freq
hsten - 1 freq
saitin - 3 freq
settan - 14 freq
sydney - 7 freq
stone - 85 freq
suttin - 81 freq
seatoun - 1 freq
setten - 22 freq
sitten - 12 freq
sittin' - 21 freq
sidden - 16 freq
cidna - 2 freq
wystin - 2 freq
stan - 150 freq
cidnae - 2 freq
sodden - 11 freq
stoon - 9 freq
stein - 22 freq
staney - 6 freq
sidn - 1 freq
steenie - 25 freq
stan¢ - 1 freq
satan - 24 freq
seton - 18 freq
settin' - 13 freq
seitten - 1 freq
suddin - 1 freq
syden - 1 freq
staine - 2 freq
sa'tin' - 1 freq
sit'n - 2 freq
sitt'n - 1 freq
stony - 4 freq
stan' - 8 freq
sedn - 1 freq
stine - 1 freq
stain - 26 freq
seatin - 4 freq
satin - 12 freq
sutten - 8 freq
steeny - 7 freq
stawn - 4 freq
sidney - 23 freq
sidon - 11 freq
sïttin - 8 freq
stane' - 2 freq
sautin - 1 freq
'stan - 2 freq
side-on - 1 freq
sit-in - 2 freq
steen' - 2 freq
sea-aeten - 1 freq
sittan - 32 freq
soodna - 7 freq
stown - 7 freq
ston - 9 freq
stun - 2 freq
soddan - 1 freq
staen - 2 freq
situn - 1 freq
steyn - 8 freq
suden - 1 freq
setteen - 2 freq
stoun - 2 freq
seteen - 1 freq
sudna - 13 freq
stonn - 1 freq
sateen - 1 freq
said-na - 1 freq
soudna - 2 freq
seedin - 1 freq
sautan - 1 freq
saidna - 1 freq
cydonia - 2 freq
cidni - 1 freq
suitin - 1 freq
swytin - 1 freq
'staun - 1 freq
€˜staun - 1 freq
suidna - 3 freq
sawtan - 1 freq
stanie - 2 freq
staun' - 2 freq
stane- - 1 freq
soddin' - 1 freq
suiden - 1 freq
€œstaan - 1 freq
sudn - 1 freq
siden - 2 freq
€˜siden - 1 freq
citin - 1 freq
stenn - 1 freq
suidnae - 1 freq
sutton - 6 freq
sittn - 1 freq
sitin - 1 freq
set-in - 1 freq
stoney - 3 freq
seaton - 3 freq
setn - 1 freq
stane” - 1 freq
settn - 1 freq
sudan - 1 freq
CITIN
Time to execute Levenshtein function - 0.239825 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.490827 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.034957 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.064540 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000909 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.