A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to spittan in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
spittan (0) - 5 freq
pittan (1) - 13 freq
spitten (1) - 1 freq
spittin (1) - 34 freq
sittan (1) - 32 freq
spittal (1) - 1 freq
skittran (2) - 1 freq
spitting (2) - 4 freq
smitten (2) - 21 freq
shiftan (2) - 4 freq
shuttan (2) - 3 freq
spirkan (2) - 1 freq
gittan (2) - 30 freq
sitten (2) - 12 freq
sittand (2) - 1 freq
pittin (2) - 392 freq
sittin (2) - 721 freq
sittn (2) - 1 freq
pitten (2) - 206 freq
spikkan (2) - 4 freq
spurtan (2) - 1 freq
spottin (2) - 4 freq
smittal (2) - 6 freq
spittoon (2) - 1 freq
fittan (2) - 1 freq
spittan (0) - 5 freq
spittin (1) - 34 freq
spitten (1) - 1 freq
spottin (2) - 4 freq
spittoon (2) - 1 freq
pittan (2) - 13 freq
spittal (2) - 1 freq
sittan (2) - 32 freq
sprettan (3) - 1 freq
settan (3) - 14 freq
splittin (3) - 12 freq
spittit (3) - 2 freq
spitter (3) - 1 freq
slitten (3) - 1 freq
spittins (3) - 1 freq
smittin (3) - 2 freq
spittle (3) - 8 freq
puttan (3) - 3 freq
spoattin (3) - 1 freq
seitten (3) - 1 freq
sprittin (3) - 1 freq
stittin (3) - 1 freq
sportan (3) - 3 freq
shittin (3) - 4 freq
spitin (3) - 2 freq
SoundEx code - S135
saften - 7 freq
shiftin - 32 freq
saftness - 4 freq
subduing - 1 freq
saftent - 3 freq
september - 124 freq
spitten - 1 freq
spittin - 34 freq
softener - 1 freq
speidin - 1 freq
spoutin - 7 freq
saft-hingin - 1 freq
siftin - 2 freq
speedin - 4 freq
speedometer - 4 freq
saftened - 4 freq
septemmer - 9 freq
saftnin - 2 freq
spottin - 4 freq
saftens - 3 freq
spootin - 5 freq
sub-heidings - 3 freq
softened - 1 freq
spittan - 5 freq
shiftan - 4 freq
soften - 2 freq
spittin't - 1 freq
siftan - 1 freq
scabbitness - 1 freq
sputnik - 1 freq
saaften - 1 freq
spitin - 2 freq
spitting - 4 freq
sub-headin - 1 freq
swiftian - 1 freq
september's - 2 freq
spittoon - 1 freq
spouting - 3 freq
speed-mad - 1 freq
siptaimbur - 1 freq
sputum - 1 freq
shifting - 1 freq
spittins - 1 freq
softens - 1 freq
sub-heidin - 1 freq
softening - 1 freq
spoattin - 1 freq
showboating - 1 freq
speeding - 1 freq
swfdnvkrdu - 1 freq
spittingimage - 1 freq
svwdnuceov - 1 freq
skiptomyloulou - 3 freq
szvptdnlay - 1 freq
MetaPhone code - SPTN
spitten - 1 freq
spittin - 34 freq
ceptin - 6 freq
speidin - 1 freq
spoutin - 7 freq
speedin - 4 freq
spottin - 4 freq
spootin - 5 freq
spittan - 5 freq
ceptna - 2 freq
spitin - 2 freq
spittoon - 1 freq
spoattin - 1 freq
SPITTAN
Time to execute Levenshtein function - 0.256905 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.414966 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.038310 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038333 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000846 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.