A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to spitten in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
spitten (0) - 1 freq
spittin (1) - 35 freq
pitten (1) - 207 freq
spitter (1) - 1 freq
seitten (1) - 1 freq
sitten (1) - 12 freq
smitten (1) - 21 freq
slitten (1) - 1 freq
spittan (1) - 5 freq
snitter (2) - 1 freq
petten (2) - 7 freq
whitten (2) - 11 freq
written (2) - 283 freq
witten (2) - 1 freq
inpitten (2) - 2 freq
spittal (2) - 1 freq
mitten (2) - 15 freq
litten (2) - 2 freq
spite (2) - 73 freq
spirken (2) - 1 freq
ritten (2) - 1 freq
shiten (2) - 1 freq
spikken (2) - 15 freq
spittled (2) - 1 freq
spittle (2) - 8 freq
spitten (0) - 1 freq
spittan (1) - 5 freq
spittin (1) - 35 freq
slitten (2) - 1 freq
spittoon (2) - 1 freq
smitten (2) - 21 freq
spottin (2) - 4 freq
sitten (2) - 12 freq
pitten (2) - 207 freq
spitter (2) - 1 freq
seitten (2) - 1 freq
sittn (3) - 1 freq
splittin (3) - 12 freq
shuitten (3) - 2 freq
spittins (3) - 1 freq
setten (3) - 22 freq
smittin (3) - 2 freq
sprittin (3) - 1 freq
spitting (3) - 4 freq
pittin (3) - 392 freq
spittit (3) - 2 freq
shotten (3) - 4 freq
spotted (3) - 41 freq
shutten (3) - 2 freq
sutten (3) - 8 freq
SoundEx code - S135
saften - 7 freq
shiftin - 33 freq
saftness - 4 freq
subduing - 1 freq
saftent - 3 freq
september - 126 freq
spitten - 1 freq
spittin - 35 freq
softener - 1 freq
speidin - 1 freq
spoutin - 7 freq
saft-hingin - 1 freq
siftin - 2 freq
speedin - 4 freq
speedometer - 4 freq
saftened - 4 freq
septemmer - 9 freq
saftnin - 2 freq
spottin - 4 freq
saftens - 3 freq
spootin - 5 freq
sub-heidings - 3 freq
softened - 1 freq
spittan - 5 freq
shiftan - 4 freq
soften - 2 freq
spittin't - 1 freq
siftan - 1 freq
scabbitness - 1 freq
sputnik - 1 freq
saaften - 1 freq
spitin - 2 freq
spitting - 4 freq
sub-headin - 1 freq
swiftian - 1 freq
september's - 2 freq
spittoon - 1 freq
spouting - 3 freq
speed-mad - 1 freq
siptaimbur - 1 freq
sputum - 1 freq
shifting - 1 freq
spittins - 1 freq
softens - 1 freq
sub-heidin - 1 freq
softening - 1 freq
spoattin - 1 freq
showboating - 1 freq
speeding - 1 freq
swfdnvkrdu - 1 freq
spittingimage - 1 freq
svwdnuceov - 1 freq
skiptomyloulou - 3 freq
szvptdnlay - 1 freq
MetaPhone code - SPTN
spitten - 1 freq
spittin - 35 freq
ceptin - 6 freq
speidin - 1 freq
spoutin - 7 freq
speedin - 4 freq
spottin - 4 freq
spootin - 5 freq
spittan - 5 freq
ceptna - 2 freq
spitin - 2 freq
spittoon - 1 freq
spoattin - 1 freq
SPITTEN
Time to execute Levenshtein function - 0.481026 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 1.014978 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.078068 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.092254 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001097 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.