A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to sabill in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
sabill (0) - 1 freq
stabill (1) - 1 freq
squill (2) - 4 freq
shill (2) - 4 freq
sawmill (2) - 9 freq
sailt (2) - 2 freq
sall (2) - 97 freq
haill (2) - 411 freq
ab'll (2) - 2 freq
saicl (2) - 1 freq
daill (2) - 1 freq
sails (2) - 39 freq
bill (2) - 10 freq
spill (2) - 24 freq
skill (2) - 55 freq
vaill (2) - 7 freq
ahill (2) - 1 freq
saidill (2) - 1 freq
saall (2) - 1 freq
sabies (2) - 13 freq
skuill (2) - 10 freq
sabine (2) - 3 freq
saville (2) - 5 freq
skaill (2) - 2 freq
skeill (2) - 11 freq
sabill (0) - 1 freq
stabill (2) - 1 freq
zabell (3) - 1 freq
sabhal (3) - 2 freq
sibiel (3) - 1 freq
smaill (3) - 1 freq
skeill (3) - 11 freq
saville (3) - 5 freq
skaill (3) - 2 freq
scuill (3) - 1 freq
sable (3) - 2 freq
i'bill (3) - 2 freq
say'll (3) - 1 freq
nobill (3) - 1 freq
abell (3) - 1 freq
sybil (3) - 1 freq
bill (3) - 551 freq
skuill (3) - 10 freq
swill (3) - 2 freq
sill (3) - 19 freq
still (3) - 2622 freq
sall (3) - 97 freq
spill (3) - 24 freq
squill (3) - 4 freq
shill (3) - 4 freq
SoundEx code - S140
swivel - 5 freq
soople - 11 freq
safely - 41 freq
supple - 6 freq
spell - 160 freq
spiel - 19 freq
supply - 45 freq
spyle - 17 freq
spoil - 14 freq
spill - 24 freq
souple - 12 freq
sabill - 1 freq
spulyie - 5 freq
shovel - 25 freq
spuil - 1 freq
spaell - 4 freq
sibiel - 1 freq
scuffle - 4 freq
saifly - 1 freq
spall - 1 freq
sable - 2 freq
splh - 1 freq
sieepily - 1 freq
speil - 18 freq
shuffle - 11 freq
saville - 5 freq
'spell - 3 freq
speal - 1 freq
sobel - 1 freq
sweevely - 2 freq
speel - 18 freq
spail - 2 freq
supleh - 1 freq
shuvel - 1 freq
skiffle - 2 freq
spile - 11 freq
spool - 2 freq
spïll - 1 freq
spull - 2 freq
shuvvle - 1 freq
sapple - 1 freq
sibble - 1 freq
ösfil - 2 freq
'spill - 1 freq
squabble - 2 freq
shapely - 2 freq
soupil - 3 freq
swabble - 1 freq
seeable - 1 freq
sibella - 2 freq
seville - 1 freq
'shapely - 1 freq
shevel - 1 freq
spulye - 1 freq
spyhole - 1 freq
saufly - 1 freq
spael - 1 freq
shiffil - 1 freq
supplie - 1 freq
skybal - 1 freq
sabhal - 2 freq
shabbily - 2 freq
sauflie - 1 freq
sybil - 1 freq
Ísabel - 6 freq
sipel - 1 freq
spaley - 3 freq
shivel - 1 freq
spfl - 14 freq
spl - 4 freq
svpl - 1 freq
MetaPhone code - SBL
sabill - 1 freq
sibiel - 1 freq
sable - 2 freq
sobel - 1 freq
sibble - 1 freq
seeable - 1 freq
ys-bloo - 1 freq
sibella - 2 freq
sybil - 1 freq
Ísabel - 6 freq
zabell - 1 freq
SABILL
Time to execute Levenshtein function - 0.266665 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.390186 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.032658 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.043157 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000952 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.