A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to sydecar in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
sydecar (0) - 1 freq
sidecar (1) - 3 freq
aydeca (2) - 1 freq
syder (2) - 5 freq
sicecar (2) - 2 freq
siecar (2) - 1 freq
syde (3) - 37 freq
syer (3) - 1 freq
decay (3) - 11 freq
scar (3) - 16 freq
sicecan (3) - 1 freq
syden (3) - 1 freq
decaf (3) - 1 freq
aydea (3) - 1 freq
sear (3) - 2 freq
bynear (3) - 2 freq
shear (3) - 9 freq
denar (3) - 1 freq
endear (3) - 1 freq
syde's (3) - 5 freq
smear (3) - 2 freq
dear (3) - 425 freq
iydeas (3) - 1 freq
year (3) - 2070 freq
syne-an (3) - 1 freq
sydecar (0) - 1 freq
sidecar (1) - 3 freq
siecar (3) - 1 freq
sicecar (3) - 2 freq
syder (3) - 5 freq
succar (4) - 3 freq
siccar (4) - 289 freq
scar (4) - 16 freq
sicear (4) - 6 freq
decor (4) - 3 freq
aydeca (4) - 1 freq
slicer (5) - 1 freq
scaar (5) - 20 freq
secure (5) - 22 freq
saucer (5) - 13 freq
seducin (5) - 1 freq
sodjer (5) - 7 freq
decora (5) - 1 freq
dewar (5) - 9 freq
scair (5) - 2 freq
ryder (5) - 1 freq
sodger (5) - 114 freq
sadler (5) - 1 freq
spacer (5) - 2 freq
scaur (5) - 14 freq
SoundEx code - S326
stauchered - 3 freq
sodjers - 23 freq
swedgers - 1 freq
sodjer - 7 freq
sodgers - 125 freq
scotscreive - 3 freq
stoker - 4 freq
sodger - 114 freq
stauchert - 6 freq
staggered - 7 freq
sodger's - 7 freq
swedgers' - 1 freq
staggers - 5 freq
sidewhuskers - 1 freq
switzerland - 12 freq
staggert - 4 freq
stagger - 15 freq
staucher - 5 freq
sodgers' - 2 freq
staigger - 1 freq
staggeren - 1 freq
sidecar - 3 freq
sydecar - 1 freq
steisher - 3 freq
sticker - 10 freq
steeker - 1 freq
staucherin - 2 freq
stauchran - 1 freq
staaker - 1 freq
stickers - 11 freq
schwytzertütsch - 3 freq
staggern - 1 freq
staggerin - 8 freq
stachered - 4 freq
stacherin - 3 freq
scutcher - 1 freq
sodgerin - 1 freq
stack-yaird - 2 freq
soadjers - 1 freq
scotch-airish - 5 freq
scots-airish - 1 freq
stacher - 3 freq
sudocrem - 2 freq
'scotscreive' - 1 freq
stacheran - 1 freq
stackyaird - 1 freq
stecheran - 1 freq
seedcoarn - 1 freq
stachert - 4 freq
scotcourts - 1 freq
sudgers - 5 freq
sodgiers - 1 freq
sudjers - 2 freq
sweetcorn - 3 freq
stok-whorne - 1 freq
sodger-palmers - 1 freq
sodger-palmer - 1 freq
shot-the-craw - 1 freq
sweatshirt - 4 freq
stockaree - 1 freq
shit-scared - 2 freq
site-search - 1 freq
staggering - 2 freq
seed-corn - 1 freq
schotserib - 1 freq
staceyr - 1 freq
scotsradiomedia - 31 freq
shithousery - 2 freq
scotsscriever - 45 freq
scotiagrannie - 1 freq
scattyscribbler - 4 freq
scotsrenaissance - 1 freq
stjerne - 1 freq
satyagrahalba - 2 freq
scotsquirrels - 4 freq
scotsradio - 1 freq
stgriswalds - 1 freq
scotswirds - 1 freq
sidecarsplz - 1 freq
scottacraigie - 3 freq
scotscores - 3 freq
scottishwriters - 2 freq
MetaPhone code - STKR
stoker - 4 freq
stagger - 15 freq
staigger - 1 freq
sidecar - 3 freq
sydecar - 1 freq
sticker - 10 freq
steeker - 1 freq
staaker - 1 freq
stockaree - 1 freq
SYDECAR
Time to execute Levenshtein function - 0.517878 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 1.287208 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.093318 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.120611 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001798 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.