A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to sodger in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
sodger (0) - 114 freq
codger (1) - 2 freq
sodjer (1) - 7 freq
dodger (1) - 5 freq
rodger (1) - 4 freq
sodgers (1) - 125 freq
lodger (1) - 3 freq
fodder (2) - 9 freq
sedge (2) - 1 freq
ledger (2) - 1 freq
singer (2) - 41 freq
jogger (2) - 1 freq
sober (2) - 26 freq
sotter (2) - 34 freq
sodgers' (2) - 2 freq
dodges (2) - 2 freq
scodge (2) - 2 freq
dodgers (2) - 2 freq
sodgiers (2) - 1 freq
sover (2) - 1 freq
sojier (2) - 1 freq
tadger (2) - 3 freq
lodged (2) - 6 freq
soccer (2) - 4 freq
hodge (2) - 3 freq
sodger (0) - 114 freq
sodgers (2) - 125 freq
rodger (2) - 4 freq
lodger (2) - 3 freq
codger (2) - 2 freq
sodjer (2) - 7 freq
dodger (2) - 5 freq
sodgerin (3) - 1 freq
badger (3) - 6 freq
suger (3) - 1 freq
gadger (3) - 1 freq
ludger (3) - 4 freq
sudgers (3) - 5 freq
syder (3) - 5 freq
wadger (3) - 5 freq
sadler (3) - 1 freq
seeger (3) - 1 freq
tadger (3) - 3 freq
souder (3) - 1 freq
cadger (3) - 5 freq
sanger (3) - 1 freq
sadder (3) - 4 freq
ledger (3) - 1 freq
sodgiers (3) - 1 freq
sedge (3) - 1 freq
SoundEx code - S326
stauchered - 3 freq
sodjers - 23 freq
swedgers - 1 freq
sodjer - 7 freq
sodgers - 125 freq
scotscreive - 3 freq
stoker - 4 freq
sodger - 114 freq
stauchert - 6 freq
staggered - 7 freq
sodger's - 7 freq
swedgers' - 1 freq
staggers - 4 freq
sidewhuskers - 1 freq
switzerland - 12 freq
staggert - 4 freq
stagger - 15 freq
staucher - 5 freq
sodgers' - 2 freq
staigger - 1 freq
staggeren - 1 freq
sidecar - 3 freq
sydecar - 1 freq
steeker - 1 freq
staucherin - 2 freq
stauchran - 1 freq
staaker - 1 freq
stickers - 11 freq
schwytzertütsch - 3 freq
staggern - 1 freq
staggerin - 8 freq
stachered - 4 freq
stacherin - 3 freq
scutcher - 1 freq
sticker - 9 freq
sodgerin - 1 freq
stack-yaird - 2 freq
soadjers - 1 freq
scotch-airish - 5 freq
scots-airish - 1 freq
stacher - 3 freq
sudocrem - 2 freq
'scotscreive' - 1 freq
stacheran - 1 freq
stackyaird - 1 freq
stecheran - 1 freq
seedcoarn - 1 freq
stachert - 4 freq
scotcourts - 1 freq
sudgers - 5 freq
sodgiers - 1 freq
sudjers - 2 freq
sweetcorn - 3 freq
stok-whorne - 1 freq
sodger-palmers - 1 freq
sodger-palmer - 1 freq
shot-the-craw - 1 freq
sweatshirt - 4 freq
stockaree - 1 freq
shit-scared - 2 freq
site-search - 1 freq
staggering - 2 freq
seed-corn - 1 freq
schotserib - 1 freq
staceyr - 1 freq
scotsradiomedia - 31 freq
shithousery - 2 freq
scotsscriever - 45 freq
scotiagrannie - 1 freq
scattyscribbler - 4 freq
scotsrenaissance - 1 freq
stjerne - 1 freq
satyagrahalba - 2 freq
scotsquirrels - 4 freq
scotsradio - 1 freq
stgriswalds - 1 freq
steisher - 2 freq
scotswirds - 1 freq
sidecarsplz - 1 freq
scottacraigie - 3 freq
scotscores - 3 freq
scottishwriters - 2 freq
MetaPhone code - SJR
sodger - 114 freq
sojer - 23 freq
soajer - 1 freq
seeger - 1 freq
suger - 1 freq
sojier - 1 freq
zojyr - 1 freq
sugery - 1 freq
SODGER
Time to execute Levenshtein function - 0.589872 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 1.021570 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.088986 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.098587 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001126 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.