A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to standart in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
standart (0) - 45 freq
standard (1) - 115 freq
standirt (1) - 65 freq
staundart (1) - 68 freq
stannart (1) - 10 freq
standarts (1) - 9 freq
staundard (2) - 15 freq
staunnart (2) - 6 freq
staandirt (2) - 1 freq
tankart (2) - 1 freq
landart (2) - 2 freq
standand (2) - 1 freq
standard' (2) - 1 freq
stannarts (2) - 6 freq
sandirt (2) - 1 freq
€žstandart (2) - 1 freq
'standard (2) - 2 freq
standards (2) - 35 freq
stannard (2) - 1 freq
stoddart (2) - 4 freq
staunart (2) - 47 freq
standoot (2) - 1 freq
stardirt (2) - 1 freq
'standart' (2) - 1 freq
standan (2) - 22 freq
standart (0) - 45 freq
staundart (1) - 68 freq
standirt (1) - 65 freq
standarts (2) - 9 freq
staandirt (2) - 1 freq
stannart (2) - 10 freq
standard (2) - 115 freq
stoddart (3) - 4 freq
staundarts (3) - 9 freq
staunart (3) - 47 freq
sandirt (3) - 1 freq
stardirt (3) - 1 freq
staundard (3) - 15 freq
staunnart (3) - 6 freq
stannert (3) - 3 freq
standoot (3) - 1 freq
stalwart (4) - 10 freq
stendit (4) - 2 freq
staunnert (4) - 4 freq
stunnert (4) - 1 freq
standan (4) - 22 freq
staunert (4) - 2 freq
sindert (4) - 41 freq
unstaundart (4) - 2 freq
staundoot (4) - 1 freq
SoundEx code - S353
staund - 107 freq
staundin - 114 freq
stent - 30 freq
stents - 10 freq
stoundin - 8 freq
standart - 45 freq
staun-oot - 2 freq
stand - 199 freq
staunds - 29 freq
stained - 26 freq
standarts - 9 freq
suddent - 44 freq
stints - 1 freq
stands - 55 freq
standable - 1 freq
stunned - 15 freq
stentin - 2 freq
stound - 12 freq
suddenty - 40 freq
suddentie - 13 freq
stint - 11 freq
suddentlie - 7 freq
stoundit - 3 freq
standin - 97 freq
standard - 115 freq
stoned - 5 freq
stunt - 3 freq
standing - 33 freq
stentit - 9 freq
suddentlike - 3 freq
syttand - 1 freq
sweet-naitured - 1 freq
stunnt - 1 freq
saddent - 1 freq
stend - 1 freq
stane-dyke - 1 freq
stane-tired - 2 freq
staaand - 1 freq
steamed - 7 freq
suddently - 50 freq
standardisation - 6 freq
stuntit - 1 freq
staandin - 25 freq
staundart - 68 freq
stoonds - 3 freq
stoont - 1 freq
stoon't - 1 freq
standards - 35 freq
sweetned - 3 freq
sweetened - 1 freq
stoondin - 9 freq
staundin' - 2 freq
standby - 4 freq
steem't - 6 freq
standerds - 1 freq
steemt - 1 freq
stunted - 2 freq
stooned - 2 freq
stane-dry - 1 freq
stemmed - 3 freq
'staund - 7 freq
'stands - 1 freq
stand-up - 2 freq
'stand - 1 freq
standardized - 1 freq
'standart' - 1 freq
staint - 3 freq
stounds - 3 freq
stoondit - 2 freq
standardize - 2 freq
'standard - 2 freq
staands - 7 freq
staand - 21 freq
sweet-naitered - 1 freq
ston-dead - 1 freq
staundarts - 9 freq
stintit - 2 freq
staundoot - 1 freq
stained-gless - 1 freq
standan - 22 freq
stendit - 2 freq
stentie - 1 freq
stentless - 2 freq
stoond - 3 freq
sotnethin - 1 freq
standin-up - 1 freq
staands'im - 1 freq
staandirt - 1 freq
stuntmen - 1 freq
standardised - 5 freq
standpretty - 1 freq
staandstill - 1 freq
steenywid - 1 freq
stone-deaf - 1 freq
standardisin' - 1 freq
sutntawaer - 1 freq
stound's - 1 freq
staundardisation - 5 freq
'stent - 1 freq
suddentlik - 1 freq
standirt - 65 freq
sweatmeat - 1 freq
suddent-like - 4 freq
stennit - 1 freq
shuttin-time - 1 freq
stunde - 1 freq
stunden - 1 freq
staundan - 1 freq
stunts - 2 freq
settin-oot - 1 freq
staundardise - 1 freq
staundardised - 4 freq
staundardiation - 1 freq
staundarts' - 1 freq
stouned - 2 freq
sweet-maet - 1 freq
stendin - 1 freq
staundardisin - 1 freq
staundards - 3 freq
scotmid - 1 freq
standardization - 1 freq
stonding - 1 freq
sooth-moothers - 1 freq
standand - 1 freq
sittand - 1 freq
steam-driven - 1 freq
staundard - 15 freq
stuntin - 3 freq
stawnd - 1 freq
stawndin - 6 freq
stawnds - 1 freq
sýstematic - 1 freq
standstill - 1 freq
stane-deefness - 1 freq
standardise - 1 freq
€˜standard - 1 freq
standardisin - 1 freq
standardiesaetion - 2 freq
standardiesation - 2 freq
€žstandart - 1 freq
stoundinly - 1 freq
€œstand - 1 freq
stond - 3 freq
stonds - 1 freq
€˜standardised - 1 freq
staund-alane - 2 freq
scotand - 2 freq
stymied - 3 freq
settint - 1 freq
standartisation - 1 freq
staundartised - 1 freq
stane-daked - 1 freq
standin' - 2 freq
scotnational - 34 freq
stanthemannie - 1 freq
standfree - 43 freq
standupfarmer - 34 freq
standglasgow - 1 freq
scotindustria - 1 freq
standfreeed - 2 freq
standy - 1 freq
standrewsday - 3 freq
standrews - 2 freq
standrewsvoices - 1 freq
standup - 1 freq
standnewcastle - 1 freq
standardnews - 1 freq
‘standardisation’ - 1 freq
‘standardise’ - 1 freq
skottehandelen - 1 freq
scotindortmund - 1 freq
soothends - 1 freq
standforbetter - 1 freq
standard' - 1 freq
standoot - 1 freq
sidneythursday - 4 freq
MetaPhone code - STNTRT
standart - 45 freq
standard - 115 freq
stane-tired - 2 freq
staundart - 68 freq
'standart' - 1 freq
'standard - 2 freq
staandirt - 1 freq
standirt - 65 freq
staundard - 15 freq
€˜standard - 1 freq
€žstandart - 1 freq
standard' - 1 freq
STANDART
standard - 115 freq
standart - 45 freq
staundart - 68 freq
staunart - 47 freq
staunnart - 6 freq
staunnert - 4 freq
staundarts - 9 freq
standards - 35 freq
standarts - 9 freq
staunarts - freq
staunnarts - freq
staunnerts - 2 freq
unstandard - freq
unstaundart - 2 freq
staunertisashun - 1 freq
standirt - 65 freq
standardised - 5 freq
standardisation - 6 freq
standardize - 2 freq
standardization - 1 freq
standardise - 1 freq
non-standard - 2 freq
standardiesation - 2 freq
standardiesaetion - 2 freq
Time to execute Levenshtein function - 0.270003 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.711588 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.066300 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.084058 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001691 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.