A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to scotsman in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
scotsman (0) - 39 freq
scotsmen (1) - 6 freq
scottsman (1) - 1 freq
scotskat (2) - 1 freq
scotsman's (2) - 1 freq
scotsmlk (2) - 6 freq
scotchman (2) - 4 freq
scotsoun (2) - 18 freq
scotssang (2) - 3 freq
sportsman (2) - 2 freq
scotseen (2) - 2 freq
scotsla (2) - 1 freq
scotlan (2) - 63 freq
scotswoman (2) - 1 freq
cotman (2) - 1 freq
footman (3) - 11 freq
scotmid (3) - 1 freq
scotslynn (3) - 3 freq
scotica (3) - 3 freq
scotsed (3) - 1 freq
coalman (3) - 1 freq
scotlang (3) - 5 freq
scotswomen (3) - 1 freq
scotsis (3) - 1 freq
woodsman (3) - 1 freq
scotsman (0) - 39 freq
scotsmen (1) - 6 freq
scottsman (2) - 1 freq
scotseen (3) - 2 freq
scotsoun (3) - 18 freq
scotswoman (3) - 1 freq
scotskat (4) - 1 freq
cotman (4) - 1 freq
scotchman (4) - 4 freq
statesman (4) - 3 freq
scotswomen (4) - 1 freq
scotlan (4) - 63 freq
scotstoun (4) - 2 freq
scotsmlk (4) - 6 freq
scotssang (4) - 3 freq
sportsman (4) - 2 freq
scotsla (4) - 1 freq
scotsman's (4) - 1 freq
scotlann (5) - 1 freq
scottm (5) - 1 freq
sportsmen (5) - 3 freq
scots- (5) - 1 freq
thescotsman (5) - 8 freq
salesman (5) - 16 freq
scotspeak (5) - 2 freq
SoundEx code - S325
stickin - 105 freq
steekin - 10 freq
shotgun - 7 freq
scotsoun - 18 freq
stoackins - 1 freq
stigand - 5 freq
scotseen - 2 freq
steggin - 1 freq
scotsman - 39 freq
stockings - 5 freq
switchin - 9 freq
sodie-scone - 1 freq
seetuashun - 1 freq
sitcom - 3 freq
stucken - 5 freq
scotsman's - 1 freq
stocking - 5 freq
'scottish-english' - 1 freq
stukkin - 3 freq
seatchin - 1 freq
stookin - 2 freq
stockins - 18 freq
stagean - 1 freq
scotchman - 4 freq
switchin' - 1 freq
stigmatist - 1 freq
stikken - 3 freq
stecm't - 1 freq
stcem - 1 freq
scutcheons - 1 freq
stegging - 1 freq
scotsmen - 6 freq
stigmata - 1 freq
seducin - 1 freq
stechin - 7 freq
scotchness - 2 freq
stagnant - 2 freq
shot-gun - 1 freq
stuckken - 1 freq
scottishness - 4 freq
stockin - 15 freq
stickan - 4 freq
stackin - 2 freq
sketchin - 2 freq
sitchana - 1 freq
sweet-smellin - 1 freq
stikkin - 2 freq
stachenn - 1 freq
sitchin - 1 freq
stockeens - 2 freq
staagin - 1 freq
stokkins - 1 freq
sketchan - 1 freq
stackan - 2 freq
sweetchan - 2 freq
stagehaunds - 2 freq
swatchin - 1 freq
shotguns - 1 freq
staukin - 1 freq
scotswomen - 1 freq
stigma - 7 freq
setswana - 1 freq
scottish-man - 1 freq
stockans - 1 freq
staignant - 1 freq
stigmatised - 2 freq
stigmatisation - 1 freq
south-central - 1 freq
stukken - 7 freq
stoakins - 1 freq
scutchin - 1 freq
stakin - 1 freq
'scotsoun' - 1 freq
sticking - 8 freq
stagin - 2 freq
scots-medium - 3 freq
stigmas - 1 freq
'staishin' - 1 freq
stookin's - 1 freq
stickiness - 2 freq
side-chaumer - 1 freq
skeitchin - 1 freq
steikin - 1 freq
sweet-scentit - 1 freq
€œscots-influenced - 1 freq
scots-influenced - 2 freq
scotsness - 1 freq
switch-ons - 1 freq
stockens - 1 freq
scott-jones - 1 freq
sad-kind - 1 freq
switching - 5 freq
swutchin - 1 freq
shuitgun - 1 freq
stigmatize - 1 freq
€œstickin - 1 freq
seth-smith - 1 freq
scotshaunbuik - 1 freq
steckin - 1 freq
scots-inflectit - 1 freq
stockiemuir - 1 freq
stickn - 1 freq
stockin-soles - 1 freq
steekienolan - 1 freq
stjohnstone - 4 freq
sweetconsidine - 1 freq
scottsmackenzie - 4 freq
scotconserv - 1 freq
scottishmam - 1 freq
scotsmlk - 6 freq
stickin' - 1 freq
scotsmerch - 1 freq
scotsonthestreets - 1 freq
scottishindependence - 12 freq
scottsman - 1 freq
'sticking - 1 freq
scotssong - 8 freq
sdkmgoydau - 1 freq
scotscunnered - 1 freq
scotssang - 3 freq
scotswoman - 1 freq
scodgemc - 1 freq
stsimon - 1 freq
scotsmagazine - 2 freq
stockansoatcake - 1 freq
squidgaming - 4 freq
MetaPhone code - SKTSMN
scotsman - 39 freq
scotsmen - 6 freq
scottsman - 1 freq
SCOTSMAN
Time to execute Levenshtein function - 0.469367 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.725265 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.079821 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036962 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000758 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.