A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to samaritans in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
samaritans (0) - 4 freq
samaritan (1) - 8 freq
samarites (2) - 1 freq
samericans (2) - 1 freq
jamaicans (3) - 5 freq
samaria (3) - 7 freq
americans (3) - 16 freq
spartans (3) - 5 freq
samarkand (3) - 1 freq
samarkan (3) - 9 freq
africans (4) - 1 freq
salaries (4) - 3 freq
smarties (4) - 6 freq
tartans (4) - 4 freq
scartins (4) - 2 freq
barbarians (4) - 2 freq
maitand (4) - 1 freq
samarkand' (4) - 1 freq
shoartens (4) - 1 freq
marias (4) - 1 freq
sandisans (4) - 1 freq
american (4) - 90 freq
tamara's (4) - 6 freq
soartins (4) - 2 freq
startan (4) - 6 freq
samaritans (0) - 4 freq
samaritan (2) - 8 freq
samericans (3) - 1 freq
samarites (3) - 1 freq
spartans (4) - 5 freq
shoartens (5) - 1 freq
scartins (5) - 2 freq
soartins (5) - 2 freq
smarten (5) - 1 freq
smarts (5) - 1 freq
smarties (5) - 6 freq
martens (5) - 4 freq
martins (5) - 2 freq
americans (5) - 16 freq
samarkand (5) - 1 freq
samarkan (5) - 9 freq
smerten (6) - 1 freq
smoorikins (6) - 2 freq
jamaicans (6) - 5 freq
skirtins (6) - 1 freq
ambitons (6) - 1 freq
partans (6) - 5 freq
americanos (6) - 1 freq
smoorikens (6) - 2 freq
tartans (6) - 4 freq
SoundEx code - S563
snorted - 7 freq
smeerit - 1 freq
smoored - 32 freq
skimmered - 2 freq
smarten - 1 freq
scunnert - 78 freq
snortin - 13 freq
smert - 44 freq
snorts - 5 freq
simmertide - 1 freq
smart - 36 freq
smuired - 13 freq
smerter - 5 freq
smairtly - 3 freq
smartest - 4 freq
smertly - 24 freq
smert-like - 3 freq
samaritans - 4 freq
smourit - 4 freq
smert-lyke - 1 freq
smoort - 8 freq
snort - 8 freq
snortit - 16 freq
scunnered - 78 freq
smuirt - 5 freq
smertest - 4 freq
sinnahard - 1 freq
snaw-wraiths - 2 freq
sneered - 10 freq
summerdale - 3 freq
smored - 4 freq
smairt - 26 freq
snored - 7 freq
smoorit - 4 freq
smearit - 1 freq
smuirit - 3 freq
smartie's - 1 freq
snared - 5 freq
smartly - 5 freq
snortan - 2 freq
smirtlin - 2 freq
snirtled - 1 freq
snortled - 1 freq
snirt - 3 freq
scunnért - 5 freq
sinnart - 1 freq
scunner't - 2 freq
simmerd - 1 freq
seunnert - 2 freq
snoartin - 1 freq
snortet - 1 freq
smarties - 6 freq
smert-erses - 1 freq
samaritan - 8 freq
scunnèrt - 1 freq
snor't - 1 freq
smart-arsed - 1 freq
smairter - 1 freq
sneert - 2 freq
smerts - 1 freq
smeared - 6 freq
scunneration - 5 freq
shimmered - 2 freq
seniority - 1 freq
smirtle - 3 freq
simmirdim - 1 freq
snirtle - 3 freq
snirtit - 2 freq
simmertime - 1 freq
smoord - 1 freq
swineherd - 1 freq
snirtl't - 1 freq
snirtlin - 1 freq
samarites - 1 freq
smoorit's - 1 freq
samhradh - 2 freq
smoured - 4 freq
smour't - 1 freq
skunnert - 1 freq
simmer-time - 1 freq
smairten - 2 freq
sconnered - 1 freq
smairtlie - 1 freq
smert's - 1 freq
sinnert - 1 freq
smirtles - 1 freq
smartboards - 1 freq
summertime - 1 freq
smartish - 2 freq
someroad - 1 freq
smart-arse - 1 freq
smirdit - 1 freq
smirtil - 1 freq
smerten - 1 freq
snirtlet - 3 freq
smearedt - 1 freq
smertlike - 1 freq
scunnert- - 1 freq
€˜scunnert - 3 freq
€˜scunnered - 1 freq
€˜scunneration - 1 freq
smartie - 1 freq
snurtysleeves - 24 freq
snurtyleeves - 1 freq
snurty - 1 freq
€˜smart - 1 freq
smart-boards - 1 freq
skimmert - 1 freq
simmerdim - 3 freq
smart-chops - 1 freq
smarts - 1 freq
smertened - 1 freq
smaert - 1 freq
seanmorod - 1 freq
scunnerred - 1 freq
smartenergygb - 1 freq
smartphone - 1 freq
snurts - 1 freq
smarty - 1 freq
smarriott - 1 freq
smartwatch - 1 freq
scunnerd - 5 freq
MetaPhone code - SMRTNS
samaritans - 4 freq
SAMARITANS
Time to execute Levenshtein function - 0.207170 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.375268 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029713 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.045189 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000817 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.