A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to shorthaund in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
shorthaund (0) - 1 freq
shorthaand (1) - 1 freq
shorthorns (3) - 1 freq
sportsound (3) - 1 freq
hinthaund (3) - 1 freq
aforehaund (3) - 7 freq
shaetland (4) - 2 freq
secondhaund (4) - 1 freq
sportan (4) - 3 freq
neirhaund (4) - 1 freq
shortened (4) - 2 freq
lefthaund (4) - 1 freq
soothland (4) - 2 freq
snortan (4) - 2 freq
shoutan (4) - 5 freq
shootan (4) - 5 freq
shotguns (4) - 1 freq
hertland (4) - 3 freq
dortmund (4) - 1 freq
coohaund (4) - 1 freq
aforehaand (4) - 2 freq
scotland (4) - 2210 freq
shetland (4) - 288 freq
shortage (4) - 6 freq
northsound (4) - 2 freq
shorthaund (0) - 1 freq
shorthaand (1) - 1 freq
shortened (5) - 2 freq
hinthaund (5) - 1 freq
shorthorns (5) - 1 freq
sportsound (5) - 1 freq
shortlins (6) - 1 freq
shortbread (6) - 6 freq
shetland (6) - 288 freq
shortenin (6) - 2 freq
shortening (6) - 1 freq
hertland (6) - 3 freq
shortbreed (6) - 3 freq
shortbreid (6) - 12 freq
short-haired (6) - 1 freq
shatland (6) - 1 freq
shaetland (6) - 2 freq
aforehaund (6) - 7 freq
shithered (7) - 2 freq
sortan (7) - 1 freq
scorland (7) - 1 freq
shortages (7) - 1 freq
scotand (7) - 2 freq
shotgun (7) - 7 freq
shortcut (7) - 4 freq
SoundEx code - S635
scartin - 18 freq
skirtin - 12 freq
scrattin - 10 freq
seartin - 1 freq
scriddans - 2 freq
skyrie-tonguit - 1 freq
serten - 4 freq
sorten - 2 freq
scrooteneer's - 4 freq
scrootenized - 1 freq
scrootinizing - 1 freq
scratten - 1 freq
seratonin - 1 freq
squirtin - 4 freq
sardinia - 3 freq
scrutiny - 5 freq
sortin - 24 freq
sweertness - 1 freq
soartin - 5 freq
sartin - 6 freq
sheridan's - 2 freq
sheridan - 5 freq
sardinian - 3 freq
scrittin - 1 freq
scrutinised - 1 freq
sortan - 1 freq
skartin - 4 freq
sardines - 3 freq
skirteen - 2 freq
shreddan - 1 freq
skirteens - 1 freq
shoartens - 1 freq
scartins - 2 freq
shortened - 2 freq
shorthaand - 1 freq
sardane - 1 freq
shortenin - 2 freq
skrattin - 1 freq
sword-dauncin - 1 freq
scartin' - 1 freq
skirtins - 1 freq
shortening - 1 freq
sardine - 1 freq
scrattins - 1 freq
schrödinger - 2 freq
soartins - 2 freq
shoardin - 1 freq
shortness - 2 freq
sorting - 2 freq
shreadin' - 1 freq
scrutinise - 1 freq
shoartenin - 2 freq
sardonic - 1 freq
sardonicism - 1 freq
“scartin - 1 freq
shorthaund - 1 freq
saortony - 1 freq
schrodinger's - 1 freq
MetaPhone code - XR0NT
shorthaand - 1 freq
shorthaund - 1 freq
SHORTHAUND
Time to execute Levenshtein function - 0.382983 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.731427 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.075279 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.090427 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000884 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.