A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to threepit in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
threepit (0) - 5 freq
threapit (1) - 47 freq
threipit (1) - 8 freq
threedit (1) - 3 freq
ereepit (2) - 1 freq
wheepit (2) - 2 freq
dreepit (2) - 7 freq
traepit (2) - 2 freq
theekit (2) - 2 freq
threapin (2) - 20 freq
thraipit (2) - 1 freq
threit (2) - 18 freq
creepit (2) - 42 freq
threipt (2) - 1 freq
threidit (2) - 4 freq
threipin (2) - 10 freq
threep (2) - 2 freq
cheepit (2) - 3 freq
threet (2) - 1 freq
three-fit (2) - 2 freq
throupit (2) - 3 freq
treedit (2) - 1 freq
theikit (3) - 6 freq
screevit (3) - 3 freq
shreddit (3) - 2 freq
threepit (0) - 5 freq
threipit (1) - 8 freq
threapit (1) - 47 freq
throupit (2) - 3 freq
thraipit (2) - 1 freq
threipt (2) - 1 freq
threedit (2) - 3 freq
threidit (3) - 4 freq
threipin (3) - 10 freq
threet (3) - 1 freq
threit (3) - 18 freq
threep (3) - 2 freq
traepit (3) - 2 freq
threapin (3) - 20 freq
thumpit (4) - 2 freq
troopit (4) - 1 freq
thrait (4) - 3 freq
thraet (4) - 2 freq
threttie (4) - 4 freq
therapet (4) - 1 freq
thrivit (4) - 1 freq
threap (4) - 34 freq
threaps (4) - 56 freq
threat (4) - 30 freq
threips (4) - 10 freq
SoundEx code - T613
threipit - 8 freq
tribute - 33 freq
terrifee'd - 3 freq
treibute - 1 freq
terrified - 28 freq
threepit - 5 freq
threapit - 47 freq
thrift - 8 freq
trepidation - 4 freq
trapped - 24 freq
threipt - 1 freq
torpedoed - 4 freq
thereaboots - 15 freq
torpedoes - 3 freq
trooped - 4 freq
tripped - 15 freq
turraveed - 2 freq
tributes - 4 freq
trippt - 1 freq
terrifeet - 2 freq
tarbouton - 1 freq
torebodin - 1 freq
thrived - 7 freq
terrifeed - 10 freq
thrivit - 1 freq
trippit - 4 freq
thereafter - 1 freq
trappet - 2 freq
trappit - 8 freq
thereaboot - 5 freq
torpedeo'd - 1 freq
throupit - 3 freq
trap't - 1 freq
turfed - 7 freq
three-fit - 2 freq
thrifty - 7 freq
tarbet - 1 freq
torpedo - 2 freq
terrafeet - 1 freq
terrifiet - 1 freq
thareaboots - 1 freq
traepit - 2 freq
throu-pittin - 1 freq
thare-about - 2 freq
tripp't - 1 freq
turbot - 2 freq
thriftless - 2 freq
trift's - 1 freq
trippet - 1 freq
thraipit - 1 freq
throu-puttin - 1 freq
trivit - 1 freq
thare-efter - 1 freq
tributary - 1 freq
throu-pit - 2 freq
thereaifter - 1 freq
trift - 1 freq
three-paddit - 1 freq
thaireftir - 1 freq
tripod - 1 freq
€œtrip-trip- - 1 freq
€œtrip-trip - 1 freq
trip-trip - 2 freq
tarbat - 1 freq
torpedos - 1 freq
touraboot - 1 freq
tripadvisor - 1 freq
tryptych - 1 freq
therapeutic - 3 freq
trip-trappin - 1 freq
trip-trapped - 1 freq
troupit - 2 freq
troopit - 1 freq
thereabouts - 1 freq
theraboots - 1 freq
‘tribute’ - 1 freq
turbodeb - 1 freq
troubador - 1 freq
therapet - 1 freq
trftw - 1 freq
tripedog - 9 freq
tripawd - 6 freq
MetaPhone code - 0RPT
threipit - 8 freq
threepit - 5 freq
threapit - 47 freq
threipt - 1 freq
throupit - 3 freq
thraipit - 1 freq
throughpit - 1 freq
throu-pit - 2 freq
therapet - 1 freq
THREEPIT
Time to execute Levenshtein function - 0.268790 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.565094 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.056039 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040281 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000965 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.