A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to tf in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
tf (0) - 9 freq
ta (1) - 2534 freq
ff (1) - 7 freq
nf (1) - 2 freq
taf (1) - 1 freq
gf (1) - 5 freq
qf (1) - 3 freq
tl (1) - 9 freq
btf (1) - 8 freq
if (1) - 5948 freq
tfi (1) - 2 freq
tr (1) - 8 freq
ts (1) - 44 freq
th (1) - 2479 freq
tv (1) - 208 freq
tc (1) - 3 freq
jf (1) - 1 freq
yf (1) - 4 freq
df (1) - 2 freq
td (1) - 9 freq
pf (1) - 4 freq
tb (1) - 14 freq
cf (1) - 12 freq
tm (1) - 7 freq
wtf (1) - 14 freq
tf (0) - 9 freq
etf (1) - 1 freq
taf (1) - 1 freq
tfi (1) - 2 freq
tif (1) - 1 freq
vf (2) - 1 freq
tt (2) - 36 freq
to (2) - 4164 freq
t' (2) - 2 freq
wf (2) - 4 freq
tn (2) - 8 freq
hf (2) - 2 freq
te (2) - 1570 freq
tg (2) - 7 freq
tk (2) - 3 freq
ef (2) - 14 freq
f (2) - 187 freq
tp (2) - 5 freq
af (2) - 14 freq
ctf (2) - 1 freq
itfy (2) - 1 freq
lf (2) - 5 freq
itif (2) - 1 freq
tofu (2) - 2 freq
tief (2) - 3 freq
SoundEx code - T100
tap - 773 freq
thief - 66 freq
tapiwa - 19 freq
tap-ee-wah - 1 freq
'thief - 2 freq
'they've - 7 freq
they've - 217 freq
tyaave - 2 freq
tae've - 5 freq
toffee - 28 freq
toap - 44 freq
th've - 1 freq
tf - 9 freq
tv - 208 freq
tie-up - 1 freq
tip - 79 freq
type - 100 freq
tuip - 4 freq
top - 315 freq
they'veee - 1 freq
toffey - 1 freq
tippy - 2 freq
tyauve - 20 freq
tube - 47 freq
toff - 12 freq
t've - 5 freq
tawpie - 1 freq
theif - 4 freq
tub - 22 freq
tubby - 1 freq
taffy - 1 freq
thieve - 3 freq
thay've - 2 freq
'tap' - 1 freq
tup - 6 freq
tubie - 1 freq
tibbie - 10 freq
thiv - 19 freq
tippa - 4 freq
taboo - 4 freq
they'v - 6 freq
tape - 30 freq
thev - 3 freq
tab - 7 freq
taobh - 1 freq
tabie - 1 freq
t'v - 1 freq
thi've - 3 freq
tup- - 1 freq
tippie - 3 freq
tofu - 2 freq
thuv - 5 freq
tovey - 2 freq
tibby - 16 freq
toff' - 1 freq
tif - 1 freq
thay'v - 4 freq
the've - 1 freq
't've - 1 freq
tiff - 1 freq
tb - 14 freq
toby - 7 freq
tabby - 15 freq
tief - 3 freq
teefy - 1 freq
tiffy - 1 freq
tabbie - 1 freq
tuba - 1 freq
tove - 3 freq
t'bie - 1 freq
they-ye've - 1 freq
toffy - 2 freq
'tap- - 1 freq
tv' - 1 freq
tiefe - 1 freq
thai've - 2 freq
teeff - 1 freq
thof - 4 freq
tib - 10 freq
toffie - 1 freq
teip - 2 freq
€™tap - 1 freq
taffee - 1 freq
tbe - 1 freq
€˜tip - 1 freq
€¦tap - 11 freq
typhoo - 1 freq
ttip - 2 freq
toyboy - 1 freq
€œthief - 1 freq
€˜tabby - 1 freq
€œtap - 1 freq
teepee - 5 freq
tappy - 1 freq
tubey - 1 freq
toaffay - 1 freq
tdup - 1 freq
tupee - 1 freq
toooooopay - 1 freq
tp - 5 freq
tbh - 35 freq
tfi - 2 freq
tipp - 1 freq
theyÂ’ve - 5 freq
tbw - 1 freq
tpea - 1 freq
tuff - 6 freq
typo - 3 freq
teevee - 1 freq
tvh - 1 freq
taf - 1 freq
tbf - 3 freq
thu've - 4 freq
tyauv - 1 freq
toof - 1 freq
tawbu - 1 freq
'tabu' - 1 freq
tpu - 1 freq
theyve - 4 freq
tuffy - 1 freq
theve - 1 freq
to've - 1 freq
tvah - 1 freq
MetaPhone code - TF
div - 506 freq
tae've - 5 freq
toffee - 28 freq
deafie - 3 freq
dive - 44 freq
tf - 9 freq
tv - 208 freq
dowf - 17 freq
deif - 28 freq
'div - 6 freq
daev - 10 freq
'daev - 2 freq
dowff - 6 freq
douff - 2 freq
deave - 8 freq
defo - 208 freq
toffey - 1 freq
toff - 12 freq
t've - 5 freq
tough - 43 freq
deef - 43 freq
davie - 229 freq
deefie - 3 freq
'tough - 1 freq
duff - 24 freq
taffy - 1 freq
daff - 5 freq
deaf - 27 freq
daffy - 1 freq
defy - 6 freq
dayvee - 1 freq
dav - 1 freq
davee - 1 freq
davey - 13 freq
dif - 2 freq
'davy - 1 freq
'doof - 1 freq
doof' - 1 freq
t'v - 1 freq
dowfie - 1 freq
dovie - 1 freq
dave - 62 freq
doff - 3 freq
divvy - 3 freq
daef - 1 freq
deeve - 5 freq
davy - 70 freq
tofu - 2 freq
duffy - 52 freq
da've - 1 freq
tovey - 2 freq
doof - 1 freq
toff' - 1 freq
tif - 1 freq
diff - 2 freq
dev - 4 freq
hdöv - 1 freq
dove - 7 freq
't've - 1 freq
tiff - 1 freq
tief - 3 freq
teefy - 1 freq
tiffy - 1 freq
daeve - 1 freq
defoe - 3 freq
dewfaa - 1 freq
tove - 3 freq
toffy - 2 freq
tv' - 1 freq
deive - 1 freq
tiefe - 1 freq
teeff - 1 freq
'dave - 1 freq
dovey - 3 freq
daph - 1 freq
toffie - 1 freq
€˜deaf - 4 freq
€˜deif - 2 freq
deifie - 4 freq
daffie- - 1 freq
taffee - 1 freq
typhoo - 1 freq
€œdiva - 1 freq
€œdavie - 3 freq
€œdiv - 11 freq
€œdave - 1 freq
devo - 2 freq
toaffay - 1 freq
teugh - 1 freq
dòigh - 1 freq
divi - 2 freq
diva - 1 freq
def - 4 freq
wtf - 14 freq
”div - 1 freq
tfi - 2 freq
dv - 5 freq
diveÂ’ - 1 freq
dof - 1 freq
deff - 1 freq
dvo - 2 freq
tuff - 6 freq
dfy - 1 freq
teevee - 1 freq
dveo - 1 freq
dvh - 1 freq
'deif - 1 freq
'deif' - 1 freq
tvh - 1 freq
taf - 1 freq
deffo - 3 freq
dvu - 1 freq
toof - 1 freq
df - 2 freq
ytfh - 1 freq
'dove' - 2 freq
'dive' - 2 freq
devi - 1 freq
tuffy - 1 freq
to've - 1 freq
tvah - 1 freq
‘deaf’ - 1 freq
“deaf” - 1 freq
TF
Time to execute Levenshtein function - 0.177886 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.316821 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027455 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037802 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000867 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.