A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to fèisean in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
fèisean (0) - 1 freq
frisian (3) - 29 freq
finishan (3) - 1 freq
fishan (3) - 1 freq
flashan (4) - 2 freq
fishen (4) - 1 freq
ruisan (4) - 1 freq
friesian (4) - 1 freq
fingan (4) - 1 freq
exciseman (4) - 1 freq
arisan (4) - 1 freq
uisein (4) - 9 freq
sean (4) - 16 freq
giean (4) - 4 freq
hissan (4) - 1 freq
ciseau (4) - 1 freq
foreseen (4) - 4 freq
tisen (4) - 1 freq
cailean (4) - 1 freq
missan (4) - 6 freq
fraised (4) - 3 freq
risan (4) - 7 freq
findan (4) - 6 freq
eilean (4) - 1 freq
fittan (4) - 1 freq
fèisean (0) - 1 freq
frisian (5) - 29 freq
fïshin (6) - 1 freq
foreseen (6) - 4 freq
fraesan (6) - 1 freq
fessen (6) - 1 freq
féin (6) - 1 freq
fresian (6) - 4 freq
frisson (6) - 2 freq
foirsein (6) - 1 freq
friesian (6) - 1 freq
fraisin (6) - 1 freq
far-seen (6) - 1 freq
finishan (6) - 1 freq
fishan (6) - 1 freq
feorlean (7) - 3 freq
fallean (7) - 1 freq
freistin (7) - 1 freq
fasson (7) - 6 freq
finishin (7) - 16 freq
frien (7) - 61 freq
finzean (7) - 8 freq
fikeane (7) - 1 freq
pisen (7) - 1 freq
flittan (7) - 5 freq
SoundEx code - F250
fushin - 19 freq
fishin - 90 freq
fashin - 28 freq
face-an - 2 freq
fashion - 71 freq
feezin - 2 freq
fousome - 17 freq
facin - 33 freq
'fuckin - 29 freq
fuckin - 1032 freq
fixin - 17 freq
fykin - 2 freq
face--an - 1 freq
fieechin - 1 freq
fushion - 30 freq
foosion - 1 freq
feign - 3 freq
fusome - 1 freq
fechan - 1 freq
fussin - 6 freq
feishen - 8 freq
fishen - 1 freq
fishin' - 7 freq
f'kn - 2 freq
f'ckin - 1 freq
fuck'n - 1 freq
fook'n - 1 freq
fknnn - 1 freq
fakin - 3 freq
fizzin - 8 freq
fowkin - 1 freq
fukin - 1 freq
fassoun - 3 freq
fackson - 1 freq
fïshin - 1 freq
'fizzin - 1 freq
fusion - 4 freq
feckin - 51 freq
fessen - 1 freq
fisheen - 6 freq
fousum - 1 freq
fashan - 2 freq
fuckan - 7 freq
faggan - 1 freq
fyshin - 4 freq
fishan - 1 freq
fashun - 2 freq
fixin' - 1 freq
facin' - 1 freq
fcnm - 1 freq
fission - 1 freq
feuchin - 1 freq
fasson - 6 freq
fushioun - 3 freq
faushion - 1 freq
€œfuckin - 1 freq
fessin - 1 freq
fèisean - 1 freq
€˜facin - 1 freq
fag-en - 1 freq
€˜fuckin - 16 freq
fecksome - 1 freq
€œfishin - 1 freq
feshin - 2 freq
fauson - 1 freq
fackin - 1 freq
faackin - 2 freq
€™fuckin - 1 freq
feckinÂ’ - 1 freq
foggin - 1 freq
fcxenn - 1 freq
fuckn - 1 freq
fookin - 2 freq
fishinÂ’ - 1 freq
fizzin' - 1 freq
fecken - 3 freq
fuckum - 1 freq
fznoyh - 1 freq
fvckin - 3 freq
fikeane - 1 freq
fjgn - 1 freq
fekkin - 1 freq
fuckinÂ’ - 2 freq
MetaPhone code - FSN
face-an - 2 freq
feezin - 2 freq
facin - 33 freq
face--an - 1 freq
fussin - 6 freq
voicin - 1 freq
fizzin - 8 freq
fassoun - 3 freq
'fizzin - 1 freq
fessen - 1 freq
vissiein - 3 freq
facin' - 1 freq
fission - 1 freq
fasson - 6 freq
fessin - 1 freq
fèisean - 1 freq
€˜facin - 1 freq
fauson - 1 freq
fizzin' - 1 freq
fznoyh - 1 freq
vosene - 1 freq
vsn - 1 freq
FÈISEAN
Time to execute Levenshtein function - 0.220708 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.407933 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028128 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.041799 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000878 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.