A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to xiang in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
xiang (0) - 4 freq
kiang (1) - 1 freq
iang (1) - 1 freq
iano (2) - 1 freq
hinng (2) - 1 freq
€™ang (2) - 2 freq
brang (2) - 2 freq
sing (2) - 350 freq
tang (2) - 50 freq
amang (2) - 699 freq
'ian' (2) - 1 freq
dinng (2) - 1 freq
bilang (2) - 7 freq
jang (2) - 3 freq
sang (2) - 623 freq
crang (2) - 2 freq
ding (2) - 87 freq
twang (2) - 13 freq
swang (2) - 5 freq
fing (2) - 25 freq
king (2) - 827 freq
mang (2) - 17 freq
wing (2) - 38 freq
iand (2) - 1 freq
'lang (2) - 5 freq
xiang (0) - 4 freq
iang (2) - 1 freq
kiang (2) - 1 freq
ling (3) - 13 freq
hing (3) - 604 freq
euang (3) - 1 freq
ing (3) - 10 freq
bing (3) - 38 freq
loang (3) - 3 freq
laang (3) - 3 freq
ang (3) - 8 freq
geang (3) - 3 freq
gyang (3) - 177 freq
fang (3) - 12 freq
lang (3) - 3250 freq
alang (3) - 1171 freq
bang (3) - 100 freq
rang (3) - 83 freq
asang (3) - 1 freq
jing (3) - 11 freq
gang (3) - 1111 freq
ping (3) - 12 freq
kang (3) - 1 freq
ting (3) - 49 freq
huang (3) - 2 freq
SoundEx code - X520
xiang - 4 freq
xmas - 48 freq
xxmwj - 1 freq
xmos - 1 freq
xnax - 1 freq
xomwag - 1 freq
xxxmissy - 1 freq
xnx - 1 freq
xmmx - 1 freq
MetaPhone code - SNK
sink - 100 freq
sang - 623 freq
sing - 350 freq
sung - 68 freq
snake - 75 freq
sank - 35 freq
seeing - 60 freq
song - 144 freq
snaik - 2 freq
sync - 6 freq
'sneaky - 1 freq
sneck - 42 freq
sunk - 26 freq
sonic - 6 freq
sneak - 23 freq
snack - 7 freq
snook - 2 freq
snoke - 9 freq
snowk - 6 freq
snug - 30 freq
snog - 7 freq
sneaky - 9 freq
sang' - 3 freq
snag - 1 freq
zoink - 16 freq
seink - 1 freq
snuck - 3 freq
zinc - 5 freq
snek - 11 freq
senga - 35 freq
zing - 3 freq
sincé - 1 freq
sing' - 2 freq
cynic - 3 freq
song' - 1 freq
sankey - 1 freq
sonk - 1 freq
snackie - 1 freq
sneuk - 2 freq
soang - 1 freq
'seeing - 1 freq
'sink' - 1 freq
sneug - 1 freq
xiang - 4 freq
sneg - 1 freq
zink - 1 freq
€˜sang - 2 freq
€œsenga - 2 freq
€œsengaaa - 1 freq
€˜sneak - 1 freq
cinq - 1 freq
sinky - 1 freq
sneckie - 3 freq
snakey - 1 freq
zunc - 1 freq
sunak - 6 freq
snc - 1 freq
sneeky - 1 freq
XIANG
Time to execute Levenshtein function - 0.248331 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.579220 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.065211 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036893 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.004845 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.