A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to giro in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
giro (0) - 12 freq
giros (1) - 1 freq
gro (1) - 3 freq
girn (1) - 59 freq
gino (1) - 1 freq
biro (1) - 6 freq
gio (1) - 1 freq
niro (1) - 1 freq
siro (1) - 1 freq
gird (1) - 48 freq
gigo (1) - 1 freq
girs (1) - 4 freq
girl (1) - 73 freq
garo (1) - 1 freq
mirr (2) - 3 freq
giza (2) - 1 freq
iron (2) - 103 freq
imo (2) - 44 freq
irw (2) - 1 freq
gri (2) - 1 freq
airon (2) - 1 freq
gre (2) - 4 freq
fimo (2) - 1 freq
gord (2) - 1 freq
gink (2) - 1 freq
giro (0) - 12 freq
garo (1) - 1 freq
gro (1) - 3 freq
gier (2) - 2 freq
guru (2) - 11 freq
gra (2) - 3 freq
gre (2) - 4 freq
gar (2) - 162 freq
ger (2) - 2 freq
gore (2) - 5 freq
groo (2) - 4 freq
gory (2) - 16 freq
gare (2) - 1 freq
gair (2) - 3 freq
gary (2) - 103 freq
geir (2) - 14 freq
gr (2) - 4 freq
gri (2) - 1 freq
grou (2) - 11 freq
niro (2) - 1 freq
siro (2) - 1 freq
gird (2) - 48 freq
girs (2) - 4 freq
gio (2) - 1 freq
biro (2) - 6 freq
SoundEx code - G600
grey - 335 freq
grew - 264 freq
gear - 228 freq
grow - 149 freq
gree - 102 freq
growe - 108 freq
gyre - 13 freq
gar - 162 freq
gray - 85 freq
'guir - 1 freq
'growe - 1 freq
grue - 55 freq
gore - 5 freq
giro - 12 freq
gaur - 8 freq
gary - 103 freq
geari - 1 freq
gre - 4 freq
gr - 4 freq
groo - 4 freq
geer - 16 freq
gcer - 2 freq
gory - 16 freq
gerry - 13 freq
'grey - 1 freq
gra - 3 freq
graw - 11 freq
guru - 11 freq
g'ower - 1 freq
grou - 11 freq
greh - 1 freq
gowrie - 1 freq
goor - 2 freq
giaur - 1 freq
gurr - 15 freq
garr - 7 freq
gear' - 1 freq
gurrie - 1 freq
gaer - 7 freq
geir - 14 freq
grae - 4 freq
guare - 1 freq
gurhie - 1 freq
grie - 6 freq
grei - 1 freq
gair - 3 freq
'gary - 2 freq
'grow - 1 freq
garry - 10 freq
grouw - 2 freq
graow - 2 freq
greee- - 1 freq
gier - 2 freq
'gree - 2 freq
gro - 3 freq
gare - 1 freq
guerre - 1 freq
grouwe - 1 freq
gearie - 4 freq
geerie - 1 freq
€œguru - 1 freq
gaar - 1 freq
ger - 2 freq
gruw - 3 freq
€˜gray - 1 freq
ger- - 1 freq
€˜grow - 1 freq
€œgary - 1 freq
grye - 1 freq
gie-owre - 1 freq
€˜guru - 1 freq
€˜growe - 1 freq
grey- - 1 freq
grrrrhh - 1 freq
gri - 1 freq
garya - 2 freq
grrrrrr - 1 freq
grrrrrrrr - 1 freq
grrrrrrrrrrrrrrrrrr - 1 freq
grrr - 2 freq
grrrrrrr - 1 freq
garye - 3 freq
ggrr - 1 freq
gkkxr - 1 freq
garo - 1 freq
gjr - 1 freq
grr - 2 freq
MetaPhone code - JR
gear - 228 freq
jar - 41 freq
jaur - 23 freq
jury - 109 freq
gyre - 13 freq
jerry - 15 freq
jr - 16 freq
giro - 12 freq
jure - 1 freq
geari - 1 freq
geer - 16 freq
jeer - 7 freq
gerry - 13 freq
jury- - 2 freq
giaur - 1 freq
gear' - 1 freq
jer - 1 freq
jaurie - 1 freq
geir - 14 freq
gier - 2 freq
jiyro - 1 freq
jura - 3 freq
jerrie - 5 freq
jour - 1 freq
gearie - 4 freq
jieir - 1 freq
geerie - 1 freq
ger - 2 freq
jirr - 1 freq
ger- - 1 freq
jrr - 1 freq
gie-owre - 1 freq
jerah - 3 freq
jru - 1 freq
jjrea - 1 freq
wwjr - 1 freq
GIRO
Time to execute Levenshtein function - 0.198400 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.341500 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027237 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036258 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000850 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.