spell

The first version of spell(1) came in 1975 and was based on affix and prefix rules, which have not proven to be very accurate. Moreover, faster and better techniques have evolved over the decades. For example edit distance based techniques are capable of figuring out about 90% of spelling errors[1]. For more difficult errors which depend on the context, there are more sophisticated models such as n-gram[2]. The SUS has marked spell(1) as legacy because of its inability to offer language sensitive spell checking [3]. Even though language detection is a hard problem, however, the n-gram based model could be generalized to detect languages as well[4].

This implementation proposes to overcome the above mentioned problems along with the aim of offering a comparison of the trade offs of the modern techniques in terms of their accuracy and simplicity in terms of ease of implementation and maintenance.

References

http://norvig.com/spell-correct.html
Kukich, Techniques for Automatically Correcting Words in Text; ACM Comput. Surv. 1992 (http://www.devl.fr/docs/these/bibli/Kukich1992Techniqueforautomatically.pdf)
http://pubs.opengroup.org/onlinepubs/007908799/xcu/spell.html
Cavnar and Trenkle, N-Gram-Based Text Categorization; In Proceedings of SDAIR-94

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
corpus		corpus
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
dictionary.c		dictionary.c
file.txt		file.txt
libspell.c		libspell.c
libspell.h		libspell.h
makedictionary.sh		makedictionary.sh
spell_hash_test.c		spell_hash_test.c
spell_list_test.c		spell_list_test.c
spellutil.c		spellutil.c
spellutil.h		spellutil.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

corpus

corpus

.gitignore

.gitignore

LICENSE

LICENSE

Makefile

Makefile

README.md

README.md

dictionary.c

dictionary.c

file.txt

file.txt

libspell.c

libspell.c

libspell.h

libspell.h

makedictionary.sh

makedictionary.sh

spell_hash_test.c

spell_hash_test.c

spell_list_test.c

spell_list_test.c

spellutil.c

spellutil.c

spellutil.h

spellutil.h

Repository files navigation

spell

References

About

Releases

Packages

Languages

License

abhinav-upadhyay/spell

Folders and files

Latest commit

History

Repository files navigation

spell

References

About

Resources

License

Stars

Watchers

Forks

Languages