Skip to content

zjucsxxd/carmel

 
 

Repository files navigation

Carmel finite-state toolkit - J. Graehl

(carmel includes EM and gibbs-sampled (pseudo-Bayesian) training)

(see carmel/LICENSE - free for research/non-commercial)

(see carmel/README and carmel/carmel-tutorial).

Download pre-built binaries

TODO

Building from source

ln -sf . graehl;cd carmel; INSTALL_PREFIX=/usr/local make -j 4 install

(prerequisites: C++ compiler and Boost, which you probably already have on your linux system; for Mac, you can get them from Homebrew. Windows: you can use Microsoft's tools or cygwin or mingw.

make options

If your system doesn't support static linking, make NOSTATIC=1

If you're trying to modify or troubleshoot the build, take a look at shared/graehl.mk as well as carmel/Makefile; you shouldn't need to manually run make depend.

Subdirectories

  • carmel: finite state transducer toolkit with EM and gibbs-sampled (pseudo-Bayesian) training

  • forest-em: derivation forests EM and gibbs (dirichlet prior bayesian) training

  • shared: utility C++/Make libraries used by carmel and forest-em

  • gextract: some python bayesian syntax MT rule inference

  • sblm: some simple pcfg (e.g. penn treebank parses, but preferably binarized)

  • clm: some class-based LM feature? I forget.

  • cipher: some word-class discovery and unsupervised decoding of simple probabilistic substitution cipher (uses carmel, but look to the tutorial in carmel/ first)

  • util: misc shell/perl scripts

About

finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • C++ 49.1%
  • Forth 19.7%
  • Shell 9.1%
  • Python 7.7%
  • C 5.6%
  • Perl 4.5%
  • Other 4.3%