Skip to content

TeluguOCR/antanci_ocr

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

INSTALLATION INSTRUCTIONS

Note: If you are on a 64-bit linux machine, you could just try running this file, that has been pre-built for you (based on the instructions below).

  1. Install git, g++, png, jpeg, tiff, zlib etc.

    sudo apt-get install git g++ libpng-dev libjpeg-dev libtiff-dev libz-dev
  2. (Optional) Add SSH keys if you want to develop and contribute.

  3. Clone this project

    git clone git@github.com:TeluguOCR/banti_segmenter.git
  4. Get latest version of Eclipse C++. Specific instructions for Ubuntu:

    1. Copy extracted directory to /opt
    2. Creat shortcut: sudo ln -s -T /opt/eclipse/eclipse /usr/bin/eclipse
    3. Add application by creating /usr/share/applications/eclipse.desktop
    4. If titles are not showing specify in the above file: Exec=env UBUNTU_MENUPROXY= eclipse
  5. Run eclipse

  6. Set workspace to the directory you cloned to.

  7. File -> Import -> General -> Existing Projects into Workspace

    Specify the directory you cloned to

    Check the option Search for Nested Projects

  8. Install Freetype

    1. Download latest source
    2. Unzip
    3. Run the usual
    ./configure
    make -j4 
    sudo make install
  9. Build all the three leptonica, gfft, segmenter. Go to the Project menu and click Build All (Ctrl B).

    (ONLY) If you are getting Freetype errors, go to the properties of leptonica project.

    1. Right click on the leptonica project and select properties (Alt Enter)
    2. Go to C/C++ Build -> Settings -> Tool Settings -> GCC Compiler -> Includes and add /usr/local/include and /usr/local/include/freetype2
  10. Run segmenter!

    <path_to_cloned_directory>/segmenter/Debug/segmenter images/praasa.tif 6 1	
    # Run with no arguments to see all the options.
    
    # Increase stack size if you are getting a Seg Fault. 
    ulimit -s 1000000
  11. If you want to run classifier you will need to have the data (charcodes.txt, cp.bin, sm.bin, wr.bin) in the data directory (symbolically) located in the same folder as the executable.

    <path_to_git_directory>/segmenter/Debug$ ln -s -T ../../dct_lda/output

About

OCR Segmenter (mostly for Indic languages)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages