GitHub - TeluguOCR/antanci_ocr: OCR Segmenter (mostly for Indic languages)

INSTALLATION INSTRUCTIONS

Note: If you are on a 64-bit linux machine, you could just try running this file, that has been pre-built for you (based on the instructions below).

Install git, g++, png, jpeg, tiff, zlib etc.

sudo apt-get install git g++ libpng-dev libjpeg-dev libtiff-dev libz-dev

(Optional) Add SSH keys if you want to develop and contribute.

Clone this project

git clone git@github.com:TeluguOCR/banti_segmenter.git

Get latest version of Eclipse C++. Specific instructions for Ubuntu:
1. Copy extracted directory to /opt
2. Creat shortcut: sudo ln -s -T /opt/eclipse/eclipse /usr/bin/eclipse
3. Add application by creating /usr/share/applications/eclipse.desktop
4. If titles are not showing specify in the above file: Exec=env UBUNTU_MENUPROXY= eclipse
Run eclipse
Set workspace to the directory you cloned to.
File -> Import -> General -> Existing Projects into Workspace

Specify the directory you cloned to

Check the option Search for Nested Projects
Install Freetype
1. Download latest source
2. Unzip
3. Run the usual
```
./configure
make -j4 
sudo make install
```
Build all the three leptonica, gfft, segmenter. Go to the Project menu and click Build All (Ctrl B).

(ONLY) If you are getting Freetype errors, go to the properties of leptonica project.
1. Right click on the leptonica project and select properties (Alt Enter)
2. Go to C/C++ Build -> Settings -> Tool Settings -> GCC Compiler -> Includes and add /usr/local/include and /usr/local/include/freetype2

Run segmenter!

<path_to_cloned_directory>/segmenter/Debug/segmenter images/praasa.tif 6 1	
# Run with no arguments to see all the options.

# Increase stack size if you are getting a Seg Fault. 
ulimit -s 1000000

If you want to run classifier you will need to have the data (charcodes.txt, cp.bin, sm.bin, wr.bin) in the data directory (symbolically) located in the same folder as the executable.
```
<path_to_git_directory>/segmenter/Debug$ ln -s -T ../../dct_lda/output
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dct_lda

dct_lda

gfft

gfft

images

images

leplab

leplab

leptonica

leptonica

segmenter

segmenter

README.md

README.md

Repository files navigation

INSTALLATION INSTRUCTIONS

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
dct_lda		dct_lda
gfft		gfft
images		images
leplab		leplab
leptonica		leptonica
segmenter		segmenter
README.md		README.md

TeluguOCR/antanci_ocr

Folders and files

Latest commit

History

Repository files navigation

INSTALLATION INSTRUCTIONS

About

Resources

Stars

Watchers

Forks

Languages