Ringeader (Arabic Sequential Text Reading)

Software to analyze video of scanned printed text, running OCR and turning it into speech.

Prerequisites

CMake: http://www.cmake.org/
Qt 4.7+ : http://qt-project.org/
OpenCV 2.4.x: http://www.opencv.org/
Tesseract OCR: https://code.google.com/p/tesseract-ocr/
Flite text-to-speech engine: http://www.speech.cs.cmu.edu/flite/
Boost (just the System component): http://www.boost.org/

Everything needs to be compiled for the system of your choice. We've successfully built for Windows (via MinGW) and Mac OS 10.7

Building

In high-level, just run CMake and then build

mkdir build ; cd build
cmake ..
make

On Windows we've used MinGW (MSYS build system), built Tessract, Flite and Boost from source, OpenCV and Qt have prebuilt binaries for MinGW. We reccommend using Eclipse for building, by setting up CMake to create an Eclipse/MinGW project.

Building Tesseract on MinGW: http://www.sk-spell.sk.cx/compiling-leptonica-and-tesseract-ocr-with-mingwmsys Building Flite on MinGW is straightforward: http://www.speech.cs.cmu.edu/flite/doc/flite_4.html#SEC4 When building boost, remember to only build System else the build takes forever. Run boostrap script and then use bjam to build System.

Running

The Qt UI should be quite self explanatory. You load a video or connect to a camera (OpenCV does that), and wait for the text to be read out loud. The video should be of a close-up of scanning along a printed text line with the finger. It supports only English` (left-to-right)

To get the big picture: Ringeader Graduation Book https://drive.google.com/file/d/1C49qUUuoit7WyBqkdrXy8NRWsPiYrB_4/view?usp=sharing

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
QExtSerialPort-1.2rc		QExtSerialPort-1.2rc
.gitignore		.gitignore
159158__daenn__din-ding.wav		159158__daenn__din-ding.wav
80360__hyderpotter__nu-tone.wav		80360__hyderpotter__nu-tone.wav
91926__corsica-s__ding.wav		91926__corsica-s__ding.wav
91926__corsica-s__ding_short.wav		91926__corsica-s__ding_short.wav
AbstractAlgorithm.h		AbstractAlgorithm.h
ArduinoDriver.cpp		ArduinoDriver.cpp
ArduinoDriver.h		ArduinoDriver.h
CMakeLists.txt		CMakeLists.txt
EspeakBridge.cpp		EspeakBridge.cpp
EspeakBridge.h		EspeakBridge.h
EspeakTTSWorker.cpp		EspeakTTSWorker.cpp
EspeakTTSWorker.h		EspeakTTSWorker.h
FingertipDetector.cpp		FingertipDetector.cpp
FingertipDetector.h		FingertipDetector.h
MathGLTools.h		MathGLTools.h
OpenCVCameraThread.cpp		OpenCVCameraThread.cpp
OpenCVCameraThread.h		OpenCVCameraThread.h
QTSequentialTextReader.h		QTSequentialTextReader.h
README.md		README.md
SeqentialTextReader.cpp		SeqentialTextReader.cpp
SeqentialTextReader.h		SeqentialTextReader.h
TesseractBridge.cpp		TesseractBridge.cpp
TesseractBridge.h		TesseractBridge.h
Viewcontroller.cpp		Viewcontroller.cpp
Viewcontroller.h		Viewcontroller.h
ViewerInterface.cpp		ViewerInterface.cpp
alphanum.hpp		alphanum.hpp
control.ui		control.ui
filter.cpp		filter.cpp
filter.h		filter.h
std.cpp		std.cpp
std.h		std.h
tone-lowpitch.wav		tone-lowpitch.wav

FEE-MNF/Ringeader

Folders and files

Latest commit

History

Repository files navigation

Ringeader (Arabic Sequential Text Reading)

Prerequisites

Building

Running

About

Resources

Stars

Watchers

Forks

Languages