ocrodjvu (0.4.6) UNRELEASED; urgency=low * -- Jakub Wilk Mon, 24 May 2010 21:52:18 +0200 ocrodjvu (0.4.5) unstable; urgency=low * djvu2hocr: add ocr-system and ocr-capabilities meta information. * Fix handling of ‘deu’ and ‘rus-eng’ languages. Thanks to Kyrill Detinov for the bug report. * Properly handle hOCR with inline formatting. Thanks to Kyrill Detinov for the bug report. -- Jakub Wilk Mon, 24 May 2010 21:22:39 +0200 ocrodjvu (0.4.4) unstable; urgency=low * Document that ocrodjvu honours TMPDIR environment variable. http://bugs.debian.org/575488 * Don't remove temporary directory if ocrodjvu crashed. http://bugs.debian.org/575487 -- Jakub Wilk Fri, 02 Apr 2010 12:00:11 +0200 ocrodjvu (0.4.3) unstable; urgency=low * Don't crash on --version. http://bugs.debian.org/573496 * Give more meaningful error messages on a malformed hOCR produced by Cuneiform. http://bugs.debian.org/572522 * Document how djvu2hocr deals with non-XML characters. -- Jakub Wilk Fri, 19 Mar 2010 01:22:54 +0100 ocrodjvu (0.4.2) unstable; urgency=low * New options for ocrodjvu: + --render=mask, + --render=foreground, + --render=all. http://bugs.debian.org/572081 * Fix off-by-one error in text area coordinates. * Add support for Cuneiform 0.9. -- Jakub Wilk Wed, 03 Mar 2010 21:27:15 +0100 ocrodjvu (0.4.1) unstable; urgency=low * Be stricter when reading hOCR produced by OCRopus 0.3.1. -- Jakub Wilk Fri, 22 Jan 2010 20:25:54 +0100 ocrodjvu (0.4.0) unstable; urgency=low * Add support for the Cuneiform OCR engine. New options for ocrodjvu: + --engine, + --list-engines. * Don't crash on non-ASCII file names. Thanks to Jean-Christophe Heger for the bug report. * hocr2djvused: add the --page-size option. * ocrodjvu: add the -j/--jobs option. -- Jakub Wilk Thu, 21 Jan 2010 23:41:37 +0100 ocrodjvu (0.3.2) unstable; urgency=low * Accept negative numbers in hOCR bounding boxes. * djvu2hocr: fix broken UAX #29 segmentation. * djvu2hocr: provide correct page bounding boxes. -- Jakub Wilk Fri, 08 Jan 2010 17:46:51 +0100 ocrodjvu (0.3.1) unstable; urgency=low * djvu2hocr: fix broken UAX #29 segmentation. -- Jakub Wilk Sun, 03 Jan 2010 12:56:08 +0100 ocrodjvu (0.3.0) unstable; urgency=low * Python ≥ 2.5 is now required. * argparse module in now required. * Add support for OCRopus 0.3.1. * Give better error messages when Tesseract language pack cannot be found. * New options for ocrodjvu: + -t/--details; + --word-segmentation. * New options for hocr2djvused: + --rotation, + -t/--details, + --word-segmentation, * New tool: djvu2hocr. -- Jakub Wilk Wed, 16 Dec 2009 18:42:21 +0100 ocrodjvu (0.2.1) unstable; urgency=low * Give a clearer error message if OCRopus were interrupted by a signal. * Add the --language option. * Add the --list-languages options. -- Jakub Wilk Sat, 17 Oct 2009 17:34:43 +0200 ocrodjvu (0.2.0) unstable; urgency=low * Provide a manual page. * Add the -D/--debug option. * Add options to specify how results are stored: + -o/--save-bundled, + -i/--save-indirect, + --save-script, + --in-place, + --dry-run. * Add the --clear-text option. * Add the --ocr-only option. * Please use the --in-place and --clear-text options to retain compatibility with ocrodjvu < 0.2. -- Jakub Wilk Wed, 14 Oct 2009 20:53:48 +0200 ocrodjvu (0.1.3) unstable; urgency=low * Use ocroscript, rather than ocrocmd. -- Jakub Wilk Sun, 15 Mar 2009 19:01:11 +0100 ocrodjvu (0.1.2) unstable; urgency=low * Make hocr2djvused work with hOCR for multiple pages. * Handle rotated pages correctly. * Ignore IW44-only pages. -- Jakub Wilk Mon, 23 Jun 2008 20:14:42 +0200 ocrodjvu (0.1.1) unstable; urgency=low * Depend on python-lxml. * Better compatibility with Python 2.4. -- Jakub Wilk Wed, 14 May 2008 11:23:13 +0200 ocrodjvu (0.1) unstable; urgency=low * Initial release. -- Jakub Wilk Wed, 07 May 2008 18:29:40 +0200