http://sourceforge.net/projects/ibsuite/
IBSUITE
ibsuite stands for image book suite. It contains a set of tools to convert ebook in various format (pdf, chm, html) into a set of images, reformat the images (crop, embold, divide, etc), and assemble the result images into a new ebook.
COMPONENTS
- ibhtml2img: convert html to image with xulrunner
- ibhtml2pdf: convert html to pdf with xulrunner
- iblineparser: parser input image, extract line information
- ibpdfinfo: get some meta-information from pdf file, such as title, author, table of contents etc.
- ibpy: python module, which is the driver of the whole system, it uses above programs to convert input file to image, extract line information from image, dilate image, and re-assemble lines into a new image, generate output e-book.
- ibtools: A set of utilities and tools, some of them are used internal by ibsuite, others are user command provided by ibsuite.
USAGE
The most important command of ibsuite is ibreformat, the basic usage is as follow:
ibreformat [options] <input file>
In most cases, something like following:
ibreformat -o <output file> <oprof>
--pprof=<pprof>
Where <input file> is input file name, <output file><iprof><pprof>
<input file>
For chm file on PRS505:
ibchm2imb
When it finishs,
ibsuite -o
INSTALL
Currently, only Linux is supported, but I think it may work on some unix enviroment (including cygwin on Windows) after some work. Currently only install from source code is supported.
Pre-requirement:
- gcc, g++, bash, make
- libfontconfig-dev, libnetpbm-dev, libgtk-dev
- python, python-imaging
- imagemagick
- for HTML/CHM support: python-chm, xulrunner
- for scanned book: unpaper
Build:
./configure [--prefix=
make [PREFIX=
Install:
# become root
make install [PREFIX=
1 comment:
Hi there. I just purchased a Sony PRS 300 and am looking for a way to put scientific papers (that contain math formulas and diagrams) on this device, and have them display reasonably well.
I downloaded and compiled your code from the git repository, but I'm not sure if and how it can be used to do achieve what I am trying to do.
In a nutshell, I imagine taking a document, such as:
https://netfiles.uiuc.edu/goldwas1/shared/publications/NAACL10.pdf
and sliding a "window" (whose width is roughly the width of a single text column) from the top of the 1st column to the bottom, taking non-overlapping snapshots. This would be repeated for the 2nd column, and then for subsequent pages. My output file would consist of these snapshots (in the same order they were produced). The format would have to be some sort of image -- not a pdf because the math equations often get messed up.
Can I do something like this with your software? If not, is something like this even possible?
Thanks,
Ken
Post a Comment