RTF TO HTML
-----------
This is based on version 2.7.5 of rtftohtml, which was last modified in
November, 1994.
It contains the text:
###
Initial Implementation by Chris Hector (cjh@cray.com)
(c) 1994 by Chris Hector
May be freely distributed.
This translator was based on rtfskel.
###
The last known email address I can find for him is .
I think that might reach him, but I'm not sure.
Versions 3.0 of rtftohtml and up are released under a shareware license,
and available from http://www.logictran.com (although it now has a new
name). You can download a free binary version of the software for most
platforms from this site.
rtftohtml is based on RTF Tools, which is (c) April 1994 by Paul DuBois
RTF Tools has the notice:
##
Distribution Availability
This software may be redistributed without restriction and used for any
purpose whatsoever.
##
The supplied version understands RTF version 1.2, while the
latest version of the specification is RTF 1.6 (May 1999).
I have added some of the newer rtf codes that were preventing some files from
being parsed properly, but they are not acted on. Most of them do not
actually affect the content anyway. Depending on how the rtf was created,
there might be tags that cause us to trip up and fail. If you have
rtf files that Greenstone is not extracting text from, please send me
some email (jrm21@cs.waikato.ac.nz).
Note - we extract but do not convert Windows Metafile Format (.wmf) images.
On unix systems, these can be converted with newer versions of ImageMagick
(ie "convert" version 4.2.9 or newer).
Also, libwmf (for unix and windows) is available for conversion at:
http://wvware.sourceforge.net/projects/libwmf.html
but we do not include it here due to its dependencies
(eg libttf for truetype fonts).
Embedded images such as jpegs and pngs are extracted fine, though.
John McPherson (jrm21@cs.waikato.ac.nz)
June 2001.