Branches
Comments
[»]
Unique and powerful addition to DocBook toolchain
by Michael(tm) Smith - Sep 5th 2002 23:06:59
This is an important addition to the DocBook toolchain.
It fills a big need and is unique in that (as far as I
know) there are no other tools available -- open-source or
proprietary -- for converting man/roff docs to DocBook.
There's some very clever logic in it for making
inferences about structure from some of the
not-that-explicitly-structured roff markup and turning it
into fairly structured DocBook markup. In particular, it
can:
* parse command/function synopses and convert them into
DocBook markup (using "real" markup like Cmdsynopsis, Arg,
Replaceable, etc.)
* recognize things like use of italics in a FILES
section to mark filenames, and convert them to correct
DocBook markup (e.g., using the Filename element)
* recognize patterns such as URLs, email addresses, man
page references, and C program listings, and convert them
to correct DocBook markup
The only significant problem I've run into with the
1.0.0 version is in the implementation it uses for dealing
with ISO character entities: In some XML instances, it
generates internal DTD subsets that include entity
declarations which reference the SGML versions of the ISO
character-entity sets instead of the XML versions.
A workaround is simply to delete any ISO character
entity declarations from doclifter-generated XML documents.
The declarations are actually redundant at best, because
both the DocBook XML and SGML DTDs already reference the
appropriate sets.
But that's a really minor issue, and one that I'm sure
Eric will probably have fixed in the next release.
[reply]
[top]
[»]
Re: Unique and powerful addition to DocBook toolchain
by Eric S. Raymond - Sep 18th 2002 01:38:23
> The only significant problem I've run
> into with the
> 1.0.0 version is in the implementation
> it uses for dealing
> with ISO character entities: In some XML
> instances, it
> generates internal DTD subsets that
> include entity
> declarations which reference the SGML
> versions of the ISO
> character-entity sets instead of the XML
> versions.
>
> But that's a really minor issue, and one
> that I'm sure
> Eric will probably have fixed in the
> next release.
Your wish is granted. :-)
[reply]
[top]
|