phrogz / docubot Goto Github PK

View Code? Open in Web Editor NEW

24.0 24.0 2.0 1.19 MB

Create CHM documentation from simple hierarchy of plain text files.

License: MIT License

Ruby 82.62% CSS 10.31% JavaScript 7.07%

docubot's People

Contributors

Stargazers

Watchers

Forkers

xingzhe2001

docubot's Issues

Directories inside _static folder create TOC entry

A simple project like this:

Hello World.md
_static
  images
    foo.jpg

ends up creating a TOC like:

Hello World
images

Just as files inside _static should be ignored, so directories therein should be ignored.

Automatic ID generation.

Put HTML id attributes on headings, definitions, fieldset labels, and table captions if they don't already have them. Modify the toc: ... walker to allow quoted sections referring to the titles. Use the same text-to-id algorithm for both, and voila! Markdown will work fine for pages with sub-anchors needed.

Need spec coverage for early features.

The project has become sufficiently complex that it's hard to know if any change will break other code. Must get some specs and test files up ASAP.

Can't make root links to pages.

It's annoying to have to write href="../../../foo/bar/page.html", and fragile.

Worse, however, is that you can't have a single Glossary entry re-used on different pages in the site that links to another page discussing it in detail.

Simple would be some sort of special href="#{root}/foo/bar/page.html" syntax that properly substituted root even in templates that didn't have Ruby interpolation.

Even better would be an addressing system that found a page based on its title (or better yet, some magic internal GUID-like thing) so that the user could rename the page or put it in another directory without all links to it breaking.

Doxygen support

Integrate the output of Doxygen, or something.

Validate img src

Like Bundle#validate_links, look in src attributes of images and validate that the files are there (unless sourced over another protocol). Possibly extend to scripts, too. Add command-line options and Bundle creation options to skip these steps.

Don't validate stylesheets, because they shouldn't be in the page body anyhow.

Optional sub-anchors for headers in TOC

When a page has a meta section like:
toc: sections
have the bundle find—and if necessary, add unique IDs to—every header in the page and create TOC sub-entries for them.

TBD: Where does TOC logic belong? Currently it's implicit in the hierarchy of Pages; perhaps we have a new DocuBot::SubSection that lives as a child of the Pages with a similar interface?

PDFWriter

Use SinglePageHTMLWriter and then generate a PDF

Glossary tab in CHM

Instead of generating a Glossary section, generate a proper (custom) Glossary tab in the sidebar of the CHM, as described here.

reStructuredText converter

Because Sphinx has it. Minor problem: missing a nice cross-platform pure-Ruby converter.

Inline glossary terms

toc.haml links in glossary.js and glossary-terms.js and glossary.css.
HTMLWriter creates glossary-terms.js with every term/definitionHTML pair from the Glossary after crawling.
glossary.js finds all <span class="glossary"> on the page and hooks up interaction to show the definition of the term on click.
glossary.css is separate from nvdevtools.css and puts unique style on glossary terms (e.g. dashed underline).

Resolve template/HTMLWriter bizarreness

Why does the CHMWriter have its own directory for ERB files in lib/docubot/writers, but the HTMLWriter uses hamls from lib/docubot/templates/default?

Allow and fix inter-page links

Allow users to create relative HTML links to the original (markdown/textile/whatever) files in the source.
During write, after generating HTML for pages, use a map between the original file path and output html path to munge <a href="..." links to point to the correct relative file path.

Validation 'fails' for mailto: links

A textile document with a mailto: link shows a warning for "broken link" during conversion.

Generate HTML frameset

For HTML-only output, create an index.html that has a frameset pulling together toc/index and content. (This includes a nice tabbed approach for a nice toc/index display.)

Wrap sections in a .section div

Parse the structure of the final page HTML and wrap <div class="section">...</div> around the contents of each logical section (determined by headings, up until the next heading of equal or greater priority).

Allows simple CSS like .section { margin-left:2em } for clean indentation of page structure.

Superfluous identifiers generated

Currently every page upon creation generates IDs for a host of elements (if not present). The only reason this is used (currently) is for the TOC linking in comma-delimited mode. Perhaps it'd be better to go and find the elements with the requested TOC entry and just auto-id them. If a page doesn't have a toc metaattribute, or the metaattribute doesn't have commas, skip the auto-id altogether.

There is one problem with trying to exactly auto-id only the referenced elements.The following textile code:
h2. What's Up
generates a curly quote entity for the apostrophe in the process of creating the HTML. As a result, a straight text match for "What's Up" would fail on the HTML.

Support hierarchical index entries

The index should be able to have entries like:

Rigid Bodies
   Basics
   Creating
   Rigid Body Modifier

Presumably this would only be through a meta entry—using some TBD syntax—and not through the auto scanner.

Validate links

When writing HTML, look through the HTML and see if there are any relative links that are broken.

Magically work with hhc.exe

Detect lack of hhc.exe in path early on and bail with helpful instructions for downloading it and setting the PATH.
Perhaps go hunting for it in the standard spots and call it directly without PATH modification.