Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
internal_link2 [2009/08/03 08:48] – removed clareinternal_link2 [2022/11/03 15:08] (current) – external edit 127.0.0.1
Line 1: Line 1:
 +====== Convert HTML to DokuWiki ======
  
 +An idea for converting existing documents to text files with [[http://www.dokuwiki.org/tips:dokuwiki|DokuWiki]] [[http://www.dokuwiki.org/tips:syntax|syntax]] using an HTML to Text converter. I've set up a config file for the tool available at [[http://www.mbayer.de/html2text/|http://www.mbayer.de/html2text/]] which produces some usable results: [[http://www.dokuwiki.org/_media/wiki:text2html.rc|text2html.rc]].
 +
 +Thomas J. Messenger made a Perl module to convert HTML to DokuWiki's syntax available at [[http://www.citlink.net/%7Emessengertj/|http://www.citlink.net/~messengertj/]] (see [[http://bugs.splitbrain.org/index.php?do=details&task_id=81|81]], too)
 +
 +Or the one at CPAN: [[http://search.cpan.org/%7Ediberri/HTML-WikiConverter-0.61/lib/HTML/WikiConverter.pm|http://search.cpan.org/~diberri/HTML-WikiConverter-0.61/lib/HTML/WikiConverter.pm]]
 +
 +A web interface used to convert pasted text or a webpage page is available at [[http://diberri.dyndns.org/html2wiki.html|http://diberri.dyndns.org/html2wiki.html]]
 +
 +A Converter Tool (HTML>DokuWiki, UTF8, Tablespacing) [[http://runtime-basic.net/Projekt:WikiTool|WikiTool]] 
 +
 +===== Html2DokuWiki Converter GUI for Win32 =====
 +
 +Html2DokuWiki is a free HTML to DokuWiki converter for Win32 platforms. It is very simple to install and extremely easy to use. Just extract the executable from the archive, double-click to start, and Html2DokuWiki is ready to go.
 +
 +To start converting, just type HTML into the upper edit. The converted DokuWiki syntax will immediately appear in the lower edit. Then select (''CTRL+A'') the converted document and copy / paste it into any DokuWiki site. Larger HTML documents can also be pasted into the HTML input.
 +
 +[[http://www.yunqa.de/delphi/downloads/Html2DokuWiki.zip|Click here to download Html2DokuWiki]].
 +
 +**Supported** **HTML** **Elements**
 +
 +[[http://www.dokuwiki.org/lib/exe/fetch.php?cache=nocache&media=http%3A%2F%2Fwww.yunqa.de%2Fsoftware%2FHtml2DokuWiki.png|{{ Convert%20HTML%20to%20DokuWiki_files:image002.gif?250x208|Html2DokuWiki}}]]Html2DokuWiki converts all HTML elements currently supported by DokuWiki:
 +
 +  * **<A>** → links, outputs multiple links for formatting
 +  * **<B>** → ''%%**%%''
 +  * **<BLOCKQUOTE>** → ''>'', including nested quotes
 +  * **<BR>** → new line ''\\''
 +  * **%%<CODE>%%** → ''%%''%%''
 +  * **<DEL>** → ''<del>''
 +  * **<DL>, <DT>, and <DD>** → Simulate output as simple unnumbered lists with ''<DT>'' as bold.
 +  * **<H1>** … **<H5>** → ''%%======%%'' to ''%%==%%''
 +  * **<I>** → ''%%//%%''
 +  * **<IMG>** → images
 +  * **<LI>** → list items, including nested lists
 +  * **<OL>** → numbered lists
 +  * **<P>** → new paragraph
 +  * **<PRE>** → ''%%<code>%%''
 +  * **<S>** → ''%%**%%''
 +  * **<STRIKE>** → ''%%//%%''
 +  * **<STRONG>** → ''%%**%%''
 +  * **%%<SUB>%%** → ''%%<sub>%%''
 +  * **%%<SUP>%%** → ''%%<sup>%%''
 +  * **<TABLE>** → tables
 +  * **<TBODY>** → recognized, but not output
 +  * **<TD>** → table cell, with ''align'' and ''colspan'' support
 +  * **<TFOOT>** → recognized, but not output
 +  * **<TH>** → table header cell, with ''align'' and ''colspan'' support
 +  * **<THEAD>** → recognized, but not output
 +  * **<TR>** → table row, with ''align'' support
 +  * **<TT>** → to ''%%''%%''
 +  * **<U>** → to ''%%__%%''
 +
 +**Special Features**
 +
 +  * Internal links are converted to DokuWiki's '':'' style, external ones are left unchanged. ''mailto'''':'' is removed from e-mail links.
 +  * Support for alignment in table cells and rows.
 +  * Image properties are converted, including alignment, width, and height.
 +  * Formatting is only applied where accepted by DokuWiki, but not to ''%%===%% … ==='' and ''%%<code>%% … </code>'', for example.
 +  * Full Unicode support, with optional UTF-8 input or output encoding.
 +  * Option to hide links from DokuWiki syntax.
 +
 +**Author, Contact, and Development**
 +
 +Html2DokuWiki is developed by Ralf Junker. You can contact the author via the [[http://www.freelists.org/list/yunqa|Yunqa]][[http://www.freelists.org/list/yunqa| mailing list]]. Feel free to report praise, bugs, or suggestions about Html2DokuWiki.
 +
 +The converter might also be available as a software library %%(*%%.DLL, *.DCU, or Delphi source code). Please get in touch if interested.
 +
 +**Version History**
 +
 +2007-08-27
 +
 +  * Add inline formatting to table cells.
 +  * New UTF-8 input encoding.
 +  * Fix paragraph problems with alternating inline-tags and block-tags.
 +  * Do not escape ''%%//%%'' to ''<nowiki>%%</nowiki>%%//%%<nowiki>%%</nowiki>'' if part of an external URI.
 +  * Escape ''<nowiki>%%</nowiki>'' to ''<nowiki>%%<nowiki>''.
 +  * Empty heading elements separate paragraphs.
 +
 +2007-10-22
 +
 +  * New: Support for ''<DL>'', ''<DT>'', and ''<DD>''. Simulate output as simple unnumbered lists with ''<DT>'' as bold.
 +  * Improve: Empty paragraph inserts line break.
 +  * Improve: Recognize DokuWiki internal escapes ''<nowiki>%%</nowiki>'', ''<nowiki>'', and ''</nowiki>'' and escape them properly.
 +  * Improve: Escape table markup (''%%|%%'' and ''%%^%%'') when inside a table.
 +  * Improve: Escape double parenthesis ''%%((%%'', which starts a DokuWiki footnote.
 +  * Improve: Suppress DokuWiki escapes and typography in ''<PRE>'' blocks.
 +  * Fix: Newline output for HTML like ''<P><PRE>one</PRE>two</P><BR>three''.
 +  * Fix: ''<TBODY>'' table problem where a new row did not ouput a linebreak.
 +
 +===== Workflow Microsoft Word 2 DokuWiki using html2wiki-GUI =====
 +
 +I was looking for a way to convert about 150 DOC-files (Microsoft Word2000 or 2003) in our new wiki without too much hazzle. The macros available didn´t work for me.
 +
 +**Specifications**
 +
 +  * usable for multiple files
 +  * converting tables
 +  * converting images
 +
 +**suggested** **workflow** (not used excessively yet)
 +
 +  - open word-document in OpenOffice
 +  - save as html (the pictures will be stored in the same folder as the html)
 +  - open html2dokuwiki plugin[[http://www.yunqa.de/delphi/downloads/Html2DokuWiki.zip|Click]] here to download Html2DokuWiki
 +  - [[http://www.yunqa.de/delphi/downloads/Html2DokuWiki.zip|paste and copy html-code in the ]][[http://www.yunqa.de/delphi/downloads/Html2DokuWiki.zip|GUI]]
 +  - [[http://www.yunqa.de/delphi/downloads/Html2DokuWiki.zip|save as txt-file (or create new page for integrity)]]
 +  - [[http://www.yunqa.de/delphi/downloads/Html2DokuWiki.zip|copy txt-file in the DATA/PAGES/NAMESPACE-folder and Pictures in the DATA/MEDIA/NAMESPACE-folder]]
 +
 +[[http://www.yunqa.de/delphi/downloads/Html2DokuWiki.zip|Is there a faster way (2009/07/18 by ]][[http://www.yunqa.de/delphi/downloads/Html2DokuWiki.zip|bobeck]][[http://www.yunqa.de/delphi/downloads/Html2DokuWiki.zip|)]]

Personal Tools