Banner showing base of Eiffel tower

Github

Download version 1.4.8: Windows or Linux

Persistency Library: Markup Document Processing

The Markup Document Processing library has 28 classes.

ECF: markup-docs.ecf

Classes for processing documents encoded with various kinds of markup language.

1. OpenDocument Flat XML spreadsheets using VTD-XML.

2. Read and export emails from the Thunderbird email client including a class to generate a Kindle book from a folder of chapter emails.

3. Classes for generating Kindle OPF packages from HTML content.

Directory: library/persistency

. /kindle-book

. /thunderbird/account

. /thunderbird/reader

. /thunderbird/reader/exporter

. /thunderbird/support

. /thunderbird/test

. /xml/open-office-spreadsheet

kindle-book

EL_BOOK_ASSEMBLY

Assembly of all book components including OPF package, navigation control file and HTML table of contents

EL_BOOK_CHAPTER

Book chapter generated from Thunderbird email

EL_BOOK_HTML_CONTENTS_TABLE

Book html contents table

EL_BOOK_INFO

Published book information reflectively settable from XML context

EL_BOOK_NAVIGATION_CONTROL_FILE

Object to create book NCX (navigation control file)

EL_BOOK_PACKAGE

Kindle book package serializeable as Open Packaging Format (OPF)

EL_MEDIA_TYPE_CONSTANTS

Media type constants

EL_OPF_MANIFEST_ITEM

Manifest item in OPF package

EL_OPF_MANIFEST_LIST

List of manifest items in OPF package

EL_SERIALIZEABLE_BOOK_INDEXING

Serializeable book indexing

Further Information

Click on class link to see descendants.

thunderbird/account

EL_ML_THUNDERBIRD_ACCOUNT_BOOK_EXPORTER

Multi-lingual (ML) Thunderbird account book exporter.

Merge localized folder of emails into a single HTML book with chapter numbers and titled derived from subject line.

EL_ML_THUNDERBIRD_ACCOUNT_READER

Multi-lingual (ML) Thunderbird email account reader.

Reads Thunderbird HTML email documents from a selected account where content folders are organized with sub-folders named as 2 letter language code to hold localized versions of documents.

foo/en
foo/de
foo/fr

bar/en
bar/de
bar/fr

Each document folder is read and processed by a class conforming to EL_THUNDERBIRD_FOLDER_READER

EL_ML_THUNDERBIRD_ACCOUNT_XHTML_BODY_EXPORTER

Multi-lingual (ML) Thunderbird account XHTML body exporter

EL_THUNDERBIRD_ACCOUNT_READER

Reads Thunderbird HTML email documents from a selected account and configured by a Pyxis document.

pyxis-doc:
   version = 1.0; encoding = "UTF-8"

thunderbird:
   account = "<email account name>"; export_dir = "<export path>"
   language = "<optional language code"
   folders:
      "<folder name 1>"
      "<folder name 2>"

thunderbird/reader

EL_THUNDERBIRD_FOLDER_READER

Read folder of Thunderbird HTML email content and collects email headers in field_table HTML content is collected in line list html_lines and then event handler on_email_end is called, before processing the next email.

thunderbird/reader/exporter

EL_THUNDERBIRD_BOOK_EXPORTER

Merge Thunderbird folder of numbered chapter emails into a HTML book

EL_THUNDERBIRD_XHTML_BODY_EXPORTER

Export Thunderbird email folders as HTML body content between <body> and </body> tags and output as <subject name>.body. Insert a page anchor before each h2 heading

<a id="Title_1"/>
<h2>Title 1</h2>

EL_THUNDERBIRD_XHTML_DOC_EXPORTER

Export contents of Thunderbird email folder as XHTML document files

EL_THUNDERBIRD_XHTML_EXPORTER

Thunderbird folder to xhtml

thunderbird/support

EL_SUBJECT_LINE_DECODER

Decode internal Thunderbird subject lines Example:

"=?ISO-8859-15?Q?=DCber_My_Ching?=" -> "Über My Ching"

"=?UTF-8?B?w5xiZXLigqwgTXkgQ2hpbmc=?=" -> Über€ My Ching

"=?UTF-8?Q?3.Journaleintr=c3=a4ge_bearbeiten?=" -> "Journaleinträge bearbeiten"

EL_SUBJECT_LIST

Subject list

EL_THUNDERBIRD_CONSTANTS

Thunderbird constants

thunderbird/test

EL_SUBJECT_LINE_DECODER_TEST_SET

Subject line decoder test set

xml/open-office-spreadsheet

EL_OPEN_OFFICE

Open office

EL_SPREAD_SHEET

Object representing OpenDocument Flat XML spreadsheets as tables of rows of data strings.

XML namespace

xmlns:office="urn:oasis:names:tc:opendocument:xmlns:office:1.0"
office:mimetype="application/vnd.oasis.opendocument.spreadsheet"
office:version="1.2"

EL_SPREAD_SHEET_DATA_CELL

Object representing table data cell in OpenDocument Flat XML format spreadsheet

EL_SPREAD_SHEET_ROW

Object representing table row in OpenDocument Flat XML format spreadsheet

EL_SPREAD_SHEET_TABLE

Object representing table in OpenDocument Flat XML format spreadsheet