An EPUB Experience

A few days ago I had the pleasure of being invited up to Stockholm to sit with a bunch of like minded people and talk about eBooks – specifically the ePub format. This was a very eye-opening experience indeed.

I was invited to Sweden by Publit, a company who have set themselves the task of making all the Swedish out-of-print books available as PoD (Print on Demand) titles. Considering that 95% of all Swedish books ever in existence are now out of print, this is a very worthy project, if perhaps somewhat daunting. Although Publit’s main business is PoD, they are making use of this opportunity to also provide these titles as ePub eBooks.

During my time in Sweden we discussed the many different areas of the eBook world, including DRM (of course), the processes involved in going from scanned document (TIFF/PDF/DOC) to an eBook Master format and onto ePub creation itself.

Now, the people at Publit are a group of very talented individuals with plenty of technical knowledge, yet there were aspects of ePub which has left them somewhat perplexed. There were two main points which I found interesting and have heard before around the web so I thought I would share them here. Continue reading “An EPUB Experience”

Using Q tags in ePub documents

Recently I’ve been making great progress on my ePub converter but in the process I’ve come across a few areas that have needed some thought. For this short article I’m going to discuss the XHTML Quote tag.

When looking though the IDPF OPS specs I noticed that <q> tags are allowed within ePub documents. As I mark up all quotes in my master TEI documents with a <q> tag I was really happy to see this. However, when I came to test my files in Adobe Digital Editions none of my quotes showed.

After seaching around the net I also noticed that <q> tags are not supported by all web browsers (i.e. Internet Explorer). Liza Daly has also noted that the HTML 5.0 specs will depreciate Q tags – will a version of XHTML follow suite in the future? Will a future IDPF OPS spec implement that future XHTML spec?

(FYI, Liza mentioned that she is currently working on implementing <q> tags in her Bookworm reader, with a work around for IE.)

This is an area which I believe ePub developers should give some serious thought over. Even if Adobe DE go ahead and implement this feature, other readers may not necessarily follow suite.

For my own project I have decided to convert all my <q> tags into regular “quote” characters. Providing more safety both now and in the long term.

Creating an ePub document from XHTML

In my last post I talked about the epubBooks Project and how I plan to convert Project Gutenberg .txt eBooks to the ePub format and how I will make these eBooks available for download from ePubBooks.com.

I already have in place a converter to transform the PG .txt files to a TEI Master Format and also an XSLT script to convert these into XHTML. The final task now is to create a converter for TEI to the ePub format.

Before I attempt to write this converter I will need to have a much better understanding on how a book is laid out inside the ePub OEBPS Container Format (OCF) .zip archive. So I set about taking my XHTML output file and breaking it up into the appropriate parts ready to be packaged in to an .epub file.

On the whole this went fairly smoothly, although I did encounter a couple of issues, which I’ll explain at the end of this article.

Continue reading “Creating an ePub document from XHTML”