Now you have a file that contains just one article, and you wish to extract three things out of each file, which are:
- Title
- Date
- Text
Assume again that you find by inspection of the files that they are contained in the lines like this:
<h2><a href="...">Need to repare my rig</a></h2> <div class="entry_text">When I switched on my rig this morning...</div> <li>[2014/03/01 05:43]
A short awk program will do the job for you to convert the files into html format which can then be fed to Ebook authoring tools.