Commit graph

16 commits

Author SHA1 Message Date
Domenic Denicola
41a6bfddb9 Fix some backward apostrophes 2015-05-20 23:43:33 -04:00
Domenic Denicola
5d86661c55 Fix end-of-thought punctuation being left out of the <em> 2015-05-20 23:43:20 -04:00
Domenic Denicola
baa98156b1 More <em> work 2015-05-20 23:43:07 -04:00
Domenic Denicola
33aaecb2ce Tweak he asked/she asked replacement 2015-05-20 23:42:32 -04:00
Domenic Denicola
a4e722ddea Fix <em>[single non-letter character]</em>
Move <em>-fixing before most others so that subsequent quote-fixing can apply.
2015-05-20 23:42:26 -04:00
Domenic Denicola
601d200c6b Try to fix the rest of the hyphen-minuses that should be dashes 2015-05-17 16:38:56 -04:00
Domenic Denicola
4019f5d1e6 Tweaks and bug-fixes for the cleanups
Several notable fixes:

- Fixed a bad bug with <span> remover: since moving the child node to a document fragment changes the indices of the childNodes collection, this would leave several nodes in limbo, with the net effect of removing their text from the document.
- Fixed the empty-<em> remover to replace the empty <em> with a space, instead of a removing it entirely; this leads to a lot fewer wordsstuck together, which were starting to accumulate erroneously in substitutions.json.
- Warn instead of error on bad substitutions: this makes it easier to actually find the bad substitution afterward, since then the output still happens.
2015-05-17 16:19:23 -04:00
Domenic Denicola
801e28d602 A lot more cleanups 2015-05-15 01:29:34 -04:00
Domenic Denicola
247f713e13 Better cleanup in the convert step 2015-05-11 22:52:14 -04:00
Domenic Denicola
cf6b5c9ab9 Update conversion to work with cache manifest 2015-05-11 21:24:16 -04:00
Domenic Denicola
b68b88e17e While downloading, save a cache manifest alongside
This allows us to keep track of the chapter title after the fact.
2015-05-11 21:24:15 -04:00
Domenic Denicola
ba1e7b956f More cleanup in conversion 2015-05-09 01:28:43 +02:00
Domenic Denicola
1f241b85ac Serialize body as XHTML, not HTML 2015-05-09 00:40:54 +02:00
Domenic Denicola
f932be159c More clean-ups; do these at a textual level. 2015-05-09 00:21:05 +02:00
Domenic Denicola
64de4a27e5 Clean up better 2015-05-08 00:19:06 +02:00
Domenic Denicola
8bb41473d2 Throttle conversions; move to a separate file 2015-05-07 23:56:53 +02:00