worm-scraper/lib/substitutions.json
Domenic Denicola 4019f5d1e6 Tweaks and bug-fixes for the cleanups
Several notable fixes:

- Fixed a bad bug with <span> remover: since moving the child node to a document fragment changes the indices of the childNodes collection, this would leave several nodes in limbo, with the net effect of removing their text from the document.
- Fixed the empty-<em> remover to replace the empty <em> with a space, instead of a removing it entirely; this leads to a lot fewer wordsstuck together, which were starting to accumulate erroneously in substitutions.json.
- Warn instead of error on bad substitutions: this makes it easier to actually find the bad substitution afterward, since then the output still happens.
2015-05-17 16:19:23 -04:00

288 lines
7.5 KiB
JSON
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

{
"https://parahumans.wordpress.com/2011/08/06/interlude-2/": [
{
"before": "<em>principles, </em>Glory",
"after": "<em>principles</em>, Glory"
}
],
"https://parahumans.wordpress.com/2011/12/03/hive-5-9/": [
{
"before": "insult. An exc</em>use to trounce me physically<em>.</em>",
"after": "insult.</em> An excuse to trounce me physically."
}
],
"https://parahumans.wordpress.com/2012/03/13/extermination-8-4/": [
{
"before": "Endbringer<em>. </em>After the laser petered out<em>, h</em>e",
"after": "Endbringer. After the laser petered out, he"
}
],
"https://parahumans.wordpress.com/2012/03/17/extermination-8-5/": [
{
"before": "past tense-, or",
"after": "past tense—or"
}
],
"https://parahumans.wordpress.com/2012/08/11/snare-13-5/": [
{
"before": "<em>similar. </em>",
"after": "<em>similar</em>. "
}
],
"https://parahumans.wordpress.com/2012/09/11/prey-14-3/": [
{
"before": "truck reached<br/>\nthe other Nine",
"after": "truck reached the other Nine"
}
],
"https://parahumans.wordpress.com/2012/10/18/interlude-15-donation-bonus/": [
{
"before": "volunteered, <em>asked<br/>\n</em> to",
"after": "volunteered, <em>asked</em> to"
}
],
"https://parahumans.wordpress.com/2012/10/27/colony-15-4/": [
{
"before": "Victfor",
"after": "Victor"
}
],
"https://parahumans.wordpress.com/2012/10/30/colony-15-5/": [
{
"before": "wasnt unfamiliar",
"after": "was unfamiliar"
}
],
"https://parahumans.wordpress.com/2012/11/06/colony-15-7/": [
{
"before": "this sort of resistance.",
"after": "this sort of resistance?"
}
],
"https://parahumans.wordpress.com/2012/12/04/monarch-16-4/": [
{
"before": "well, with Bitchs civilian",
"after": "well, while Bitchs civilian"
}
],
"https://parahumans.wordpress.com/2012/12/08/monarch-16-5/": [
{
"before": "and It was powerful enough",
"after": "and it was powerful enough"
}
],
"https://parahumans.wordpress.com/2012/12/11/monarch-16-6/": [
{
"before": "Lost in thought”",
"after": "Lost in thought.”"
}
],
"https://parahumans.wordpress.com/2012/12/13/interlude-16-donation-bonus-2/": [
{
"before": "<i>“</i>",
"after": "”"
}
],
"https://parahumans.wordpress.com/2012/12/15/monarch-16-7/": [
{
"before": "Brockton bay",
"after": "Brockton Bay"
}
],
"https://parahumans.wordpress.com/2012/12/29/monarch-16-11/": [
{
"before": "attemtps",
"after": "attempts"
}
],
"https://parahumans.wordpress.com/2013/01/09/migration-17-2/": [
{
"before": "left now.</em></p>",
"after": "left now.</em></p>"
}
],
"https://parahumans.wordpress.com/2013/01/10/migration-17-3/": [
{
"before": "divots intp",
"after": "divots into"
},
{
"before": "No. the",
"after": "No. The"
}
],
"https://parahumans.wordpress.com/2013/01/12/migration-17-5/": [
{
"before": "been replying",
"after": "been replaying"
},
{
"before": "sort of stuff that , and",
"after": "sort of stuff that ???, and"
},
{
"before": "stirred. her eyes",
"after": "stirred. Her eyes"
}
],
"https://parahumans.wordpress.com/2013/01/13/migration-17-6/": [
{
"before": "ground. Not exactly",
"after": "ground. “Not exactly"
},
{
"before": "MWBB <em>",
"after": "<em>MWBB "
}
],
"https://parahumans.wordpress.com/2013/01/14/migration-17-7/": [
{
"before": "focus</em>. This is because of <em>Noelle",
"after": "focus. This is because of Noelle"
}
],
"https://parahumans.wordpress.com/2013/01/19/queen-18-1/": [
{
"before": "<em>untrustworthy.</em>",
"after": "<em>untrustworthy</em>."
}
],
"https://parahumans.wordpress.com/2013/01/24/interlude-18x/": [
{
"before": "french accent",
"after": "French accent"
}
],
"https://parahumans.wordpress.com/2013/02/12/queen-18-8/": [
{
"before": "dragged long",
"after": "dragged along"
}
],
"https://parahumans.wordpress.com/2013/02/14/interlude-18-donation-bonus-4/": [
{
"before": "C.U.l China",
"after": "C.U.I. China"
},
{
"before": "Faulltine asked",
"after": "Faultline asked"
},
{
"before": "through Faulltine",
"after": "through Faultline"
}
],
"https://parahumans.wordpress.com/2013/02/28/interlude-19-donation-bonus-1/": [
{
"before": "and be brought it",
"after": "and he brought it"
}
],
"https://parahumans.wordpress.com/2013/03/09/scourge-19-6/": [
{
"before": "heailng",
"after": "healing"
}
],
"https://parahumans.wordpress.com/2013/03/12/scourge-19-7/": [
{
"before": "they<em> did, but how we</em> dealt",
"after": "they <em>did</em>, but <em>how</em> we dealt"
}
],
"https://parahumans.wordpress.com/2013/03/16/interlude-19-y/": [
{
"before": "Brockton bay",
"after": "Brockton Bay"
}
],
"https://parahumans.wordpress.com/2013/03/30/chrysalis-20-4/": [
{
"before": "guess-, while",
"after": "guess—while"
}
],
"https://parahumans.wordpress.com/2013/04/04/interlude-20-donation-bonus-1/": [
{
"before": "20th",
"after": "20th"
}
],
"https://parahumans.wordpress.com/2013/04/06/interlude-20/": [
{
"before": "the costume were studded",
"after": "the costume was studded"
}
],
"https://parahumans.wordpress.com/2013/04/13/imago-21-2/": [
{
"before": "Captains hill",
"after": "Captains Hill"
}
],
"https://parahumans.wordpress.com/2013/04/25/imago-21-7/": [
{
"before": "Dinah</em>. <em>because this",
"after": "Dinah. Because this"
}
],
"https://parahumans.wordpress.com/2013/04/30/interlude-21/": [
{
"before": "But…Maybe she",
"after": "But… Maybe she"
}
],
"https://parahumans.wordpress.com/2013/05/07/cell-22-2/": [
{
"before": "said, “Is to set",
"after": "said, “is to set"
}
],
"https://parahumans.wordpress.com/2013/05/18/interlude-22/": [
{
"before": "long,” <em>she</em> warned",
"after": "long,” she warned"
}
],
"https://parahumans.wordpress.com/2013/05/21/interlude-22-donation-bonus-1/": [
{
"before": "“is it reassuring",
"after": "“Is it reassuring"
}
],
"https://parahumans.wordpress.com/2013/05/25/drone-23-1/": [
{
"before": "Defiant spoke , “Lets",
"after": "Defiant spoke. “Lets"
}
],
"https://parahumans.wordpress.com/2013/05/28/drone-23-2/": [
{
"before": "of the ship. Its",
"after": "of the ship. Its"
}
],
"https://parahumans.wordpress.com/2013/06/01/drone-23-4/": [
{
"before": "better off?'”",
"after": "better off?’”"
},
{
"before": "said. Someone",
"after": "said. “Someone"
}
],
"https://parahumans.wordpress.com/2013/09/28/venom-29-5/": [
{
"before": "<em>Losing y</em>ou as you get further down<em>.”</em>",
"after": "<em>Losing you as you get further down.</em>”"
}
],
"https://parahumans.wordpress.com/2013/11/05/teneral-e-2/": [
{
"before": "<em>property of Ner</em>o",
"after": "<em>property of Nero</em>"
}
]
}