Fix raw html reference issue #585

facelessuser · 2017-10-07T03:25:02Z

Preserve the line which a reference was on to prevent raw HTML indexing issue. Ref #584.

facelessuser · 2017-10-07T06:14:38Z

Just occurred to me that abbreviations and footnotes should be checked for this kind of breakage and fixed if it occurs. I will verify and patch those if necessary in this pull as well.

Peserve abbreviation line when stripping and preserve a line for each footnote block. Footnotes should also accumulate the extraneous padding.

facelessuser · 2017-10-07T14:35:35Z

Abbreviations and footnotes are now handled. Footnotes were a little special as we had to preserve a line for each block. We also had to account for unnecessary trailing empty lines. Now that I say it out loud, I should get a test in for trailing empty lines as well....

facelessuser · 2017-10-07T14:47:30Z

Looks like footnotes changed in the tests slightly. Ugh, forgot to run the tests one more time...

When processing footnotes, we don't actually care to process the extra whitespace at the end of a footnote, but we want it to calculate lines to preserve.

facelessuser · 2017-10-07T14:59:36Z

Tests are now passing.

waylan · 2017-10-07T20:12:10Z

This looks good. One question though: why not just modify the raw HTML processor so that this didn’t matter? To be clear, I haven’t looked into it myself. It just seems like that might have been the first approach I would have explored. Maybe there’s a good reason?

facelessuser · 2017-10-07T20:29:31Z

One question though: why not just modify the raw HTML processor so that this didn’t matter?

I am open to suggestions, but the reason this issue occurs is that the raw HTML preprocessor is not aware of references. It parses the blocks and such first populating the tag_data. Then the footnote, abbr, and link reference preprocessor gets run and then removes entire blocks from the preprocessed file. Then the raw HTML parser references indexes it thought was good that are no longer good.

At least with abbr and link references, maybe you could post process the tag data after the references get stripped and rebuild them proper. With the footnote references, the extension actually utilizes the tag_data when constructing the footnote which adds even more complications.

Honestly, its a messy situations and this was the easiest way to solve the issue. Maybe there is a "better" way to approach it, but this seemed like the least invasive approach. Maybe if I spent more time getting to understand the raw HTML parser another cleaner (more involved) approach may make itself manifest.

Any ideas?

facelessuser · 2017-10-07T20:33:20Z

I'm not in a hurry to get this merged. I can mull over this more and see if there is a better way. We can consider this a first draft. If we can't come up with something better, this may be decent stopgap solution.

Fix raw html reference issue

9b43efb

Preserve the line which a reference was on to prevent raw HTML indexing issue. Ref #584.

Prevent raw HTML parsing issue in abbr and footnotes

cfe3376

Peserve abbreviation line when stripping and preserve a line for each footnote block. Footnotes should also accumulate the extraneous padding.

Test extra lines at the end of references

08abeb5

Strip the gathered extraneous whitespace

c55c50c

When processing footnotes, we don't actually care to process the extra whitespace at the end of a footnote, but we want it to calculate lines to preserve.

facelessuser mentioned this pull request Nov 15, 2017

Additional paragraph when using Markdown in raw HTML #595

Closed

waylan merged commit 1de595a into master Jan 4, 2018

waylan mentioned this pull request Sep 2, 2020

Refactor HTML Parser #803

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix raw html reference issue #585

Fix raw html reference issue #585

Uh oh!

facelessuser commented Oct 7, 2017

Uh oh!

facelessuser commented Oct 7, 2017

Uh oh!

facelessuser commented Oct 7, 2017 •

edited

Loading

Uh oh!

facelessuser commented Oct 7, 2017

Uh oh!

facelessuser commented Oct 7, 2017

Uh oh!

waylan commented Oct 7, 2017

Uh oh!

facelessuser commented Oct 7, 2017

Uh oh!

facelessuser commented Oct 7, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix raw html reference issue #585

Fix raw html reference issue #585

Uh oh!

Conversation

facelessuser commented Oct 7, 2017

Uh oh!

facelessuser commented Oct 7, 2017

Uh oh!

facelessuser commented Oct 7, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facelessuser commented Oct 7, 2017

Uh oh!

facelessuser commented Oct 7, 2017

Uh oh!

waylan commented Oct 7, 2017

Uh oh!

facelessuser commented Oct 7, 2017

Uh oh!

facelessuser commented Oct 7, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

facelessuser commented Oct 7, 2017 •

edited

Loading