fix for TIKA-1841 contributed by zetisam
Is there a reason this is being held off on? The current output from XSLF extraction is very hard to parse to do anything useful with since it's a flat structure that may or may not contain a notes block.
Hi @dstevenson - the last activity on this (via TIKA-1841 was @chrismattmann requesting an update based on his input (this was back in Aug 2016). I think that's why it stalled out.
I'm guessing it wouldn't be very hard to bring the patch into alignment with the current code base, but it's not mergeable currently.
@zetisam are you able to merge master to resolve conflicts here? Otherwise we can maybe close this and reopen a new one we have access to
I'll see if I can get to it somewhere in the coming days.