Hello,
I noticed that "open-dev-v1" upgraded PDFBOX to version 2.0.21
I was upgrading some libraries, then so give PDFBOX 2.0.21 a try too.
Unfortunately, I found a regression.
I created a reproducer to illustrate the issue:
minimal_reproducer_missing_words.txt
Using openhtml 1.0.4 with PDFBOX 2.0.20, works:

But using openhtml 1.0.4 with PDFBOX 2.0.21, not all words appear on generated PDF:

I am not sure if this issue is on openhtml or PDFBOX side, so I am describing it here in the hope that someone with proper knowledge could wheighting in.
Thanks for this great library. Hope you can continue this great work.
Best wishes,
lagar84.