I am converting IMS Content Package HTML to Word documents. I have some discovered an issue with some of the HTML entities. For example the single down arrow ↓ (& darr;) works fine in the conversion. However, the double down arrow ⇓(& dArr;) breaks in the word document. I tried converting from the named HTML entities to HTML decimal (e.g. & #8594;) (And note that the spaces aren't being used in the actual entities in the code).
The only thing that I have found that works is to do a string replace in the code:
$output = str_replace("& dArr;","& darr;",$output);
But obviously I don't want to do this for every case like this that turns up. Is there anything else I can try?