|Subject:||Warning messages when parsing questionable entities|
When parsing the text: �� one gets warnings: UTF-16 surrogate 0xdbc0 at [...] UTF-16 surrogate 0xdc85 at [...] There are two issues here. One, while this encoding is highly questionable, it would be good if it could be interpreted the same as "􀂅". Two, when there is an unpaired surrogate or an illegal character (such as "") there should be no warning. It probably should interpret all such junk as �, "REPLACEMENT CHARACTER".