S
Stanley Xu
[Note: parts of this message were removed to make it a legal post.]
Dear Buddies,
Yesterday, I sent a mail of let the split ignore the error utf-8 bytes
sequences. And I checked the string I wanted to parse in Java and found out
that the string is encoded in gbk and part of the string is encoded in
utf-8.
I am wondering if I could find a way to still split the string by split
method, and then I could try to force_encoding part of the string that might
encoded in gbk and resolve the problem.
I am wondering if there is a way I could do so without the "invalid bytes
sequence" error?
Thanks.
Best wishes,
Stanley Xu
Dear Buddies,
Yesterday, I sent a mail of let the split ignore the error utf-8 bytes
sequences. And I checked the string I wanted to parse in Java and found out
that the string is encoded in gbk and part of the string is encoded in
utf-8.
I am wondering if I could find a way to still split the string by split
method, and then I could try to force_encoding part of the string that might
encoded in gbk and resolve the problem.
I am wondering if there is a way I could do so without the "invalid bytes
sequence" error?
Thanks.
Best wishes,
Stanley Xu