Converting non latin characters to UTF-8 as Proxomitron can't read Unicode UTF-16

Post Reply

Threaded Mode | Linear Mode

Converting non latin characters to UTF-8 as Proxomitron can't read Unicode UTF-16

Jun. 14, 2009, 04:08 PM (This post was last modified: Jun. 14, 2009 04:16 PM by sidki3003.)

Post: #10

sidki3003 Offline

Offline

Administrator

Posts: 1,458
Joined: Mar 2004

RE: Proxomitron cannot read Unicode UTF-16 Hebrew

Actually, you can filter UTF-16 with Proxomitron. I'm doing so.

However, you need a webfilter to convert the page (i don't have a config independent version to post), *and* you're losing any double-byte information that goes beyond UTF-8. For the little-endian case that means the second byte is supposed to be x00. If not, the double byte will be replaced by a dummy char.

Luckily, most little-endian and all big-endian pages i've seen are indeed using just one byte for char information. But not in your example. I've once written a UTF-16 example to test with Proxomitron (little-endian).

Add Thank You

Quote this message in a reply

« Next Oldest | Next Newest »

Post Reply

Messages In This Thread

Converting non latin characters to UTF-8 as Proxomitron can't read Unicode UTF-16 - bugmenot - Jun. 11, 2009, 07:57 AM

RE: Proxomitron cannot read Unicode UTF-16 Hebrew - lnminente - Jun. 11, 2009, 08:41 AM

RE: Proxomitron cannot read Unicode UTF-16 Hebrew - bugmenot - Jun. 11, 2009, 10:51 AM

RE: Proxomitron cannot read Unicode UTF-16 Hebrew - lnminente - Jun. 11, 2009, 11:12 AM

RE: Proxomitron cannot read Unicode UTF-16 Hebrew - bugmenot - Jun. 11, 2009, 11:23 AM

RE: Proxomitron cannot read Unicode UTF-16 Hebrew - Graycode - Jun. 12, 2009, 01:00 AM

RE: Proxomitron cannot read Unicode UTF-16 Hebrew - lnminente - Jun. 12, 2009, 02:35 PM

RE: Proxomitron cannot read Unicode UTF-16 Hebrew - bugmenot - Jun. 14, 2009, 12:39 PM

RE: Proxomitron cannot read Unicode UTF-16 Hebrew - sidki3003 - Jun. 16, 2009, 10:10 PM

RE: Proxomitron cannot read Unicode UTF-16 Hebrew - lnminente - Jun. 14, 2009, 03:00 PM

RE: Proxomitron cannot read Unicode UTF-16 Hebrew - sidki3003 - Jun. 14, 2009 04:08 PM

RE: Proxomitron cannot read Unicode UTF-16 Hebrew - lnminente - Jun. 14, 2009, 05:13 PM

RE: Proxomitron cannot read Unicode UTF-16 Hebrew - bugmenot - Jun. 16, 2009, 08:09 PM

RE: Converting non latin characters to UTF-8 as Proxomitron can't read Unicode UTF-16 - bugmenot - Jun. 16, 2009, 10:48 PM

RE: Converting non latin characters to UTF-8 as Proxomitron can't read Unicode UTF-16 - sidki3003 - Jun. 16, 2009, 11:15 PM

RE: Converting non latin characters to UTF-8 as Proxomitron can't read Unicode UTF-16 - bugmenot - Jun. 16, 2009, 11:26 PM

RE: Converting non latin characters to UTF-8 as Proxomitron can't read Unicode UTF-16 - lnminente - Jun. 17, 2009, 12:19 AM

RE: Converting non latin characters to UTF-8 as Proxomitron can't read Unicode UTF-16 - bugmenot - Jun. 17, 2009, 11:43 AM

Contact Us | The Un-Official Proxomitron Site | Return to Top | Return to Content | Lite (Archive) Mode | RSS Syndication

Powered By MyBB, © 2002-2026 MyBB Group.
Favicon by Mizz Mona.