Start Learning Japanese in the next 30 Seconds with
a Free Lifetime Account

Or sign up using Facebook

Kanji in UCS or UTF-16 sort order

Moderators: Moderator Team, Admin Team

rshiplett
Established Presence
Posts: 69
Joined: March 4th, 2009 9:46 pm

Kanji in UCS or UTF-16 sort order

Postby rshiplett » October 11th, 2012 4:58 pm

I have posted a short list of some basic kanji sorted by their UCS code over at http://kanjirecog.blogspot.ca/2012/10/kanji-by-ucs.html.

That list is only

一 丁 七 万 丈 三 上 下 不 与 世 丘 丙 両 並 中 丸 丹 主 久 乏 乗 乙 九 乱 乳 乾 事 二 亜 享 京 亭 人 仁 今

I will try to find a moment to put a complete view of the jōyō kanji sorted on that basis.

Feedback most welcome.

Several of my pages at http://kanji.aule-browser.com display the UCS or UTF-16 hex value - and I have a little utility ready to flip between kanji, code and the web urlencoded utf-8 version for URL's.

cheers

Robert

rshiplett
Established Presence
Posts: 69
Joined: March 4th, 2009 9:46 pm

Re: Kanji in UCS or UTF-16 sort order

Postby rshiplett » October 12th, 2012 1:55 pm

I now have a page of the Henshall kanji ( 1,945 definitions - roughly the jouyou set ) as an HTML page sorted by UCS or UNICODE UTF-16.

The page is http://www.aule-browser.com/kanji/henshall-sorted-by-unicode.html and has NO JavaScript or nuissance images or links - just simple HTML.

Font feedback etc most welcome.

I will build a 210 joyo page shortly.

The idera is that some people may find this order helpful.

I will soon add a page using the Toyko SCSK Corp's "Curl web content" markup language that will let the user maintain the preferred order - but that will require a browser to use the Curl plugin.

You can always take my HTML page and just edit it - the page is just in simple rows, so this is easy to do by turning off "word wrap" in your editor while you work. It can be useful.

I will do one with JUST the kanji, the UCS code and ONYOMI and/or kunyomi, which requires reading to do your sorting ;-)

If you would like to see a page with the urlencoding for the web ( hex UTF-8 with % as the delimiter, ) let me know. It only takes a few minutes now that I have an array of "objects" for the script that generates the HTML.

Robert
Fredericton, Canada

Get 40% OFF
rshiplett
Established Presence
Posts: 69
Joined: March 4th, 2009 9:46 pm

Re: Kanji in UCS or UTF-16 sort order + urlencoded char

Postby rshiplett » October 12th, 2012 6:26 pm

I have added a plain HTML page which is 1,945 basic Kanji sorted by UTF-16 but including the utf-8 value as the urlencoded utf-8 equivalent.

You will find the page at http://www.aule-browser.com/kanji/henshall-sorted-urlencoded.html.

Escaping the urlencoding within the HTML is a bit of a pain, but I have reformatted each row so that if you rearrange the page locally as your own, you can see what bits to leave alone at the end of the row outside the text span and jsut before the line break (if you add a note, a mnemonic or whatever.)

Return to “Japanese Resources & Reviews”