1
Fork 0
blog/_posts/2013-11-06-unicode-codepoints-in-ruby.md
2018-01-08 15:56:34 +01:00

1.3 KiB

permalink: "/{{ year }}/{{ month }}/{{ day }}/unicode-codepoints-in-ruby" title: Unicode codepoints in ruby published_date: "2013-11-06 12:04:00 +0100" layout: post.liquid data: route: blog

Another post of the category "better write it down before you forget it".

I ❤ Unicode. Atleast most of the time. That's why I have things like ✓, ✗ and ツ mapped directly on my keyboard.

But sometimes you need not only the symbol itself, but maybe the codepoint as well. That's easy in ruby:

irb> "❤".codepoints
=> [10084]

Got some codepoints and need to map it back to it's symbol? Easy:

irb> [10084, 10003].pack("U*")
=> "❤✓"

Oh, of course the usual \uXYZ syntax works aswell, but you need the hexstring for that:

irb> 10084.to_s 16
=> "2764"
irb> "\u{2764}"
=> "❤"

Sometimes you may need to see the actual bytes. This is easy in ruby aswell:

irb> "❤".bytes
=> [226, 157, 164]

There is documentation on these things:

Enjoy the world of unicode!