Creating Unicode character from its number

Question

I want to display a Unicode character in Java   If I do this  it works just fine   String symbol     u2202    symbol is equal to       That s what I want   The problem is that I know the Unicode number and need to create the Unicode symbol from that   I tried  to me  the obvious thing   int c   2202  String symbol       u    c    However  in this case  symbol is equal to   u2202    That s not what I want   How can I construct the symbol if I know its Unicode number  but only at run-time---I can t hard-code it in like the first example

User · Answer

String st  2202   int cp Integer parseInt st 16     it convert st into hex number  char c   Character toChars cp   System out println c     its display the character corresponding to   u2202

User · Answer

The code below will write the 4 unicode chars  represented by decimals  for the word  be  in Japanese   Yes  the verb  be  in Japanese has 4 chars  The value of characters is in decimal and it has been read into an array of String   -- using split for instance   If you have Octal or Hex  parseInt take a radix as well      pseudo code    1  init the String   containing the 4 unicodes in decima    intsInStrs     2  allocate the proper number of character pairs    c2s    3  Using Integer parseInt      with radix or not  get the right int value    4  place it in the correct location of in the array of character pairs    5  convert c2s   to String    6  print   String   intsInStrs     12354    12426    12414    12377       1  char    c2s   new char  intsInStrs length   2       2   two chars per unicode  int ii   0  for  String intString   intsInStrs           3  NB ii 2 because the 16 bit value of Unicode is written in 2 chars     Character toChars Integer parseInt intsInStrs ii    c2s  ii   2       3   4       ii     advance to the next char    String symbols   new String c2s       5  System out println   nLooooonger code point      symbols      6     I tested it in Eclipse and Java 7 and it works   Enjoy

User · Answer

The other answers here either only support unicode up to U FFFF  the answers dealing with just one instance of char  or don t tell how to get to the actual symbol  the answers stopping at Character toChars   or using incorrect method after that   so adding my answer here  too  To support supplementary code points also  this is what needs to be done     this character     http   www isthisthingon org unicode index php page 1F amp subpage 4 amp glyph 1F495    using code points here  not U n notation    for equivalence with U n  below would be 0xnnnn int codePoint   128149     converting to char   pair char   charPair   Character toChars codePoint      and to String  containing the character we want String symbol   new String charPair       we now have str with the desired character as the first item    confirm that we indeed have character with code point 128149 System out println  quot First code point   quot    symbol codePointAt 0     I also did a quick test as to which conversion methods work and which don t int codePoint   128149  char   charPair   Character toChars codePoint    System out println new String charPair  0  2  codePointAt 0       128149  worked System out println charPair toString   codePointAt 0              91  didn t work System out println new String charPair  codePointAt 0             128149  worked System out println String valueOf codePoint  codePointAt 0        49  didn t work System out println new String new int    codePoint   0  1  codePointAt 0                                                                      128149  worked

User · Answer

Remember that char is an integral type  and thus can be given an integer value  as well as a char constant   char c   0x2202   aka 8706 in decimal   u codepoints are in hex  String s   String valueOf c

User · Answer

ANSWER IS IN DOT NET 4 5 and in java  there must be a similar approach exist   I am from West Bengal in INDIA   As I understand your problem is     You want to produce similar to        It is a letter in Bengali language   which has Unicode HEX   0X0985   Now if you know this value in respect of your language then how will you produce that language specific Unicode symbol right    In Dot Net it is as simple as this    int c   0X0985  string x   Char ConvertFromUtf32 c     Now x is your answer  But this is HEX by HEX convert and sentence to sentence conversion is a work for researchers  P

User · Answer

char c  char 0x2202  String s    c

User · Answer

Although this is an old question  there is a very easy way to do this in Java 11 which was released today  you can use a new overload of Character toString     public static String toString  int codePoint   Returns a String object representing the specified character  Unicode code point   The result is a string of length 1 or 2  consisting solely of the specified codePoint   Parameters  codePoint - the codePoint to be converted  Returns  the string representation of the specified codePoint  Throws  IllegalArgumentException - if the specified codePoint is not a valid Unicode code point   Since  11   Since this method supports any Unicode code point  the length of the returned String is not necessarily 1   The code needed for the example given in the question is simply       int codePoint     u2202       String s   Character toString codePoint       lt  lt  lt  Requires JDK 11         System out println s      Prints     This approach offers several advantages    It works for any Unicode code point rather than just those that can be handled using a char   It s concise  and it s easy to understand what the code is doing  It returns the value as a string rather than a char    which is often what you want  The answer posted by McDowell is appropriate if you want the code point returned as char

User · Answer

If you want to get a UTF-16 encoded code unit as a char  you can parse the integer and cast to it as others have suggested   If you want to support all code points  use Character toChars int   This will handle cases where code points cannot fit in a single char value   Doc says      Converts the specified character  Unicode code point  to its UTF-16 representation stored in a char array  If the specified code point is a BMP  Basic Multilingual Plane or Plane 0  value  the resulting char array has the same value as codePoint  If the specified code point is a supplementary code point  the resulting char array has the corresponding surrogate pair

User · Answer

This is how you do it   int cc   0x2202  char ccc    char  Integer parseInt String valueOf cc   16   final String text   String valueOf ccc     This solution is by Arne Vajh  j

User · Answer

This one worked fine for me     String cc2    2202     String text2   String valueOf Character toChars Integer parseInt cc2  16       Now text2 will have

User · Answer

Here is a block to print out unicode chars between  u00c0 to  u00ff   char   ca      u00c0    for  int i   0  i  lt  4  i          for  int j   0  j  lt  16  j              String sc   new String ca           System out print sc                 ca 0               System out println

User · Answer

Just cast your int to a char  You can convert that to a String using Character toString     String s   Character toString  char c     EDIT   Just remember that the escape sequences in Java source code  the  u bits  are in HEX  so if you re trying to reproduce an escape sequence  you ll need something like int c   0x2202

User · Answer

Unfortunatelly  to remove one backlash as mentioned in first comment  newbiedoodle  don t lead to good result  Most  if not all  IDE issues syntax error  The reason is in this  that Java Escaped Unicode format expects syntax   uXXXX   where XXXX are 4 hexadecimal digits  which are mandatory  Attempts to fold this string from pieces fails  Of course    u  is not the same as    u   The first syntax means escaped  u   second means escaped backlash  which is backlash  followed by  u   It is strange  that on the Apache pages is presented utility  which doing exactly this behavior  But in reality  it is Escape mimic utility  Apache has some its own utilities  i didn t testet them   which do this work for you  May be  it is still not that  what you want to have  Apache Escape Unicode utilities But this utility 1 have good approach to the solution  With combination described above  MeraNaamJoker   My solution is create this Escaped mimic string and then convert it back to unicode  to avoid real Escaped Unicode restriction   I used it for copying text  so it is possible  that in uencode method will be better to use    u  except      u   Try it              Converts character to the mimic unicode format i e     u0020              This format is the Java source code format               CharUtils unicodeEscaped           u0020         CharUtils unicodeEscaped  A        u0041              param ch  the character to convert       return is in the mimic of escaped unicode string           public static String unicodeEscaped char ch        String returnStr        String uniTemplate     u0000       final static String charEsc      u        if  ch  lt  0x10          returnStr    000    Integer toHexString ch             else if  ch  lt  0x100          returnStr    00    Integer toHexString ch             else if  ch  lt  0x1000          returnStr    0    Integer toHexString ch             else       returnStr        Integer toHexString ch        return charEsc   returnStr                  Converts the string from UTF8 to mimic unicode format i e     u0020        notice  i cannot use real unicode format  because this is immediately translated      to the character in time of compiling and editor  i e  netbeans  checking it      instead reaal unicode format i e    u0020  i using mimic unicode format    u0020       as a string  but it doesn t gives the same results  of course            This format is the Java source code format               CharUtils unicodeEscaped           u0020         CharUtils unicodeEscaped  A        u0041              param String - nationalString in the UTF8 string to convert       return is the string in JAVA unicode mimic escaped         public String encodeStr String nationalString  throws UnsupportedEncodingException       String convertedString            for  int i   0  i  lt  nationalString length    i            Character chs   nationalString charAt i         convertedString    unicodeEscaped chs             return convertedString                  Converts the string from mimic unicode format i e     u0020  back to UTF8             This format is the Java source code format               CharUtils unicodeEscaped           u0020         CharUtils unicodeEscaped  A        u0041              param String - nationalString in the JAVA unicode mimic escaped       return is the string in UTF8 string         public String uencodeStr String escapedString  throws UnsupportedEncodingException       String convertedString            String   arrStr   escapedString split      u        String str  istr      for  int i   1  i  lt  arrStr length  i            str   arrStr i         if   str isEmpty              Integer iI   Integer parseInt str  16           char   chaCha   Character toChars iI           convertedString    String valueOf chaCha                     return convertedString

[java] Creating Unicode character from its number

Examples related to java

Examples related to string

Examples related to unicode

Examples related to character