Replacing accented characters php

Question

I am trying to replace accented characters with the normal replacements  Below is what I am currently doing        string      ric Cantona        strict   strtolower  string        echo  After Lower     strict        patterns 0                               patterns 1                               patterns 2                            patterns 3                                  patterns 4                            patterns 5                 patterns 6                 patterns 7                 replacements 0     a        replacements 1     e        replacements 2     i        replacements 3     o        replacements 4     u        replacements 5     ae        replacements 6     c        replacements 7     ss         strict   preg replace  patterns   replacements   strict       echo  Final     strict    This gives me       After Lower    ric cantona     Final  ric cantona   The above gives me ric cantona I want the output to be eric cantona   can anyone help me with where I am going wrong

User · Answer

if you have http   php net manual en book intl php available  this will solve your problem    string      ric Cantona    transliterator   Transliterator  createFromRules     NFD       Nonspacing Mark   Remove     Lower       NFC    Transliterator  FORWARD   echo  normalized    transliterator- gt transliterate  string     EDIT  To install the php extension in ubuntu   apt-get install php-intl   Don t forget  in composer  to require the extension ext-intl to ensure it properly fits into deployed systems

User · Answer

I ve searched and your idea for accent striping is quite awesome and cost-effective but your regex is wrongly done and misses 2 extra params  Long story short the regex must be    patterns 0                   ui    patterns 1                   ui    patterns 2                 ui    patterns 3                     ui    patterns 4                 ui    patterns 5         ui    patterns 6         ui    patterns 7         ui    replacements 0     a    replacements 1     e    replacements 2     i    replacements 3     o    replacements 4     u    replacements 5     ae    replacements 6     c    replacements 7     ss     As you can see is quite similar but the most important thing is the paramas after the second slash of the regular expression  When a regualr expression is like this   someCoolRegex  ui the u specifies that it must use unicode and the i specifies that is case insensitive  I ve tested my own and with the ansewer in this forum I must say is more cost efective than using strtr   Hope someone reads this answer

User · Answer

This worked for me    lt  php setlocale LC ALL   en US utf8      val   iconv  UTF-8   ASCII  TRANSLIT   val     gt

User · Answer

Disclaimer  I m not supporting this answer anymore  I was blind at that time   But thanks for the up-votes  P   You can take this as basis  From WordPress  used to generate pretty urls  the entry point is the slugify   function           Converts all accent characters to ASCII characters        If there are no accent characters  then the string given is just returned         param string  string Text that might have accent characters     return string Filtered string with replaced  nice  characters       function remove accents  string     if   preg match     x80- xff      string     return  string   if  seems utf8  string        chars   array       Decompositions for Latin-1 Supplement   chr 195  chr 128    gt   A   chr 195  chr 129    gt   A     chr 195  chr 130    gt   A   chr 195  chr 131    gt   A     chr 195  chr 132    gt   A   chr 195  chr 133    gt   A     chr 195  chr 135    gt   C   chr 195  chr 136    gt   E     chr 195  chr 137    gt   E   chr 195  chr 138    gt   E     chr 195  chr 139    gt   E   chr 195  chr 140    gt   I     chr 195  chr 141    gt   I   chr 195  chr 142    gt   I     chr 195  chr 143    gt   I   chr 195  chr 145    gt   N     chr 195  chr 146    gt   O   chr 195  chr 147    gt   O     chr 195  chr 148    gt   O   chr 195  chr 149    gt   O     chr 195  chr 150    gt   O   chr 195  chr 153    gt   U     chr 195  chr 154    gt   U   chr 195  chr 155    gt   U     chr 195  chr 156    gt   U   chr 195  chr 157    gt   Y     chr 195  chr 159    gt   s   chr 195  chr 160    gt   a     chr 195  chr 161    gt   a   chr 195  chr 162    gt   a     chr 195  chr 163    gt   a   chr 195  chr 164    gt   a     chr 195  chr 165    gt   a   chr 195  chr 167    gt   c     chr 195  chr 168    gt   e   chr 195  chr 169    gt   e     chr 195  chr 170    gt   e   chr 195  chr 171    gt   e     chr 195  chr 172    gt   i   chr 195  chr 173    gt   i     chr 195  chr 174    gt   i   chr 195  chr 175    gt   i     chr 195  chr 177    gt   n   chr 195  chr 178    gt   o     chr 195  chr 179    gt   o   chr 195  chr 180    gt   o     chr 195  chr 181    gt   o   chr 195  chr 182    gt   o     chr 195  chr 182    gt   o   chr 195  chr 185    gt   u     chr 195  chr 186    gt   u   chr 195  chr 187    gt   u     chr 195  chr 188    gt   u   chr 195  chr 189    gt   y     chr 195  chr 191    gt   y        Decompositions for Latin Extended-A   chr 196  chr 128    gt   A   chr 196  chr 129    gt   a     chr 196  chr 130    gt   A   chr 196  chr 131    gt   a     chr 196  chr 132    gt   A   chr 196  chr 133    gt   a     chr 196  chr 134    gt   C   chr 196  chr 135    gt   c     chr 196  chr 136    gt   C   chr 196  chr 137    gt   c     chr 196  chr 138    gt   C   chr 196  chr 139    gt   c     chr 196  chr 140    gt   C   chr 196  chr 141    gt   c     chr 196  chr 142    gt   D   chr 196  chr 143    gt   d     chr 196  chr 144    gt   D   chr 196  chr 145    gt   d     chr 196  chr 146    gt   E   chr 196  chr 147    gt   e     chr 196  chr 148    gt   E   chr 196  chr 149    gt   e     chr 196  chr 150    gt   E   chr 196  chr 151    gt   e     chr 196  chr 152    gt   E   chr 196  chr 153    gt   e     chr 196  chr 154    gt   E   chr 196  chr 155    gt   e     chr 196  chr 156    gt   G   chr 196  chr 157    gt   g     chr 196  chr 158    gt   G   chr 196  chr 159    gt   g     chr 196  chr 160    gt   G   chr 196  chr 161    gt   g     chr 196  chr 162    gt   G   chr 196  chr 163    gt   g     chr 196  chr 164    gt   H   chr 196  chr 165    gt   h     chr 196  chr 166    gt   H   chr 196  chr 167    gt   h     chr 196  chr 168    gt   I   chr 196  chr 169    gt   i     chr 196  chr 170    gt   I   chr 196  chr 171    gt   i     chr 196  chr 172    gt   I   chr 196  chr 173    gt   i     chr 196  chr 174    gt   I   chr 196  chr 175    gt   i     chr 196  chr 176    gt   I   chr 196  chr 177    gt   i     chr 196  chr 178    gt   IJ  chr 196  chr 179    gt   ij     chr 196  chr 180    gt   J   chr 196  chr 181    gt   j     chr 196  chr 182    gt   K   chr 196  chr 183    gt   k     chr 196  chr 184    gt   k   chr 196  chr 185    gt   L     chr 196  chr 186    gt   l   chr 196  chr 187    gt   L     chr 196  chr 188    gt   l   chr 196  chr 189    gt   L     chr 196  chr 190    gt   l   chr 196  chr 191    gt   L     chr 197  chr 128    gt   l   chr 197  chr 129    gt   L     chr 197  chr 130    gt   l   chr 197  chr 131    gt   N     chr 197  chr 132    gt   n   chr 197  chr 133    gt   N     chr 197  chr 134    gt   n   chr 197  chr 135    gt   N     chr 197  chr 136    gt   n   chr 197  chr 137    gt   N     chr 197  chr 138    gt   n   chr 197  chr 139    gt   N     chr 197  chr 140    gt   O   chr 197  chr 141    gt   o     chr 197  chr 142    gt   O   chr 197  chr 143    gt   o     chr 197  chr 144    gt   O   chr 197  chr 145    gt   o     chr 197  chr 146    gt   OE  chr 197  chr 147    gt   oe     chr 197  chr 148    gt   R  chr 197  chr 149    gt   r     chr 197  chr 150    gt   R  chr 197  chr 151    gt   r     chr 197  chr 152    gt   R  chr 197  chr 153    gt   r     chr 197  chr 154    gt   S  chr 197  chr 155    gt   s     chr 197  chr 156    gt   S  chr 197  chr 157    gt   s     chr 197  chr 158    gt   S  chr 197  chr 159    gt   s     chr 197  chr 160    gt   S   chr 197  chr 161    gt   s     chr 197  chr 162    gt   T   chr 197  chr 163    gt   t     chr 197  chr 164    gt   T   chr 197  chr 165    gt   t     chr 197  chr 166    gt   T   chr 197  chr 167    gt   t     chr 197  chr 168    gt   U   chr 197  chr 169    gt   u     chr 197  chr 170    gt   U   chr 197  chr 171    gt   u     chr 197  chr 172    gt   U   chr 197  chr 173    gt   u     chr 197  chr 174    gt   U   chr 197  chr 175    gt   u     chr 197  chr 176    gt   U   chr 197  chr 177    gt   u     chr 197  chr 178    gt   U   chr 197  chr 179    gt   u     chr 197  chr 180    gt   W   chr 197  chr 181    gt   w     chr 197  chr 182    gt   Y   chr 197  chr 183    gt   y     chr 197  chr 184    gt   Y   chr 197  chr 185    gt   Z     chr 197  chr 186    gt   z   chr 197  chr 187    gt   Z     chr 197  chr 188    gt   z   chr 197  chr 189    gt   Z     chr 197  chr 190    gt   z   chr 197  chr 191    gt   s        Euro Sign   chr 226  chr 130  chr 172    gt   E        GBP  Pound  Sign   chr 194  chr 163    gt          string   strtr  string   chars      else        Assume ISO-8859-1 if not UTF-8    chars  in     chr 128  chr 131  chr 138  chr 142  chr 154  chr 158      chr 159  chr 162  chr 165  chr 181  chr 192  chr 193  chr 194      chr 195  chr 196  chr 197  chr 199  chr 200  chr 201  chr 202      chr 203  chr 204  chr 205  chr 206  chr 207  chr 209  chr 210      chr 211  chr 212  chr 213  chr 214  chr 216  chr 217  chr 218      chr 219  chr 220  chr 221  chr 224  chr 225  chr 226  chr 227      chr 228  chr 229  chr 231  chr 232  chr 233  chr 234  chr 235      chr 236  chr 237  chr 238  chr 239  chr 241  chr 242  chr 243      chr 244  chr 245  chr 246  chr 248  chr 249  chr 250  chr 251      chr 252  chr 253  chr 255      chars  out      EfSZszYcYuAAAAAACEEEEIIIINOOOOOOUUUUYaaaaaaceeeeiiiinoooooouuuuyy      string   strtr  string   chars  in     chars  out        double chars  in     array chr 140   chr 156   chr 198   chr 208   chr 222   chr 223   chr 230   chr 240   chr 254       double chars  out     array  OE    oe    AE    DH    TH    ss    ae    dh    th       string   str replace  double chars  in     double chars  out     string       return  string            Checks to see if a string is utf8 encoded         author bmorel at ssi dot fr        param string  Str The string to be checked     return bool True if  Str fits a UTF-8 model  false otherwise      function seems utf8  Str      by bmorel at ssi dot fr   length   strlen  Str    for   i   0   i  lt   length   i        if  ord  Str  i    lt  0x80  continue    0bbbbbbb   elseif   ord  Str  i    amp  0xE0     0xC0   n   1    110bbbbb   elseif   ord  Str  i    amp  0xF0     0xE0   n   2    1110bbbb   elseif   ord  Str  i    amp  0xF8     0xF0   n   3    11110bbb   elseif   ord  Str  i    amp  0xFC     0xF8   n   4    111110bb   elseif   ord  Str  i    amp  0xFE     0xFC   n   5    1111110b   else return false    Does not match any model   for   j   0   j  lt   n   j        n bytes matching 10bbbbbb follow      if      i     length       ord  Str  i    amp  0xC0     0x80      return false          return true     function utf8 uri encode  utf8 string   length   0      unicode         values   array      num octets   1    unicode length   0    string length   strlen  utf8 string    for   i   0   i  lt   string length   i         value   ord  utf8 string  i      if   value  lt  128       if   length  amp  amp    unicode length  gt    length       break      unicode    chr  value       unicode length        else      if  count  values     0   num octets     value  lt  224    2   3      values      value     if   length  amp  amp    unicode length     num octets   3    gt   length      break     if  count   values       num octets        if   num octets    3          unicode          dechex  values 0           dechex  values 1           dechex  values 2          unicode length    9        else         unicode          dechex  values 0           dechex  values 1          unicode length    6             values   array         num octets   1               return  unicode            Sanitizes title  replacing whitespace with dashes        Limits the output to alphanumeric characters  underscore     and dash  -      Whitespace becomes a dash         param string  title The title to be sanitized      return string The sanitized title      function slugify  title      title   strip tags  title       Preserve escaped octets    title   preg replace      a-fA-F0-9  a-fA-F0-9       --- 1---    title       Remove percent signs that are not part of an octet    title   str replace           title       Restore octets    title   preg replace   ---  a-fA-F0-9  a-fA-F0-9  ---       1    title     title   remove accents  title    if  seems utf8  title       if  function exists  mb strtolower          title   mb strtolower  title   UTF-8           title   utf8 uri encode  title  200        title   strtolower  title     title   preg replace    amp              title      kill entities   title   preg replace      a-z0-9  -          title     title   preg replace    s      -    title     title   preg replace   -      -    title     title   trim  title   -     return  title

User · Answer

I have tried all sorts based on the variations listed in the answers  but the following worked    unwanted array   array           gt  S         gt  s         gt  Z         gt  z         gt  A         gt  A         gt  A         gt  A         gt  A         gt  A         gt  A         gt  C         gt  E         gt  E                                     gt  E         gt  E         gt  I         gt  I         gt  I         gt  I         gt  N         gt  O         gt  O         gt  O         gt  O         gt  O         gt  O         gt  U                                     gt  U         gt  U         gt  U         gt  Y         gt  B         gt  Ss         gt  a         gt  a         gt  a         gt  a         gt  a         gt  a         gt  a         gt  c                                     gt  e         gt  e         gt  e         gt  e         gt  i         gt  i         gt  i         gt  i         gt  o         gt  n         gt  o         gt  o         gt  o         gt  o                                     gt  o         gt  o         gt  u         gt  u         gt  u         gt  y         gt  b         gt  y      str   strtr   str   unwanted array

User · Answer

You can use PHP strtr   function to get rid of accented characters      string      ric Cantona    accented array   array       gt  S         gt  s         gt  Z         gt  z         gt  A         gt  A         gt  A         gt  A         gt  A         gt  A         gt  A         gt  C         gt  E         gt  E        gt  E         gt  E         gt  I         gt  I         gt  I         gt  I         gt  N         gt  O         gt  O         gt  O         gt  O         gt  O         gt  O         gt  U        gt  U         gt  U         gt  U         gt  Y         gt  B         gt  Ss         gt  a         gt  a         gt  a         gt  a         gt  a         gt  a         gt  a         gt  c        gt  e         gt  e         gt  e         gt  e         gt  i         gt  i         gt  i         gt  i         gt  o         gt  n         gt  o         gt  o         gt  o         gt  o        gt  o         gt  o         gt  u         gt  u         gt  u         gt  y         gt  b         gt  y       required str   strtr   string   accented array

User · Answer

I know  that question has been asked a long long time ago     I was looking for a short and elegant solution  but couldn t find satisfaction for two reasons   First  most of the existing solutions replace a list of characters by a list of other characters  Unfortunately  it require to use a specific encoding for the php script file itself which might be unwanted   Second  using iconv seems to be a good way  but it s not enough as the result of a converted character could be one or two characters  or a Fatal Exception   So I wrote that small function which does the job    function replaceAccent  string   replacement               alnumPattern       a-zA-Z0-9             if  preg match  alnumPattern   string             return  string              ret   array map          function   chr  use   alnumPattern   replacement                if  preg match  alnumPattern   chr                     return  chr                else                    chr    iconv  ISO-8859-1    ASCII  TRANSLIT    chr                   if  strlen  chr     1                        return  chr                    elseif  strlen  chr   gt  1                         ret                           foreach  str split  chr  as  char2                            if  preg match  alnumPattern   char2                                  ret     char2                                                                      return  ret                    else                          replace whatever iconv fail to convert by something else                     return  replacement                                                     str split  string              return implode  ret

User · Answer

In PHP 5 4 the intl extension provides a new class named Transliterator   I believe that s the best way to remove diacritics for two reasons    Transliterator is based on ICU  so you re using the tables of the ICU library  ICU is a great project  developed over the year to provide comprehensive tables and functionalities  Whatever table you want to write yourself  it will never be as complete as the one from ICU  In UTF-8  characters could be represented differently  For example  the character    could be saved as a single  multi-byte  character  or as the combination of characters     multibyte  and n  In addition to this  some characters in Unicode are homograph  they look the same while having different codepoints  For this reason it s also important to normalize the string    Here s a sample code  taken from an old answer of mine    lt  php  transliterator   Transliterator  createFromRules     NFD       Nonspacing Mark   Remove     NFC    Transliterator  FORWARD    test     abcd      e                                               ti  sto    foreach  test as  e         normalized    transliterator- gt transliterate  e       echo  e    -- gt     normalized   n       gt    Result   abcd -- gt  abcd   e -- gt  ee     -- gt                     -- gt  aouieeu                -- gt  aouieeu ti  sto -- gt  tiesto   The first argument for the Transliterator class performs the removal of diacritics as well as the normalization of the string

User · Answer

Adding a little bit to what Lizard said  it worked to display correctly on web page  but added some other codes to complete what I was looking for replacing my tags to search correctly into my database with special characters  Thanks in advance    unwanted array   array           gt  S         gt  s         gt  Z         gt  z         gt  A         gt  A         gt  A         gt  A         gt  A         gt  A         gt  A         gt  C         gt  E         gt  E                                     gt  E         gt  E         gt  I         gt  I         gt  I         gt  I         gt  N         gt  O         gt  O         gt  O         gt  O         gt  O         gt  O         gt  U                                     gt  U         gt  U         gt  U         gt  Y         gt  B         gt  Ss         gt  a         gt  a         gt  a         gt  a         gt  a         gt  a         gt  a         gt  c                                     gt  e         gt  e         gt  e         gt  e         gt  i         gt  i         gt  i         gt  i         gt  o         gt  n         gt  o         gt  o         gt  o         gt  o                                     gt  o         gt  o         gt  u         gt  u         gt  u         gt  y         gt  b         gt  y                                  amp  225    gt  a     amp  233    gt  e     amp  237    gt  i     amp  243    gt  o     amp  250    gt  u                                   amp  193    gt  A     amp  201    gt  E     amp  205    gt  I     amp  211    gt  O     amp  218    gt  U                                 amp  209    gt  N     amp  241    gt  n      newtag   strtr   newtag   unwanted array

User · Answer

protected   convertTable   array        amp amp     gt   and           gt   at             gt   c          gt   r          gt   a              gt   a          gt   a          gt   a          gt   a          gt   ae         gt   c              gt   e          gt   e          gt   e          gt   i          gt   i          gt   i              gt   i          gt   o          gt   o          gt   o          gt   o          gt   o              gt   o          gt   u          gt   u          gt   u          gt   u          gt   y              gt   ss         gt   a          gt   a          gt   a          gt   a          gt   a              gt   ae         gt   c          gt   e          gt   e          gt   e          gt   e              gt   i          gt   i          gt   i          gt   i          gt   o          gt   o              gt   o          gt   o          gt   o          gt   o          gt   u          gt   u              gt   u          gt   u          gt   y          gt   p          gt   y    A    gt   a        a    gt   a    A    gt   a    a    gt   a    A    gt   a    a    gt   a    C    gt   c        c    gt   c    C    gt   c    c    gt   c    C    gt   c    c    gt   c    C    gt   c        c    gt   c    D    gt   d    d    gt   d          gt   d    d    gt   d    E    gt   e        e    gt   e    E    gt   e    e    gt   e    E    gt   e    e    gt   e    E    gt   e        e    gt   e    E    gt   e    e    gt   e    G    gt   g    g    gt   g    G    gt   g        g    gt   g    G    gt   g    g    gt   g    G    gt   g    g    gt   g    H    gt   h        h    gt   h    H    gt   h    h    gt   h    I    gt   i    i    gt   i    I    gt   i        i    gt   i    I    gt   i    i    gt   i    I    gt   i    i    gt   i    I    gt   i        i    gt   i         gt   ij        gt   ij   J    gt   j    j    gt   j    K    gt   k        k    gt   k         gt   k    L    gt   l    l    gt   l    L    gt   l    l    gt   l        L    gt   l    l    gt   l         gt   l         gt   l    L    gt   l    l    gt   l        N    gt   n    n    gt   n    N    gt   n    n    gt   n    N    gt   n    n    gt   n             gt   n         gt   n         gt   n    O    gt   o    o    gt   o    O    gt   o        o    gt   o    O    gt   o    o    gt   o          gt   oe         gt   oe   R    gt   r        r    gt   r    R    gt   r    r    gt   r    R    gt   r    r    gt   r    S    gt   s        s    gt   s    S    gt   s    s    gt   s    S    gt   s    s    gt   s          gt   s              gt   s    T    gt   t    t    gt   t    T    gt   t    t    gt   t    T    gt   t        t    gt   t    U    gt   u    u    gt   u    U    gt   u    u    gt   u    U    gt   u        u    gt   u    U    gt   u    u    gt   u    U    gt   u    u    gt   u    U    gt   u        u    gt   u    W    gt   w    w    gt   w    Y    gt   y    y    gt   y          gt   y        Z    gt   z    z    gt   z    Z    gt   z    z    gt   z          gt   z          gt   z             gt   z         gt   e          gt   f    O    gt   o    o    gt   o    U    gt   u        u    gt   u    A    gt   a    a    gt   a    I    gt   i    i    gt   i    O    gt   o        o    gt   o    U    gt   u    u    gt   u    U    gt   u    u    gt   u    U    gt   u        u    gt   u    U    gt   u    u    gt   u    U    gt   u    u    gt   u         gt   a             gt   a         gt   ae        gt   ae        gt   o         gt   o         gt   e             gt   jo        gt   e         gt   i         gt   i         gt   a         gt   b             gt   v         gt   g         gt   d         gt   e         gt   zh        gt   z             gt   i         gt   j         gt   k         gt   l         gt   m         gt   n             gt   o         gt   p         gt   r         gt   s         gt   t         gt   u             gt   f         gt   h         gt   c         gt   ch        gt   sh        gt   sch             gt   -         gt   y         gt   -         gt   je        gt   ju        gt   ja             gt   a         gt   b         gt   v         gt   g         gt   d         gt   e             gt   zh        gt   z         gt   i         gt   j         gt   k         gt   l             gt   m         gt   n         gt   o         gt   p         gt   r         gt   s             gt   t         gt   u         gt   f         gt   h         gt   c         gt   ch             gt   sh        gt   sch        gt   -        gt   y         gt   -         gt   je             gt   ju        gt   ja        gt   jo        gt   e         gt   i         gt   i             gt   g         gt   g         gt   a         gt   b         gt   g         gt   d             gt   h         gt   v         gt   z         gt   h         gt   t         gt   i             gt   k         gt   k         gt   l         gt   m         gt   m         gt   n             gt   n         gt   s         gt   e         gt   p         gt   p         gt   C             gt   c         gt   q         gt   r         gt   w         gt   t           gt   tm        From magento  im using it for basically everything

User · Answer

I just came accross the answer from Lizard which is extremely helpful - especially when you do some sorting  Isn t is beautiful how many chars we need to say mostly the same     If anyone else if looking for a all-in solution  as far as the comments above tell   here s the copy amp paste          Replace language-specific characters by ASCII-equivalents      param string  s     return string     public static function normalizeChars  s         replace   array               gt  -        gt  -        gt  -        gt  -            A   gt  A    A   gt  A         gt  A         gt  A         gt  A         gt  A         gt  A         gt  A         gt  Ae                 gt  B            C   gt  C        gt  C         gt  C                 gt  E    E   gt  E         gt  E         gt  E         gt  E            G   gt  G            I   gt  I         gt  I         gt  I         gt  I         gt  I            L   gt  L                 gt  N    N   gt  N                 gt  O         gt  O         gt  O         gt  O         gt  O         gt  Oe            S   gt  S    S   gt  S        gt  S         gt  S                gt  T                 gt  U         gt  U         gt  U         gt  Ue                 gt  Y            Z   gt  Z         gt  Z    Z   gt  Z                 gt  a    a   gt  a    a   gt  a         gt  a    a   gt  a         gt  a    A   gt  a        gt  a        gt  a         gt  a         gt  a        gt  a        gt  a    A   gt  a        gt  a    a   gt  a         gt  ae         gt  ae        gt  ae        gt  ae                gt  b        gt  b        gt  b         gt  b            c   gt  c    C   gt  c    C   gt  c    c   gt  c         gt  c        gt  c        gt  c    c   gt  c        gt  c    C   gt  c    c   gt  c        gt  ch        gt  ch                gt  d    d   gt  d         gt  d    D   gt  d    d   gt  d        gt  d        gt  D         gt  d                gt  e        gt  e        gt  e        gt  e        gt  e    e   gt  e    e   gt  e    e   gt  e    E   gt  e    E   gt  e    e   gt  e    e   gt  e    E   gt  e        gt  e    E   gt  e         gt  e        gt  e         gt  e         gt  e         gt  e                gt  f         gt  f        gt  f            g   gt  g    G   gt  g    G   gt  g    G   gt  g        gt  g        gt  g    g   gt  g    g   gt  g        gt  g        gt  g        gt  g    g   gt  g                gt  h    h   gt  h        gt  h    H   gt  h    H   gt  h    h   gt  h        gt  h        gt  h                 gt  i         gt  i         gt  i         gt  i    i   gt  i    i   gt  i    i   gt  i    I   gt  i        gt  i    i   gt  i    i   gt  i    I   gt  i    I   gt  i        gt  i    I   gt  i        gt  i        gt  i    I   gt  i        gt  i        gt  i        gt  i    i   gt  i        gt  ij        gt  ij                gt  j        gt  j    J   gt  j    j   gt  j        gt  ja        gt  ja        gt  je        gt  je        gt  jo        gt  jo        gt  ju        gt  ju                gt  k        gt  k    K   gt  k        gt  k        gt  k    k   gt  k        gt  k                gt  l        gt  l        gt  l    l   gt  l    l   gt  l    l   gt  l    L   gt  l    L   gt  l        gt  l    L   gt  l    l   gt  l        gt  l                gt  m        gt  m        gt  m        gt  m                 gt  n        gt  n    N   gt  n        gt  n        gt  n        gt  n        gt  n    n   gt  n        gt  n    n   gt  n        gt  n    N   gt  n    n   gt  n                gt  o        gt  o    o   gt  o         gt  o         gt  o    O   gt  o    o   gt  o    O   gt  o    O   gt  o    o   gt  o         gt  o        gt  o    o   gt  o         gt  o        gt  o    O   gt  o    o   gt  o         gt  o    O   gt  o         gt  oe         gt  oe         gt  oe                gt  p        gt  p        gt  p        gt  p                gt  q            r   gt  r    r   gt  r    R   gt  r    r   gt  r    R   gt  r        gt  r    R   gt  r        gt  r        gt  r                gt  s        gt  s    S   gt  s         gt  s    s   gt  s        gt  s    s   gt  s        gt  s    s   gt  s        gt  sch        gt  sch        gt  sh        gt  sh         gt  ss                gt  t        gt  t    t   gt  t        gt  t    t   gt  t    t   gt  t    T   gt  t        gt  t        gt  t    T   gt  t    T   gt  t          gt  tm            u   gt  u        gt  u    U   gt  u    u   gt  u    U   gt  u    u   gt  u    U   gt  u    U   gt  u    u   gt  u    U   gt  u    u   gt  u    U   gt  u    U   gt  u    u   gt  u    u   gt  u    U   gt  u    U   gt  u    u   gt  u    U   gt  u         gt  u         gt  u         gt  u        gt  u    u   gt  u    u   gt  u    U   gt  u    U   gt  u    u   gt  u    u   gt  u         gt  ue                gt  v        gt  v        gt  v                gt  w    w   gt  w    W   gt  w                gt  y    y   gt  y         gt  y         gt  y         gt  y    Y   gt  y                gt  y         gt  z        gt  z        gt  z    z   gt  z        gt  z    z   gt  z        gt  z        gt  zh        gt  zh             return strtr  s   replace       Note some slight changes regarding the German umlauts        ae   Edit  Included more characters based on the posting from user3682119  except for the copyright symbol  and the comment from daker

User · Answer

To remove the diacritics  use iconv    val   iconv  ISO-8859-1   ASCII  TRANSLIT   val     or   val   iconv  UTF-8   ASCII  TRANSLIT   val     note that php has some weird bug in that it  sometimes   needs to have a locale set to make these conversions work  using setlocale     edit tested  it gets all of your diacritics out of the box    val                                                                                     abc ABC 123   echo iconv  UTF-8   ASCII  TRANSLIT   val      output  updated 2019-12-30   a a a a a d e e e e i i i i o o o o o o u u u u ae c ss abc ABC 123   Note that    is correctly transliterated to d instead of o  as in the accepted answer

User · Answer

I found this way to be a good one  without having to worry too much about charsets and arrays  or iconv  function replace accents  str        str   htmlentities  str  ENT COMPAT   quot UTF-8 quot        str   preg replace    amp   a-zA-Z   uml acute grave circ tilde ring       1   str      return html entity decode  str

User · Answer

An updated answer based on  BurninLeo s answer  function replace spec char  subject         char map   array                gt   -         gt   -         gt   -         gt   -                 gt   A    A    gt   A    A    gt   A    A    gt   A          gt   A          gt   A          gt   A          gt   A          gt   A          gt   A         gt   A    A    gt   A         gt   A                 gt   B         gt   B          gt   B            C    gt   C    C    gt   C          gt   C         gt   C         gt   C    C    gt   C    C    gt   C          gt   C         gt   C                 gt   D    D    gt   D          gt   D         gt   D          gt   D                  gt   E    E    gt   E          gt   E          gt   E          gt   E         gt   E    E    gt   E    E    gt   E    E    gt   E    E    gt   E         gt   E         gt   E         gt   E                 gt   F          gt   F            G    gt   G    G    gt   G    G    gt   G    G    gt   G         gt   G         gt   G         gt   G                 gt   H    H    gt   H         gt   H    H    gt   H         gt   H            I    gt   I          gt   I          gt   I          gt   I          gt   I    I    gt   I    I    gt   I    I    gt   I         gt   I    I    gt   I    I    gt   I         gt   I         gt   I    I    gt   I         gt   I                 gt   J    J    gt   J                 gt   K         gt   K    K    gt   K         gt   K         gt   K            L    gt   L         gt   L         gt   L    L    gt   L    L    gt   L    L    gt   L         gt   L                 gt   M         gt   M         gt   M                  gt   N    N    gt   N         gt   N    N    gt   N         gt   N         gt   N         gt   N         gt   N    N    gt   N                  gt   O          gt   O          gt   O          gt   O          gt   O         gt   O    O    gt   O    O    gt   O    O    gt   O         gt   O    O    gt   O    O    gt   O                 gt   P         gt   P         gt   P                 gt   Q            R    gt   R    R    gt   R    R    gt   R         gt   R         gt   R          gt   R            S    gt   S    S    gt   S         gt   S          gt   S         gt   S    S    gt   S         gt   S                 gt   T         gt   T         gt   T    T    gt   T         gt   T    T    gt   T    T    gt   T                  gt   U          gt   U          gt   U    U    gt   U         gt   U    U    gt   U    U    gt   U    U    gt   U    U    gt   U    U    gt   U    U    gt   U    U    gt   U    U    gt   U    U    gt   U    U    gt   U    U    gt   U                 gt   V         gt   V                  gt   Y         gt   Y    Y    gt   Y          gt   Y            Z    gt   Z          gt   Z    Z    gt   Z         gt   Z         gt   Z                 gt   a    a    gt   a    a    gt   a    a    gt   a          gt   a          gt   a          gt   a          gt   a          gt   a          gt   a         gt   a    a    gt   a         gt   a                 gt   b         gt   b          gt   b            c    gt   c    c    gt   c          gt   c         gt   c         gt   c    c    gt   c    c    gt   c          gt   c         gt   c                 gt   ch         gt   ch                 gt   d    d    gt   d    d    gt   d         gt   d          gt   d                  gt   e    e    gt   e          gt   e          gt   e          gt   e         gt   e    e    gt   e    e    gt   e    e    gt   e    e    gt   e         gt   e         gt   e         gt   e                 gt   f          gt   f            g    gt   g    g    gt   g    g    gt   g    g    gt   g         gt   g         gt   g         gt   g                 gt   h    h    gt   h         gt   h    h    gt   h         gt   h            i    gt   i          gt   i          gt   i          gt   i          gt   i    i    gt   i    i    gt   i    i    gt   i         gt   i    i    gt   i    i    gt   i         gt   i         gt   i    i    gt   i         gt   i                 gt   j         gt   j    J    gt   j    j    gt   j                 gt   k         gt   k    k    gt   k         gt   k         gt   k            l    gt   l         gt   l         gt   l    l    gt   l    l    gt   l    l    gt   l         gt   l                 gt   m         gt   m         gt   m                  gt   n    n    gt   n         gt   n    n    gt   n         gt   n         gt   n         gt   n         gt   n    n    gt   n                  gt   o          gt   o          gt   o          gt   o          gt   o         gt   o    o    gt   o    o    gt   o    o    gt   o         gt   o    o    gt   o    o    gt   o                 gt   p         gt   p         gt   p                 gt   q            r    gt   r    r    gt   r    r    gt   r         gt   r         gt   r          gt   r            s    gt   s    s    gt   s         gt   s          gt   s         gt   s    s    gt   s         gt   s                 gt   t         gt   t         gt   t    t    gt   t         gt   t    t    gt   t    t    gt   t                  gt   u          gt   u          gt   u    u    gt   u         gt   u    u    gt   u    u    gt   u    u    gt   u    u    gt   u    u    gt   u    u    gt   u    u    gt   u    u    gt   u    u    gt   u    u    gt   u    u    gt   u                 gt   v         gt   v                  gt   y         gt   y    y    gt   y          gt   y            z    gt   z          gt   z    z    gt   z         gt   z         gt   z         gt   z                   gt   tm                 gt   at                  gt   ae         gt   ae          gt   ae          gt   ae         gt   ae                 gt   ij         gt   ij                 gt   ja         gt   ja                 gt   je         gt   je                 gt   jo         gt   jo                 gt   ju         gt   ju                  gt   oe          gt   oe          gt   oe          gt   oe                 gt   sch         gt   sch                 gt   sh         gt   sh                  gt   ss                  gt   ue                 gt   zh         gt   zh              return strtr  subject   char map       string    H   th          t    test    echo replace spec char  string     H   th          t    test     Hi there  jusst a test   This does not mix up upper and lower case chars except for longer chars  eg  ss ch  sch    added          Also if you want to build regex matching regardless to special chars    rss   gt    rrrRrR R       s  S  s s s  s  S  s s s          A vala implementation of this   https   code launchpad net  jeremy-munsch synapse-project ascii-smart  merge 277477  Here is the base list you could work with  with regex replacing  in sublime text  or small script you can build anything from this array to fill your needs    -    gt           A    gt    AAA             A     B    gt           C    gt   CC    CC       D    gt    D         E    gt     E       EEEE       F    gt          G    gt   GGGG       H    gt    H H     I    gt   I        III II  I     J    gt    J    K    gt     K      L    gt   L  LLL     M    gt          N    gt     N N    N    O    gt              OOO OO    P    gt          Q    gt        R    gt   RRR        S    gt   SS    S     T    gt      T TT    U    gt         U UUUUUUUUUUU    V    gt         Y    gt      Y      Z    gt   Z  Z      a    gt    aaa             a     b    gt           c    gt   cc    cc       ch    gt        d    gt    dd       e    gt     e       eeee       f    gt          g    gt   gggg       h    gt    h h     i    gt   i        iii ii  i     j    gt    j    k    gt     k      l    gt   l  lll     m    gt          n    gt     n n    n    o    gt              ooo oo    p    gt          q    gt        r    gt   rrr        s    gt   ss    s     t    gt      t tt    u    gt         u uuuuuuuuuuu    v    gt         y    gt      y      z    gt   z  z       tm    gt          at    gt        ae    gt               ch    gt         ij    gt         j    gt     Jj    ja    gt         je    gt         jo    gt         ju    gt         oe    gt               sch    gt         sh    gt         ss    gt         tm    gt          ue    gt         zh    gt

User · Answer

Vietnamese characters for those who need them        gt  S         gt  s         gt  Z         gt  z         gt  A         gt  A         gt  A         gt  A         gt  A         gt  A         gt  A         gt  C         gt  E         gt  E                                     gt  E         gt  E         gt  I         gt  I         gt  I         gt  I         gt  N         gt  O         gt  O         gt  O         gt  O         gt  O         gt  O         gt  U                                     gt  U         gt  U         gt  U         gt  Y         gt  B         gt  Ss         gt  a         gt  a         gt  a         gt  a         gt  a         gt  a         gt  a         gt  c                                     gt  e         gt  e         gt  e         gt  e         gt  i         gt  i         gt  i         gt  i         gt  o         gt  n         gt  o         gt  o         gt  o         gt  o                                     gt  o         gt  o         gt  u         gt  u         gt  u         gt  y         gt  b         gt  y      str   strtr   str   unwanted array

User · Answer

strtolower only works on iso-8859-1 encoded strings  You could try with mb strtolower   Or  if you have to mangle with multibyte-extensions  you might as well use iconv s transliteration support   iconv  UTF-8    ISO-8859-1  TRANSLIT    text     Edit   It seems I was a bit fast  You appear to use iso-8859-1  so your current strategy will work  You just need to write the regexp s properly  Eg                           not

User · Answer

It s worked like magically  i have used only array  this pattern is worked for me  check this pattern

User · Answer

As an alternative  a bit more complex in nature through   have a look at how wordpress does accent removal  Made some changes below to make it run independently without referencing wordpress functions         function mbstring binary safe encoding  reset   false        static  encodings    array        static  overloaded   null       if  is null  overloaded              overloaded   function exists  mb internal encoding    amp  amp   ini get  mbstring func overload    amp  2              if  false      overloaded            return             if    reset             encoding   mb internal encoding            array push  encodings   encoding           mb internal encoding  ISO-8859-1               if   reset  amp  amp   encodings             encoding   array pop  encodings           mb internal encoding  encoding            function seems utf8  str        mbstring binary safe encoding         length   strlen  str       mbstring binary safe encoding true       for   i   0   i  lt   length   i               c   ord  str  i            if   c  lt  0x80                 n   0                       0bbbbbbb         elseif    c  amp  0xE0     0xC0                 n   1                       110bbbbb         elseif    c  amp  0xF0     0xE0                 n   2                       1110bbbb         elseif    c  amp  0xF8     0xF0                 n   3                       11110bbb         elseif    c  amp  0xFC     0xF8                 n   4                       111110bb         elseif    c  amp  0xFE     0xFC                 n   5                       1111110b         else                   return false                               Does not match any model             for   j   0   j  lt   n   j                         n bytes matching 10bbbbbb follow                   if      i     length       ord  str  i    amp  0xC0     0x80                         return false                                                     return true             function remove accents  string            if   preg match     x80- xff      string                 return  string                     if  seems utf8  string                  chars   array                     Decompositions for Latin-1 Supplement                        gt   a           gt   o                          gt   A           gt   A                          gt   A           gt   A                          gt   A           gt   A                          gt   AE          gt   C                          gt   E           gt   E                          gt   E           gt   E                          gt   I           gt   I                          gt   I           gt   I                          gt   D           gt   N                          gt   O           gt   O                          gt   O           gt   O                          gt   O           gt   U                          gt   U           gt   U                          gt   U           gt   Y                          gt   TH          gt   s                          gt   a           gt   a                          gt   a           gt   a                          gt   a           gt   a                          gt   ae          gt   c                          gt   e           gt   e                          gt   e           gt   e                          gt   i           gt   i                          gt   i           gt   i                          gt   d           gt   n                          gt   o           gt   o                          gt   o           gt   o                          gt   o           gt   o                          gt   u           gt   u                          gt   u           gt   u                          gt   y           gt   th                          gt   y           gt   O                      Decompositions for Latin Extended-A                  A    gt   A    a     gt   a                    A    gt   A    a     gt   a                    A    gt   A    a     gt   a                    C    gt   C    c     gt   c                    C    gt   C    c     gt   c                    C    gt   C    c     gt   c                    C    gt   C    c     gt   c                    D    gt   D    d     gt   d                          gt   D    d     gt   d                    E    gt   E    e     gt   e                    E    gt   E    e     gt   e                    E    gt   E    e     gt   e                    E    gt   E    e     gt   e                    E    gt   E    e     gt   e                    G    gt   G    g     gt   g                    G    gt   G    g     gt   g                    G    gt   G    g     gt   g                    G    gt   G    g     gt   g                    H    gt   H    h     gt   h                    H    gt   H    h     gt   h                    I    gt   I    i     gt   i                    I    gt   I    i     gt   i                    I    gt   I    i     gt   i                    I    gt   I    i     gt   i                    I    gt   I    i     gt   i                         gt   IJ         gt   ij                    J    gt   J    j     gt   j                    K    gt   K    k     gt   k                         gt   k    L     gt   L                    l    gt   l    L     gt   L                    l    gt   l    L     gt   L                    l    gt   l          gt   L                         gt   l    L     gt   L                    l    gt   l    N     gt   N                    n    gt   n    N     gt   N                    n    gt   n    N     gt   N                    n    gt   n          gt   n                         gt   N          gt   n                    O    gt   O    o     gt   o                    O    gt   O    o     gt   o                    O    gt   O    o     gt   o                          gt   OE          gt   oe                    R    gt   R    r     gt   r                    R    gt   R    r     gt   r                    R    gt   R    r     gt   r                    S    gt   S    s     gt   s                    S    gt   S    s     gt   s                    S    gt   S    s     gt   s                          gt   S           gt   s                    T    gt   T    t     gt   t                    T    gt   T    t     gt   t                    T    gt   T    t     gt   t                    U    gt   U    u     gt   u                    U    gt   U    u     gt   u                    U    gt   U    u     gt   u                    U    gt   U    u     gt   u                    U    gt   U    u     gt   u                    U    gt   U    u     gt   u                    W    gt   W    w     gt   w                    Y    gt   Y    y     gt   y                          gt   Y    Z     gt   Z                    z    gt   z    Z     gt   Z                    z    gt   z           gt   Z                          gt   z          gt   s                      Decompositions for Latin Extended-B                       gt   S          gt   s                         gt   T          gt   t                      Euro Sign                         gt   E                      GBP  Pound  Sign                        gt                         Vowels with diacritic  Vietnamese                     unmarked                  O    gt   O    o     gt   o                    U    gt   U    u     gt   u                      grave accent                       gt   A          gt   a                         gt   A          gt   a                         gt   E          gt   e                         gt   O          gt   o                         gt   O          gt   o                         gt   U          gt   u                         gt   Y          gt   y                      hook                       gt   A          gt   a                         gt   A          gt   a                         gt   A          gt   a                         gt   E          gt   e                         gt   E          gt   e                         gt   I          gt   i                         gt   O          gt   o                         gt   O          gt   o                         gt   O          gt   o                         gt   U          gt   u                         gt   U          gt   u                         gt   Y          gt   y                      tilde                       gt   A          gt   a                         gt   A          gt   a                         gt   E          gt   e                         gt   E          gt   e                         gt   O          gt   o                         gt   O          gt   o                         gt   U          gt   u                         gt   Y          gt   y                      acute accent                       gt   A          gt   a                         gt   A          gt   a                         gt   E          gt   e                         gt   O          gt   o                         gt   O          gt   o                         gt   U          gt   u                      dot below                       gt   A          gt   a                         gt   A          gt   a                         gt   A          gt   a                         gt   E          gt   e                         gt   E          gt   e                         gt   I          gt   i                         gt   O          gt   o                         gt   O          gt   o                         gt   O          gt   o                         gt   U          gt   u                         gt   U          gt   u                         gt   Y          gt   y                      Vowels with diacritic  Chinese  Hanyu Pinyin                        gt   a                      macron                  U    gt   U    u     gt   u                      acute accent                  U    gt   U    u     gt   u                      caron                  A    gt   A    a     gt   a                    I    gt   I    i     gt   i                    O    gt   O    o     gt   o                    U    gt   U    u     gt   u                    U    gt   U    u     gt   u                      grave accent                  U    gt   U    u     gt   u                                string   strtr  string   chars             else                chars   array                   Assume ISO-8859-1 if not UTF-8              chars  in       x80 x83 x8a x8e x9a x9e                      x9f xa2 xa5 xb5 xc0 xc1 xc2                      xc3 xc4 xc5 xc7 xc8 xc9 xca                      xcb xcc xcd xce xcf xd1 xd2                      xd3 xd4 xd5 xd6 xd8 xd9 xda                      xdb xdc xdd xe0 xe1 xe2 xe3                      xe4 xe5 xe7 xe8 xe9 xea xeb                      xec xed xee xef xf1 xf2 xf3                      xf4 xf5 xf6 xf8 xf9 xfa xfb                      xfc xfd xff                 chars  out      EfSZszYcYuAAAAAACEEEEIIIINOOOOOOUUUUYaaaaaaceeeeiiiinoooooouuuuyy                 string                strtr  string   chars  in     chars  out                  double chars          array                 double chars  in      array   x8c     x9c     xc6     xd0     xde     xdf     xe6     xf0     xfe                 double chars  out     array  OE    oe    AE    DH    TH    ss    ae    dh    th                 string                str replace  double chars  in     double chars  out     string                      return  string

User · Answer

So I found this on php net page for preg replace function     replace accented chars   string    Zacar  as Ferre  ra      my definition for string variable  accents      amp   A-Za-z  1 2   grave acute circ cedil uml lig        string encoded   htmlentities  string ENT NOQUOTES  UTF-8      string   preg replace  accents   1   string encoded     If you have encoding issues you may get someting like this  Zacar        as Ferre        ra   just decode the string and use said code above   string   utf8 decode  Zacar        as Ferre        ra

User · Answer

You can try this one   class Diacritic       public function replaceDiacritic  input                 input   iconv  UTF-8   ASCII  TRANSLIT   input            input   preg replace                    input            input   preg replace             input           return preg replace                input

[php] Replacing accented characters php

EDIT

Examples related to php

Examples related to string

Examples related to preg-replace

Examples related to non-ascii-characters