PHP function to make slug URL string

Question

I want to have a function to create slugs from Unicode strings  e g  gen slug  Andr  s Cortez   should return andres-cortez  How should I do that

User · Answer

If you have intl extension installed  you can use Transliterator  transliterate function to create a slug easily    lt  php  string    Namnet p   bildt  vlingen    slug    Transliterator  createFromRules          Any-Latin             NFD               Nonspacing Mark   Remove             NFC               Punctuation   Remove             Lower              Separator    gt    -          - gt transliterate   string    echo  slug     namnet-pa-bildtavlingen   gt

User · Answer

I didn t know which one to use so I made a quick bench on phptester net  lt  php     First test    https   stackoverflow com a 42740874 10232729 function slugify STRING  string  STRING  separator    -              accents regex      amp   a-z  1 2     acute cedil circ grave lig orn ring slash th tilde uml   i        special cases       amp     gt   and    quot   quot    gt            string   mb strtolower  trim   string     UTF-8          string   str replace  array keys  special cases   array values   special cases    string         string   preg replace   accents regex    1   htmlentities   string  ENT QUOTES   UTF-8            string   preg replace     a-z0-9  u    separator   string            return preg replace       separator     u    separator   string         Second test    https   stackoverflow com a 13331948 10232729 function slug STRING  string  STRING  separator    -              string   transliterator transliterate  Any-Latin  NFD    Nonspacing Mark   Remove  NFC    Punctuation   Remove  Lower       string                return str replace       separator   string          Third test - My choice    https   stackoverflow com a 38066136 10232729 function slugbis  text         replace               lt     gt        gt     gt       -    gt         amp     gt                quot     gt             gt   A          gt   A          gt   A          gt   A         gt   Ae                  gt   A          gt   A    A    gt   A    A    gt   A    A    gt   A          gt   Ae                  gt   C    C    gt   C    C    gt   C    C    gt   C    C    gt   C    D    gt   D          gt   D                  gt   D          gt   E          gt   E          gt   E          gt   E    E    gt   E            E    gt   E    E    gt   E    E    gt   E    E    gt   E    G    gt   G    G    gt   G            G    gt   G    G    gt   G    H    gt   H    H    gt   H          gt   I          gt   I                  gt   I          gt   I    I    gt   I    I    gt   I    I    gt   I    I    gt   I            I    gt   I         gt   IJ    J    gt   J    K    gt   K    L    gt   K    L    gt   K            L    gt   K    L    gt   K         gt   K          gt   N    N    gt   N    N    gt   N            N    gt   N         gt   N          gt   O          gt   O          gt   O          gt   O                  gt   Oe          gt   Oe          gt   O    O    gt   O    O    gt   O    O    gt   O                  gt   OE    R    gt   R    R    gt   R    R    gt   R    S    gt   S          gt   S            S    gt   S    S    gt   S         gt   S    T    gt   T    T    gt   T    T    gt   T                 gt   T          gt   U          gt   U          gt   U          gt   Ue    U    gt   U                  gt   Ue    U    gt   U    U    gt   U    U    gt   U    U    gt   U    U    gt   U            W    gt   W          gt   Y    Y    gt   Y          gt   Y    Z    gt   Z          gt   Z            Z    gt   Z          gt   T          gt   a          gt   a          gt   a          gt   a                  gt   ae          gt   ae          gt   a    a    gt   a    a    gt   a    a    gt   a                  gt   ae          gt   c    c    gt   c    c    gt   c    c    gt   c    c    gt   c            d    gt   d    d    gt   d          gt   d          gt   e          gt   e          gt   e                  gt   e    e    gt   e    e    gt   e    e    gt   e    e    gt   e    e    gt   e                  gt   f    g    gt   g    g    gt   g    g    gt   g    g    gt   g    h    gt   h            h    gt   h          gt   i          gt   i          gt   i          gt   i    i    gt   i            i    gt   i    i    gt   i    i    gt   i    i    gt   i         gt   ij    j    gt   j            k    gt   k         gt   k    l    gt   l    l    gt   l    l    gt   l    l    gt   l                 gt   l          gt   n    n    gt   n    n    gt   n    n    gt   n         gt   n                 gt   n          gt   o          gt   o          gt   o          gt   o          gt   oe                  gt   oe          gt   o    o    gt   o    o    gt   o    o    gt   o          gt   oe            r    gt   r    r    gt   r    r    gt   r          gt   s          gt   u          gt   u                  gt   u          gt   ue    u    gt   u          gt   ue    u    gt   u    u    gt   u            u    gt   u    u    gt   u    u    gt   u    w    gt   w          gt   y          gt   y            y    gt   y          gt   z    z    gt   z    z    gt   z          gt   t          gt   ss                 gt   ss          gt   iy         gt   A         gt   B         gt   V         gt   G                 gt   D         gt   E         gt   YO         gt   ZH         gt   Z         gt   I                 gt   Y         gt   K         gt   L         gt   M         gt   N         gt   O                 gt   P         gt   R         gt   S         gt   T         gt   U         gt   F                 gt   H         gt   C         gt   CH         gt   SH         gt   SCH         gt                    gt   Y         gt            gt   E         gt   YU         gt   YA         gt   a                 gt   b         gt   v         gt   g         gt   d         gt   e         gt   yo                 gt   zh         gt   z         gt   i         gt   y         gt   k         gt   l                 gt   m         gt   n         gt   o         gt   p         gt   r         gt   s                 gt   t         gt   u         gt   f         gt   h         gt   c         gt   ch                 gt   sh         gt   sch         gt            gt   y         gt            gt   e                 gt   yu         gt   ya                 make a human readable string      text   strtr  text   replace           replace non letter or digits by -      text   preg replace      pL d    u    -    text           trim      text   trim  text   -            remove unwanted characters      text   preg replace     - w            text        return strtolower  text         Fourth test    https   stackoverflow com a 2955521 10232729 function slugagain  string             table                   gt  S         gt  s         gt  Dj    d   gt  dj         gt  Z         gt  z    C   gt  C    c   gt  c    C   gt  C    c   gt  c                 gt  A         gt  A         gt  A         gt  A         gt  A         gt  A         gt  A         gt  C         gt  E         gt  E                 gt  E         gt  E         gt  I         gt  I         gt  I         gt  I         gt  N         gt  O         gt  O         gt  O                 gt  O         gt  O         gt  O         gt  U         gt  U         gt  U         gt  U         gt  Y         gt  B         gt  Ss                 gt  a         gt  a         gt  a         gt  a         gt  a         gt  a         gt  a         gt  c         gt  e         gt  e                 gt  e         gt  e         gt  i         gt  i         gt  i         gt  i         gt  o         gt  n         gt  o         gt  o                 gt  o         gt  o         gt  o         gt  o         gt  u         gt  u         gt  u         gt  y         gt  y         gt  b                 gt  y    R   gt  R    r   gt  r        gt  -              return strtr  string   table         Fifth test    https   stackoverflow com a 27396804 10232729 function slugifybis  url        url   trim  url         url   str replace       -    url        url   str replace       -slash-    url            return rawurlencode  url         Sixth and last test    https   stackoverflow com a 39442034 10232729 setlocale  LC ALL   quot en US UTF8 quot       function slugifyagain  string             string   iconv  utf-8    us-ascii  translit  ignore    string      transliterate      string   str replace  quot   quot        string        string   preg replace      pL d   u    -    string      replace non letter or non digits by  quot - quot       string   preg replace     - w           string      remove unwanted characters      string   preg replace   -      -    string      remove duplicate  quot - quot       string   trim  string   -       trim  quot - quot       string   trim  string      trim      string   mb strtolower  string   utf-8       lowercase          return urlencode  string      safe       string    newString    quot        dr     l affreux gar  on  amp  n    l en for  t   quot     max   10000   echo   lt pre gt    echo  Beginning     echo   lt br   gt    echo   lt br   gt        echo   gt  Slugging    max   iterations of following     echo   lt br   gt    echo   gt  gt       string  echo   lt br   gt      echo   lt br   gt    echo  Output results     echo   lt br   gt    echo   lt br   gt        start   microtime true    for  i   0    i  lt   max    i               newString   slugify  string       time    microtime true  -  start    1000   echo   gt  First test passed in       round  time  2     ms     echo   lt br   gt      echo   gt  gt  Result        newString  echo   lt br   gt    echo   lt br   gt      start   microtime true    for  i   0    i  lt   max    i               newString   slug  string       time    microtime true  -  start    1000   echo   gt  Second test passed in       round  time  2     ms     echo   lt br   gt    echo   gt  gt  Result        newString  echo   lt br   gt    echo   lt br   gt      start   microtime true    for  i   0    i  lt   max    i               newString   slugbis  string       time    microtime true  -  start    1000   echo   gt  Third test passed in       round  time  2     ms     echo   lt br   gt    echo   gt  gt  Result        newString  echo   lt br   gt    echo   lt br   gt      start   microtime true    for  i   0    i  lt   max    i               newString   slugagain  string       time    microtime true  -  start    1000   echo   gt  Fourth test passed in       round  time  2     ms     echo   lt br   gt    echo   gt  gt  Result        newString  echo   lt br   gt    echo   lt br   gt      start   microtime true    for  i   0    i  lt   max    i               newString   slugifybis  string       time    microtime true  -  start    1000   echo   gt  Fifth test passed in       round  time  2     ms     echo   lt br   gt    echo   gt  gt  Result        newString  echo   lt br   gt    echo   lt br   gt      start   microtime true    for  i   0    i  lt   max    i               newString   slugifyagain  string       time    microtime true  -  start    1000   echo   gt  Sixth test passed in       round  time  2     ms     echo   lt br   gt    echo   gt  gt  Result        newString  echo   lt  pre gt     Beginning    Slugging 10000 iterations of following           dr     l affreux gar  on  amp  n    l en for  t     Output results    First test passed in 120 78ms  Result   -iquest-andresz-laffreux-arcon-and-noel-en-foret-  Second test passed in 3883 82ms  Result   -andre  -laffreux-garcon--n  el-en-foret-  Third test passed in 56 83ms  Result   andress-l-affreux-garcon-noel-en-foret  Fourth test passed in 18 93ms  Result     -AndreSs-l affreux-garcon- amp -noel-en-foret-   Fifth test passed in 6 45ms  Result    C2 BF- C3 80 C3 B1dr C3 A9 C3 9F-l 27affreux- C4 9Far C3 A7on- 26-n C3 B8 C3 ABl-en-for C3 AAt- 21  Sixth test passed in 112 42ms  Result   andress-laffreux-garcon-n-el-en-foret   Further tests needed  Edit   less iterations test Beginning    Slugging 100 iterations of following           dr     l affreux gar  on  amp  n    l en for  t     Output results    First test passed in 1 72ms  Result   -iquest-andresz-laffreux-arcon-and-noel-en-foret-  Second test passed in 48 59ms  Result   -andre  -laffreux-garcon--n  el-en-foret-  Third test passed in 0 91ms  Result   andress-l-affreux-garcon-noel-en-foret  Fourth test passed in 0 3ms  Result     -AndreSs-l affreux-garcon- amp -noel-en-foret-   Fifth test passed in 0 14ms  Result    C2 BF- C3 80 C3 B1dr C3 A9 C3 9F-l 27affreux- C4 9Far C3 A7on- 26-n C3 B8 C3 ABl-en-for C3 AAt- 21  Sixth test passed in 1 4ms  Result   andress-laffreux-garcon-n-el-en-foret

User · Answer

This may be a way to do it too  Inspired from these links Experts-exchange and alinalexander  function slugifier  txt          Get rid of accented characters        search   explode                                                                                 e i    u        replace   explode      c ae oe a e i o u a e i o u a e i o u y a e i o u a e i o u        txt   str replace  search   replace   txt          Lowercase all the characters        txt   strtolower  txt          Avoid whitespace at the beginning and the ending        txt   trim  txt          Replace all the characters that are not in a-z or 0-9 by a hyphen        txt   preg replace     a-z0-9      -    txt         Remove hyphen anywhere it s more than one        txt   preg replace     -       -    txt      return  txt

User · Answer

Don t use preg replace for this  There s a php function built just for the task  strtr   http   php net manual en function strtr php  Taken from the comments in the above link  and I tested it myself  it works   function normalize   string         table   array                gt  S         gt  s         gt  Dj    d   gt  dj         gt  Z         gt  z    C   gt  C    c   gt  c    C   gt  C    c   gt  c                 gt  A         gt  A         gt  A         gt  A         gt  A         gt  A         gt  A         gt  C         gt  E         gt  E                 gt  E         gt  E         gt  I         gt  I         gt  I         gt  I         gt  N         gt  O         gt  O         gt  O                 gt  O         gt  O         gt  O         gt  U         gt  U         gt  U         gt  U         gt  Y         gt  B         gt  Ss                 gt  a         gt  a         gt  a         gt  a         gt  a         gt  a         gt  a         gt  c         gt  e         gt  e                 gt  e         gt  e         gt  i         gt  i         gt  i         gt  i         gt  o         gt  n         gt  o         gt  o                 gt  o         gt  o         gt  o         gt  o         gt  u         gt  u         gt  u         gt  y         gt  y         gt  b                 gt  y    R   gt  R    r   gt  r               return strtr  string   table

User · Answer

Update Since this answer is getting some attention  I m adding some explanation  The solution provided will essentially replace everything except A-Z  a-z  0-9   amp  -  hyphen  with -  hyphen   So  it won t work properly with other unicode characters  which are valid characters for a URL slug string   A common scenario is when the input string contains non-English characters  Only use this solution if you re confident that the input string won t have unicode characters which you might want to be a part of output slug  Eg   quot            quot  will become  quot ---------- quot   all hyphens  instead of  quot     -      quot   valid URL slug   Original Answer How about     slug   strtolower trim preg replace     A-Za-z0-9-       -    string

User · Answer

There s a good solution here that deals with special characters as well   Texto Fant  stico    texto-fantastico  function slugify   string   separator    -           accents regex      amp   a-z  1 2     acute cedil circ grave lig orn ring slash th tilde uml   i        special cases   array    amp     gt   and         gt            string   mb strtolower  trim   string     UTF-8          string   str replace  array keys  special cases   array values   special cases    string         string   preg replace   accents regex    1   htmlentities   string  ENT QUOTES   UTF-8            string   preg replace     a-z0-9  u     separator    string        string   preg replace     separator   u     separator    string       return  string      Author  Natxet

User · Answer

Since I ve Seen a lot of methods here but I ve found a simplest method for myself Maybe it will help someone    slug   strtolower preg replace     a-zA-Z0-9 -        preg replace    s      -    string

User · Answer

What about using something that is already implemented in Core     Clean non UTF-8 characters     Mage  getHelper  core string  - gt cleanString  str    Or one of the core url  url rewrite methods

User · Answer

Here is an other one  for example      Title   with strange characters        A    X    Z  becomes  title-with-strange-characters-eee-a-x-z            Function used to create a slug associated to an  ugly  string         param string  string the string to transform         return string the resulting slug      public static function createSlug  string          table   array                    gt  S         gt  s         gt  Dj    d   gt  dj         gt  Z         gt  z    C   gt  C    c   gt  c    C   gt  C    c   gt  c                     gt  A         gt  A         gt  A         gt  A         gt  A         gt  A         gt  A         gt  C         gt  E         gt  E                     gt  E         gt  E         gt  I         gt  I         gt  I         gt  I         gt  N         gt  O         gt  O         gt  O                     gt  O         gt  O         gt  O         gt  U         gt  U         gt  U         gt  U         gt  Y         gt  B         gt  Ss                     gt  a         gt  a         gt  a         gt  a         gt  a         gt  a         gt  a         gt  c         gt  e         gt  e                     gt  e         gt  e         gt  i         gt  i         gt  i         gt  i         gt  o         gt  n         gt  o         gt  o                     gt  o         gt  o         gt  o         gt  o         gt  u         gt  u         gt  u         gt  y         gt  y         gt  b                     gt  y    R   gt  R    r   gt  r         gt   -         gt   -                 -- Remove duplicated spaces      stripped   preg replace array    s 2          t n            string           -- Returns the slug     return strtolower strtr  string   table

User · Answer

Instead of a lengthy replace  try this one   public static function slugify  text         replace non letter or digits by -    text   preg replace      pL d   u    -    text         transliterate    text   iconv  utf-8    us-ascii  TRANSLIT    text         remove unwanted characters    text   preg replace     - w           text         trim    text   trim  text   -          remove duplicate -    text   preg replace   -      -    text         lowercase    text   strtolower  text      if  empty  text         return  n-a          return  text      This was based off the one in Symfony s Jobeet tutorial

User · Answer

Note  I have taken this from wordpress and it works     Use it like this   echo sanitize  testing this link      Code    taken from wordpress function utf8 uri encode   utf8 string   length   0          unicode            values   array         num octets   1       unicode length   0        string length   strlen   utf8 string        for   i   0   i  lt   string length   i                 value   ord   utf8 string   i               if    value  lt  128                 if    length  amp  amp     unicode length  gt    length                     break               unicode    chr  value                unicode length              else               if   count   values      0    num octets      value  lt  224     2   3                values      value               if    length  amp  amp     unicode length     num octets   3     gt   length                   break              if   count   values       num octets                     if   num octets    3                         unicode          dechex  values 0           dechex  values 1           dechex  values 2                         unicode length    9                    else                        unicode          dechex  values 0           dechex  values 1                         unicode length    6                                      values   array                     num octets   1                                     return  unicode       taken from wordpress function seems utf8  str         length   strlen  str       for   i 0   i  lt   length   i               c   ord  str  i            if   c  lt  0x80   n   0    0bbbbbbb         elseif    c  amp  0xE0     0xC0   n 1    110bbbbb         elseif    c  amp  0xF0     0xE0   n 2    1110bbbb         elseif    c  amp  0xF8     0xF0   n 3    11110bbb         elseif    c  amp  0xFC     0xF8   n 4    111110bb         elseif    c  amp  0xFE     0xFC   n 5    1111110b         else return false    Does not match any model         for   j 0   j lt  n   j        n bytes matching 10bbbbbb follow               if      i     length       ord  str  i    amp  0xC0     0x80                   return false                      return true       function sanitize title with dashes taken from wordpress function sanitize  title         title   strip tags  title          Preserve escaped octets       title   preg replace      a-fA-F0-9  a-fA-F0-9       --- 1---    title          Remove percent signs that are not part of an octet       title   str replace           title          Restore octets       title   preg replace   ---  a-fA-F0-9  a-fA-F0-9  ---       1    title        if  seems utf8  title             if  function exists  mb strtolower                   title   mb strtolower  title   UTF-8                       title   utf8 uri encode  title  200               title   strtolower  title        title   preg replace    amp              title      kill entities      title   str replace       -    title        title   preg replace      a-z0-9  -          title        title   preg replace    s      -    title        title   preg replace   -      -    title        title   trim  title   -         return  title

User · Answer

Since gTLDs and IDNs are becoming more and more used I cannot see why URL shouldn t contain Andr  s   Just rawurlencode  URL you want instead  Most browsers show UTF-8 characters in URLs  not some ancient IE6 maybe  and bit ly   goo gl can be used to make it short in cases like Russian and Arabic if need may be for ad purposes or just write them in ads like user would write them on browser URL    Only difference is spaces     it might be good idea to replace them with  -  and     if you don t want to allow those    lt  php function slugify  url         url   trim  url         url   str replace      -   url        url   str replace      -slash-   url        url   rawurlencode  url       gt    Url as encoded  http   www hurtta com RU  D0 9F D1 80 D0 BE D0 B4 D1 83 D0 BA D1 82 D1 8B   Url as written http   www hurtta com RU

User · Answer

You could have a look at Normalizer  normalize    see here  It just needs to load the intl module for PHP

User · Answer

if your slug contain only A-Za-z0-9- then it is ok for you function sanitize slug  text         text   preg replace     A-Za-z0-9-       -    text        text   trim  text   -         text   preg replace   -      -    text       return  text

User · Answer

For me this variant is perfect  also it change  amp  to and  Here is code  function dSlug  string        return strtolower trim preg replace     0-9a-z   i    -   html entity decode preg replace    amp   a-z  1 2     acute cedil circ grave lig orn ring slash th tilde uml   i     1  htmlentities preg replace     amp        and     title   ENT QUOTES   UTF-8     ENT QUOTES   UTF-8      -

User · Answer

I wrote this based on Maerlyn s response  This function will work regardless of the character encoding on the page  It also won t turn single quotes in to dashes     function slugify   string         string   utf8 encode  string        string   iconv  UTF-8    ASCII  TRANSLIT    string           string   preg replace     a-z0-9-   i        string        string   str replace       -    string        string   trim  string   -         string   strtolower  string        if  empty  string             return  n-a              return  string

User · Answer

The most elegant way I think is using a Behat Transliterator Transliterator   I need to extends this class by your class because it is an Abstract  some like this    lt  php use Behat Transliterator Transliterator   class Urlizer extends Transliterator       And then  just use it    text    Master   piu    urlizer   new Urlizer     slug    urlizer- gt transliterate  slug   -    echo  slug     master-apiu   Of course you should put this things in your composer as well    composer require behat transliterator   More info here https   github com Behat Transliterator

User · Answer

public static function slugify   text          replace               amp lt     gt        amp gt     gt        amp  039     gt        amp amp     gt                amp quot     gt             gt   A          gt   A          gt   A          gt   A         gt   Ae             amp Auml     gt   A          gt   A    A    gt   A    A    gt   A    A    gt   A          gt   Ae                  gt   C    C    gt   C    C    gt   C    C    gt   C    C    gt   C    D    gt   D          gt   D                  gt   D          gt   E          gt   E          gt   E          gt   E    E    gt   E            E    gt   E    E    gt   E    E    gt   E    E    gt   E    G    gt   G    G    gt   G            G    gt   G    G    gt   G    H    gt   H    H    gt   H          gt   I          gt   I                  gt   I          gt   I    I    gt   I    I    gt   I    I    gt   I    I    gt   I            I    gt   I         gt   IJ    J    gt   J    K    gt   K    L    gt   K    L    gt   K            L    gt   K    L    gt   K         gt   K          gt   N    N    gt   N    N    gt   N            N    gt   N         gt   N          gt   O          gt   O          gt   O          gt   O                  gt   Oe     amp Ouml     gt   Oe          gt   O    O    gt   O    O    gt   O    O    gt   O                  gt   OE    R    gt   R    R    gt   R    R    gt   R    S    gt   S          gt   S            S    gt   S    S    gt   S         gt   S    T    gt   T    T    gt   T    T    gt   T                 gt   T          gt   U          gt   U          gt   U          gt   Ue    U    gt   U             amp Uuml     gt   Ue    U    gt   U    U    gt   U    U    gt   U    U    gt   U    U    gt   U            W    gt   W          gt   Y    Y    gt   Y          gt   Y    Z    gt   Z          gt   Z            Z    gt   Z          gt   T          gt   a          gt   a          gt   a          gt   a                  gt   ae     amp auml     gt   ae          gt   a    a    gt   a    a    gt   a    a    gt   a                  gt   ae          gt   c    c    gt   c    c    gt   c    c    gt   c    c    gt   c            d    gt   d    d    gt   d          gt   d          gt   e          gt   e          gt   e                  gt   e    e    gt   e    e    gt   e    e    gt   e    e    gt   e    e    gt   e                  gt   f    g    gt   g    g    gt   g    g    gt   g    g    gt   g    h    gt   h            h    gt   h          gt   i          gt   i          gt   i          gt   i    i    gt   i            i    gt   i    i    gt   i    i    gt   i    i    gt   i         gt   ij    j    gt   j            k    gt   k         gt   k    l    gt   l    l    gt   l    l    gt   l    l    gt   l                 gt   l          gt   n    n    gt   n    n    gt   n    n    gt   n         gt   n                 gt   n          gt   o          gt   o          gt   o          gt   o          gt   oe             amp ouml     gt   oe          gt   o    o    gt   o    o    gt   o    o    gt   o          gt   oe            r    gt   r    r    gt   r    r    gt   r          gt   s          gt   u          gt   u                  gt   u          gt   ue    u    gt   u     amp uuml     gt   ue    u    gt   u    u    gt   u            u    gt   u    u    gt   u    u    gt   u    w    gt   w          gt   y          gt   y            y    gt   y          gt   z    z    gt   z    z    gt   z          gt   t          gt   ss                 gt   ss          gt   iy         gt   A         gt   B         gt   V         gt   G                 gt   D         gt   E         gt   YO         gt   ZH         gt   Z         gt   I                 gt   Y         gt   K         gt   L         gt   M         gt   N         gt   O                 gt   P         gt   R         gt   S         gt   T         gt   U         gt   F                 gt   H         gt   C         gt   CH         gt   SH         gt   SCH         gt                    gt   Y         gt            gt   E         gt   YU         gt   YA         gt   a                 gt   b         gt   v         gt   g         gt   d         gt   e         gt   yo                 gt   zh         gt   z         gt   i         gt   y         gt   k         gt   l                 gt   m         gt   n         gt   o         gt   p         gt   r         gt   s                 gt   t         gt   u         gt   f         gt   h         gt   c         gt   ch                 gt   sh         gt   sch         gt            gt   y         gt            gt   e                 gt   yu         gt   ya                 make a human readable string      text   strtr  text   replace           replace non letter or digits by -      text   preg replace       pL d    u    -    text           trim      text   trim  text   -            remove unwanted characters      text   preg replace     - w            text         text   strtolower  text        return  text

User · Answer

I am using   function slugify  text          text   iconv  utf-8    us-ascii  TRANSLIT    text       return strtolower preg replace     A-Za-z0-9-       -    text        Only fallback is that Cyrillic characters will not be converted  and I am searching now for solution that is not long str replace for every single Cyrillic character

User · Answer

An updated version of  Imran Omar Bukhsh code  from the latest Wordpress  4 0  branch     lt  php     Add methods to slugify taken from Wordpress     - https   github com WordPress WordPress blob master wp-includes formatting php     - https   github com WordPress WordPress blob master wp-includes functions php         Set the mbstring internal encoding to a binary safe encoding when func overload    is enabled        When mbstring func overload is in use for multi-byte encodings  the results from    strlen   and similar functions respect the utf8 characters  causing binary data    to return incorrect lengths        This function overrides the mbstring encoding to a binary-safe encoding  and    resets it to the users expected encoding afterwards through the     reset mbstring encoding  function        It is safe to recursively call this function  however each     mbstring binary safe encoding    call must be followed up with an equal number    of  reset mbstring encoding    calls         since 3 7 0        see reset mbstring encoding          param bool  reset Optional  Whether to reset the encoding back to a previously-set encoding                        Default false      function mbstring binary safe encoding   reset   false       static  encodings   array      static  overloaded   null     if   is null   overloaded          overloaded   function exists   mb internal encoding     amp  amp    ini get   mbstring func overload     amp  2       if   false      overloaded       return     if      reset          encoding   mb internal encoding        array push   encodings   encoding        mb internal encoding   ISO-8859-1            if    reset  amp  amp   encodings          encoding   array pop   encodings        mb internal encoding   encoding                  Reset the mbstring internal encoding to a users previously set encoding         see mbstring binary safe encoding          since 3 7 0     function reset mbstring encoding       mbstring binary safe encoding  true               Checks to see if a string is utf8 encoded        NOTE  This function checks for 5-Byte sequences  UTF8          has Bytes Sequences with a maximum length of 4         author bmorel at ssi dot fr  modified      since 1 2 1        param string  str The string to be checked     return bool True if  str fits a UTF-8 model  false otherwise      function seems utf8  str      mbstring binary safe encoding       length   strlen  str     reset mbstring encoding      for   i 0   i  lt   length   i           c   ord  str  i        if   c  lt  0x80   n   0    0bbbbbbb     elseif    c  amp  0xE0     0xC0   n 1    110bbbbb     elseif    c  amp  0xF0     0xE0   n 2    1110bbbb     elseif    c  amp  0xF8     0xF0   n 3    11110bbb     elseif    c  amp  0xFC     0xF8   n 4    111110bb     elseif    c  amp  0xFE     0xFC   n 5    1111110b     else return false    Does not match any model     for   j 0   j lt  n   j        n bytes matching 10bbbbbb follow         if      i     length       ord  str  i    amp  0xC0     0x80           return false              return true             Encode the Unicode values to be used in the URI         since 1 5 0        param string  utf8 string     param int  length Max length of the string     return string String with Unicode encoded for URI      function utf8 uri encode   utf8 string   length   0        unicode          values   array       num octets   1     unicode length   0     mbstring binary safe encoding       string length   strlen   utf8 string      reset mbstring encoding       for   i   0   i  lt   string length   i             value   ord   utf8 string   i           if    value  lt  128           if    length  amp  amp     unicode length  gt    length             break         unicode    chr  value          unicode length          else         if   count   values      0    num octets      value  lt  224     2   3          values      value         if    length  amp  amp     unicode length     num octets   3     gt   length           break        if   count   values       num octets             if   num octets    3               unicode          dechex  values 0           dechex  values 1           dechex  values 2               unicode length    9            else              unicode          dechex  values 0           dechex  values 1               unicode length    6                      values   array             num octets   1                       return  unicode             Sanitizes a title  replacing whitespace and a few other characters with dashes        Limits the output to alphanumeric characters  underscore     and dash  -      Whitespace becomes a dash         since 1 2 0        param string  title The title to be sanitized      param string  raw title Optional  Not used      param string  context Optional  The operation for which the string is sanitized      return string The sanitized title      function sanitize title with dashes   title   raw title        context    display         title   strip tags  title        Preserve escaped octets     title   preg replace      a-fA-F0-9  a-fA-F0-9       --- 1---    title        Remove percent signs that are not part of an octet     title   str replace           title        Restore octets     title   preg replace   ---  a-fA-F0-9  a-fA-F0-9  ---       1    title      if  seems utf8  title         if  function exists  mb strtolower             title   mb strtolower  title   UTF-8               title   utf8 uri encode  title  200           title   strtolower  title      title   preg replace    amp              title      kill entities    title   str replace       -    title      if    save      context            Convert nbsp  ndash and mdash to hyphens      title   str replace  array    c2 a0     e2 80 93     e2 80 94      -    title            Strip these characters entirely      title   str replace  array           iexcl and iquest         c2 a1     c2 bf            angle quotes         c2 ab     c2 bb     e2 80 b9     e2 80 ba            curly quotes         e2 80 98     e2 80 99     e2 80 9c     e2 80 9d           e2 80 9a     e2 80 9b     e2 80 9e     e2 80 9f            copy  reg  deg  hellip and trade         c2 a9     c2 ae     c2 b0     e2 80 a6     e2 84 a2            acute accents         c2 b4     cb 8a     cc 81     cd 81            grave accent  macron  caron         cc 80     cc 84     cc 8c               title            Convert times to x      title   str replace    c3 97    x    title            title   preg replace      a-z0-9  -          title      title   preg replace    s      -    title      title   preg replace   -      -    title      title   trim  title   -       return  title      title     PFW Alexander McQueen Spring Summer 2015   echo  title - gt  slug   n    title    - gt     sanitize title with dashes  title   echo   n n    title      GQ    Elyas M  Barek geh  rt zu M  nnern des Jahres   echo  title - gt  slug   n    title    - gt     sanitize title with dashes  title     View online example

User · Answer

It is always a good idea to use existing solutions that are being supported by a lot of high-level developers  The most popular one is https   github com cocur slugify  First of all  it supports more than one language  and it is being updated    If you do not want to use the whole package  you can copy the part that you need

User · Answer

On my localhost everything was ok  but on server it helped me    set locale    and    utf-8    at    mb strtolower      lt   setlocale  LC ALL   quot en US UTF8 quot     function slug   str   char    quot - quot    tf    quot lowercase quot           str   iconv   quot utf-8 quot    quot us-ascii  translit  ignore quot    str       transliterate      str   str replace   quot   quot    quot  quot    str       remove         generated by iconv      str   preg replace   quot    a-z0-9   ui quot    char   str       replace unwanted by single    -         str   trim   str   char       trim    -        if   tf     quot lowercase quot     str   mb strtolower   str   quot utf-8 quot        lowercase     elseif   tf     quot uppercase quot     str   mb strtoupper   str   quot utf-8 quot         return  str      gt   Test  string    quot -- e  cr      091354--          -6          ac          elnszzAC    ELNS   Z            Z quot   echo slug   string    echo slug   string   quot   quot    quot uppercase quot           escrzya091354-6iessac-elnszzac-elns-z-z      ESCRZYA091354 6IESSAC ELNSZZAC ELNS Z Z

[php] PHP function to make slug (URL string)

Examples related to php

Examples related to internationalization

Examples related to slug