Parsing domain from a URL

Question

I need to build a function which parses the domain from a URL   So  with   http   google com dhasjkdas sadsdds sdda sdads html   or  http   www google com dhasjkdas sadsdds sdda sdads html  it should return google com  with   http   google co uk dhasjkdas sadsdds sdda sdads html  it should return google co uk

User · Answer

You can pass PHP URL HOST into parse url function as second parameter   url    http   google com dhasjkdas sadsdds sdda sdads html    host   parse url  url  PHP URL HOST   print  host     prints  google com

User · Answer

Check out parse url      url    http   google com dhasjkdas sadsdds sdda sdads html    parse   parse url  url   echo  parse  host       prints  google com    parse url doesn t handle really badly mangled urls very well  but is fine if you generally expect decent urls

User · Answer

domain   str ireplace  www        parse url  url  PHP URL HOST      This would return the google com for both http   google com     and http   www google com

User · Answer

function get domain  url   SITE URL        preg match    a-z0-9 -  1 63    a-z    2 6      parse url  url  PHP URL HOST     domain tld       return   domain tld 0      get domain  http   www cdl gr      cdl gr get domain  http   cdl gr      cdl gr get domain  http   www2 cdl gr      cdl gr

User · Answer

I m adding this answer late since this is the answer that pops up most on Google     You can use PHP to      url    www google co uk    host   parse url  url  PHP URL HOST       host     www google co uk    to grab the host but not the private domain to which the host refers   Example www google co uk is the host  but google co uk is the private domain   To grab the private domain  you must need know the list of public suffixes to which one can register a private domain  This list happens to be curated by Mozilla at https   publicsuffix org   The below code works when an array of public suffixes has been created already  Simply call   domain   get private domain  www google co uk      with the remaining code          find some way to parse the above list of public suffix    then add them to a PHP array  suffix        all valid public suffix        function get public suffix  host       parts   split        host     while  count  parts   gt  0        if  is public suffix join       parts          return join       parts        array shift  parts          return false     function is public suffix  host      global  suffix    return isset  suffix  host       function get private domain  host       public   get public suffix  host      public parts   split        public      all parts   split        host       private          for   x   0   x  lt  count  public parts      x        private     array pop  all parts      if  count  all parts   gt  0       private     array pop  all parts      return join      array reverse  private

User · Answer

function getTrimmedUrl  link         str   str replace   www    https      http            link        link   explode      str       return strtolower  link 0

User · Answer

The code that was meant to work 100  didn t seem to cut it for me  I did patch the example a little but found code that wasn t helping and problems with it  so I changed it out to a couple of functions  to save asking for the list from Mozilla all the time  and removing the cache system   This has been tested against a set of 1000 URLs and seemed to work   function domain  url        global  subtlds       slds            url   strtolower  url         host   parse url  http      url PHP URL HOST        preg match                           host   matches       foreach  subtlds as  sub           if  preg match       preg quote  sub         host   xyz                preg match                                     host   matches                        return   matches 0      function get tlds          address    http   mxr mozilla org mozilla-central source netwerk dns effective tld names dat raw 1        content   file  address       foreach   content as  num   gt   line             line   trim  line           if  line        continue          if  substr  line 0   0  2          continue           line    preg replace     a-zA-Z0-9            line           if  line        continue      line        line          if   line 0           line   substr  line  1           if  strstr  line        continue           subtlds      line            echo    num      line     echo   lt br gt                subtlds   array merge array               co uk    me uk    net uk    org uk    sch uk    ac uk                 gov uk    nhs uk    police uk    mod uk    asn au    com au                net au    id au    org au    edu au    gov au    csiro au              subtlds         subtlds   array unique  subtlds        return  subtlds          Then use it like   subtlds   get tlds    echo domain  www example com     outputs  example com echo domain  www example uk com     outputs  example uk com echo domain  www example fr     outputs  example fr   I know I should have turned this into a class  but didn t have time

User · Answer

If you want extract host from string http   google com dhasjkdas sadsdds sdda sdads html  usage of parse url   is acceptable solution for you   But if you want extract domain or its parts  you need package that using Public Suffix List  Yes  you can use string functions arround parse url    but it will produce incorrect results sometimes   I recomend TLDExtract for domain parsing  here is sample code that show diff    extract   new LayerShifter TLDExtract Extract       For  http   google com dhasjkdas sadsdds sdda sdads html    url    http   google com dhasjkdas sadsdds sdda sdads html    parse url  url  PHP URL HOST      will return google com   result    extract- gt parse  url    result- gt getFullHost       will return  google com   result- gt getRegistrableDomain       will return  google com   result- gt getSuffix       will return  com     For  http   search google com dhasjkdas sadsdds sdda sdads html    url    http   search google com dhasjkdas sadsdds sdda sdads html    parse url  url  PHP URL HOST      will return  search google com    result    extract- gt parse  url    result- gt getFullHost       will return  search google com   result- gt getRegistrableDomain       will return  google com

User · Answer

Combining the answers of worldofjr and Alix Axel into one small function that will handle most use-cases   function get url hostname  url          parse   parse url  url       return str ireplace  www         parse  host         get url hostname  http   www google com example path file html       google com

User · Answer

domain   parse url  url  PHP URL HOST   echo implode      array slice explode       domain   -2  2

User · Answer

parse url didn t work for me   It only returned the path   Switching to basics using php5 3     url    str replace  http          strtolower   s- gt website    if  strpos  url          url   strstr  url       true

User · Answer

Here is the code i made that 100  finds only the domain name  since it takes mozilla sub tlds to account  Only thing you have to check is how you make cache of that file  so you dont query mozilla every time    For some strange reason  domains like co uk are not in the list  so you have to make some hacking and add them manually  Its not cleanest solution but i hope it helps someone                                                            static function domain  url         slds            url   strtolower  url                 address    http   mxr mozilla org mozilla-central source netwerk dns effective tld names dat raw 1       if   subtlds    kohana  cache  subtlds   null  60                   content   file  address           foreach  content as  num   gt   line                         line   trim  line               if  line        continue              if  substr  line 0   0  2          continue               line    preg replace     a-zA-Z0-9            line               if  line        continue      line        line              if   line 0           line   substr  line  1               if  strstr  line        continue               subtlds      line                echo    num      line     echo   lt br gt                       subtlds   array merge Array               co uk    me uk    net uk    org uk    sch uk    ac uk                 gov uk    nhs uk    police uk    mod uk    asn au    com au                net au    id au    org au    edu au    gov au    csiro au                  subtlds             subtlds   array unique  subtlds             echo var dump  subtlds            kohana  cache  subtlds    subtlds               preg match     http      2             i    url   matches         preg match     http      https        a-zA-Z-          i    url   matches        host     matches 2         echo var dump  matches        preg match                           host   matches       foreach  subtlds as  sub                 if  preg match     sub       host   xyz           preg match                                     host   matches              return   matches 0

User · Answer

domain   str ireplace  www        parse url  url  PHP URL HOST      This would return the google com for both http   google com     and http   www google com

User · Answer

This will generally work very well if the input URL is not total junk   It removes the subdomain    host   parse url   Row- gt url  PHP URL HOST     parts   explode        host     parts   array reverse   parts     domain    parts 1       parts 0     Example  Input  http   www2 website com 8080 some file structure some parameters  Output  website com

User · Answer

Please consider replacring the accepted solution with the following   parse url   will always include any sub-domain s   so this function doesn t parse domain names very well  Here are some examples    url    http   www google com dhasjkdas sadsdds sdda sdads html    parse   parse url  url   echo  parse  host       prints  www google com   echo parse url  https   subdomain example com foo bar   PHP URL HOST      Output  subdomain example com  echo parse url  https   subdomain example co uk foo bar   PHP URL HOST      Output  subdomain example co uk   Instead  you may consider this pragmatic solution  It will cover many  but not all domain names -- for instance  lower-level domains such as  sos state oh us  are not covered   function getDomain  url         host   parse url  url  PHP URL HOST        if filter var  host FILTER VALIDATE IP                IP address returned as domain         return  host      or replace with null if you don t want an IP back             domain array   explode      str replace  www         host         count   count  domain array       if   count gt  3  amp  amp  strlen  domain array  count-2    2                SLD  example co uk          return implode      array splice  domain array   count-3 3          else if   count gt  2                TLD  example com          return implode      array splice  domain array   count-2 2                Your domains     echo getDomain  http   google com dhasjkdas sadsdds sdda sdads html       google com     echo getDomain  http   www google com dhasjkdas sadsdds sdda sdads html       google com     echo getDomain  http   google co uk dhasjkdas sadsdds sdda sdads html       google co uk     TLD     echo getDomain  https   shop example com       example com     echo getDomain  https   foo bar example com       example com     echo getDomain  https   www example com       example com     echo getDomain  https   example com       example com     SLD     echo getDomain  https   more news bbc co uk       bbc co uk     echo getDomain  https   www bbc co uk       bbc co uk     echo getDomain  https   bbc co uk       bbc co uk     IP     echo getDomain  https   1 2 3 45        1 2 3 45   Finally  Jeremy Kendall s PHP Domain Parser allows you to parse the domain name from a url  League URI Hostname Parser will also do the job

User · Answer

From http   us3 php net manual en function parse-url php 93983     for some odd reason  parse url   returns the host  ex  example com  as   the path when no scheme is provided in   the input url  So I ve written a quick   function to get the real host    function getHost  Address         parseUrl   parse url trim  Address        return trim  parseUrl  host      parseUrl  host     array shift explode       parseUrl  path    2          getHost  example com       Gives example com  getHost  http   example com       Gives example com  getHost  www example com       Gives www example com  getHost  http   example com xyz       Gives example com

User · Answer

function getTrimmedUrl  link         str   str replace   www    https      http            link        link   explode      str       return strtolower  link 0

User · Answer

Combining the answers of worldofjr and Alix Axel into one small function that will handle most use-cases   function get url hostname  url          parse   parse url  url       return str ireplace  www         parse  host         get url hostname  http   www google com example path file html       google com

User · Answer

If you want extract host from string http   google com dhasjkdas sadsdds sdda sdads html  usage of parse url   is acceptable solution for you   But if you want extract domain or its parts  you need package that using Public Suffix List  Yes  you can use string functions arround parse url    but it will produce incorrect results sometimes   I recomend TLDExtract for domain parsing  here is sample code that show diff    extract   new LayerShifter TLDExtract Extract       For  http   google com dhasjkdas sadsdds sdda sdads html    url    http   google com dhasjkdas sadsdds sdda sdads html    parse url  url  PHP URL HOST      will return google com   result    extract- gt parse  url    result- gt getFullHost       will return  google com   result- gt getRegistrableDomain       will return  google com   result- gt getSuffix       will return  com     For  http   search google com dhasjkdas sadsdds sdda sdads html    url    http   search google com dhasjkdas sadsdds sdda sdads html    parse url  url  PHP URL HOST      will return  search google com    result    extract- gt parse  url    result- gt getFullHost       will return  search google com   result- gt getRegistrableDomain       will return  google com

User · Answer

Check out parse url      url    http   google com dhasjkdas sadsdds sdda sdads html    parse   parse url  url   echo  parse  host       prints  google com    parse url doesn t handle really badly mangled urls very well  but is fine if you generally expect decent urls

User · Answer

Check out parse url      url    http   google com dhasjkdas sadsdds sdda sdads html    parse   parse url  url   echo  parse  host       prints  google com    parse url doesn t handle really badly mangled urls very well  but is fine if you generally expect decent urls

User · Answer

I have edited for you   function getHost  Address          parseUrl   parse url trim  Address         host   trim  parseUrl  host      parseUrl  host     array shift explode       parseUrl  path    2            parts   explode        host         num parts   count  parts        if   parts 0      www             for   i 1   i  lt   num parts   i                    h     parts  i                        else           for   i 0   i  lt   num parts   i                    h     parts  i                             return substr  h 0 -1       All type url  www domain ltd  sub1 subn domain ltd will result to   domain ltd

User · Answer

I m adding this answer late since this is the answer that pops up most on Google     You can use PHP to      url    www google co uk    host   parse url  url  PHP URL HOST       host     www google co uk    to grab the host but not the private domain to which the host refers   Example www google co uk is the host  but google co uk is the private domain   To grab the private domain  you must need know the list of public suffixes to which one can register a private domain  This list happens to be curated by Mozilla at https   publicsuffix org   The below code works when an array of public suffixes has been created already  Simply call   domain   get private domain  www google co uk      with the remaining code          find some way to parse the above list of public suffix    then add them to a PHP array  suffix        all valid public suffix        function get public suffix  host       parts   split        host     while  count  parts   gt  0        if  is public suffix join       parts          return join       parts        array shift  parts          return false     function is public suffix  host      global  suffix    return isset  suffix  host       function get private domain  host       public   get public suffix  host      public parts   split        public      all parts   split        host       private          for   x   0   x  lt  count  public parts      x        private     array pop  all parts      if  count  all parts   gt  0       private     array pop  all parts      return join      array reverse  private

User · Answer

Check out parse url      url    http   google com dhasjkdas sadsdds sdda sdads html    parse   parse url  url   echo  parse  host       prints  google com    parse url doesn t handle really badly mangled urls very well  but is fine if you generally expect decent urls

User · Answer

You can pass PHP URL HOST into parse url function as second parameter   url    http   google com dhasjkdas sadsdds sdda sdads html    host   parse url  url  PHP URL HOST   print  host     prints  google com

User · Answer

parse url didn t work for me   It only returned the path   Switching to basics using php5 3     url    str replace  http          strtolower   s- gt website    if  strpos  url          url   strstr  url       true

User · Answer

I ve found that  philfreo s solution  referenced from php net  is pretty well to get fine result but in some cases it shows php s  notice  and  Strict Standards  message  Here a fixed version of this code   function getHost  url         parseUrl   parse url trim  url        if isset  parseUrl  host                  host    parseUrl  host            else               path   explode       parseUrl  path              host    path 0           return trim  host        echo getHost  http   example com anything html                 example com echo getHost  http   www example net directory post php        www example net echo getHost  https   example co uk                            example co uk echo getHost  www example net                                  example net echo getHost  subdomain example net anything                   subdomain example net echo getHost  example net                                      example net

User · Answer

Here is the code i made that 100  finds only the domain name  since it takes mozilla sub tlds to account  Only thing you have to check is how you make cache of that file  so you dont query mozilla every time    For some strange reason  domains like co uk are not in the list  so you have to make some hacking and add them manually  Its not cleanest solution but i hope it helps someone                                                            static function domain  url         slds            url   strtolower  url                 address    http   mxr mozilla org mozilla-central source netwerk dns effective tld names dat raw 1       if   subtlds    kohana  cache  subtlds   null  60                   content   file  address           foreach  content as  num   gt   line                         line   trim  line               if  line        continue              if  substr  line 0   0  2          continue               line    preg replace     a-zA-Z0-9            line               if  line        continue      line        line              if   line 0           line   substr  line  1               if  strstr  line        continue               subtlds      line                echo    num      line     echo   lt br gt                       subtlds   array merge Array               co uk    me uk    net uk    org uk    sch uk    ac uk                 gov uk    nhs uk    police uk    mod uk    asn au    com au                net au    id au    org au    edu au    gov au    csiro au                  subtlds             subtlds   array unique  subtlds             echo var dump  subtlds            kohana  cache  subtlds    subtlds               preg match     http      2             i    url   matches         preg match     http      https        a-zA-Z-          i    url   matches        host     matches 2         echo var dump  matches        preg match                           host   matches       foreach  subtlds as  sub                 if  preg match     sub       host   xyz           preg match                                     host   matches              return   matches 0

User · Answer

I have edited for you   function getHost  Address          parseUrl   parse url trim  Address         host   trim  parseUrl  host      parseUrl  host     array shift explode       parseUrl  path    2            parts   explode        host         num parts   count  parts        if   parts 0      www             for   i 1   i  lt   num parts   i                    h     parts  i                        else           for   i 0   i  lt   num parts   i                    h     parts  i                             return substr  h 0 -1       All type url  www domain ltd  sub1 subn domain ltd will result to   domain ltd

User · Answer

domain   parse url  url  PHP URL HOST   echo implode      array slice explode       domain   -2  2

User · Answer

Just use as like following       lt  php    echo   SERVER  SERVER NAME      gt

User · Answer

Please consider replacring the accepted solution with the following   parse url   will always include any sub-domain s   so this function doesn t parse domain names very well  Here are some examples    url    http   www google com dhasjkdas sadsdds sdda sdads html    parse   parse url  url   echo  parse  host       prints  www google com   echo parse url  https   subdomain example com foo bar   PHP URL HOST      Output  subdomain example com  echo parse url  https   subdomain example co uk foo bar   PHP URL HOST      Output  subdomain example co uk   Instead  you may consider this pragmatic solution  It will cover many  but not all domain names -- for instance  lower-level domains such as  sos state oh us  are not covered   function getDomain  url         host   parse url  url  PHP URL HOST        if filter var  host FILTER VALIDATE IP                IP address returned as domain         return  host      or replace with null if you don t want an IP back             domain array   explode      str replace  www         host         count   count  domain array       if   count gt  3  amp  amp  strlen  domain array  count-2    2                SLD  example co uk          return implode      array splice  domain array   count-3 3          else if   count gt  2                TLD  example com          return implode      array splice  domain array   count-2 2                Your domains     echo getDomain  http   google com dhasjkdas sadsdds sdda sdads html       google com     echo getDomain  http   www google com dhasjkdas sadsdds sdda sdads html       google com     echo getDomain  http   google co uk dhasjkdas sadsdds sdda sdads html       google co uk     TLD     echo getDomain  https   shop example com       example com     echo getDomain  https   foo bar example com       example com     echo getDomain  https   www example com       example com     echo getDomain  https   example com       example com     SLD     echo getDomain  https   more news bbc co uk       bbc co uk     echo getDomain  https   www bbc co uk       bbc co uk     echo getDomain  https   bbc co uk       bbc co uk     IP     echo getDomain  https   1 2 3 45        1 2 3 45   Finally  Jeremy Kendall s PHP Domain Parser allows you to parse the domain name from a url  League URI Hostname Parser will also do the job

User · Answer

function get domain  url   SITE URL        preg match    a-z0-9 -  1 63    a-z    2 6      parse url  url  PHP URL HOST     domain tld       return   domain tld 0      get domain  http   www cdl gr      cdl gr get domain  http   cdl gr      cdl gr get domain  http   www2 cdl gr      cdl gr

User · Answer

This will generally work very well if the input URL is not total junk   It removes the subdomain    host   parse url   Row- gt url  PHP URL HOST     parts   explode        host     parts   array reverse   parts     domain    parts 1       parts 0     Example  Input  http   www2 website com 8080 some file structure some parameters  Output  website com

User · Answer

From http   us3 php net manual en function parse-url php 93983     for some odd reason  parse url   returns the host  ex  example com  as   the path when no scheme is provided in   the input url  So I ve written a quick   function to get the real host    function getHost  Address         parseUrl   parse url trim  Address        return trim  parseUrl  host      parseUrl  host     array shift explode       parseUrl  path    2          getHost  example com       Gives example com  getHost  http   example com       Gives example com  getHost  www example com       Gives www example com  getHost  http   example com xyz       Gives example com

User · Answer

I ve found that  philfreo s solution  referenced from php net  is pretty well to get fine result but in some cases it shows php s  notice  and  Strict Standards  message  Here a fixed version of this code   function getHost  url         parseUrl   parse url trim  url        if isset  parseUrl  host                  host    parseUrl  host            else               path   explode       parseUrl  path              host    path 0           return trim  host        echo getHost  http   example com anything html                 example com echo getHost  http   www example net directory post php        www example net echo getHost  https   example co uk                            example co uk echo getHost  www example net                                  example net echo getHost  subdomain example net anything                   subdomain example net echo getHost  example net                                      example net

User · Answer

The code that was meant to work 100  didn t seem to cut it for me  I did patch the example a little but found code that wasn t helping and problems with it  so I changed it out to a couple of functions  to save asking for the list from Mozilla all the time  and removing the cache system   This has been tested against a set of 1000 URLs and seemed to work   function domain  url        global  subtlds       slds            url   strtolower  url         host   parse url  http      url PHP URL HOST        preg match                           host   matches       foreach  subtlds as  sub           if  preg match       preg quote  sub         host   xyz                preg match                                     host   matches                        return   matches 0      function get tlds          address    http   mxr mozilla org mozilla-central source netwerk dns effective tld names dat raw 1        content   file  address       foreach   content as  num   gt   line             line   trim  line           if  line        continue          if  substr  line 0   0  2          continue           line    preg replace     a-zA-Z0-9            line           if  line        continue      line        line          if   line 0           line   substr  line  1           if  strstr  line        continue           subtlds      line            echo    num      line     echo   lt br gt                subtlds   array merge array               co uk    me uk    net uk    org uk    sch uk    ac uk                 gov uk    nhs uk    police uk    mod uk    asn au    com au                net au    id au    org au    edu au    gov au    csiro au              subtlds         subtlds   array unique  subtlds        return  subtlds          Then use it like   subtlds   get tlds    echo domain  www example com     outputs  example com echo domain  www example uk com     outputs  example uk com echo domain  www example fr     outputs  example fr   I know I should have turned this into a class  but didn t have time

User · Answer

Just use as like following       lt  php    echo   SERVER  SERVER NAME      gt

[php] Parsing domain from a URL

Examples related to php