Remove style attribute from HTML tags

Question

I m not too good with regular expressions  but with PHP I m wanting to remove the style attribute from HTML tags in a string that s coming back from TinyMCE   So change  lt p style       gt Text lt  p gt  to just vanilla  lt p gt Test lt  p gt    How would I achieve this with something like the preg replace   function

User · Answer

You could handle it client side  the easiest would be with jQuery  Something like       tinyMce p   removeAttr  style

User · Answer

Something like this should work  untested code warning     lt  php   html     lt p style  asd  gt qwe lt  p gt  lt br   gt  lt p class  qwe  gt qweqweqwe lt  p gt      domd   new DOMDocument    libxml use internal errors true    domd- gt loadHTML  html   libxml use internal errors false     domx   new DOMXPath  domd    items    domx- gt query    p  style      foreach  items as  item       item- gt removeAttribute  style       echo  domd- gt saveHTML

User · Answer

I m using such thing to clean-up the style       section out of tags with keeping of other attributes at the moment    output   preg replace    lt     gt      sstyle   P lt stq gt            k lt stq gt      lt     gt  iUs     lt  1 5 gt     input

User · Answer

I commented on  Mayerln  s function  It does work but DOMDocument really stuffs with encoding  Here s my simplehtmldom version  function stripAttributes  html  attribs         dom   new simple html dom         dom- gt load  html       foreach  attribs as  attrib          foreach  dom- gt find     attrib    as  e               e- gt  attrib   null        dom- gt load  dom- gt save         return  dom- gt save

User · Answer

The pragmatic regex   lt    gt     style  quot     quot  will solve this problem in all reasonable cases  The part of the match that is not the first captured group should be removed  like this   output   preg replace     lt    gt     style  quot     quot  i     1    input    Match a  lt  followed by one or more  quot not  gt  quot  until we come to space and the style  quot     quot  part  The  i makes it work even with STYLE  quot     quot   Replace this match with  1  which is the captured group  It will leave the tag as is  if the tag doesn t include style  quot     quot

User · Answer

I use this   function strip word html  text   allowed tags     lt a gt  lt ul gt  lt li gt  lt b gt  lt i gt  lt sup gt  lt sub gt  lt em gt  lt strong gt  lt u gt  lt br gt  lt br  gt  lt br   gt  lt p gt  lt h2 gt  lt h3 gt  lt h4 gt  lt h5 gt  lt h6 gt          mb regex encoding  UTF-8          replace MS special characters first      search   array    amp lsquo  u      amp rsquo  u      amp ldquo  u      amp rdquo  u      amp mdash  u         replace   array                        -         text   preg replace  search   replace   text         make sure  all  html entities are converted to the plain ascii equivalents - it appears       in some MS headers  some html entities are encoded and some aren t        text   html entity decode  text  ENT QUOTES   UTF-8          try to strip out any C style comments first  since these  embedded in html comments  seem to       prevent strip tags from removing html comments  MS Word introduced combination      if mb stripos  text            FALSE            text   mb eregi replace             s        text   m                introduce a space into any arithmetic expressions that could be caught by strip tags so that they won t be         lt 1  becomes   lt  1  note  somewhat application specific       text   preg replace array    lt   0-9        array   lt   1     text        text   strip tags  text   allowed tags         eliminate extraneous whitespace from start and end of line  or anywhere there are two or more spaces  convert it to one      text   preg replace array     s s        s s         s s  u    array                text         strip out inline css and simplify style tags      search   array    lt  strong b    gt    gt       lt   strong b  gt  isu      lt  em i    gt    gt       lt   em i  gt  isu      lt u   gt    gt       lt  u gt  isu         replace   array   lt b gt  2 lt  b gt      lt i gt  2 lt  i gt      lt u gt  1 lt  u gt          text   preg replace  search   replace   text         on some of the  newer MS Word exports  where you get conditionals of the form  if gte mso 9   etc   it appears       that whatever is in one of the html comments prevents strip tags from eradicating the html comment that contains       some MS Style Definitions - this last bit gets rid of any leftover comments         num matches   preg match all     lt  -- u    text   matches       if  num matches            text   preg replace     lt  --    --  gt  isu        text              text   preg replace     lt    gt     style       i     1    text   return  text

User · Answer

Here you go    lt  php   html     lt p style  border  1px solid red   gt Test lt  p gt    echo preg replace    lt p style         gt       lt   p gt  i     lt p gt  2 lt  p gt     html      gt    By the way  as pointed out by others  regex are not suggested for this

User · Answer

html   preg replace    sstyle                 i        html     For replacing all style    with blank

User · Answer

In addition to Lorenzo Marcon s answer   Using preg replace to select everything except style attribute    html   preg replace     lt p    style        gt      i     1 2    html

[php] Remove style attribute from HTML tags

Examples related to php

Examples related to regex

Examples related to tinymce