Remove Sub String by using Python

Question

I already extract some information from a forum  It is the raw string I have now   string    i think mabe 124    lt font color  black  gt  lt font face  Times New Roman  gt but I don  t have a big experience it just how I see it in my eyes  lt font color  green  gt  lt font face  Arial  gt fun stuff    The thing I do not like is the sub string   lt font color  black  gt  lt font face  Times New Roman  gt   and   lt font color  green  gt  lt font face  Arial  gt    I do want to keep the other part of string except this  So the result should be like this  resultString    i think mabe 124   but I don t have a big experience it just how I see it in my eyes fun stuff    How could I do this  Actually I used beautiful soup to extract the string above from a forum  Now I may prefer regular expression to remove the  part

User · Answer

gt  gt  gt  import re  gt  gt  gt  st     i think mabe 124    lt font color   black   gt  lt font face   Times New Roman   gt but I don t have a big experience it just how I see it in my eyes  lt font color   green   gt  lt font face   Arial   gt fun stuff   gt  gt  gt  re sub   lt     gt      st    i think mabe 124   but I don t have a big experience it just how I see it in my eyes fun stuff   gt  gt  gt

User · Answer

BeautifulSoup text  features  quot html parser quot   text   For the people who were seeking deep info in my answer  sorry  I ll explain it  Beautifulsoup is a widely use python package that helps the user  developer  to interact with HTML within python  The above like just take all the HTML text  text  and cast it to Beautifulsoup object - that means behind the sense its parses everything up  Every HTML tag within the given text  Once done so  we just request all the text from within the HTML object

User · Answer

import re re sub   lt     gt        string   i think mabe 124   but I don t have a big experience it just how I see it in my eyes fun stuff    The re sub function takes a regular expresion and replace all the matches in the string with the second parameter  In this case  we are searching for all tags    lt     gt    and replacing them with nothing        The   is used in re for non-greedy searches   More about the re module

[python] Remove Sub String by using Python

Examples related to python

Examples related to regex

Examples related to string