How to find elements by class

Question

I m having trouble parsing HTML elements with  class  attribute using Beautifulsoup  The code looks like this  soup   BeautifulSoup sdata  mydivs   soup findAll  div   for div in mydivs       if  div  class       stylelistrow            print div   I get an error on the same line  after  the script finishes    File    beautifulcoding py   line 130  in getlanguage   if  div  class       stylelistrow    File   usr local lib python2 6 dist-packages BeautifulSoup py   line 599  in   getitem      return self  getAttrMap   key  KeyError   class    How do I get rid of this error

User · Accepted Answer

You can refine your search to only find those divs with a given class using BS3  mydivs   soup find all  quot div quot     quot class quot    quot stylelistrow quot

User · Answer

Concerning  Wernight s comment on the top answer about partial matching    You can partially match    lt div class  quot stylelistrow quot  gt  and  lt div class  quot stylelistrow button quot  gt   with gazpacho  from gazpacho import Soup  my divs   soup find  quot div quot     quot class quot    quot stylelistrow quot    partial True   Both will be captured and returned as a list of Soup objects

User · Answer

Use class   If you want to find element s  without stating the HTML tag  For single element  soup find class   my-class-name    For multiple elements  soup find all class   my-class-name

User · Answer

The following should work  soup find  span   attrs   class   totalcount      replace  totalcount  with your class name and  span  with tag you are looking for  Also  if your class contains multiple names with space  just choose one and use   P S  This finds the first element with given criteria  If you want to find all elements then replace  find  with  find all

User · Answer

CSS selectors  single class first match  soup select one   stylelistrow     list of matches  soup select   stylelistrow     compound class  i e  AND another class   soup select one   stylelistrow otherclassname   soup select   stylelistrow otherclassname     Spaces in compound class names e g  class   stylelistrow otherclassname are replaced with      You can continue to add classes    list of classes  OR - match whichever present  soup select one   stylelistrow   otherclassname   soup select   stylelistrow   otherclassname       bs4 4 7 1    Specific class whose innerText contains a string  soup select one   stylelistrow contains  some string     soup select   stylelistrow contains  some string       Specific class which has a certain child element e g  a tag  soup select one   stylelistrow has a    soup select   stylelistrow has a

User · Answer

This worked for me   for div in mydivs      try          clazz   div  class       except KeyError          clazz          if  clazz     stylelistrow            print div

User · Answer

Update  2016 In the latest version of beautifulsoup  the method  findAll  has been renamed to   find all   Link to official documentation    Hence the answer will be   soup find all  html element   class   your class name

User · Answer

Try to check if the div has a class attribute first  like this   soup   BeautifulSoup sdata  mydivs   soup findAll  div   for div in mydivs      if  class  in div          if  div  class     stylelistrow                print div

User · Answer

A straight forward way would be    soup   BeautifulSoup sdata  for each div in soup findAll  div    class   stylelist         print each div   Make sure you take of the casing of findAll  its not findall

User · Answer

This works for me to access the class attribute  on beautifulsoup 4  contrary to what the documentation says   The KeyError comes a list being returned not a dictionary   for hit in soup findAll name  span        print hit contents 1   class

User · Answer

From the documentation   As of Beautiful Soup 4 1 2  you can search by CSS class using the keyword argument class    soup find all  a   class   sister     Which in this case would be   soup find all  div   class   stylelistrow     It would also work for   soup find all  div   class   stylelistrowone stylelistrowtwo

User · Answer

the following worked for me  a tag   soup find all  div  class   full tabpublist

User · Answer

Alternatively we can use lxml  it support xpath and very fast   from lxml import html  etree   attr   html fromstring html text  passing the raw html handles   attr xpath    div  class  stylelistrow     xpath exresssion to find that specific class  for each in handles      print etree tostring each   printing the html as string

User · Answer

How to find elements by class I m having trouble parsing html elements with  quot class quot  attribute using Beautifulsoup   You can easily find by one class  but if you want to find by the intersection of two classes  it s a little more difficult  From the documentation  emphasis added    If you want to search for tags that match two or more CSS classes  you should use a CSS selector  css soup select  quot p strikeout body quot       lt p class  quot body strikeout quot  gt  lt  p gt     To be clear  this selects only the p tags that are both strikeout and body class  To find for the intersection of any in a set of classes  not the intersection  but the union   you can give a list to the class  keyword argument  as of 4 1 2   soup   BeautifulSoup sdata  class list     quot stylelistrow quot     can add any other classes to this list    will find any divs with any names in class list  mydivs   soup find all  div   class  class list    Also note that findAll has been renamed from the camelCase to the more Pythonic find all

User · Answer

Other answers did not work for me   In other answers the findAll is being used on the soup object itself  but I needed a way to do a find by class name on objects inside a specific element extracted from the object I obtained after doing findAll   If you are trying to do a search inside nested HTML elements to get objects by class name  try below -    parse html page soup   soup web page read     html parser      filter out items matching class name all songs   page soup findAll  li    song item      traverse through all songs for song in all songs         get text out of span element matching class  song name        doing a  find  by class name within a specific song element taken out of  all songs  collection     song find  span    song name   text   Points to note    I m not explicitly defining the search to be on  class  attribute findAll  li     class    song item     since it s the only attribute I m searching on and it will by default search for class attribute if you don t exclusively tell which attribute you want to find on   When you do a findAll or find  the resulting object is of class bs4 element ResultSet which is a subclass of list  You can utilize all methods of ResultSet  inside any number of nested elements  as long as they are of type ResultSet  to do a find or find all  My BS4 version - 4 9 1  Python version - 3 8 1

User · Answer

Specific to BeautifulSoup 3   soup findAll  div                  class   lambda x  x                         and  stylelistrow  in x split                                  Will find all of these    lt div class  stylelistrow  gt   lt div class  stylelistrow button  gt   lt div class  button stylelistrow  gt

User · Answer

As of BeautifulSoup 4     If you have a single class name   you can just pass the class name as parameter like    mydivs   soup find all  div    class name     Or if you have more than one class names   just pass the list of class names as parameter like    mydivs   soup find all  div     class1    class2

User · Answer

This should work   soup   BeautifulSoup sdata  mydivs   soup findAll  div   for div in mydivs       if  div find class      stylelistrow            print div

[python] How to find elements by class

The answer is

How to find elements by class

Examples related to python

Examples related to html

Examples related to web-scraping

Examples related to beautifulsoup

Tags