How to drop rows from pandas data frame that contains a particular string in a particular column

Question

I have a very large data frame in python and I want to drop all rows that have a particular string inside a particular column   For example  I want to drop all rows which have the string  XYZ  as a substring in the column C of the data frame   Can this be implemented in an efficient way using  drop   method

User · Answer

new df   df df C     XYZ     Reference  https   chrisalbon com python data wrangling pandas dropping column and rows

User · Answer

This will only work if you want to compare exact strings  It will not work in case you want to check if the column string contains any of the strings in the list   The right way to compare with a list would be     searchfor     john    doe   df   df  df col str contains     join searchfor

User · Answer

pandas has vectorized string operations  so you can just filter out the rows that contain the string you don t want   In  91   df   pd DataFrame dict A  5 3 5 6   C   foo   bar   fooXYZbar    bat      In  92   df Out 92      A          C 0  5        foo 1  3        bar 2  5  fooXYZbar 3  6        bat  In  93   df  df C str contains  XYZ    Out 93      A    C 0  5  foo 1  3  bar 3  6  bat

User · Answer

If your string constraint is not just one string you can drop those corresponding rows with   df   df  df  your column   isin   list of strings       The above will drop all rows containing elements of your list

User · Answer

The below code will give you list of all the rows -   df df  C       XYZ     To store the values from the above code into a dataframe  -  newdf   df df  C       XYZ

User · Answer

Slight modification to the code  Having na False will skip empty values  Otherwise you can get an error TypeError  bad operand type for unary    float  df  df C str contains  XYZ   na False     Source  TypeError  bad operand type for unary    float

User · Answer

if you do not want to delete all NaN  use  df  df C str contains  XYZ      True

[python] How to drop rows from pandas data frame that contains a particular string in a particular column?

Examples related to python

Examples related to pandas