List comprehension vs lambda filter

Question

I happened to find myself having a basic filtering need  I have a list and I have to filter it by an attribute of the items   My code looked like this   my list    x for x in my list if x attribute    value    But then I thought  wouldn t it be better to write it like this   my list   filter lambda x  x attribute    value  my list    It s more readable  and if needed for performance the lambda could be taken out to gain something     Question is  are there any caveats in using the second way  Any performance difference  Am I missing the Pythonic Way    entirely and should do it in yet another way  such as using itemgetter instead of the lambda

User · Answer

I thought I d just add that in python 3  filter   is actually an iterator object  so you d have to pass your filter method call to list   in order to build the filtered list  So in python 2   lst a   range 25   arbitrary list lst b    num for num in lst a if num   2    0  lst c   filter lambda num  num   2    0  lst a    lists b and c have the same values  and were completed in about the same time as filter   was equivalent  x for x in y if z   However  in 3  this same code would leave list c containing a filter object  not a filtered list  To produce the same values in 3   lst a   range 25   arbitrary list lst b    num for num in lst a if num   2    0  lst c   list filter lambda num  num  2    0  lst a     The problem is that list   takes an iterable as it s argument  and creates a new list from that argument  The result is that using filter in this way in python 3 takes up to twice as long as the  x for x in y if z  method because you have to iterate over the output from filter   as well as the original list

User · Answer

Filter is just that  It filters out the elements of a list  You can see the definition mentions the same in the official docs link I mentioned before   Whereas  list comprehension is something that produces a new list after acting upon something on the previous list  Both filter and list comprehension creates new list and not perform operation in place of the older list  A new list here is something like a list with  say  an entirely new data type  Like converting integers to string  etc   In your example  it is better to use filter than list comprehension  as per the definition  However  if you want  say other attribute from the list elements  in your example is to be retrieved as a new list  then you can use list comprehension   return  item other attribute for item in my list if item attribute  value    This is how I actually remember about filter and list comprehension  Remove a few things within a list and keep the other elements intact  use filter  Use some logic on your own at the elements and create a watered down list suitable for some purpose  use list comprehension

User · Answer

It took me some time to get familiarized with the higher order functions filter and map  So i got used to them and i actually liked filter as it was explicit that it filters by keeping whatever is truthy and I ve felt cool that I knew some functional programming terms    Then I read this passage  Fluent Python Book        The map and filter functions are still builtins   in Python 3  but since the introduction of list comprehensions and generator ex-   pressions  they are not as important  A listcomp or a genexp does the job of map and   filter combined  but is more readable     And now I think  why bother with the concept of  filter   map if you can achieve it with already widely spread idioms like list comprehensions  Furthermore maps and filters are kind of functions  In this case I prefer using Anonymous functions lambdas    Finally  just for the sake of having it tested  I ve timed both methods  map and listComp  and I didn t see any relevant speed difference that would justify making arguments about it    from timeit import Timer  timeMap   Timer lambda  list map lambda x  x x  range 10  7     print timeMap timeit number 100    timeListComp   Timer lambda   lambda x  x x  for x in range 10  7    print timeListComp timeit number 100     Map                  166 95695265199174  List Comprehension   177 97208347299602

User · Answer

I find the second way more readable  It tells you exactly what the intention is  filter the list  PS  do not use  list  as a variable name

User · Answer

It is strange how much beauty varies for different people  I find the list comprehension much clearer than filter lambda  but use whichever you find easier   There are two things that may slow down your use of filter   The first is the function call overhead  as soon as you use a Python function  whether created by def or lambda  it is likely that filter will be slower than the list comprehension  It almost certainly is not enough to matter  and you shouldn t think much about performance until you ve timed your code and found it to be a bottleneck  but the difference will be there   The other overhead that might apply is that the lambda is being forced to access a scoped variable  value   That is slower than accessing a local variable and in Python 2 x the list comprehension only accesses local variables  If you are using Python 3 x the list comprehension runs in a separate function so it will also be accessing value through a closure and this difference won t apply   The other option to consider is to use a generator instead of a list comprehension   def filterbyvalue seq  value      for el in seq         if el attribute  value  yield el   Then in your main code  which is where readability really matters  you ve replaced both list comprehension and filter with a hopefully meaningful function name

User · Answer

Although filter may be the  faster way   the  Pythonic way  would be not to care about such things unless performance is absolutely critical  in which case you wouldn t be using Python

User · Answer

An important difference is that list comprehension will return a list while the filter returns a filter  which you cannot manipulate like a list  ie  call len on it  which does not work with the return of filter    My own self-learning brought me to some similar issue   That being said  if there is a way to have the resulting list from a filter  a bit like you would do in  NET when you do lst Where i   gt  i something    ToList    I am curious to know it   EDIT  This is the case for Python 3  not 2  see discussion in comments

User · Answer

My take  def filter list list  key  value  limit None       return  i for i in list if i key     value   limit

User · Answer

In addition to the accepted answer  there is a corner case when you should use filter instead of a list comprehension  If the list is unhashable you cannot directly process it with a list comprehension  A real world example is if you use pyodbc to read results from a database  The fetchAll   results from cursor is an unhashable list  In this situation  to directly manipulating on the returned results  filter should be used   cursor execute  SELECT   FROM TABLE1    data from db   cursor fetchall   processed data   filter lambda s   abc  in s field1 or s StartTime  gt   start date time  data from db     If you use list comprehension here you will get the error      TypeError  unhashable type   list

User · Answer

Summarizing other answers Looking through the answers  we have seen a lot of back and forth  whether or not list comprehension or filter may be faster or if it is even important or pythonic to care about such an issue  In the end  the answer is as most times  it depends  I just stumbled across this question while optimizing code where this exact question  albeit combined with an in expression  not     is very relevant - the filter   lambda expression is taking up a third of my computation time  of multiple minutes   My case In my case  the list comprehension is much faster  twice the speed   But I suspect that this varies strongly based on the filter expression as well as the Python interpreter used  Test it for yourself Here is a simple code snippet that should be easy to adapt  If you profile it  most IDEs can do that easily   you will be able to easily decide for your specific case which is the better option  whitelist   set range 0  100000000  27    input list   list range 0  100000000    proximal list   list filter          lambda x  x in whitelist          input list         proximal list2    x for x in input list if x in whitelist   print len proximal list   print len proximal list2    If you do not have an IDE that lets you profile easily  try this instead  extracted from my codebase  so a bit more complicated   This code snippet will create a profile for you that you can easily visualize using e g  snakeviz  import cProfile from time import time   class BlockProfile      def   init   self  profile path           self profile path   profile path         self profiler   None         self start time   None      def   enter   self           self profiler   cProfile Profile           self start time   time           self profiler enable        def   exit   self   args           self profiler disable           exec time   int  time   - self start time    1000          self profiler dump stats self profile path    whitelist   set range 0  100000000  27   input list   list range 0  100000000    with BlockProfile  quot  path to create profile in profile pstat quot        proximal list   list filter              lambda x  x in whitelist              input list                 proximal list2    x for x in input list if x in whitelist   print len proximal list   print len proximal list2

User · Answer

generally filter is slightly faster if using a builtin function   I would expect the list comprehension to be slightly faster in your case

User · Answer

Curiously on Python 3  I see filter performing faster than list comprehensions   I always thought that the list comprehensions would be more performant  Something like   name for name in brand names db if name is not None  The bytecode generated is a bit better    gt  gt  gt  def f1 seq           return list filter None  seq    gt  gt  gt  def f2 seq           return  i for i in seq if i is not None   gt  gt  gt  disassemble f1   code    2         0 LOAD GLOBAL              0  list            2 LOAD GLOBAL              1  filter            4 LOAD CONST               0  None            6 LOAD FAST                0  seq            8 CALL FUNCTION            2          10 CALL FUNCTION            1          12 RETURN VALUE  gt  gt  gt  disassemble f2   code    2           0 LOAD CONST               1   lt code object  lt listcomp gt  at 0x10cfcaa50  file   lt stdin gt    line 2 gt             2 LOAD CONST               2   f2  lt locals gt   lt listcomp gt              4 MAKE FUNCTION            0           6 LOAD FAST                0  seq            8 GET ITER          10 CALL FUNCTION            1          12 RETURN VALUE   But they are actually slower       gt  gt  gt  timeit stmt  f1 range 1000     setup  from   main   import f1 f2      21 177661532000116     gt  gt  gt  timeit stmt  f2 range 1000     setup  from   main   import f1 f2      42 233950221000214

User · Answer

This is a somewhat religious issue in Python  Even though Guido considered removing map  filter and reduce from Python 3  there was enough of a backlash that in the end only reduce was moved from built-ins to functools reduce   Personally I find list comprehensions easier to read  It is more explicit what is happening from the expression  i for i in list if i attribute    value  as all the behaviour is on the surface not inside the filter function   I would not worry too much about the performance difference between the two approaches as it is marginal  I would really only optimise this if it proved to be the bottleneck in your application which is unlikely   Also since the BDFL wanted filter gone from the language then surely that automatically makes list comprehensions more Pythonic  -

User · Answer

Since any speed difference is bound to be miniscule  whether to use filters or list comprehensions comes down to a matter of taste  In general I m inclined to use comprehensions  which seems to agree with most other answers here   but there is one case where I prefer filter    A very frequent use case is pulling out the values of some iterable X subject to a predicate P x     x for x in X if P x     but sometimes you want to apply some function to the values first    f x  for x in X if P f x       As a specific example  consider  primes cubed    x x x for x in range 1000  if prime x     I think this looks slightly better than using filter  But now consider  prime cubes    x x x for x in range 1000  if prime x x x     In this case we want to filter against the post-computed value  Besides the issue of computing the cube twice  imagine a more expensive calculation   there is the issue of writing the expression twice  violating the DRY aesthetic  In this case I d be apt to use  prime cubes   filter prime   x x x for x in range 1000

User · Answer

Here s a short piece I use when I need to filter on something after the list comprehension   Just a combination of filter  lambda  and lists  otherwise known as the loyalty of a cat and the cleanliness of a dog    In this case I m reading a file  stripping out blank lines  commented out lines  and anything after a comment on a line     Throw out blank lines and comments with open  file txt    r   as lines                From the inside out            s partition      0  strip   for s in lines     Throws out comments         filter lambda x  x        s part    Filters out blank lines        y for y in filter    Converts filter object to list     file contents    y for y in filter lambda x  x         s partition      0  strip   for s in lines

[python] List comprehension vs. lambda + filter

Examples related to python

Examples related to list

Examples related to functional-programming

Examples related to filter

Examples related to lambda