How do I profile memory usage in Python

Question

I ve recently become interested in algorithms and have begun exploring them by writing a naive implementation and then optimizing it in various ways   I m already familiar with the standard Python module for profiling runtime  for most things I ve found the timeit magic function in IPython to be sufficient   but I m also interested in memory usage so I can explore those tradeoffs as well  e g  the cost of caching a table of previously computed values versus recomputing them as needed   Is there a module that will profile the memory usage of a given function for me

User · Accepted Answer

This one has been answered already here  Python memory profiler  Basically you do something like that  cited from Guppy-PE     gt  gt  gt  from guppy import hpy  h hpy    gt  gt  gt  h heap   Partition of a set of 48477 objects  Total size   3265516 bytes   Index  Count         Size     Cumulative    Kind  class   dict of class       0  25773  53  1612820  49   1612820  49 str      1  11699  24   483960  15   2096780  64 tuple      2    174   0   241584   7   2338364  72 dict of module      3   3478   7   222592   7   2560956  78 types CodeType      4   3296   7   184576   6   2745532  84 function      5    401   1   175112   5   2920644  89 dict of class      6    108   0    81888   3   3002532  92 dict  no owner       7    114   0    79632   2   3082164  94 dict of type      8    117   0    51336   2   3133500  96 type      9    667   1    24012   1   3157512  97   builtin   wrapper descriptor  lt 76 more rows  Type e g     more  to view  gt   gt  gt  gt  h iso 1        Partition of a set of 3 objects  Total size   176 bytes   Index  Count         Size     Cumulative    Kind  class   dict of class       0      1  33      136  77       136  77 dict  no owner       1      1  33       28  16       164  93 list      2      1  33       12   7       176 100 int  gt  gt  gt  x     gt  gt  gt  h iso x  sp  0  h Root i0 modules    main       dict    x    gt  gt  gt

User · Answer

Python 3 4 includes a new module  tracemalloc  It provides detailed statistics about which code is allocating the most memory  Here s an example that displays the top three lines allocating memory   from collections import Counter import linecache import os import tracemalloc  def display top snapshot  key type  lineno   limit 3       snapshot   snapshot filter traces           tracemalloc Filter False    lt frozen importlib  bootstrap gt             tracemalloc Filter False    lt unknown gt                top stats   snapshot statistics key type       print  Top  s lines    limit      for index  stat in enumerate top stats  limit   1           frame   stat traceback 0            replace   path to module file py  with  module file py          filename   os sep join frame filename split os sep  -2            print    s   s  s    1f KiB                   index  filename  frame lineno  stat size   1024           line   linecache getline frame filename  frame lineno  strip           if line              print       s    line       other   top stats limit       if other          size   sum stat size for stat in other          print   s other    1f KiB     len other   size   1024       total   sum stat size for stat in top stats      print  Total allocated size    1f KiB     total   1024     tracemalloc start    counts   Counter   fname     usr share dict american-english  with open fname  as words      words   list words      for word in words          prefix   word  3          counts prefix     1 print  Top prefixes    counts most common 3    snapshot   tracemalloc take snapshot   display top snapshot    And here are the results   Top prefixes     con   1220     dis   1002     pro   809   Top 3 lines  1  scratches memory test py 37  6527 1 KiB     words   list words   2  scratches memory test py 39  247 7 KiB     prefix   word  3   3  scratches memory test py 40  193 0 KiB     counts prefix     1 4 other  4 3 KiB Total allocated size  6972 1 KiB   When is a memory leak not a leak   That example is great when the memory is still being held at the end of the calculation  but sometimes you have code that allocates a lot of memory and then releases it all  It s not technically a memory leak  but it s using more memory than you think it should  How can you track memory usage when it all gets released  If it s your code  you can probably add some debugging code to take snapshots while it s running  If not  you can start a background thread to monitor memory usage while the main thread runs   Here s the previous example where the code has all been moved into the count prefixes   function  When that function returns  all the memory is released  I also added some sleep   calls to simulate a long-running calculation   from collections import Counter import linecache import os import tracemalloc from time import sleep   def count prefixes        sleep 2     Start up time      counts   Counter       fname     usr share dict american-english      with open fname  as words          words   list words          for word in words              prefix   word  3              counts prefix     1             sleep 0 0001      most common   counts most common 3      sleep 3     Shut down time      return most common   def main        tracemalloc start        most common   count prefixes       print  Top prefixes    most common       snapshot   tracemalloc take snapshot       display top snapshot    def display top snapshot  key type  lineno   limit 3       snapshot   snapshot filter traces           tracemalloc Filter False    lt frozen importlib  bootstrap gt             tracemalloc Filter False    lt unknown gt                top stats   snapshot statistics key type       print  Top  s lines    limit      for index  stat in enumerate top stats  limit   1           frame   stat traceback 0            replace   path to module file py  with  module file py          filename   os sep join frame filename split os sep  -2            print    s   s  s    1f KiB                   index  filename  frame lineno  stat size   1024           line   linecache getline frame filename  frame lineno  strip           if line              print       s    line       other   top stats limit       if other          size   sum stat size for stat in other          print   s other    1f KiB     len other   size   1024       total   sum stat size for stat in top stats      print  Total allocated size    1f KiB     total   1024     main     When I run that version  the memory usage has gone from 6MB down to 4KB  because the function released all its memory when it finished   Top prefixes     con   1220     dis   1002     pro   809   Top 3 lines  1  collections   init   py 537  0 7 KiB     self update  args    kwds   2  collections   init   py 555  0 6 KiB     return  heapq nlargest n  self items    key  itemgetter 1    3  python3 6 heapq py 569  0 5 KiB     result     key elem   i  elem  for i  elem in zip range 0  -n  -1   it   10 other  2 2 KiB Total allocated size  4 0 KiB   Now here s a version inspired by another answer that starts a second thread to monitor memory usage   from collections import Counter import linecache import os import tracemalloc from datetime import datetime from queue import Queue  Empty from resource import getrusage  RUSAGE SELF from threading import Thread from time import sleep  def memory monitor command queue  Queue  poll interval 1       tracemalloc start       old max   0     snapshot   None     while True          try              command queue get timeout poll interval              if snapshot is not None                  print datetime now                    display top snapshot               return         except Empty              max rss   getrusage RUSAGE SELF  ru maxrss             if max rss  gt  old max                  old max   max rss                 snapshot   tracemalloc take snapshot                   print datetime now     max RSS   max rss    def count prefixes        sleep 2     Start up time      counts   Counter       fname     usr share dict american-english      with open fname  as words          words   list words          for word in words              prefix   word  3              counts prefix     1             sleep 0 0001      most common   counts most common 3      sleep 3     Shut down time      return most common   def main        queue   Queue       poll interval   0 1     monitor thread   Thread target memory monitor  args  queue  poll interval       monitor thread start       try          most common   count prefixes           print  Top prefixes    most common      finally          queue put  stop           monitor thread join     def display top snapshot  key type  lineno   limit 3       snapshot   snapshot filter traces           tracemalloc Filter False    lt frozen importlib  bootstrap gt             tracemalloc Filter False    lt unknown gt                top stats   snapshot statistics key type       print  Top  s lines    limit      for index  stat in enumerate top stats  limit   1           frame   stat traceback 0            replace   path to module file py  with  module file py          filename   os sep join frame filename split os sep  -2            print    s   s  s    1f KiB                   index  filename  frame lineno  stat size   1024           line   linecache getline frame filename  frame lineno  strip           if line              print       s    line       other   top stats limit       if other          size   sum stat size for stat in other          print   s other    1f KiB     len other   size   1024       total   sum stat size for stat in top stats      print  Total allocated size    1f KiB     total   1024     main     The resource module lets you check the current memory usage  and save the snapshot from the peak memory usage  The queue lets the main thread tell the memory monitor thread when to print its report and shut down  When it runs  it shows the memory being used by the list   call   2018-05-29 10 34 34 441334 max RSS 10188 2018-05-29 10 34 36 475707 max RSS 23588 2018-05-29 10 34 36 616524 max RSS 38104 2018-05-29 10 34 36 772978 max RSS 45924 2018-05-29 10 34 36 929688 max RSS 46824 2018-05-29 10 34 37 087554 max RSS 46852 Top prefixes     con   1220     dis   1002     pro   809   2018-05-29 10 34 56 281262 Top 3 lines  1  scratches scratch py 36  6527 0 KiB     words   list words   2  scratches scratch py 38  16 4 KiB     prefix   word  3   3  scratches scratch py 39  10 1 KiB     counts prefix     1 19 other  10 8 KiB Total allocated size  6564 3 KiB   If you re on Linux  you may find  proc self statm more useful than the resource module

User · Answer

If you only want to look at the memory usage of an object   answer to other question      There is a module called Pympler which contains the asizeof   module       Use as follows   from pympler import asizeof asizeof asizeof my object        Unlike sys getsizeof  it works for your self-created objects    gt  gt  gt  asizeof asizeof tuple  bcd    200  gt  gt  gt  asizeof asizeof   foo    bar    baz    bar    400  gt  gt  gt  asizeof asizeof     280  gt  gt  gt  asizeof asizeof   foo   bar    360  gt  gt  gt  asizeof asizeof  foo   40  gt  gt  gt  asizeof asizeof Bar    352  gt  gt  gt  asizeof asizeof Bar     dict    280     gt  gt  gt  help asizeof asizeof  Help on function asizeof in module pympler asizeof   asizeof  objs    opts      Return the combined size in bytes of all objects passed as positional arguments

User · Answer

Since the accepted answer and also the next highest voted answer have  in my opinion  some problems  I d like to offer one more answer that is based closely on Ihor B  s answer with some small but important modifications   This solution allows you to run profiling on either by wrapping a function call with the profile function and calling it  or by decorating your function method with the  profile decorator   The first technique is useful when you want to profile some third-party code without messing with its source  whereas the second technique is a bit  cleaner  and works better when you are don t mind modifying the source of the function method you want to profile   I ve also modified the output  so that you get RSS  VMS  and shared memory  I don t care much about the  before  and  after  values  but only the delta  so I removed those  if you re comparing to Ihor B  s answer    Profiling code    profile py import time import os import psutil import inspect   def elapsed since start        return time strftime   H  M  S   time gmtime time time   - start       elapsed   time time   - start     if elapsed  lt  1          return str round elapsed 1000 2      ms      if elapsed  lt  60          return str round elapsed  2      s      if elapsed  lt  3600          return str round elapsed 60  2      min      else          return str round elapsed   3600  2      hrs    def get process memory        process   psutil Process os getpid        mi   process memory info       return mi rss  mi vms  mi shared   def format bytes bytes       if abs bytes   lt  1000          return str bytes   B      elif abs bytes   lt  1e6          return str round bytes 1e3 2      kB      elif abs bytes   lt  1e9          return str round bytes   1e6  2      MB      else          return str round bytes   1e9  2      GB    def profile func   args    kwargs       def wrapper  args    kwargs           rss before  vms before  shared before   get process memory           start   time time           result   func  args    kwargs          elapsed time   elapsed since start          rss after  vms after  shared after   get process memory           print  Profiling     gt 20   RSS     gt 8    VMS     gt 8    SHR                     gt 8    time     gt 8                format   lt     func   name       gt                        format bytes rss after - rss before                       format bytes vms after - vms before                       format bytes shared after - shared before                       elapsed time           return result     if inspect isfunction func           return wrapper     elif inspect ismethod func           return wrapper  args   kwargs    Example usage  assuming the above code is saved as profile py   from profile import profile from time import sleep from sklearn import datasets   Just an example of 3rd party function call     Method 1 run profiling   profile datasets load digits  data   run profiling      Method 2  profile def my function          do some stuff     a list          for i in range 1 100000           a list append i      return a list   res   my function     This should result in output similar to the below   Profiling          lt load digits gt   RSS    5 07MB   VMS    4 91MB   SHR  73 73kB   time   89 99ms Profiling          lt my function gt   RSS    1 06MB   VMS    1 35MB   SHR       0B   time    8 43ms   A couple of important final notes    Keep in mind  this method of profiling is only going to be approximate  since lots of other stuff might be happening on the machine  Due to garbage collection and other factors  the deltas might even be zero  For some unknown reason  very short function calls  e g  1 or 2 ms  show up with zero memory usage  I suspect this is some limitation of the hardware OS  tested on basic laptop with Linux  on how often memory statistics are updated   To keep the examples simple  I didn t use any function arguments  but they should work as one would expect  i e  profile my function  arg  to profile my function arg

User · Answer

Below is a simple function decorator which allows to track how much memory the process consumed before the function call  after the function call  and what is the difference  import time import os import psutil     def elapsed since start       return time strftime  quot  H  M  S quot   time gmtime time time   - start       def get process memory        process   psutil Process os getpid        mem info   process memory info       return mem info rss     def profile func       def wrapper  args    kwargs           mem before   get process memory           start   time time           result   func  args    kwargs          elapsed time   elapsed since start          mem after   get process memory           print  quot     memory before        after        consumed        exec time     quot  format              func   name                mem before  mem after  mem after - mem before              elapsed time           return result     return wrapper  Here is my blog which describes all the details   archived link

User · Answer

A simple example to calculate the memory usage of a block of codes   function using memory profile  while returning result of the function    import memory profiler as mp  def fun n       tmp          for i in range n           tmp extend list range i i        return  XXXXX       calculate memory usage before running the code then calculate max usage during the code        start mem   mp memory usage max usage True  res   mp memory usage proc  fun   100    max usage True  retval True   print  start mem   start mem  print  max mem   res 0  0   print  used mem   res 0  0 -start mem  print  fun output   res 1        calculate usage in sampling points while running function    res   mp memory usage  fun   100    interval  001  retval True  print  min mem   min res 0    print  max mem   max res 0    print  used mem   max res 0  -min res 0    print  fun output   res 1        Credits   skeept

User · Answer

Disclosure    Applicable on Linux only Reports memory used by the current process as a whole  not individual functions within   But nice because of its simplicity   import resource def using point          usage resource getrusage resource RUSAGE SELF      return     s  usertime  s systime  s mem  s mb                 point usage 0  usage 1                   usage 2  1024 0     Just insert using  Label   where you want to see what s going on   For example  print using  before    wrk     wasting mem     1000000 print using  after      gt  gt  gt  before  usertime 2 117053 systime 1 703466 mem 53 97265625 mb  gt  gt  gt  after  usertime 2 12023 systime 1 70708 mem 60 8828125 mb

User · Answer

maybe it help   lt see additional   pip install gprof2dot sudo apt-get install graphviz  gprof2dot -f pstats profile for func1 001   dot -Tpng -o profile png  def profileit name                profileit  profile for func1 001               def inner func           def wrapper  args    kwargs               prof   cProfile Profile               retval   prof runcall func   args    kwargs                Note use of name from outer scope             prof dump stats name              return retval         return wrapper     return inner   profileit  profile for func1 001   def func1

[python] How do I profile memory usage in Python?

Examples related to python

Examples related to memory

Examples related to profiling