Log all requests from the python-requests module

Question

I am using python Requests  I need to debug some OAuth activity  and for that I would like it to log all requests being performed  I could get this information with ngrep  but unfortunately it is not possible to grep https connections  which are needed for OAuth   How can I activate logging of all URLs    parameters  that Requests is accessing

User · Accepted Answer

The underlying urllib3 library logs all new connections and URLs with the logging module  but not POST bodies  For GET requests this should be enough   import logging  logging basicConfig level logging DEBUG    which gives you the most verbose logging option  see the logging HOWTO for more details on how to configure logging levels and destinations   Short demo    gt  gt  gt  import requests  gt  gt  gt  import logging  gt  gt  gt  logging basicConfig level logging DEBUG   gt  gt  gt  r   requests get  http   httpbin org get foo bar amp baz python   DEBUG urllib3 connectionpool Starting new HTTP connection  1   httpbin org 80 DEBUG urllib3 connectionpool http   httpbin org 80  GET  get foo bar amp baz python HTTP 1 1  200 366   Depending on the exact version of urllib3  the following messages are logged    INFO  Redirects WARN  Connection pool full  if this happens often increase the connection pool size  WARN  Failed to parse headers  response headers with invalid format  WARN  Retrying the connection WARN  Certificate did not match expected hostname WARN  Received response with both Content-Length and Transfer-Encoding  when processing a chunked response DEBUG  New connections  HTTP or HTTPS  DEBUG  Dropped connections DEBUG  Connection details  method  path  HTTP version  status code and response length DEBUG  Retry count increments   This doesn t include headers or bodies  urllib3 uses the http client HTTPConnection class to do the grunt-work  but that class doesn t support logging  it can normally only be configured to print to stdout  However  you can rig it to send all debug information to logging instead by introducing an alternative print name into that module   import logging import http client  httpclient logger   logging getLogger  http client    def httpclient logging patch level logging DEBUG          Enable HTTPConnection debug logging to the logging framework         def httpclient log  args           httpclient logger log level      join args          mask the print   built-in in the http client module to use       logging instead     http client print   httpclient log       enable debugging     http client HTTPConnection debuglevel   1   Calling httpclient logging patch   causes http client connections to output all debug information to a standard logger  and so are picked up by logging basicConfig      gt  gt  gt  httpclient logging patch    gt  gt  gt  r   requests get  http   httpbin org get foo bar amp baz python   DEBUG urllib3 connectionpool Starting new HTTP connection  1   httpbin org 80 DEBUG http client send  b GET  get foo bar amp baz python HTTP 1 1 r nHost  httpbin org r nUser-Agent  python-requests 2 22 0 r nAccept-Encoding  gzip  deflate r nAccept      r nConnection  keep-alive r n r n  DEBUG http client reply   HTTP 1 1 200 OK r n  DEBUG http client header  Date  Tue  04 Feb 2020 13 36 53 GMT DEBUG http client header  Content-Type  application json DEBUG http client header  Content-Length  366 DEBUG http client header  Connection  keep-alive DEBUG http client header  Server  gunicorn 19 9 0 DEBUG http client header  Access-Control-Allow-Origin    DEBUG http client header  Access-Control-Allow-Credentials  true DEBUG urllib3 connectionpool http   httpbin org 80  GET  get foo bar amp baz python HTTP 1 1  200 366

User · Answer

Just improving this answer This is how it worked for me  import logging import sys     import requests import textwrap      root   logging getLogger  httplogger     def logRoundtrip response   args    kwargs       extra     req   response request   res   response      root debug  HTTP roundtrip   extra extra        class HttpFormatter logging Formatter        def  formatHeaders self  d           return   n  join f  k    v   for k  v in d items         def formatMessage self  record           result   super   formatMessage record          if record name     httplogger               result    textwrap dedent                     ---------------- request ----------------                  req method   req url                   reqhdrs                    req body                  ---------------- response ----------------                  res status code   res reason   res url                   reshdrs                    res text                   format                  req record req                  res record res                  reqhdrs self  formatHeaders record req headers                   reshdrs self  formatHeaders record res headers                          return result  formatter   HttpFormatter   asctime   levelname   name   message    style      handler   logging StreamHandler sys stdout  handler setFormatter formatter  root addHandler handler  root setLevel logging DEBUG    session   requests Session   session hooks  response   append logRoundtrip  session get  http   httpbin org

User · Answer

For those using python 3   import requests import logging import http client  http client HTTPConnection debuglevel   1  logging basicConfig   logging getLogger   setLevel logging DEBUG  requests log   logging getLogger  requests packages urllib3   requests log setLevel logging DEBUG  requests log propagate   True

User · Answer

I m using python 3 4  requests 2 19 1    urllib3  is the logger to get now  no longer  requests packages urllib3     Basic logging will still happen without setting http client HTTPConnection debuglevel

User · Answer

You need to enable debugging at httplib level  requests  rarr  urllib3  rarr  httplib    Here s some functions to both toggle      on   and     off    or temporarily have it on   import logging import contextlib try      from http client import HTTPConnection   py3 except ImportError      from httplib import HTTPConnection   py2  def debug requests on           Switches on logging of the requests module         HTTPConnection debuglevel   1      logging basicConfig       logging getLogger   setLevel logging DEBUG      requests log   logging getLogger  requests packages urllib3       requests log setLevel logging DEBUG      requests log propagate   True  def debug requests off           Switches off logging of the requests module  might be some side-effects        HTTPConnection debuglevel   0      root logger   logging getLogger       root logger setLevel logging WARNING      root logger handlers          requests log   logging getLogger  requests packages urllib3       requests log setLevel logging WARNING      requests log propagate   False   contextlib contextmanager def debug requests           Use with  with          debug requests on       yield     debug requests off     Demo use    gt  gt  gt  requests get  http   httpbin org     lt Response  200  gt    gt  gt  gt  debug requests on    gt  gt  gt  requests get  http   httpbin org    INFO requests packages urllib3 connectionpool Starting new HTTP connection  1   httpbin org DEBUG requests packages urllib3 connectionpool  GET   HTTP 1 1  200 12150 send   GET   HTTP 1 1 r nHost  httpbin org r nConnection  keep-alive r nAccept- Encoding  gzip  deflate r nAccept      r nUser-Agent  python-requests 2 11 1 r n r n  reply   HTTP 1 1 200 OK r n  header  Server  nginx      lt Response  200  gt    gt  gt  gt  debug requests off    gt  gt  gt  requests get  http   httpbin org     lt Response  200  gt    gt  gt  gt  with debug requests            requests get  http   httpbin org    INFO requests packages urllib3 connectionpool Starting new HTTP connection  1   httpbin org      lt Response  200  gt    You will see the REQUEST  including HEADERS and DATA  and RESPONSE with HEADERS but without DATA  The only thing missing will be the response body which is not logged   Source

User · Answer

When trying to get the Python logging system  import logging  to emit low level debug log messages  it suprised me to discover that given   requests -- gt  urllib3 -- gt  http client HTTPConnection   that only urllib3 actually uses the Python logging system    requests no http client HTTPConnection no urllib3 yes   Sure  you can extract debug messages from HTTPConnection by setting   HTTPConnection debuglevel   1   but these outputs are merely emitted via the print statement  To prove this  simply grep the Python 3 7 client py source code and view the print statements yourself  thanks  Yohann    curl https   raw githubusercontent com python cpython 3 7 Lib http client py  grep -A1 debuglevel     Presumably redirecting stdout in some way might work to shoe-horn stdout into the logging system and potentially capture to e g  a log file    Choose the  urllib3  logger not  requests packages urllib3   To capture urllib3 debug information through the Python 3 logging system   contrary to much advice on the internet  and as  MikeSmith points out  you won   t have much luck intercepting   log   logging getLogger  requests packages urllib3     instead you need to   log   logging getLogger  urllib3     Debugging urllib3 to a log file  Here is some code which logs urllib3 workings to a log file using the Python logging system   import requests import logging from http client import HTTPConnection    py3    log   logging getLogger  requests packages urllib3      useless log   logging getLogger  urllib3      works  log setLevel logging DEBUG     needed fh   logging FileHandler  requests log   log addHandler fh   requests get  http   httpbin org      the result   Starting new HTTP connection  1   httpbin org 80 http   httpbin org 80  GET   HTTP 1 1  200 3168   Enabling the HTTPConnection debuglevel print   statements  If you set HTTPConnection debuglevel   1   from http client import HTTPConnection    py3 HTTPConnection debuglevel   1 requests get  http   httpbin org      you ll get the print statement output of additional juicy low level info   send  b GET   HTTP 1 1 r nHost  httpbin org r nUser-Agent  python-  requests 2 22 0 r nAccept-Encoding  gzip  deflate r nAccept      r nConnection  keep-alive r n r n  reply   HTTP 1 1 200 OK r n  header  Access-Control-Allow-Credentials header  Access-Control-Allow-Origin  header  Content-Encoding header  Content-Type header  Date header        Remember this output uses print and not the Python logging system  and thus cannot be captured using a traditional logging stream or file handler  though it may be possible to capture output to a file by redirecting stdout    Combine the two above - maximise all possible logging to console  To maximise all possible logging  you must settle for console stdout output with this   import requests import logging from http client import HTTPConnection    py3  log   logging getLogger  urllib3   log setLevel logging DEBUG     logging from urllib3 to console ch   logging StreamHandler   ch setLevel logging DEBUG  log addHandler ch     print statements from  http client HTTPConnection  to console stdout HTTPConnection debuglevel   1  requests get  http   httpbin org      giving the full range of output   Starting new HTTP connection  1   httpbin org 80 send  b GET   HTTP 1 1 r nHost  httpbin org r nUser-Agent  python-requests 2 22 0 r nAccept-Encoding  gzip  deflate r nAccept      r nConnection  keep-alive r n r n  reply   HTTP 1 1 200 OK r n  http   httpbin org 80  GET   HTTP 1 1  200 3168 header  Access-Control-Allow-Credentials header  Access-Control-Allow-Origin  header  Content-Encoding header

User · Answer

Having a script or even a subsystem of an application for a network protocol debugging  it s desired to see what request-response pairs are exactly  including effective URLs  headers  payloads and the status  And it s typically impractical to instrument individual requests all over the place  At the same time there are performance considerations that suggest using single  or few specialised  requests Session  so the following assumes that the suggestion is followed   requests supports so called event hooks  as of 2 23 there s actually only response hook   It s basically an event listener  and the event is emitted before returning control from requests request  At this moment both request and response are fully defined  hence can be logged   import logging  import requests   logger   logging getLogger  httplogger    def logRoundtrip response   args    kwargs       extra     req   response request   res   response      logger debug  HTTP roundtrip   extra extra   session   requests Session   session hooks  response   append logRoundtrip    That s basically how to log all HTTP round-trips of a session    Formatting HTTP round-trip log records  For the logging above to be useful there can be specialised logging formatter that understands req and res extras on logging records  It can look like this   import textwrap  class HttpFormatter logging Formatter           def  formatHeaders self  d           return   n  join f  k    v   for k  v in d items         def formatMessage self  record           result   super   formatMessage record          if record name     httplogger               result    textwrap dedent                     ---------------- request ----------------                  req method   req url                   reqhdrs                    req body                  ---------------- response ----------------                  res status code   res reason   res url                   reshdrs                    res text                   format                  req record req                  res record res                  reqhdrs self  formatHeaders record req headers                   reshdrs self  formatHeaders record res headers                          return result  formatter   HttpFormatter   asctime   levelname   name   message    style      handler   logging StreamHandler   handler setFormatter formatter  logging basicConfig level logging DEBUG  handlers  handler     Now if you do some requests using the session  like   session get  https   httpbin org user-agent   session get  https   httpbin org status 200     The output to stderr will look as follows   2020-05-14 22 10 13 224 DEBUG urllib3 connectionpool Starting new HTTPS connection  1   httpbin org 443 2020-05-14 22 10 13 695 DEBUG urllib3 connectionpool https   httpbin org 443  GET  user-agent HTTP 1 1  200 45 2020-05-14 22 10 13 698 DEBUG httplogger HTTP roundtrip ---------------- request ---------------- GET https   httpbin org user-agent User-Agent  python-requests 2 23 0 Accept-Encoding  gzip  deflate Accept      Connection  keep-alive  None ---------------- response ---------------- 200 OK https   httpbin org user-agent Date  Thu  14 May 2020 20 10 13 GMT Content-Type  application json Content-Length  45 Connection  keep-alive Server  gunicorn 19 9 0 Access-Control-Allow-Origin    Access-Control-Allow-Credentials  true       user-agent    python-requests 2 23 0      2020-05-14 22 10 13 814 DEBUG urllib3 connectionpool https   httpbin org 443  GET  status 200 HTTP 1 1  200 0 2020-05-14 22 10 13 818 DEBUG httplogger HTTP roundtrip ---------------- request ---------------- GET https   httpbin org status 200 User-Agent  python-requests 2 23 0 Accept-Encoding  gzip  deflate Accept      Connection  keep-alive  None ---------------- response ---------------- 200 OK https   httpbin org status 200 Date  Thu  14 May 2020 20 10 13 GMT Content-Type  text html  charset utf-8 Content-Length  0 Connection  keep-alive Server  gunicorn 19 9 0 Access-Control-Allow-Origin    Access-Control-Allow-Credentials  true   A GUI way  When you have a lot of queries  having a simple UI and a way to filter records comes at handy  I ll show to use Chronologer for that  which I m the author of    First  the hook has be rewritten to produce records that logging can serialise when sending over the wire  It can look like this   def logRoundtrip response   args    kwargs        extra              req                  method   response request method               url   response request url               headers   response request headers               body   response request body                       res                  code   response status code               reason   response reason               url   response url               headers   response headers               body   response text                      logger debug  HTTP roundtrip   extra extra   session   requests Session   session hooks  response   append logRoundtrip    Second  logging configuration has to be adapted to use logging handlers HTTPHandler  which Chronologer understands    import logging handlers  chrono   logging handlers HTTPHandler     localhost 8080     api v1 record    POST   credentials   logger        handlers    logging StreamHandler    chrono  logging basicConfig level logging DEBUG  handlers handlers    Finally  run Chronologer instance  e g  using Docker   docker run --rm -it -p 8080 8080 -v  tmp db       -e CHRONOLOGER STORAGE DSN sqlite     tmp db chrono sqlite       -e CHRONOLOGER SECRET example       -e CHRONOLOGER ROLES  basic-reader query-reader writer        saaj chronologer       python -m chronologer -e production serve -u www-data -g www-data -m   And run the requests again   session get  https   httpbin org user-agent   session get  https   httpbin org status 200     The stream handler will produce   DEBUG urllib3 connectionpool Starting new HTTPS connection  1   httpbin org 443 DEBUG urllib3 connectionpool https   httpbin org 443  GET  user-agent HTTP 1 1  200 45 DEBUG httplogger HTTP roundtrip DEBUG urllib3 connectionpool https   httpbin org 443  GET  status 200 HTTP 1 1  200 0 DEBUG httplogger HTTP roundtrip   Now if you open http   localhost 8080   use  logger  for username and empty password for the basic auth popup  and click  Open  button  you should see something like

User · Answer

I m using a logger config yaml file to configure my logging  and to get those logs to show up  all I had to do was to add a disable existing loggers  False to the end of it  My logging setup is rather extensive and confusing  so I don t even know a good way to explain it here  but if someone s also using a YAML file to configure their logging  this might help  https   docs python org 3 howto logging html configuring-logging

[python] Log all requests from the python-requests module

Examples related to python

Examples related to logging

Examples related to python-requests