Parsing HTTP Response in Python

Question

I want to manipulate the information at THIS url   I can successfully open it and read its contents   But what I really want to do is throw out all the stuff I don t want  and to manipulate the stuff I want to keep   Is there a way to convert the string into a dict so I can iterate over it   Or do I just have to parse it as is  str type    from urllib request import urlopen  url    http   www quandl com api v1 datasets FRED GDP json  response   urlopen url   print response read      returns string with info

User · Answer

TL amp DR  When you typically get data from a server  it is sent in bytes   The rationale is that these bytes will need to be  decoded  by the recipient  who should know how to use the data    You should decode the binary upon arrival to not get  b   bytes  but instead a string   Use case   import requests     def get data from url url           response   requests get url to visit          response data split by line   response content decode  utf-8   splitlines           return response data split by line   In this example  I decode the content that I received into UTF-8   For my purposes  I then split it by line  so I can loop through each line with a for loop

User · Answer

You can also use python s requests library instead   import requests  url    http   www quandl com api v1 datasets FRED GDP json      response   requests get url      dict   response json     Now you can manipulate the  dict  like a python dictionary

User · Answer

json works with Unicode text in Python 3  JSON format itself is defined only in terms of Unicode text  and therefore you need to decode bytes received in HTTP response  r headers get content charset  utf-8   gets your the character encoding      usr bin env python3 import io import json from urllib request import urlopen  with urlopen  https   httpbin org get   as r         io TextIOWrapper r  encoding r headers get content charset  utf-8    as file      result   json load file  print result  headers    User-Agent      It is not necessary to use io TextIOWrapper here      usr bin env python3 import json from urllib request import urlopen  with urlopen  https   httpbin org get   as r      result   json loads r read   decode r headers get content charset  utf-8     print result  headers    User-Agent

User · Answer

I guess things have changed in python 3 4  This worked for me   print  resp     json dumps resp json

User · Answer

When I printed response read   I noticed that b was preprended to the string  e g  b   a  1       The  b  stands for bytes and serves as a declaration for the type of the object you re handling   Since  I knew that a string could be converted to a dict by using json loads  string    I just had to convert the byte type to a string type   I did this by decoding the response to utf-8 decode  utf-8     Once it was in a string type my problem was solved and I was easily able to iterate over the dict   I don t know if this is the fastest or most  pythonic  way of writing this but it works and theres always time later of optimization and improvement   Full code for my solution   from urllib request import urlopen import json    Get the dataset url    http   www quandl com api v1 datasets FRED GDP json  response   urlopen url     Convert bytes to string type and string type to dict string   response read   decode  utf-8   json obj   json loads string   print json obj  source name      prints the string with  source name  key

[json] Parsing HTTP Response in Python

Examples related to json

Examples related to api

Examples related to python-3.x

Examples related to dictionary

Examples related to urlopen