How to import a csv file using python with headers intact where first column is a non-numerical

Question

This is an elaboration of a previous question, but as I delve deeper into python, I just get more confused as to how python handles csv files.

I have a csv file, and it must stay that way (e.g., cannot convert it to text file). It is the equivalent of a 5 rows by 11 columns array or matrix, or vector.

I have been attempting to read in the csv using various methods I have found here and other places (e.g. python.org) so that it preserves the relationship between columns and rows, where the first row and the first column = non-numerical values. The rest are float values, and contain a mixture of positive and negative floats.

What I wish to do is import the csv and compile it in python so that if I were to reference a column header, it would return its associated values stored in the rows. For example:

>>> workers, constant, age
>>> workers
    w0
    w1
    w2
    w3
    constant
    7.334
    5.235
    3.225
    0
    age
    -1.406
    -4.936
    -1.478
    0

And so forth...

I am looking for techniques for handling this kind of data structure. I am very new to python.

User · Answer

You can use pandas library and reference the rows and columns like this   import pandas as pd  input   pd read csv  path to file      for accessing ith row  input iloc i    for accessing column named X input X   for accessing ith row and column named X input iloc i  X

User · Answer

Python s csv module handles data row-wise  which is the usual way of looking at such data  You seem to want a column-wise approach  Here s one way of doing it   Assuming your file is named myclone csv and contains  workers constant age w0 7 334 -1 406 w1 5 235 -4 936 w2 3 2225 -1 478 w3 0 0   this code should give you an idea or two    gt  gt  gt  import csv  gt  gt  gt  f   open  myclone csv    rb    gt  gt  gt  reader   csv reader f   gt  gt  gt  headers   next reader  None   gt  gt  gt  headers   workers    constant    age    gt  gt  gt  column       gt  gt  gt  for h in headers         column h            gt  gt  gt  column   workers        constant        age        gt  gt  gt  for row in reader        for h  v in zip headers  row           column h  append v       gt  gt  gt  column   workers     w0    w1    w2    w3     constant     7 334    5 235    3 2225    0     age     -1 406    -4 936    -1 478    0     gt  gt  gt  column  workers     w0    w1    w2    w3    gt  gt  gt  column  constant     7 334    5 235    3 2225    0    gt  gt  gt  column  age     -1 406    -4 936    -1 478    0    gt  gt  gt    To get your numeric values into floats  add this  converters    str strip     float     len headers  - 1    up front  and do this  for h  v  conv in zip headers  row  converters     column h  append conv v     for each row instead of the similar two lines above

User · Answer

For Python 3  Remove the rb argument and use either r or don t pass argument  default read mode     with open   lt path-to-file gt    r    as theFile      reader   csv DictReader theFile      for line in reader            line is    workers    w0    constant   7 334   age   -1 406                  e g  print  line   workers      yields  w0          print line    For Python 2  import csv with open   lt path-to-file gt    rb    as theFile      reader   csv DictReader  theFile       for line in reader            line is    workers    w0    constant   7 334   age   -1 406                  e g  print  line   workers      yields  w0    Python has a powerful built-in CSV handler  In fact  most things are already built in to the standard library

[python] How to import a csv file using python with headers intact, where first column is a non-numerical

The answer is

Examples related to python

Examples related to csv

Tags