Add an index numeric ID column to large data frame

Question

I have a read large csv file into a data frame  Data in the csv file are from multiple web sites representing user information  For example here is the structure of the data frame   user id  number of logins  number of images  web 001  34  3  aa com 002  4  4  aa com 034  3  3  aa com 001  12  4  bb com 002  1  3  bb com 034  2  2  cc com   as you can see once I bring the data into the data frame user id is no longer a unique id and this causes all the analysis  I am trying to add another columns prior to user id which is something like  generated uid  and pretty much use the index of the data frame to be filled by that column  What s the best way to accomplish this

User · Answer

Well  if I understand you correctly  You can do something like the following   To show it  I first create a data frame with your example   df  lt -  scan what   character    sep        text    001  34  3  aa com 002  4  4  aa com 034  3  3  aa com 001  12  4  bb com 002  1  3  bb com 034  2  2  cc com    df  lt - as data frame matrix df  6  4  byrow   TRUE   colnames df   lt - c  user id    number of logins    number of images    web       You can then run one of the following lines to add a column  at the end of the data frame  with the row number as the generated user id  The second lines simply adds leading zeros   df generated uid   lt - 1 nrow df  df generated uid2  lt - sprintf   03d   1 nrow df     If you absolutely want the generated user id to be the first column  you can add the column like so   df  lt - cbind  generated uid3    sprintf   03d   1 nrow df    df    or simply rearrage the columns

User · Answer

Using alternative dplyr package   library  dplyr     or library  tidyverse    df  lt - df   gt   mutate id   row number

User · Answer

If your data frame is a data table  you can use special symbol  I   data   ID     I

User · Answer

You can add a sequence of numbers very easily with data ID  lt - seq int nrow data    If you are already using library tidyverse   you can use data  lt - tibble  rowid to column data   quot ID quot

[r] Add an index (numeric ID) column to large data frame

Examples related to r

Examples related to dataframe