Removing empty rows of a data file in R

Question

I have a dataset with empty rows  I would like to remove them   myData lt -myData -which apply myData 1 function x all is na x         It works OK  But now I would like to add a column in my data and initialize the first value   myData newCol 1   lt - -999  Error in    lt - data frame    tmp     newCol   value   -999       replacement has 1 rows  data has 0   Unfortunately it doesn t work and I don t really understand why and I can t solve this  It worked when I removed one line at a time using   TgData   TgData 2 nrow TgData      Or anything similar   It also works when I used only the first 13 000 rows   But it doesn t work with my actual data  with 32 000 rows   What did I do wrong  It seems to make no sense to me

User · Answer

Alternative solution for rows of NAs using janitor package   myData   gt   remove empty  rows

User · Answer

I assume you want to remove rows that are all NAs  Then  you can do the following    data  lt - rbind c 1 2 3   c 1  NA  4   c 4 6 7   c NA  NA  NA   c 4  8  NA     sample data data        1    2    3   1      1    2    3  2      1   NA    4  3      4    6    7  4     NA   NA   NA  5      4    8   NA  data rowSums is na data      ncol data           1    2    3   1      1    2    3  2      1   NA    4  3      4    6    7  4      4    8   NA   If you want to remove rows that have at least one NA  just change the condition    data rowSums is na data      0          1    2    3   1      1    2    3  2      4    6    7

User · Answer

Here are some dplyr options     sample data df  lt - data frame a   c  1   NA   3   NA   b   c  a    b    c   NA   c   c  e    f    g   NA    library dplyr     remove rows where all values are NA  df   gt   filter all any vars  is na      df   gt   filter all any vars complete cases            remove rows where only some values are NA  df   gt   filter all all vars  is na      df   gt   filter all all vars complete cases           or more succinctly  df   gt   filter complete cases       df   gt   na omit    dplyr and tidyr  library tidyr  df   gt   drop na

User · Answer

If you have empty rows  not NAs  you can do   data  apply data        1  all      To remove both  NAs and empty    data  lt - data  apply is na data    data        1  all

User · Answer

This is similar to some of the above answers  but with this  you can specify if you want to remove rows with a percentage of missing values greater-than or equal-to a given percent  with the argument pct   drop rows all na  lt - function x  pct 1  x  rowSums is na x    gt   ncol x  pct     Where x is a dataframe and pct is the threshold of NA-filled data you want to get rid of   pct   1 means remove rows that have 100  of its values NA  pct    5 means remome rows that have at least half its values NA

[r] Removing empty rows of a data file in R

Examples related to r