Using Linq to get the last N elements of a collection

Question

Given a collection  is there a way to get the last N elements of that collection   If there isn t a method in the framework  what would be the best way to write an extension method to do this

User · Answer

Here s my solution   public static class EnumerationExtensions       public static IEnumerable lt T gt  TakeLast lt T gt  this IEnumerable lt T gt  input  int count                if  count  lt   0              yield break           var inputList   input as IList lt T gt            if  inputList    null                        int last   inputList Count              int first   last - count               if  first  lt  0                  first   0               for  int i   first  i  lt  last  i                    yield return inputList i                     else                          Use a ring buffer  We have to enumerate the input  and we don t know in advance how many elements it will contain              T   buffer   new T count                int index   0               count   0               foreach  T item in input                                buffer index    item                   index    index   1    buffer Length                  count                                  The index variable now points at the next buffer entry that would be filled  If the buffer isn t completely                full  then there are  count  elements preceding index  If the buffer  is  full  then index is pointing at                the oldest entry  which is the first one to return                                If the buffer isn t full  which means that the enumeration has fewer than  count  elements  we ll fix up                 index  to point at the first entry to return  That s easy to do  if the buffer isn t full  then the oldest                entry is the first one   -                                We ll also set  count  to the number of elements to be returned  It only needs adjustment if we ve wrapped                past the end of the buffer and have enumerated more than the original count value               if  count  lt  buffer Length                  index   0              else                 count   buffer Length                  Return the values in the correct order              while  count  gt  0                                yield return buffer index                    index    index   1    buffer Length                  count--                                     public static IEnumerable lt T gt  SkipLast lt T gt  this IEnumerable lt T gt  input  int count                if  count  lt   0              return input          else             return input SkipLastIter count              private static IEnumerable lt T gt  SkipLastIter lt T gt  this IEnumerable lt T gt  input  int count                var inputList   input as IList lt T gt            if  inputList    null                        int first   0              int last   inputList Count - count               if  last  lt  0                  last   0               for  int i   first  i  lt  last  i                    yield return inputList i                     else                          Aim to leave  count  items in the queue  If the input has fewer than  count                 items  then the queue won t ever fill and we return nothing               Queue lt T gt  elements   new Queue lt T gt                  foreach  T item in input                                elements Enqueue item                    if  elements Count  gt  count                      yield return elements Dequeue                                      The code is a bit chunky  but as a drop-in reusable component  it should perform as well as it can in most scenarios  and it ll keep the code that s using it nice and concise   -   My TakeLast for non-IList 1 is based on the same ring buffer algorithm as that in the answers by  Mark Byers and  MackieChan further up  It s interesting how similar they are -- I wrote mine completely independently  Guess there s really just one way to do a ring buffer properly   -   Looking at  kbrimington s answer  an additional check could be added to this for IQuerable lt T gt  to fall back to the approach that works well with Entity Framework -- assuming that what I have at this point does not

User · Answer

coll Reverse   Take N  Reverse   ToList      public static IEnumerable lt T gt  TakeLast lt T gt  this IEnumerable lt T gt  coll  int N        return coll Reverse   Take N  Reverse        UPDATE  To address clintp s problem   a  Using the TakeLast   method I defined above solves the problem  but if you really want the do it without the extra method  then you just have to recognize that while Enumerable Reverse   can be used as an extension method  you aren t required to use it that way   List lt string gt  mystring   new List lt string gt       one    two    three      mystring   Enumerable Reverse mystring  Take 2  Reverse   ToList

User · Answer

Use EnumerableEx TakeLast in RX s System Interactive assembly  It s an O N  implementation like  Mark s  but it uses a queue rather than a ring-buffer construct  and dequeues items when it reaches buffer capacity     NB  This is the IEnumerable version - not the IObservable version  though the implementation of the two is pretty much identical

User · Answer

If using a third-party library is an option  MoreLinq defines TakeLast   which does exactly this

User · Answer

Honestly I m not super proud of the answer  but for small collections you could use the following  var lastN   collection Reverse   Take n  Reverse     A bit hacky but it does the job

User · Answer

Little different implementation with usage of circular buffer  The benchmarks show that the method is circa two times faster than ones using Queue  implementation of TakeLast in System Linq   however not without a cost - it needs a buffer which grows along with the requested number of elements  even if you have a small collection you can get huge memory allocation    public IEnumerable lt T gt  TakeLast lt T gt  IEnumerable lt T gt  source  int count        int i   0       if  count  lt  1          yield break       if  source is IList lt T gt  listSource                if  listSource Count  lt  1              yield break           for  i   listSource Count  lt  count   0   listSource Count - count  i  lt  listSource Count  i                yield return listSource i              else               bool move   true          bool filled   false          T   result   new T count            using  var enumerator   source GetEnumerator                while  move                                for  i   0   move   enumerator MoveNext     amp  amp  i  lt  count  i                        result i    enumerator Current                   filled    move                         if  filled              for  int j   i  j  lt  count  j                    yield return result j            for  int j   0  j  lt  i  j                yield return result j

User · Answer

I am surprised that no one has mentioned it  but SkipWhile does have a method that uses the element s index    public static IEnumerable lt T gt  TakeLastN lt T gt  this IEnumerable lt T gt  source  int n        if  source    null          throw new ArgumentNullException  Source cannot be null         int goldenIndex   source Count   - n      return source SkipWhile  val  index    gt  index  lt  goldenIndex        Or if you like them one-liners  in the spirit of the current accepted answer     However  this is most likely impractical due to the repeated calculations collection SkipWhile  val  index    gt  index  lt  collection Count   - N    The only perceivable benefit that this solution presents over others is that you can have the option to add in a predicate to make a more powerful and efficient LINQ query  instead of having two separate operations that traverse the IEnumerable twice   public static IEnumerable lt T gt  FilterLastN lt T gt  this IEnumerable lt T gt  source  int n  Predicate lt T gt  pred        int goldenIndex   source Count   - n      return source SkipWhile  val  index    gt  index  lt  goldenIndex  amp  amp  pred val

User · Answer

I tried to combine efficiency and simplicity and end up with this     public static IEnumerable lt T gt  TakeLast lt T gt  this IEnumerable lt T gt  source  int count        if  source    null    throw new ArgumentNullException  source           Queue lt T gt  lastElements   new Queue lt T gt         foreach  T element in source                lastElements Enqueue element           if  lastElements Count  gt  count                        lastElements Dequeue                         return lastElements      About  performance   In C   Queue lt T gt  is implemented using a circular buffer so there is no object instantiation done each loop  only when the queue is growing up   I did not set queue capacity  using dedicated constructor  because someone might call this extension with count   int MaxValue   For extra performance you might check if source implement IList lt T gt  and if yes  directly extract the last values using array indexes

User · Answer

NET Core 2 0  provides the LINQ method TakeLast     https   docs microsoft com en-us dotnet api system linq enumerable takelast  example   Enumerable      Range 1  10       TakeLast 3      lt --- takes last 3 items      ToList        ForEach i   gt  System Console WriteLine i       outputs     8    9    10

User · Answer

Here s a method that works on any enumerable but uses only O N  temporary storage   public static class TakeLastExtension       public static IEnumerable lt T gt  TakeLast lt T gt  this IEnumerable lt T gt  source  int takeCount                if  source    null    throw new ArgumentNullException  source              if  takeCount  lt  0    throw new ArgumentOutOfRangeException  takeCount    must not be negative              if  takeCount    0    yield break             T   result   new T takeCount           int i   0           int sourceCount   0          foreach  T element in source                        result i    element              i    i   1    takeCount              sourceCount                       if  sourceCount  lt  takeCount                        takeCount   sourceCount              i   0                     for  int j   0  j  lt  takeCount    j                        yield return result  i   j    takeCount                       Usage   List lt int gt  l   new List lt int gt   4  6  3  6  2  5  7   List lt int gt  lastElements   l TakeLast 3  ToList      It works by using a ring buffer of size N to store the elements as it sees them  overwriting old elements with new ones  When the end of the enumerable is reached the ring buffer contains the last N elements

User · Answer

Note  I missed your question title which said Using Linq  so my answer does not in fact use Linq   If you want to avoid caching a non-lazy copy of the entire collection  you could write a simple method that does it using a linked list   The following method will add each value it finds in the original collection into a linked list  and trim the linked list down to the number of items required  Since it keeps the linked list trimmed to this number of items the entire time through iterating through the collection  it will only keep a copy of at most N items from the original collection   It does not require you to know the number of items in the original collection  nor iterate over it more than once   Usage   IEnumerable lt int gt  sequence   Enumerable Range 1  10000   IEnumerable lt int gt  last10   sequence TakeLast 10         Extension method   public static class Extensions       public static IEnumerable lt T gt  TakeLast lt T gt  this IEnumerable lt T gt  collection          int n                if  collection    null              throw new ArgumentNullException nameof collection            if  n  lt  0              throw new ArgumentOutOfRangeException nameof n      nameof n   must be 0 or greater             LinkedList lt T gt  temp   new LinkedList lt T gt              foreach  var value in collection                        temp AddLast value               if  temp Count  gt  n                  temp RemoveFirst                       return temp

User · Answer

I know it s to late to answer this question  But if you are working with collection of type IList lt   and you don t care about an order of the returned collection  then this method is working faster  I ve used Mark Byers answer and made a little changes  So now method TakeLast is   public static IEnumerable lt T gt  TakeLast lt T gt  IList lt T gt  source  int takeCount        if  source    null    throw new ArgumentNullException  source          if  takeCount  lt  0    throw new ArgumentOutOfRangeException  takeCount    must not be negative          if  takeCount    0    yield break         if  source Count  gt  takeCount                for  int z   source Count - 1  takeCount  gt  0  z--                        takeCount--              yield return source z                       else               for int i   0  i  lt  source Count  i                          yield return source i                       For test I have used Mark Byers method and kbrimington s andswer  This is test   IList lt int gt  test   new List lt int gt     for int i   0  i lt 1000000  i          test Add i      Stopwatch stopwatch   new Stopwatch    stopwatch Start     IList lt int gt  result   TakeLast test  10  ToList     stopwatch Stop     Stopwatch stopwatch1   new Stopwatch    stopwatch1 Start     IList lt int gt  result1   TakeLast2 test  10  ToList     stopwatch1 Stop     Stopwatch stopwatch2   new Stopwatch    stopwatch2 Start     IList lt int gt  result2   test Skip Math Max 0  test Count - 10   Take 10  ToList     stopwatch2 Stop      And here are results for taking 10 elements     and for taking 1000001 elements results are

User · Answer

Using This Method To Get All Range Without Error   public List lt T gt  GetTsRate  List lt T gt  AllT int Index int Count                        List lt T gt  Ts   null              try                               Ts   AllT ToList   GetRange Index  Count                             catch  Exception ex                                Ts   AllT Skip Index  ToList                              return Ts

User · Answer

If you are dealing with a collection with a key  e g  entries from a database  a quick  i e  faster than the selected answer  solution would be  collection OrderByDescending c   gt  c Key  Take 3  OrderBy c   gt  c Key

User · Answer

It is a little inefficient to take the last N of a collection using LINQ as all the above solutions require iterating across the collection  TakeLast int n  in System Interactive also has this problem   If you have a list a more efficient thing to do is slice it using the following method      Select from start to end exclusive of end using the same semantics     as python slice       lt param name  list  gt  the list to slice lt  param gt       lt param name  start  gt The starting index lt  param gt       lt param name  end  gt The ending index  The result does not include this index lt  param gt  public static List lt T gt  Slice lt T gt   this IReadOnlyList lt T gt  list  int start  int  end   null        if  end    null                end   list Count               if  start  lt  0                start   list Count   start             if  start  gt   0  amp  amp  end Value  gt  0  amp  amp  end Value  gt  start                return list GetRange start  end Value - start              if  end  lt  0                return list GetRange start   list Count     end Value  - start              if  end    start                return new List lt T gt                throw new IndexOutOfRangeException           count       list Count                start       start             end       end       with  public static List lt T gt  GetRange lt T gt   this IReadOnlyList lt T gt  list  int index  int count         List lt T gt  r   new List lt T gt  count       for   int i   0  i  lt  count  i                   int j i   index          if   j  gt   list Count                         break                    r Add list j              return r      and some test cases   Fact  public void GetRange         IReadOnlyList lt int gt  l   new List lt int gt      0  10  20  30  40  50  60         l          GetRange 2  3           ShouldAllBeEquivalentTo new     20  30  40          l          GetRange 5  10           ShouldAllBeEquivalentTo new     50  60          Fact  void SliceMethodShouldWork         var list   new List lt int gt      1  3  5  7  9  11        list Slice 1  4  ShouldBeEquivalentTo new     3  5  7         list Slice 1  -2  ShouldBeEquivalentTo new     3  5  7         list Slice 1  null  ShouldBeEquivalentTo new     3  5  7  9  11         list Slice -2           Should            BeEquivalentTo new    9  11         list Slice -2 -1            Should            BeEquivalentTo new    9

User · Answer

If you don t mind dipping into Rx as part of the monad  you can use TakeLast   IEnumerable lt int gt  source   Enumerable Range 1  10000    IEnumerable lt int gt  lastThree   source AsObservable   TakeLast 3  AsEnumerable

User · Answer

Below the real example how to take last 3 elements from a collection  array       split address by spaces into array string   adrParts   adr Split new string           StringSplitOptions RemoveEmptyEntries      take only 3 last items in array adrParts   adrParts SkipWhile  value  index    gt    return adrParts Length - index  gt  3     ToArray

User · Answer

collection Skip Math Max 0  collection Count   - N      This approach preserves item order without a dependency on any sorting  and has broad compatibility across several LINQ providers   It is important to take care not to call Skip with a negative number  Some providers  such as the Entity Framework  will produce an ArgumentException when presented with a negative argument  The call to Math Max avoids this neatly   The class below has all of the essentials for extension methods  which are  a static class  a static method  and use of the this keyword   public static class MiscExtensions          Ex  collection TakeLast 5       public static IEnumerable lt T gt  TakeLast lt T gt  this IEnumerable lt T gt  source  int N                return source Skip Math Max 0  source Count   - N              A brief note on performance   Because the call to Count   can cause enumeration of certain data structures  this approach has the risk of causing two passes over the data  This isn t really a problem with most enumerables  in fact  optimizations exist already for Lists  Arrays  and even EF queries to evaluate the Count   operation in O 1  time   If  however  you must use a forward-only enumerable and would like to avoid making two passes  consider a one-pass algorithm like Lasse V  Karlsen or Mark Byers describe  Both of these approaches use a temporary buffer to hold items while enumerating  which are yielded once the end of the collection is found

User · Answer

detailed code for the problem   suppose we have a enumerable collection  collection  var lastIndexOfCollection collection Count-1   var nthIndexFromLast  lastIndexOfCollection- N   var desiredCollection collection GetRange nthIndexFromLast  N   ---------------------------------------------------------------------      use this one liner var desiredCollection collection GetRange  collection Count- 1 N    N

[c#] Using Linq to get the last N elements of a collection?

Examples related to c#

Examples related to linq