Remove duplicates from a list of objects based on property in Java 8

Question

I am trying to remove duplicates from a List of objects based on some property.

can we do it in a simple way using java 8

List<Employee> employee

Can we remove duplicates from it based on id property of employee. I have seen posts removing duplicate strings form arraylist of string.

User · Answer

If order does not matter and when it s more performant to run in parallel  Collect to a Map and then get values   employee stream   collect Collectors toConcurrentMap Employee  getId  Function identity     p  q  - gt  p   values

User · Answer

You can get a stream from the List and put in in the TreeSet from which you provide a custom comparator that compares id uniquely  Then if you really need a list you can put then back this collection into an ArrayList  import static java util Comparator comparingInt  import static java util stream Collectors collectingAndThen  import static java util stream Collectors toCollection       List lt Employee gt  unique   employee stream                                    collect collectingAndThen toCollection    - gt  new TreeSet lt  gt  comparingInt Employee  getId                                                                ArrayList  new     Given the example  List lt Employee gt  employee   Arrays asList new Employee 1   quot John quot    new Employee 1   quot Bob quot    new Employee 2   quot Alice quot      It will output   Employee id 1  name  John    Employee id 2  name  Alice      Another idea could be to use a wrapper that wraps an employee and have the equals and hashcode method based with its id  class WrapperEmployee       private Employee e       public WrapperEmployee Employee e            this e   e             public Employee unwrap             return this e              Override     public boolean equals Object o            if  this    o  return true          if  o    null    getClass      o getClass    return false          WrapperEmployee that    WrapperEmployee  o          return Objects equals e getId    that e getId                 Override     public int hashCode             return Objects hash e getId              Then you wrap each instance  call distinct    unwrap them and collect the result in a list  List lt Employee gt  unique   employee stream                                    map WrapperEmployee  new                                   distinct                                    map WrapperEmployee  unwrap                                   collect Collectors toList       In fact  I think you can make this wrapper generic by providing a function that will do the comparison  public class Wrapper lt T  U gt        private T t      private Function lt T  U gt  equalityFunction       public Wrapper T t  Function lt T  U gt  equalityFunction            this t   t          this equalityFunction   equalityFunction             public T unwrap             return this t              Override     public boolean equals Object o            if  this    o  return true          if  o    null    getClass      o getClass    return false           SuppressWarnings  quot unchecked quot           Wrapper lt T  U gt  that    Wrapper lt T  U gt   o          return Objects equals equalityFunction apply this t   that equalityFunction apply that t                Override     public int hashCode             return Objects hash equalityFunction apply this t             and the mapping will be   map e - gt  new Wrapper lt  gt  e  Employee  getId

User · Answer

The easiest way to do it directly in the list is

HashSet<Object> seen=new HashSet<>();
employee.removeIf(e->!seen.add(e.getID()));

removeIf will remove an element if it meets the specified criteria
Set.add will return false if it did not modify the Set, i.e. already contains the value
combining these two, it will remove all elements (employees) whose id has been encountered before

Of course, it only works if the list supports removal of elements.

User · Answer

Another solution is to use a Predicate  then you can use this in any filter   public static  lt T gt  Predicate lt T gt  distinctBy Function lt   super T    gt  f      Set lt Object gt  objects   new ConcurrentHashSet lt  gt       return t - gt  objects add f apply t        Then simply reuse the predicate anywhere   employees stream   filter distinctBy e - gt  e getId      Note  in the JavaDoc of filter  which says it takes a stateless Predicte  Actually  this works fine even if the stream is parallel    About other solutions   1  Using  collect Collectors toConcurrentMap      values   is a good solution  but it s annoying if you want to sort and keep the order   2  stream removeIf e- gt  seen add e getID      is also another very good solution  But we need to make sure the collection implemented removeIf  for example it will throw exception if we construct the collection use Arrays asList

User · Answer

If you can make use of equals  then filter the list by using distinct within a stream  see answers above   If you can not or don t want to override the equals method  you can  filter the stream in the following way for any property  e g  for the property Name  the same for the property Id etc     Set lt String gt  nameSet   new HashSet lt  gt     List lt Employee gt  employeesDistinctByName   employees stream                filter e - gt  nameSet add e getName                  collect Collectors toList

User · Answer

Another version which is simple  BiFunction lt TreeSet lt Employee gt  List lt Employee gt   TreeSet lt Employee gt  gt  appendTree    y x  - gt   y addAll x    y y   TreeSet lt Employee gt  outputList   appendTree apply new TreeSet lt Employee gt  Comparator comparing p- gt p getId     personList

User · Answer

Try this code   Collection lt Employee gt  nonDuplicatedEmployees   employees stream        lt Map lt Integer  Employee gt  gt  collect HashMap  new  m e - gt m put e getId    e   Map  putAll      values

User · Answer

This worked for me   list stream   distinct   collect Collectors toList       You need to implement equals  of course

User · Answer

There are a lot of good answers here but I didn t find the one about using reduce method  So for your case  you can apply it in following way    List lt Employee gt  employeeList   employees stream          reduce new ArrayList lt  gt      List lt Employee gt  accumulator  Employee employee  - gt                  if  accumulator stream   noneMatch emp - gt  emp getId   equals employee getId                          accumulator add employee                     return accumulator            acc1  acc2  - gt                  acc1 addAll acc2           return acc1

[java] Remove duplicates from a list of objects based on property in Java 8

The answer is

Examples related to java

Examples related to list

Examples related to java-8

Tags