I got the task to highlight all email duplicates in a pandas data frame. Is there a function for this or a way to drop all the NON duplicates which leaves me with a nice list off all the duplicates in the dataset?
The table consists of six columns:
Email, FirstName, LastName, C_ID, A_ID, CreatedDate
[email protected], Bill, Schneider, 123, 321, 20190502
[email protected], Damian, Schneider, 124, 231, 20190502
[email protected], Bill, Schneider, 164, 313, 20190503
I want to get rid of the last column as the last mail is NOT a duplicate.