Solved

Remove duplicates, merge events / activity feed

  • 22 January 2020
  • 2 replies
  • 365 views

We have duplicate profiles due to switching of identifier. I would like to merge the duplicate profiles and also merge the activity feed.

I got the API working and by calling "deduplicate_people(prop_to_match='$email',merge_props=True,case_sensitive=False,backup=True,backup_file=None)" duplicates are removed, but the activity feed is not merged. So we loose many events.

Is there a way to remove duplicates and merging events / activity feed at the same time?

icon

Best answer by Cassandra 25 March 2020, 07:30

Hi @hperrey

We just released an update called “ID Merge” last week that would allow you to merge the activity feeds without duplicating event data.

ID merge improves the behavior of identify() so you can associate anonymous events pre-login with authenticated users. It also adds a new method called Merge().

  • Merge allows you to use multiple identities from different sources to refer to one user, which would fix the problem you outlined.
  • One nuance worth mentioning: if multiple user profile exist when distinct ids that are merged, Mixpanel will hide all but one of the user profiles. User property values will not be updated or added when profiles are merged.
View original

2 replies

Userlevel 5
Badge +4

@hperrey  unfortunately,  as events are immutable you are unable to edit them. 

The only method to merge the events will be to do an ETL (extract, transform, load) with the API, in which you will export the data, manually correct the distinct_id and then re-import the corrected events with the correct mapping. This can be sent to a new project or the same project.

If you would like to use the same project, the benefit of this would be that you will be able to see all events of users in one project. However, the downfall of this would be that this can inflate your numbers when doing analysis in Insights since there will be double counts. I would suggest speaking with your CSM or RM to see is which method is best for your use case. 

Userlevel 1

Hi @hperrey

We just released an update called “ID Merge” last week that would allow you to merge the activity feeds without duplicating event data.

ID merge improves the behavior of identify() so you can associate anonymous events pre-login with authenticated users. It also adds a new method called Merge().

  • Merge allows you to use multiple identities from different sources to refer to one user, which would fix the problem you outlined.
  • One nuance worth mentioning: if multiple user profile exist when distinct ids that are merged, Mixpanel will hide all but one of the user profiles. User property values will not be updated or added when profiles are merged.

Reply