I'm syncing data from Snowflake to Mixpanel and for some reason the events I have are not received fully, In snowflake I see 124 events while mixpanel shows 111. Some does not have a user in the mixpanel system but for some reason some events does not added to the user already exsists
Hi Tal, Please stand by as I search the Mixpanel knowledge base for an AI-powered answer. I normally take about 15 seconds or so. In case AI cannot answer your question, feel free to reach out to Mixpanel support at mixpanel.com/get-support. If you have follow-up questions for this thread, please tag me again with your question: @kapa.ai <follow-up question> đź‘‹
Feel free to ask a follow-up question once I've answered by mentioning me with @kapa.ai
I understand you're experiencing a discrepancy between the number of events in your Snowflake database (124) and what's showing up in Mixpanel (111). This is a common issue when syncing data between warehouses and Mixpanel.There are several potential reasons for this discrepancy: 1. Data Sync Timing: There can be up to a 24-hour delay for data to fully sync between systems. Late-arriving data might take additional time to appear in Mixpanel.2. Missing User Identifiers: For events to be properly associated with users in Mixpanel, they need proper user identification. Events without proper user IDs might not be correctly attributed.3. Sync Mode Configuration: The sync mode you're using affects how data is transferred. If you're using "Append" mode rather than "Mirror" mode, modifications to existing rows won't be reflected.4. Hidden Events: Some events might be hidden in the Mixpanel UI via Lexicon but still exist in your data.To troubleshoot this issue: - Check if Events Data Sync is enabled for your pipeline
- Verify your Sync Mode is set to "Mirror" if you want complete synchronization
- Examine if the missing events have proper user identification
- Check for any hidden events in LexiconYou can also run a query in Snowflake to count events by day to compare with what's in Mixpanel: LBKSSELECT TO_DATE(CONVERT_TIMEZONE('UTC','<YOUR_PROJECT_TIMEZONE> ', time)) as ttime, count(*) FROM <YOUR_DB_NAME> .<YOUR_SCHEMA_NAME> .MP_MASTER_EVENT WHERE ttime> =TO_DATE('2021-12-03') AND ttime<=TO_DATE('2024-09-01') GROUP BY ttime ORDER BY ttime;LBKS
Replace the placeholders with your actual project timezone, database name, and schema name.For more detailed troubleshooting on data discrepancies, you can refer to the <https://docs.mixpanel.com/docs/tracking-best-practices/debugging#data-discrepancies|Debugging Data Discrepancies> documentation.
kapa.ai is still learning and improving, please let me know how I did by reacting below.
ℹ️ Tip: Mention @kapa.ai in the thread for follow-ups or DM the app for 1-to-1 help.