I setup a AWS S3 data pipeline, and when I go to AWS Console and download the exported file (gunzipped JSON file), it's a binary file and not a text JSON. Is this normal? How do I fix this?
Hi Hyun C., Please stand by as I search the Mixpanel knowledge base for an AI-powered answer. I normally take about 15 seconds or so. In case AI cannot answer your question, feel free to reach out to Mixpanel support at mixpanel.com/get-support. If you have follow-up questions for this thread, please tag me again with your question: @kapa.ai <follow-up question> 👋
Feel free to ask a follow-up question once I've answered by mentioning me with @kapa.ai
Yes, this is normal behavior. When you export data from Mixpanel to AWS S3 using Data Pipelines, the files are typically compressed using gzip for efficient storage and transfer. This means the files you download will be in a binary gzip format (with a .gz extension), not plain text JSON. If you try to open them directly, they will appear as binary.How to fix/view the data: To read the exported data, you need to decompress (gunzip) the file first. After decompression, you will get a text file containing JSON lines (each line is a JSON object representing an event or record).Example command to decompress:Why this happens: Mixpanel exports data in gzip-compressed format by default to optimize for storage and transfer speed. This is documented in the setup and export guides for AWS S3 pipelines, which mention that files are written in this format and should be decompressed before use Mixpanel AWS S3 Integration Guide.If you need to automate processing, you can use standard tools or scripts (e.g., Python’s gzip module) to read and process these files programmatically.Summary: The binary file is expected—just decompress it to access the JSON data inside. No changes to your pipeline setup are needed.
kapa.ai is still learning and improving, please let me know how I did by reacting below.
ℹ️ Tip: Mention @kapa.ai in the thread for follow-ups or DM the app for 1-to-1 help.
