I will set up a self hosted event analytics pipeline with raw data load into bigquery
About this Gig
If you want ownership of your analytics data without depending entirely on third-party tracking platforms, I can set up a self-hosted event pipeline for you.
This project is designed for teams that want a clean, practical analytics foundation:
- custom event tracking
- raw event ownership
- portable storage
- BigQuery-ready querying
- support for web, backend, app, or games
What I will deliver
- deployment of an event ingestion API
- API key-based event collection
- event buffering through NATS JetStream
- raw event storage as partitioned Parquet files in Google Cloud Storage
- BigQuery external table setup for querying raw data
- configuration for your initial event producer setup
- Open source version of Metabase (optional)
Typical use cases
- product analytics for web/mobile apps, browser extensions
- custom backend event tracking system
- raw event foundation before dashboards or BI
- game analytics
Important scope note
This project sets up the event ingestion and raw analytics backbone.
It does not include:
- a finished analytics dashboard
- advanced BI implementation
- a full product analytics UI
Project requirements
- subdomain
- GCP project, VM that I'll help to setup
Destination Platform:
Google BigQuery
•
Google Cloud Storage
Tools & Platforms:
Other
FAQ
What DataQuery is (current project offer) ?
DataQueryEvent is a self-hosted in-app event tracking, ingestion, and raw-storage pipeline for product, application, and game analytics. It is an open source project developed and maintained by me.
Is DataQuery a SaaS analytics platform?
No. DataQuery is a self-hosted analytics pipeline deployed into your infrastructure.
Who hosts the system?
You do. The default supported setup uses your own VM and Google Cloud services where needed.
What happens as event volume grows?
The default setup starts simple, usually on one VM, and can be scaled later as your traffic increases. Cloud VM is easy to scale, usually computing performance can be increased just by updating settings in the interface.
Where is raw data stored?
Raw events are written to the blob storage in Parquet format (Google Cloud Storage by default).
What kind of products can use this?
Web apps, websites, mobile apps, Chrome extensions, games, and backend services.
Do I own the data?
Yes. The system is designed around raw data ownership and portability.
Can you help define event schema?
Yes. I can help you structure event names, parameters, and producer boundaries.
Does this include dashboards?
No, this project focuses on ingestion and raw analytics infrastructure, but I can deploy an open source version of Metabase for your BI needs. Dashboards development can be added separately.
