“SourceMedium has future-proofed our data infrastructure. Their comprehensive solution not only supports our current needs but is also scalable as we grow. With their managed BigQuery instance, we have full control over our data, creating custom metrics and insights rapidly. This has empowered our team to answer complex business questions and focus on strategic growth initiatives.”
The industry-standard warehouse. Yours to keep.
Your ecommerce data lives in Google BigQuery, the industry-standard data warehouse. We manage the foundation, you own every table. Query it with any tool, extend it with your own models, and take everything with you if you leave.
Integrated AI-to-BI stack on Google Cloud
The tables you actually get in BigQuery
20+ core tables across commerce, marketing, and executive reporting.
Commerce core
Track revenue, customer behavior, and order economics from one foundation.
- Orders : Revenue trends, valid order counts, and daily performance.
- Order lines : SKU-level units, product mix, and margin analysis.
- Customers : New vs repeat behavior, cohorts, and customer value.
- Products / variants : Stable product attributes for joins and segmentation.
- Refunds : Return rates and refund impact on net revenue.
- Discounts, shipping, taxes : Checkout components that shape contribution margin.
Marketing + funnel
Measure acquisition efficiency and where conversion breaks down.
- Funnel event history : Customer pathing and touchpoint history across sources.
- Ad performance (daily) : Spend, clicks, and ROAS by platform and campaign.
- Funnel events (hourly) : Near-real-time conversion monitoring and drop-off diagnostics.
- Outbound message performance (daily) : Email/SMS campaign and flow performance at the channel level.
Exec rollups
Give leadership a fast read on daily KPIs and long-term value.
- Executive summary (daily) : Board-ready daily KPI snapshot across growth and finance metrics.
- Cohort LTV : Lifetime value by first purchase source, channel, and campaign.
Start with the business-ready tables for dashboards and reporting. Go deeper with the building-block tables if your data team wants to customize.
Explore the full schema + table docsWhy BigQuery
Not a default. A deliberate choice.
Most ecommerce data tools lock your data inside their platform. We chose BigQuery so you keep full ownership, native Google integrations, and an ecosystem of tools that already speak its language.
- 1 Scales automatically, no engineering required. BigQuery handles capacity automatically. No warehouse resizing, no credit management, no infrastructure babysitting.
- 2 Native Google integrations built in. Your highest-volume data (GA4, Google Ads) flows into BigQuery through Google's own pipelines. Often no third-party connectors needed.
- 3 Store everything, only pay when you query. Keep years of historical data without worrying about storage costs. You only pay for compute when you actually run queries.
- 4 Free dashboards for your whole team. No per-seat BI licenses. Looker Studio connects natively to BigQuery, so everyone gets fast dashboards at no extra cost. Prefer Tableau, Hex, or Mode? They connect natively too.
- 5 AI that runs on your actual data. Our AI Analyst answers questions in Slack with the SQL behind every answer, running directly on your BigQuery tables. No data exports, no separate system, no black box.
What your team does with it.
When the warehouse works, every team moves faster. Here's what that looks like by role.
Data teams
- Run custom SQL on your own data. No export workarounds, no API limits
- Build dbt models on a documented, stable schema
- Connect Tableau, Python, Hex, Mode, or any BI tool natively
- Run custom transformations with included compute
- Use the schema as a foundation, extend it without breaking it
Growth teams
- Pre-built dashboards read directly from BigQuery, one shared foundation
- AI Analyst answers questions in Slack with SQL you can verify in BigQuery
- Attribution, LTV, cohort analysis: all queryable, all auditable
- No new interface to learn if you just use dashboards + Slack
Executives
- Numbers reconcile back to Shopify and your ad platforms. No more "which revenue is right?"
- Board-ready reporting from the same warehouse your data team queries
- If you ever hire a data team or bring in an analyst, they inherit a documented, governed foundation, not a mess
Agencies + consultants
- Plug into your existing workflow: Hex, Mode, dbt, Tableau, any BigQuery-compatible tool
- No data export rituals, no proprietary UI dependency
- Serve multiple clients from a consistent, documented schema
- Build custom analyses without learning a proprietary interface
“SourceMedium's managed BigQuery instance has been a revelation for our team. It has democratized access to data across our entire organization, empowering everyone to create custom metrics and transformations without relying on a dedicated BI team.”
Nick Osborn
Head of Growth, Catalina Crunch
Trust layer
A schema you can build on.
The reason data teams prefer working with us.
- Documented and consistently named. Every table and column follows a published naming convention so new team members can read it without a decoder ring.
- Designed to stay stable. When we evolve the schema, we manage the transition so your dashboards and reports don't break.
- 2,500+ automated quality checks daily. Run multiple times per day to catch problems before they reach your reports.
2,500+
daily quality checks
180+
metric catalog
Consistent
Best-practice naming
Native
dbt-compatible
Example: orders table
Explore documentation| Column | Type |
|---|---|
| sm_order_key | STRING |
| order_processed_at | TIMESTAMP |
| order_net_revenue | FLOAT |
| sm_channel | STRING |
| sm_customer_key | STRING |
| is_order_sm_valid | BOOLEAN |
Want the details on cleaning, transformation, and enrichment? See Data Transformation .
Extensibility
Your BigQuery warehouse isn't limited to ecommerce.
Your data lives in BigQuery, not a proprietary vendor system, so you can centralize anything. Finance data, ops data, custom sources, third-party APIs. Standard BigQuery storage, no additional vendor fees for bringing in more data.
SourceMedium manages the ecommerce foundation. Your team extends the warehouse however you need.
Costco + grocery store data
Catalina Crunch integrated retail POS data alongside their ecommerce data and built a real-time P&L dashboard in 3 weeks.
Custom fulfillment data
CPAP set up webhooks to pull fulfillment-date revenue recognition into BigQuery for finance-grade reporting.
Third-party data via Google Sheets
Brands that need non-automated data (SPINS, wholesale, marketplace) upload via Google Sheets on a cadence, cast to the schema, and it becomes part of the same governed foundation.
If you leave, do you keep everything?
- Full data export & transfer
- Custom SQL queries & models
- Looker Studio dashboards
- Schema & metric documentation
Yes. You keep your data.
Your data, your control
Built to stay, free to leave.
We don't hold your data hostage. If you leave, you keep the warehouse tables, dashboards, and SQL. No rebuild. Move it to a BigQuery project you control. Our internal dbt models and SQL transformation logic remain SourceMedium IP. Your data, schema, and any custom models you built are yours to keep.
Ready to stop debating the numbers?
Get started
Tell us a bit about your brand and stack—we’ll follow up shortly.
You're all set