Built on BigQuery

The industry-standard warehouse. Yours to keep.

Your ecommerce data lives in Google BigQuery, the industry-standard data warehouse. We manage the foundation, you own every table. Query it with any tool, extend it with your own models, and take everything with you if you leave.

Integrated AI-to-BI stack on Google Cloud

SourceMedium
integrates
BI Layer
BigQuery
+
AI Layer
Vertex AI
built on
Google Cloud Platform

The tables you actually get in BigQuery

20+ core tables across commerce, marketing, and executive reporting.

Commerce core

Track revenue, customer behavior, and order economics from one foundation.

  • Orders : Revenue trends, valid order counts, and daily performance.
  • Order lines : SKU-level units, product mix, and margin analysis.
  • Customers : New vs repeat behavior, cohorts, and customer value.
  • Products / variants : Stable product attributes for joins and segmentation.
  • Refunds : Return rates and refund impact on net revenue.
  • Discounts, shipping, taxes : Checkout components that shape contribution margin.

Marketing + funnel

Measure acquisition efficiency and where conversion breaks down.

  • Funnel event history : Customer pathing and touchpoint history across sources.
  • Ad performance (daily) : Spend, clicks, and ROAS by platform and campaign.
  • Funnel events (hourly) : Near-real-time conversion monitoring and drop-off diagnostics.
  • Outbound message performance (daily) : Email/SMS campaign and flow performance at the channel level.

Exec rollups

Give leadership a fast read on daily KPIs and long-term value.

  • Executive summary (daily) : Board-ready daily KPI snapshot across growth and finance metrics.
  • Cohort LTV : Lifetime value by first purchase source, channel, and campaign.

Start with the business-ready tables for dashboards and reporting. Go deeper with the building-block tables if your data team wants to customize.

Explore the full schema + table docs

Why BigQuery

Not a default. A deliberate choice.

Most ecommerce data tools lock your data inside their platform. We chose BigQuery so you keep full ownership, native Google integrations, and an ecosystem of tools that already speak its language.

  1. 1 Scales automatically, no engineering required. BigQuery handles capacity automatically. No warehouse resizing, no credit management, no infrastructure babysitting.
  2. 2 Native Google integrations built in. Your highest-volume data (GA4, Google Ads) flows into BigQuery through Google's own pipelines. Often no third-party connectors needed.
  3. 3 Store everything, only pay when you query. Keep years of historical data without worrying about storage costs. You only pay for compute when you actually run queries.
  4. 4 Free dashboards for your whole team. No per-seat BI licenses. Looker Studio connects natively to BigQuery, so everyone gets fast dashboards at no extra cost. Prefer Tableau, Hex, or Mode? They connect natively too.
  5. 5 AI that runs on your actual data. Our AI Analyst answers questions in Slack with the SQL behind every answer, running directly on your BigQuery tables. No data exports, no separate system, no black box.

What your team does with it.

When the warehouse works, every team moves faster. Here's what that looks like by role.

Data teams

  • Run custom SQL on your own data. No export workarounds, no API limits
  • Build dbt models on a documented, stable schema
  • Connect Tableau, Python, Hex, Mode, or any BI tool natively
  • Run custom transformations with included compute
  • Use the schema as a foundation, extend it without breaking it

Growth teams

  • Pre-built dashboards read directly from BigQuery, one shared foundation
  • AI Analyst answers questions in Slack with SQL you can verify in BigQuery
  • Attribution, LTV, cohort analysis: all queryable, all auditable
  • No new interface to learn if you just use dashboards + Slack

Executives

  • Numbers reconcile back to Shopify and your ad platforms. No more "which revenue is right?"
  • Board-ready reporting from the same warehouse your data team queries
  • If you ever hire a data team or bring in an analyst, they inherit a documented, governed foundation, not a mess

Agencies + consultants

  • Plug into your existing workflow: Hex, Mode, dbt, Tableau, any BigQuery-compatible tool
  • No data export rituals, no proprietary UI dependency
  • Serve multiple clients from a consistent, documented schema
  • Build custom analyses without learning a proprietary interface
“SourceMedium's managed BigQuery instance has been a revelation for our team. It has democratized access to data across our entire organization, empowering everyone to create custom metrics and transformations without relying on a dedicated BI team.”
Nick Osborn

Nick Osborn

Head of Growth, Catalina Crunch

Trust layer

A schema you can build on.

The reason data teams prefer working with us.

  • Documented and consistently named. Every table and column follows a published naming convention so new team members can read it without a decoder ring.
  • Designed to stay stable. When we evolve the schema, we manage the transition so your dashboards and reports don't break.
  • 2,500+ automated quality checks daily. Run multiple times per day to catch problems before they reach your reports.

2,500+

daily quality checks

180+

metric catalog

Consistent

Best-practice naming

Native

dbt-compatible

Example: orders table

Explore documentation
Column Type
sm_order_key STRING
order_processed_at TIMESTAMP
order_net_revenue FLOAT
sm_channel STRING
sm_customer_key STRING
is_order_sm_valid BOOLEAN

Want the details on cleaning, transformation, and enrichment? See Data Transformation .

Extensibility

Your BigQuery warehouse isn't limited to ecommerce.

Your data lives in BigQuery, not a proprietary vendor system, so you can centralize anything. Finance data, ops data, custom sources, third-party APIs. Standard BigQuery storage, no additional vendor fees for bringing in more data.

SourceMedium manages the ecommerce foundation. Your team extends the warehouse however you need.

Costco + grocery store data

Catalina Crunch integrated retail POS data alongside their ecommerce data and built a real-time P&L dashboard in 3 weeks.

Custom fulfillment data

CPAP set up webhooks to pull fulfillment-date revenue recognition into BigQuery for finance-grade reporting.

Third-party data via Google Sheets

Brands that need non-automated data (SPINS, wholesale, marketplace) upload via Google Sheets on a cadence, cast to the schema, and it becomes part of the same governed foundation.

If you leave, do you keep everything?

  • Full data export & transfer
  • Custom SQL queries & models
  • Looker Studio dashboards
  • Schema & metric documentation

Yes. You keep your data.

Your data, your control

Built to stay, free to leave.

We don't hold your data hostage. If you leave, you keep the warehouse tables, dashboards, and SQL. No rebuild. Move it to a BigQuery project you control. Our internal dbt models and SQL transformation logic remain SourceMedium IP. Your data, schema, and any custom models you built are yours to keep.

Warehouse customers

“We have full control over our data.”

“SourceMedium has future-proofed our data infrastructure. Their comprehensive solution not only supports our current needs but is also scalable as we grow. With their managed BigQuery instance, we have full control over our data, creating custom metrics and insights rapidly. This has empowered our team to answer complex business questions and focus on strategic growth initiatives.”

“Acquiring confidence in your data is literally money. We've nearly doubled revenue in the short time we've used SourceMedium. They are the layer through which all our data flows and gets transformed into a unified, coherent and reliable language that my entire team leverages regardless of analytical capability.”

“SourceMedium is our command center; it helps us identify problems before they get bigger. It's easy enough for everyone on the team to use, and it empowers all people.”

“We've spent months looking for an analytics solution that would present accurate data we 100% trust in a digestible format that's easily understandable by any team member. We found that in SourceMedium.”

“SourceMedium is a game-changer for our business. We have a single, reliable source of truth with their data infrastructure that allows us to make data driven decisions against the short and long term impact of our marketing and customer experience efforts.”

“The real win wasn't the cost savings (though those were nice). It's that my team stopped arguing about which dashboard was right. One source of truth for reporting AND attribution means we actually build things now instead of reconciling data.”

Ready to stop debating the numbers?

Get started

Tell us a bit about your brand and stack—we’ll follow up shortly.