Sales data across all channels
Multi-channel reporting without Excel Frankenstein. One source of truth instead of manual data stitching.
Background
Which levers make multi-channel sales data consolidatable?
Why the data is fragmented in the first place
Every sales channel delivers data in its own format: Shopify has one schema, Amazon another, the POS system its own again, the field sales team reports via Excel. Even if the contents are similar (product, quantity, price, date), column names, time zones, currencies and tax logic differ. Without a unified data model, every report becomes its own truth — and discrepancies can no longer be explained.
What makes a central data model
A good multi-channel data model has one table for all sales events — regardless of the source. Every record has the same fields: channel, location, timestamp (in a unified time zone), product ID, quantity, gross and net revenue, tax amount. Platform-specific quirks are normalized in the pipeline, not in the dashboard. This way, any question can be asked later without having to build report logic per channel.
Marketplace reports and their quirks
Marketplaces like Amazon, Otto or Zalando often deliver reports with a delay and with their own aggregation logic: returns are partly booked back up to 30 days later, advertising costs delivered in a separate report with different granularity, payment receipt offset differently again. Whoever doesn't model these quirks does see numbers in the dashboard — but not the actual contribution margin per channel.
Why manual consolidation almost always produces errors
Studies on data quality in Excel-based reports (Panko 1998, KPMG, PwC) show: in 88% of all spreadsheets with non-trivial complexity, material errors are found. With manual multi-channel consolidation over a year — with daily updates from 5+ sources — the probability that at least one error above 1% revenue deviation exists at year-end is effectively 100%.
The main causes aren't incompetence but structural: schema drift on API updates, time-zone confusion, manual copy-paste errors, forgotten returns bookings, different tax logic per marketplace.
A central data architecture banishes the risk not through better Excel — but through automated validation: schema tests on every import, reconciliation checks between sources, alerting on deviations above defined thresholds.
Frequently asked questions
Common questions about multi-channel sales data
How many data sources can sensibly be integrated?
Technically there's no hard limit — but in practice a staggered approach pays off.
Phase 1 (typically 3–5 sources): the most important sales channels plus one CRM source. With that you usually cover 80–90% of revenue and can base the biggest decisions on it. Setup effort: 6–10 weeks.
Phase 2 (expansion to 8–15 sources): marketing data (Meta, Google Ads), email platforms, warehouse and logistics systems. Here the focus shifts from “measuring sales” to “steering the business”. Per source 1–2 weeks setup effort.
What determines the limit: not the tool choice, but the data maintenance. More than 20 active sources rarely make sense without a dedicated data team.
Real-time or daily aggregation — what's enough?
The answer depends on which decisions you make on the data — real-time is more expensive and complex and not sensible for every use case.
Cost difference: real-time pipelines typically cost 3–5× more than daily batches — both infrastructure and setup.
Practical advice: start with daily aggregation. If concrete real-time use cases show up after 6 months, switch over individual streams selectively — not everything at once.
- Daily aggregation is enough: operational reports (CFO, marketing, branch managers), weekly sales reviews, monthly reporting routines, strategic cohort analyses
- Real-time is worth it: operational out-of-stock warnings, real-time marketplace pricing, live dashboards for promotions, fraud monitoring
What does BigQuery realistically cost monthly?
BigQuery has a pay-per-use logic with two main cost drivers: storage and query costs.
Comparison to alternatives: a classic BI SaaS tool for comparable functionality often costs €500–2,000 monthly per seat — at 5 users that's quickly €2,500–10,000. BigQuery is almost always cheaper over total cost of ownership.
- Small (1–5M sales events/month, 3–5 sources): typically €30–80 storage + query
- Medium (5–30M events/month, 5–10 sources): typically €80–250
- Large (50M+ events/month, 10+ sources, many dashboards): €300–800
- Enterprise (real-time + many concurrent users): from €1,000 — reserved-capacity contracts pay off here
Cloud tools or self-build — what fits me?
One of the most important decisions — and it depends more on your team than on the data volume.
Hybrid is often the best solution: standard sources via cloud tools (e.g. Airbyte for Shopify, Stripe), custom sources via self-build (e.g. Lightspeed API, marketplace reports). dbt as the transformation layer is the same in both cases.
Rule of thumb for the cost trade-off: cloud tools often cost €100–500 monthly per source — at 10 sources that's €1,000–5,000 per month. Self-build has setup costs of 4–8 hours per source, then ongoing costs of €0. From 6+ sources, break-even is usually reached after 12 months.
- Cloud tools are worth it: no internal data engineers, mostly standard APIs, fast time-to-value more important than optimized costs
- Self-build is worth it: proprietary sources, data volume exponential for SaaS, special transformation logic, compliance requires full control
Do I need different dashboards for CFO and marketing?
Yes — and that's one of the most important insights you internalize after the first multi-channel projects. Building a single view for all stakeholders is almost always a mistake.
Important: all views are based on the same raw data in the data warehouse. Only the aggregation and visualization layer is tailored per audience. Otherwise the data conflicts between departments — which we actually wanted to avoid — arise again.
- CFO view: gross revenue, net revenue, tax per channel & location, returns-adjusted figures, forecast comparison, margin/contribution
- Marketing view: performance per campaign/channel/creative, cohort analyses, CLV, repeat rate, channel attribution, real-time during promotions
- Branch/sales view: daily revenue per location and employee, top/flop products, comparison to the previous period and to other branches
Can I also reconstruct historical data retroactively?
Yes, in most cases historical data can be built up retroactively — with a few concrete limitations per source.
Typical approach: an initial backfill of the last 12–24 months during setup, after which the pipeline continues on ongoing data. For larger data volumes, the backfill is done in weekly chunks.
Important: reconstruction is a one-time investment with clearly estimable effort. Ongoing data quality is more important long-term than historical depth — if you had to choose between 5 years retroactively and a clean pipeline for the future, choose the pipeline.
- Reconstructable well: Shopify (2 years retroactive via API), marketplaces like Amazon/Otto/Zalando (1–2 years), POS systems (mostly locally available), CRM data (complete)
- Not, or only partly, reconstructable: data from shut-down sources, manual Excel corrections without an audit trail, aggregated reports from tools without raw-data export
How does an intro call for such a project typically go?
A typical intro call for multi-channel sales data lasts about 30–45 minutes and is deliberately designed to classify your case — not to sell a finished package.
What it isn't: a sales call with pressure. If your case is too small, too large or topically unsuitable for me, I say so openly — and name suitable alternatives if needed.
- What we discuss: which sales sources exist today and which are missing, where the biggest data pain sits, what data knowledge exists in the team, which tools are already used
- What you have afterwards: an assessment of whether a tracking audit or a data-architecture project makes more sense, a rough order of magnitude (effort, duration, cost), concrete recommendations for action even without further collaboration
Who this is relevant for
Do you recognize yourself in any of these points?
If even just one of these applies to you, you should check your sales-data architecture.
When you sell across multiple channels and have no consolidated view
Shop, Amazon, marketplaces, field sales, POS — each source delivers its own numbers, but nobody knows the total. Strategic decisions are made based on gut feeling, not data.
When every marketplace has its own reporting
Amazon, Otto, Zalando, Shopify — each with its own reporting logic, its own returns bookings and its own ad-cost reports. The data doesn't reconcile itself over weeks, without a central model.
When your field sales sends Excel files that no one maintains centrally
Manually created reports arrive late, in different formats and without clear data standards. Consolidation happens by copy-paste — and is immediately outdated again.
When you can't compare conversion rate per location
Branch A in Munich, branch B in Hamburg, online shop, marketplaces — all with different conversion conditions, but without a unified comparison data model. Best and worst performers stay invisible.
When your CFO names different numbers than your marketing
That's the classic symptom of fragmented sales data: each department uses its own sources with its own logic. Discussions then revolve around number definitions, not strategic decisions.
My approach
5 phases that consolidate your sales data
My answer to fragmented multi-channel data: a 5-phase framework that turns 5 reports into a clear view of revenue — from the inventory to ongoing reporting.
PHASE 01
Data inventory
Complete capture of all sales sources: shop, POS, marketplaces, field-sales Excel, CRM. Schemas, time zones, tax logic documented.
PHASE 02
Data-model design
A unified schema for all sales events. Normalization logic per source. Time zones, currencies, returns clearly defined.
PHASE 03
Pipeline build
Source connectors, dbt transformation layer, BigQuery as the central data warehouse. Daily or hourly updates.
PHASE 04
Dashboards
Looker Studio or Metabase. One dedicated view layer per audience: CFO, marketing, sales locations.
PHASE 05
Reporting routines
Automated daily and weekly reports to stakeholders. Data-quality checks in the background. Alerts on deviations.
My stack
Which tools I work with
Data sources
Shopify · Lightspeed · marketplaces
Shop, POS and marketplace APIs together.
Connectors
Airbyte · Fivetran
Cloud connectors for standard sources.
Data warehouse
BigQuery · Snowflake
A central data warehouse with pay-per-use logic.
Transformation
dbt · GitHub Actions
Versioned schema with automated tests.
Dashboards
Looker Studio · Metabase
Stakeholder-specific view layers.
Quality & alerts
Great Expectations · Slack
Data-quality tests + alert on drift.
Timeline
Express track: prioritized setup with the 2 most important channels in 3 weeks — the others follow iteratively.
Free intro call
Let's talk
Multi-channel sales data is rarely a standard project. Before I sell you an audit or an architecture project, I want to understand where your actual bottleneck sits — and whether a targeted tracking audit, a data-architecture workshop or a full pipeline project makes sense for you.
Included
- ✓An assessment of your sales-data situation in 30–45 minutes
- ✓Concrete recommendations for action — even without further collaboration
- ✓An order-of-magnitude estimate (effort, duration, cost)
- ✓A recommendation of suitable alternatives if your case is topically unsuitable
Related content
Related content

GEO vs. SEO — what comes after Google
Generative Engine Optimization: How LLMs like ChatGPT and Perplexity select content, and what changes for brands.
Read
Attribution After the Cookie's Death
First-click and last-click misallocate budgets — a framework for data-driven attribution using first-party data, server-side signals, and incremental measurement.
Read
The Conversion Velocity Matrix
How your martech stack architecture determines how quickly prospects become customers — a strategic framework for B2B decision-makers.
Read