Glossary
Data Blending
Most teams don’t work from one clean dataset.
Sales lives in one system. Marketing in another. Customer data somewhere else. None of it lines up by default.
Data blending lets you combine what you need, when you need it, without moving everything into a warehouse or setting up a full pipeline.
You connect sources, define the link, and the tool brings them together in a single view.
It’s not about building the perfect dataset. It’s about answering questions with what you already have.
What Is Data Blending?
Data blending is a way to combine data from different sources into one analysis. You don’t merge the data in storage. You don’t create a new table.
Each source is queried separately. The results are then combined in the view based on a shared field like date, region, or product.
One source becomes the primary. Others are secondary. The primary defines what rows appear. The secondary fills in extra details where there’s a match.
Blending is useful when:
- You’re working across different platforms
- The data sources have different structures or levels of detail
- You need results quickly and can’t wait for a full integration
How Data Blending Works
Blending happens at the view level, not in the backend.
You connect to two or more sources. The first field you drag into the view becomes the primary. When you drag a field from another source, it becomes the secondary.
If a shared field exists with the same name, the tool links them automatically. If not, you create the link manually.
Important rules to remember:
- Each source runs its own query
- Data is aggregated before blending
- You can’t mix raw and aggregated fields in the same formula
Because of this, secondary data is always aggregated. It only returns values that match something in the primary source.
For example, if the primary has only three months of data and the secondary has twelve, only the three shared months will appear in the view.
When to Use Blending Instead of Joins
Joins combine tables before the analysis begins. They work when the data lives in one place and follows a consistent structure.
Blending works when that structure is missing.
Use blending when you:
- Need to work across separate systems
- Want to use published sources
- Can’t or don’t want to change the data model
Key Differences
- Joins happen before aggregation. Blends happen after.
- Joins create a new table. Blends don’t.
- Joins can include unmatched rows. Blends only show matches from the primary.
Use joins when:
- Everything lives in the same database
- You need row-level operations
- You plan to reuse the joined dataset
Use blends when:
- Data comes from different platforms
- You’re dealing with inconsistent structure
- The analysis is short-term or specific
Choosing the Primary and Secondary Sources
Your choice of primary and secondary matters. The primary defines which rows appear.
The secondary only adds values where there’s a match.
What to Watch For
- If a value is missing in the primary, the secondary won’t show it
- If the primary has three months and the secondary has twelve, you’ll only see three
- Switching the primary can reveal more data
Be intentional.
- Choose the primary based on your goal
- Know what might get excluded
- Test both directions to confirm results
Working Across Multiple Data Sources
When you mix data sources, your options change.
Each calculated field must live in one source. If you reference a secondary field, it must be aggregated.
What This Means
- Fields from a secondary source are always aggregated
- You can’t mix raw fields and aggregated fields
- You use dot notation to reference fields from another source (for example,
[Sales].[Target]
)
Limits to Expect
- You can’t sort by secondary fields
- Some filters won’t behave the same
- You may see an asterisk when the data doesn’t match cleanly
To reduce issues:
- Keep most logic in one source
- Match field names where possible
- Use manual links when automatic ones fail
Setting Up a Data Blend
Blending works per view, not across an entire dashboard or project.\
Steps to Blend
- Connect to your sources. They must appear separately in the data pane.
- Add a field from one source. This defines the primary.
- Add a field from another source. This becomes the secondary.
- Confirm the relationship. If no link appears, define it manually.
If fields have different names, rename them or create a custom mapping.
You can blend on one field or several. But more links make the blend stricter and may hide rows.
Blending doesn’t create a new dataset. It builds a temporary link to support one view, for one question, using only the fields needed.
Data Blending and Aggregation Rules
Blending always aggregates the secondary data. That’s how it works.
What You Need to Know
- You can’t get row-level detail from the secondary
- Every field from it must be aggregated
- Even dimensions are wrapped in ATTR() by default
If there are multiple values for a field, Tableau uses an asterisk to flag the conflict.
Example
You want to compare actual sales (primary) with quotas (secondary).
To get the ratio, you write:
SUM([Sales]) / SUM([Quota])
If you forget to aggregate both sides, the formula won’t run.
This matters most when combining real-time data with static summaries. You can only show totals or averages, not every transaction.
FAQ
What is data blending?
Data blending combines data from different sources into one view. Each source is queried on its own and merged at analysis time. No data is moved or stored together.
How is data blending different from joining?
Joins happen before analysis and build one dataset. Blends keep sources separate, aggregate each one, and align them in the view.
When should I use blending?
Use it when:
- Data lives in different systems
- You need a quick answer
- You don’t want to change your data model
- You’re working with published or real-time sources
What’s the difference between primary and secondary?
The primary source defines which rows appear. The secondary only contributes where there’s a match in the primary.
Why do values go missing from the secondary?
Only matches appear. If the primary is missing a value that exists in the secondary, it won’t show up unless you switch the primary.
Can I blend on more than one field?
Yes. But more linking fields mean stricter rules. You could filter out more than intended.
What if field names don’t match?
Rename one to match or manually map the relationship. What matters is that the values line up, not the names.
Can I calculate using both sources?
Yes. But any field from the secondary must be aggregated.
Why do I see an asterisk in the view?
There are multiple matching values in the secondary. Tableau doesn’t know which one to use, so it shows an asterisk.
Can I save or publish a blend?
No. Blends only exist inside the view. For reuse, join the data or combine it elsewhere.
What are the limits?
Blending can’t:
- Return row-level detail from the secondary
- Support all filters
- Handle non-additive metrics like MEDIAN or COUNTD consistently
Will it slow down performance?
It might. Each source is queried separately. Use fewer fields or summary data to keep things fast.
Which tools support blending?
Data blending is supported in Tableau, Looker Studio, Power BI (limited), and Alteryx. Each one handles it a little differently.
Summary
Data blending lets you work across tools, sources, and formats without changing your entire pipeline. You keep data where it lives, query it separately, and blend results in one view.
It’s perfect for:
- Quick comparisons
- Published datasets
- Disconnected systems
It has limits. The secondary data is always aggregated. Some values might not appear. And you can’t save the blend for reuse.
But when speed matters, and structure isn’t perfect, data blending gives analysts a way to keep moving. It’s the shortcut that works when full integration isn’t worth the time.
A wide array of use-cases
Discover how we can help your data into your most valuable asset.
We help businesses boost revenue, save time, and make smarter decisions with Data and AI