A unified data governance solution that helps manage, protect, and discover data across your organization
Hey @Anna Llop — this is a good catch, and it definitely sounds blocking.
Based on the provided docs, Data Quality for on-premises data sources (preview) does include “SQL Server” as a supported on-premises data source, alongside Oracle. The connection you’re creating in Unified Catalog → Health management → Data quality → … → Manage → Connection is also described as being created by selecting a Source type and then setting up the connection using a data integration runtime registered with Microsoft Purview (and on-premises scanning is hosted on customer infrastructure using Kubernetes + the registered runtime).
1) Is on-premises SQL Server supported for Data Quality?
Yes (preview) — the “Data quality for on-premises data sources (preview)” doc explicitly lists SQL Server as supported on-premises data sources.
2) Why might SQL Server not appear in the “Source type” dropdown?
The provided information doesn’t include a general, user-facing explanation for why the dropdown would omit SQL Server. However, there is a relevant supportability note (from the provided “Issue: On-premises SQL Server not available for Purview connection” snippet) indicating a backend feature-flag can be missing/not enabled for the account/tenant, which results in SQL Server not showing as an available source type even when the customer has the required SHIR + DIR infrastructure.
So in your scenario, the most likely explanation (per the provided material) is:
- A backend feature flag for Unified Catalog Data Quality on-prem SQL Server isn’t enabled for the Purview account/tenant.
3) Additional prerequisites / feature flags / tenant settings / regions / licenses
From the on-premises Data Quality prerequisites doc, the user-facing prerequisites are:
- Set up a Kubernetes cluster
- Download the data integration runtime
- Register it with Microsoft Purview
- Also: before running data quality, you must scan the assets through a Data Map scan (using a self-hosted integration runtime)
From the dropdown-missing case, additionally:
- A backend feature flag may need to be enabled (per the provided “Issue…” support note)
The provided docs don’t specify region/ licensing / tenant settings beyond the above, so I can’t confirm anything else from the supplied content.
4) Is the setup different from Oracle on-premises?
The connection workflow is broadly the same (Unified Catalog → Health management → Data quality → Manage → Connections → New). But the connection configuration model has an Oracle-specific nuance:
- For SQL Server connections: a connection accepts database information but not schema (single database can contain multiple schemas; you can scan schemas within that database).
- For Oracle: you can create connections at schema level or database level, with rules about precedence.
So yes, configuration details differ, even though the high-level workflow is the same.
5) Should SQL Server on-premises be configured through a different workflow?
The on-premises Data Quality doc says to set up connections the same way you create connections for other data source types in Unified Catalog (Health management → Data quality). It does not indicate a different workflow for SQL Server versus Oracle—rather, it indicates SQL Server should be selectable as a supported on-premises data source.
Given that SQL Server should be supported, if it’s missing from the dropdown, the “feature flag not enabled” scenario (above) is the most consistent explanation from the provided materials.
https://learn.microsoft.com/purview/unified-catalog-data-quality-on-premises-data-sources
https://learn.microsoft.com/purview/unified-catalog-data-quality-supported-sources-connection
https://learn.microsoft.com/purview/unified-catalog-data-quality-troubleshooting
https://learn.microsoft.com/en-us/purview/data-map-integration-runtime-kubernetes
https://learn.microsoft.com/en-us/purview/data-map-integration-runtime-choose#supported-data-sources