,

What is difference between Azure Data Factory and Data Factory in Microsoft Fabric

Posted by

  • Data Factory in Microsoft Fabric: It’s a new version of Azure Data Factory that offers big-scale data movement and transformation services in the cloud. It’s designed to handle even the toughest ETL tasks, making it user-friendly, powerful, and perfect for businesses.
  • Comparison Article: This article looks at the distinctions between Azure Data Factory and Data Factory in Microsoft Fabric.

Difference between Azure Data Factory and Data Factory in Microsoft Fabric

Some different features concepts of Data Factory in Fabric compared to Azure Data Factory.

Azure Data FactoryData Factory in FabricDescription
PipelineData pipelineData pipeline in Fabric is better integrated with the unified data platform including Lakehouse, Datawarehouse, and more.
Mapping dataflowDataflow Gen2Dataflow Gen2 provides easier experience to build transformation. We are in progress of letting more functions of mapping dataflow supported in Dataflow Gen2
ActivitiesActivitiesWe are in progress to make more activities of ADF supported in Data Factory in Fabric. Data Factory in Fabric also has some newly attracted activities like Office 365 Outlook activity.
DatasetNot ApplicableData Factory in Fabric doesn’t have dataset concepts. Connection will be used for connecting each data source and pull data.
Linked ServiceConnectionsConnections have similar functionality as linked service, but connections in Fabric have more intuitive way to create.
TriggersSchedules (other triggers are in progress)Fabric can use the schedule to automatically run pipeline. We are adding more triggers supported by ADF in Microsoft Fabric.
PublishSave, RunFor pipeline in Fabric, you don’t need to publish to save the content. Instead, you can use Save button to save the content directly. When you click Run button, it will save the content before running pipeline.
Autoresolve and Azure Integration runtimeNot ApplicableIn Fabric, we don’t have the concept of Integration runtime.
Self-hosted integration runtimesOn-premises Data Gateway(in design)The capability in Fabric is still in progress of design.
Azure-SSIS integration runtimesTo be determinedThe capability in Fabric hasn’t confirmed the roadmap and design.
MVNet and Private End PointTo be determinedThe capability in Fabric hasn’t confirmed the roadmap and design.
Expression languageExpression languageExpression language is similar in ADF and Fabric.
Authentication type in linked serviceAuthentication kind in connectionAuthentication kind in Fabric pipeline already supported popular authentication types in ADF, and more authentication kinds will be added.
CI/CDCI/CDCI/CD capability in Fabric Data Factory will be coming soon.
Export and Import ARMSave asSave as is available in Fabric pipeline to duplicate a pipeline.
MonitoringMonitoring, Run historyThe monitoring hub in Fabric has more advanced functions and modern experience like monitoring across different workspaces for better insights.

Data pipeline of data factory in Microsoft Fabric

There are many exciting features of data pipeline in Data Factory of Microsoft Fabric.

Lakehouse/Datawarehouse integration

  • In Fabric’s pipeline, you can use Lakehouse and Data Warehouse as both sources and destinations.
  • This makes it really easy for you to create your own projects that connect with Lakehouse and Data Warehouse.
  • Whether you’re pulling data from these sources or sending data to them, Fabric’s pipeline makes it simple to work with Lakehouse and Data Warehouse.

Office 365 outlook activity

  • Office 365 Outlook activity: This is a feature in Office 365 that lets you use Outlook to do things automatically.
  • Sends customized email notifications: It sends emails that you can personalize to include information from your pipeline and its activities.
  • Easy configuration: You can set it up quickly and without much hassle.

Get data experience

  • Modern and easy Get Data experience: You can quickly and easily get the data you need.
  • Data Factory in Fabric: This feature is available in Data Factory, a tool in Microsoft Fabric.
  • Super-fast for you to set up your copy pipeline: It’s very quick for you to create a pipeline to copy your data.
  • Create a new connection: You can easily make a connection to the data source you want to copy from.

Data pipeline monitoring experience

  • The monitoring hub, along with Data Factory tools like data flows and pipelines, gives us a complete view of all the tasks.
  • We can easily investigate any task within Data Factory.
  • It’s also handy for analyzing data across different workspaces through the monitoring hub.

Activity continuity between Azure Data Factory (ADF) and Data Factory in Fabric

ActivityADFData Factory in Fabric
Append VariableYY
CopyYY
DeleteYY
Execute/Invoke PipelineYY
FailYY
FilterYY
For EachYY
GetMetadataYY
IfYY
LookupYY
ScriptYY
Set VariableYY
SprocYY
SwitchYY
UntilYY
WaitYY
WebYY
WebhookYY
Synapse NotebookYY
Azure Function/FunctionsYY
ADX/KQLYY
Azure Machine LearningYY
Azure BatchYY
Azure Databricks (3 activities in ADF)YY
ValidationYN
HDInsight (5 activities in ADF)YN
SSISYN
Mapping DataflowYN
Dataflow Gen2N/AY
Office 365 OutlookN/AY
TeamsN/AY
Dataset RefreshN/AN
Azure ML Batch ExecutionDeprecatedN/A
Azure ML Update ResourceDeprecatedN/A
Power Query (ADF only – Wrangling Dataflow)DeprecatedN/A
USQLDeprecatedN/A

New activities in Fabric Data Factory

In addition to maintaining activity continuity, Data Factory in Fabric introduces some new activities to meet your richer orchestration needs. These new activities are:

  1. Outlook: Available in Fabric Data Factory to facilitate integration with Outlook services.
  2. Teams: Available in Fabric Data Factory to enable orchestration of Microsoft Teams activities.
  3. Dataset Refresh: In progress in Fabric Data Factory to enhance dataset refresh capabilities.
  4. Dataflow Gen2: Available in Fabric Data Factory to empower data orchestration with advanced dataflow capabilities.

Connector continuity between Azure Data Factory (ADF) and Data Factory in Fabric

Connector TypeADF (Source/Sink)Fabric Data Factory (Source/Sink)
Azure & Fabric
Blob Storage✓/✓✓/✓
Cognitive Search Index-/✓N
Cosmos DB – SQL API✓/✓✓/✓
Cosmos DB – MongoDB API✓/✓N
Azure Data Explorer✓/✓✓/✓
ADLS Gen1✓/✓✓/✓
ADLS Gen2✓/✓✓/✓
Database for MariaDB✓/-N
Database for MySQL✓/✓N
Databricks Delta Lake✓/✓N
Azure File Storage✓/✓N
Database for PostgreSQL✓/✓✓/✓
Azure Cosmos DB for PostgreSQL✓/✓N
Azure Table Storage✓/✓✓/✓
SQL Database✓/✓✓/✓
SQL Database MI✓/✓✓/✓
Synapse Analytics✓/✓✓/✓
Fabric Lakehouse ​✓/✓✓/✓
Fabric DWN✓/✓
Fabric KQLN✓/✓
Database & DW
Amazon Redshift✓/-✓/-
DB2✓/-N
Drill✓/-N
Google Big Query✓/-N
Greenplum✓/-N
HBase✓/-N
Hive✓/-N
Impala✓/-N
Informix✓/✓N
MariaDB✓/-N
Microsoft Access✓/✓N
MySQL✓/-N
Netezza✓/-N
Oracle✓/✓N
Amazon RDS for Oracle✓/-N
Phoenix✓/-N
PostgreSQL✓/-✓/-
Presto✓/-N
SAP BW Open Hub✓/-N
SAP BW MDX✓/-N
SAP HANA✓/✓N
SAP Table✓/-N
Snowflake✓/✓✓/✓
Amazon RDS for SQL Server✓/-✓/-
SQL Server✓/✓✓/✓
Spark✓/-N
Sybase✓/-N
Teradata✓/-N
Vertica✓/-N
File Storage
Amazon S3✓/-✓/-
Amazon S3 Compatible✓/-✓/-
File System✓/✓N
FTP✓/-✓/-
Google Cloud Storage✓/-✓/-
HDFS✓/-N
Oracle Cloud Storage✓/-N
SFTP✓/✓✓/✓
File Formats
AVRO✓/✓✓/✓
Binary✓/✓✓/✓
Delimited Text✓/✓✓/✓
Excel✓/-✓/-
JSON✓/✓✓/✓
ORC✓/✓✓/✓
Parquet✓/✓✓/✓
XML✓/-✓/-
NoSQL
Cassandra✓/-N
Couchbase✓/-N
MongoDB✓/✓✓/✓
MongoDB Atlas✓/✓✓/✓
Services & Apps
Amazon MWS✓/-N
Concur✓/-N
Dataverse✓/✓✓/✓
Dynamics 365✓/✓N
Dynamics AX✓/-N
Dynamics CRM✓/✓✓/✓
Google AdWords✓/-N
HubSpot✓/-N
Jira✓/-N
Magento✓/-N
Marketo✓/-N
Microsoft 365✓/-✓/-
Oracle Eloqua✓/-N
Oracle Responsys✓/-N
Zoho✓/-N
Oracle Service Cloud✓/-N
PayPal✓/-N
QuickBooks✓/-N
Salesforce✓/✓N
SF Service Cloud✓/✓N
SF Marketing Cloud✓/-N
SAP C4C✓/✓N
SAP ECC✓/-N
ServiceNow✓/-N
SharePoint Online
List
✓/-✓/-
Shopify✓/-N
Square✓/-N
Web Table✓/-N
Xero✓/-N
Generic
HTTP✓/-✓/-
OData✓/-✓/-
ODBC✓/✓N
REST✓/✓✓/✓
guest
0 Comments
Inline Feedbacks
View all comments
0
Would love your thoughts, please comment.x
()
x