XML File Connector for Azure Data Factory (Pipeline)

XML File Connector can be used to extract and output XML data stored in local XML files or direct XML String (variables or DB columns). XML Connector also supports Path expression to extract data from any level. This Connector is optimized to work with very large files.

In this article you will learn how to quickly and efficiently integrate XML File data in Azure Data Factory (Pipeline) without coding. We will use high-performance XML File Connector to easily connect to XML File and then access the data inside Azure Data Factory (Pipeline).

Let's follow the steps below to see how we can accomplish that!

Download Documentation

Create ODBC Data Source (DSN) based on ZappySys XML Driver

Step-by-step instructions

To get data from XML File using Azure Data Factory (Pipeline) we first need to create a DSN (Data Source) which will access data from XML File. We will later be able to read data using Azure Data Factory (Pipeline). Perform these steps:

  1. Download and install ODBC PowerPack.

  2. Open ODBC Data Sources (x64):

    Open ODBC Data Source
  3. Create a User data source (User DSN) based on ZappySys XML Driver

    ZappySys XML Driver
    Create new User DSN for ZappySys XML Driver
    • Create and use User DSN if the client application is run under a User Account. This is an ideal option in design-time, when developing a solution, e.g. in Visual Studio 2019. Use it for both type of applications - 64-bit and 32-bit.
    • Create and use System DSN if the client application is launched under a System Account, e.g. as a Windows Service. Usually, this is an ideal option to use in a production environment. Use ODBC Data Source Administrator (32-bit), instead of 64-bit version, if Windows Service is a 32-bit application.
    Azure Data Factory (Pipeline) uses a Service Account, when a solution is deployed to production environment, therefore for production environment you have to create and use a System DSN.
  4. You can use pass single file or multiple file path using wildcard pattern in path and you can use select single file by clicking [...] path button or multiple file using wildcard pattern in path.

    Note: If you want to operation with multiple files then use wild card pattern as below 
    (when you use wild card pattern in source path then system will treat target path as folder regardless you end with slash)
    
    C:\SSIS\Test\reponse.xml (will read only single reponse.xml file)
    C:\SSIS\Test\j*.xml (all files starting with file name j)
    C:\SSIS\Test\*.xml (all files with .xml Extension and located under folder subfolder)
    

  5. Now enter Path expression in Array Filter textbox to extract only specific part of XML file as below ($.feed.entry[*] will get content of entry attribute from XML document. Entry attribute is array of XML documents so we have to use [*] to indicate we want all records of that array)

    NOTE: Here, We are using our desired filter, but you need to select your desired filter based on your requirement.

    Click on Test Connection button to view whether the Test Connection is SUCCESSFUL or Not.

    $.feed.entry[*]
    ZappySys ODBC Driver - Configure XML Driver
  6. Once you configured a data source, you can preview data. Hit Preview tab, and use similar settings to preview data:
    ZappySys ODBC Driver - Preview XML Driver

  7. Click OK to finish creating the data source.

  8. That's it; we are done. In a few clicks we configured the call to XML File using ZappySys XML File Connector.

Video Tutorial

Read data in Azure Data Factory (ADF) from ODBC datasource (XML File)

  1. To start press New button:

    Create new Self-Hosted integration runtime
  2. Select "Azure, Self-Hosted" option:

    Create new Self-Hosted integration runtime
  3. Select "Self-Hosted" option:

    Create new Self-Hosted integration runtime
  4. Set a name, we will use "OnPremisesRuntime":

    Set a name for IR
  5. Download and install Microsoft Integration Runtime.

  6. Launch Integration Runtime and copy/paste Authentication Key from Integration Runtime configuration in Azure Portal:

    Copy/paste Authentication Key
  7. After finishing registering the Integration Runtime node, you should see a similar view:

    Check Integration Runtime node status
  8. Go back to Azure Portal and finish adding new Integration Runtime. You should see it was successfully added:

    Integration Runtime status
  9. Go to Linked services section and create a new Linked service based on ODBC:

    Add new Linked service
  10. Select "ODBC" service:

    Add new ODBC service
  11. Configure new ODBC service. Use the same DSN name we used in the previous step and copy it to Connection string box:

    XmlFileDSN
    DSN=XmlFileDSN
    Configure new ODBC service
  12. For created ODBC service create ODBC-based dataset:

    Add new ODBC dataset
  13. Go to your pipeline and add Copy data connector into the flow. In Source section use OdbcDataset we created as a source dataset:

    Set source in Copy data
  14. Then go to Sink section and select a destination/sink dataset. In this example we use precreated AzureBlobStorageDataset which saves data into an Azure Blob:

    Set sink in Copy data
  15. Finally, run the pipeline and see data being transferred from OdbcDataset to your destination dataset:

    Run the flow

Conclusion

In this article we showed you how to connect to XML File in Azure Data Factory (Pipeline) and integrate data without any coding, saving you time and effort. It's worth noting that ZappySys XML Driver allows you to connect not only to XML File, but to any Java application that supports JDBC (just use a different JDBC driver and configure it appropriately).

We encourage you to download XML File Connector for Azure Data Factory (Pipeline) and see how easy it is to use it for yourself or your team.

If you have any questions, feel free to contact ZappySys support team. You can also open a live chat immediately by clicking on the chat icon below.

Download XML File Connector for Azure Data Factory (Pipeline) Documentation

More integrations

Other connectors for Azure Data Factory (Pipeline)

All
Big Data & NoSQL
Database
CRM & ERP
Marketing
Collaboration
Cloud Storage
Reporting
Commerce
API & Files

Other application integration scenarios for XML File

All
Data Integration
Database
BI & Reporting
Productivity
Programming Languages
Automation & Scripting
ODBC applications

  • How to connect XML File in Azure Data Factory (Pipeline)?

  • How to get XML File data in Azure Data Factory (Pipeline)?

  • How to read XML File data in Azure Data Factory (Pipeline)?

  • How to load XML File data in Azure Data Factory (Pipeline)?

  • How to import XML File data in Azure Data Factory (Pipeline)?

  • How to pull XML File data in Azure Data Factory (Pipeline)?

  • How to push data to XML File in Azure Data Factory (Pipeline)?

  • How to write data to XML File in Azure Data Factory (Pipeline)?

  • How to POST data to XML File in Azure Data Factory (Pipeline)?

  • Call XML File API in Azure Data Factory (Pipeline)

  • Consume XML File API in Azure Data Factory (Pipeline)

  • XML File Azure Data Factory (Pipeline) Automate

  • XML File Azure Data Factory (Pipeline) Integration

  • Integration XML File in Azure Data Factory (Pipeline)

  • Consume real-time XML File data in Azure Data Factory (Pipeline)

  • Consume real-time XML File API data in Azure Data Factory (Pipeline)

  • XML File ODBC Driver | ODBC Driver for XML File | ODBC XML File Driver | SSIS XML File Source | SSIS XML File Destination

  • Connect XML File in Azure Data Factory (Pipeline)

  • Load XML File in Azure Data Factory (Pipeline)

  • Load XML File data in Azure Data Factory (Pipeline)

  • Read XML File data in Azure Data Factory (Pipeline)

  • XML File API Call in Azure Data Factory (Pipeline)