<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Redshift Archives | ZappySys Blog</title>
	<atom:link href="https://zappysys.com/blog/category/cloud-computing/aws-amazon-web-services/redshift/feed/" rel="self" type="application/rss+xml" />
	<link>https://zappysys.com/blog/category/cloud-computing/aws-amazon-web-services/redshift/</link>
	<description>SSIS / ODBC Drivers / API Connectors for JSON, XML, Azure, Amazon AWS, Salesforce, MongoDB and more</description>
	<lastBuildDate>Mon, 14 Oct 2019 20:57:35 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.4.4</generator>

<image>
	<url>https://zappysys.com/blog/wp-content/uploads/2023/01/cropped-zappysys-symbol-large-32x32.png</url>
	<title>Redshift Archives | ZappySys Blog</title>
	<link>https://zappysys.com/blog/category/cloud-computing/aws-amazon-web-services/redshift/</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>Merge / Upsert data in Amazon Redshift using SSIS</title>
		<link>https://zappysys.com/blog/ssis-amazon-redshift-upsert-update-insert-delete/</link>
		
		<dc:creator><![CDATA[ZappySys]]></dc:creator>
		<pubDate>Fri, 06 Sep 2019 09:02:51 +0000</pubDate>
				<category><![CDATA[Redshift]]></category>
		<category><![CDATA[SSIS Upsert Destination]]></category>
		<category><![CDATA[access]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[redshift]]></category>
		<category><![CDATA[ssis]]></category>
		<category><![CDATA[upsert]]></category>
		<guid isPermaLink="false">https://zappysys.com/blog/?p=7903</guid>

					<description><![CDATA[<p>Introduction In our previous blog we saw how update / insert data into SQL Server using SSIS Upsert Destination. In this post we will look at specific example on Data migration from Access to Amazon Redshift using SSIS Upsert Destination (Insert, Update, Delete), along with few other topics such as how to create table using [&#8230;]</p>
<p>The post <a href="https://zappysys.com/blog/ssis-amazon-redshift-upsert-update-insert-delete/">Merge / Upsert data in Amazon Redshift using SSIS</a> appeared first on <a href="https://zappysys.com/blog">ZappySys Blog</a>.</p>
]]></description>
										<content:encoded><![CDATA[<h2><span id="Introduction">Introduction</span></h2>
<p><a href="https://zappysys.com/blog/wp-content/uploads/2019/09/Access-to-AmazonRedshift.png" target="_blank" rel="noopener"><img decoding="async" class="wp-image-7910 size-thumbnail alignleft" src="https://zappysys.com/blog/wp-content/uploads/2019/09/Access-to-AmazonRedshift-150x150.png" alt="Access-to-AmazonRedshift" width="150" height="150" /></a>In our previous blog we saw <a href="https://zappysys.com/blog/data-migration-access-sql-server/" target="_blank" rel="noopener">how update / insert data into SQL Server using SSIS Upsert Destination</a>. In this post we will look at specific example on <strong>Data migration from Access to Amazon Redshift using SSIS Upsert Destination (Insert, Update, Delete)</strong>, along with few other topics such as how to create table using <a href="https://zappysys.com/products/ssis-powerpack/ssis-upsert-destination/" target="_blank" rel="noopener">Upsert Destination</a>. how to read all Customers data from Ms Access Table and Merge it in the Amazon Redshift.</p>
<p>We will go through the steps to read data from Access and Load into Amazon Redshift.</p>
<p>In nutshell, this post will focus on how to read access table data in SSIS.</p>
<p>So let’s get started.</p>
<h2><span id="Requirements">Requirements</span></h2>
<ol>
<li>First, you will need to have SSIS installed</li>
<li>Secondly, make sure to have SSDT</li>
<li>Thirdly, do not forget to install ZappySys <a href="https://zappysys.com/products/ssis-powerpack/" target="_blank" rel="noopener">SSIS PowerPack</a></li>
<li>Finally, Make sure that Microsoft Access installed.</li>
</ol>
<h2>How to Read MS Access table data and migrate that data in Amazon Redshift table.</h2>
<p>Let´s start with an example. In this article we will see Data migration from Access to Amazon Redshift.</p>
<ol>
<li>First of All, Drag and drop Data Flow Task from SSIS Toolbox and double click it to edit.
<div id="attachment_7934" style="width: 470px" class="wp-caption aligncenter"><a href="https://zappysys.com/blog/wp-content/uploads/2019/09/ssis-drag-drop-data-flow-task.png" target="_blank" rel="noopener"><img fetchpriority="high" decoding="async" aria-describedby="caption-attachment-7934" class="wp-image-7934 size-full" src="https://zappysys.com/blog/wp-content/uploads/2019/09/ssis-drag-drop-data-flow-task.png" alt="Drag and Drop SSIS Data Flow Task from SSIS Toolbox" width="460" height="155" srcset="https://zappysys.com/blog/wp-content/uploads/2019/09/ssis-drag-drop-data-flow-task.png 460w, https://zappysys.com/blog/wp-content/uploads/2019/09/ssis-drag-drop-data-flow-task-300x101.png 300w" sizes="(max-width: 460px) 100vw, 460px" /></a><p id="caption-attachment-7934" class="wp-caption-text">Drag and Drop : SSIS Data Flow Task from SSIS Toolbox</p></div></li>
<li>Furthermore, drag and drop the OLE DB Source.
<div id="attachment_7289" style="width: 515px" class="wp-caption aligncenter"><a href="https://zappysys.com/blog/wp-content/uploads/2019/06/oledb-source-drag-and-drop.png" target="_blank" rel="noopener"><img loading="lazy" decoding="async" aria-describedby="caption-attachment-7289" class="wp-image-7289 size-full" src="https://zappysys.com/blog/wp-content/uploads/2019/06/oledb-source-drag-and-drop.png" alt="OLE DB Source - Drag and Drop" width="505" height="190" srcset="https://zappysys.com/blog/wp-content/uploads/2019/06/oledb-source-drag-and-drop.png 505w, https://zappysys.com/blog/wp-content/uploads/2019/06/oledb-source-drag-and-drop-300x113.png 300w" sizes="(max-width: 505px) 100vw, 505px" /></a><p id="caption-attachment-7289" class="wp-caption-text">OLE DB Source &#8211; Drag and Drop</p></div></li>
<li>Double click on OLE DB Source for configure it and click on New Connection and configure connection as below to connect access database and click on OK.
<div id="attachment_7851" style="width: 722px" class="wp-caption aligncenter"><a href="https://zappysys.com/blog/wp-content/uploads/2019/08/ssis-ole-db-access-connection.png" target="_blank" rel="noopener"><img loading="lazy" decoding="async" aria-describedby="caption-attachment-7851" class="wp-image-7851 size-full" src="https://zappysys.com/blog/wp-content/uploads/2019/08/ssis-ole-db-access-connection.png" alt="OLE DB : Access Connection" width="712" height="618" srcset="https://zappysys.com/blog/wp-content/uploads/2019/08/ssis-ole-db-access-connection.png 712w, https://zappysys.com/blog/wp-content/uploads/2019/08/ssis-ole-db-access-connection-300x260.png 300w" sizes="(max-width: 712px) 100vw, 712px" /></a><p id="caption-attachment-7851" class="wp-caption-text">OLE DB : Access Connection</p></div></li>
<li>Now in OLE DB Source Select the mode as Table or View and select Preview to view the access table data.
<div id="attachment_7852" style="width: 730px" class="wp-caption aligncenter"><a href="https://zappysys.com/blog/wp-content/uploads/2019/08/ssis-ole-db-source-preview.png" target="_blank" rel="noopener"><img loading="lazy" decoding="async" aria-describedby="caption-attachment-7852" class="wp-image-7852 size-medium_large" src="https://zappysys.com/blog/wp-content/uploads/2019/08/ssis-ole-db-source-preview-768x472.png" alt="OLE DB Source Preview" width="720" height="443" srcset="https://zappysys.com/blog/wp-content/uploads/2019/08/ssis-ole-db-source-preview-768x472.png 768w, https://zappysys.com/blog/wp-content/uploads/2019/08/ssis-ole-db-source-preview-300x184.png 300w, https://zappysys.com/blog/wp-content/uploads/2019/08/ssis-ole-db-source-preview-1024x629.png 1024w, https://zappysys.com/blog/wp-content/uploads/2019/08/ssis-ole-db-source-preview.png 1165w" sizes="(max-width: 720px) 100vw, 720px" /></a><p id="caption-attachment-7852" class="wp-caption-text">OLE DB Source Preview</p></div></li>
<li>Now drag and drop Upsert Destination (Insert, Update, Delete) and create connection with Amazon Redshift Database.
<div id="attachment_8093" style="width: 690px" class="wp-caption aligncenter"><a href="https://zappysys.com/blog/wp-content/uploads/2019/09/ssis-upsert-aws-redshift-connection.png"><img loading="lazy" decoding="async" aria-describedby="caption-attachment-8093" class="size-full wp-image-8093" src="https://zappysys.com/blog/wp-content/uploads/2019/09/ssis-upsert-aws-redshift-connection.png" alt="Upsert Destination : Amazon Redshift Connection" width="680" height="762" srcset="https://zappysys.com/blog/wp-content/uploads/2019/09/ssis-upsert-aws-redshift-connection.png 680w, https://zappysys.com/blog/wp-content/uploads/2019/09/ssis-upsert-aws-redshift-connection-268x300.png 268w" sizes="(max-width: 680px) 100vw, 680px" /></a><p id="caption-attachment-8093" class="wp-caption-text">Upsert Destination : Amazon Redshift Connection</p></div></li>
<li>Now select Action as Sync and check all the checkboxes Insert, Update and Delete from target if not found in source. Select the table and Map all the columns and select the Key field(s) and click on OK.
<div id="attachment_7901" style="width: 730px" class="wp-caption aligncenter"><a href="https://zappysys.com/blog/wp-content/uploads/2019/09/ssis-upsert-destination-select-table-and-key.png" target="_blank" rel="noopener"><img loading="lazy" decoding="async" aria-describedby="caption-attachment-7901" class="wp-image-7901 size-medium_large" src="https://zappysys.com/blog/wp-content/uploads/2019/09/ssis-upsert-destination-select-table-and-key-768x572.png" alt="Upsert Destination Configuration" width="720" height="536" srcset="https://zappysys.com/blog/wp-content/uploads/2019/09/ssis-upsert-destination-select-table-and-key-768x572.png 768w, https://zappysys.com/blog/wp-content/uploads/2019/09/ssis-upsert-destination-select-table-and-key-300x224.png 300w, https://zappysys.com/blog/wp-content/uploads/2019/09/ssis-upsert-destination-select-table-and-key.png 950w" sizes="(max-width: 720px) 100vw, 720px" /></a><p id="caption-attachment-7901" class="wp-caption-text">Upsert Destination Configuration</p></div></li>
<li>That&#8217;s it we are ready to migrate MS access table data into Amazon Redshift Table. Execute the package and it will migrate the data.
<div id="attachment_7860" style="width: 420px" class="wp-caption aligncenter"><a href="https://zappysys.com/blog/wp-content/uploads/2019/08/ssis-oledb-source-and-upsert-destination.png" target="_blank" rel="noopener"><img loading="lazy" decoding="async" aria-describedby="caption-attachment-7860" class="wp-image-7860 size-full" src="https://zappysys.com/blog/wp-content/uploads/2019/08/ssis-oledb-source-and-upsert-destination.png" alt="Upsert Destination (Insert, Update, Delete)" width="410" height="225" srcset="https://zappysys.com/blog/wp-content/uploads/2019/08/ssis-oledb-source-and-upsert-destination.png 410w, https://zappysys.com/blog/wp-content/uploads/2019/08/ssis-oledb-source-and-upsert-destination-300x165.png 300w" sizes="(max-width: 410px) 100vw, 410px" /></a><p id="caption-attachment-7860" class="wp-caption-text">Upsert Destination (Insert, Update, Delete)</p></div></li>
</ol>
<h2>Bulk Update data in Amazon Redshift</h2>
<p>So in previous example we saw bulk update or insert (Upsert) in Redshift Table. Now let&#8217;s look at how to update data in target table if record exists.</p>
<p>Here is how you can perform bulk update in Amazon Redshift using Upsert Destination.</p>
<ol>
<li>Double click on Upsert Destination for configure it.</li>
<li>Set Action Bulk Update =&gt; based on matching records on target. Select Connection and Target Table. Click on Map All to Mappings all columns and check on Only Primary Key columns.<br />
<img decoding="async" class="figureimage" src="https://zappysys.com/onlinehelp/ssis-powerpack/scr/images/upsert-destination/ssis-bulk-update.png" alt="SSIS Bulk Update rows in SQL Table" /></li>
<li>Thats all, Click on OK to save Upsert Destination settings UI.</li>
</ol>
<h2>Bulk Delete data in Amazon Redshift</h2>
<p>Here is how you can bulk delete data in Amazon Redshift.</p>
<ol>
<li>Double click on Upsert Destination for configure it.</li>
<li>Set Action Bulk Delete =&gt; based on matching records on target. Select Connection and Target Table. Click on Map All to Mappings all columns and check on Only Primary Key columns.<br />
<img decoding="async" class="figureimage" src="https://zappysys.com/onlinehelp/ssis-powerpack/scr/images/upsert-destination/ssis-bulk-delete.png" alt="SSIS Bulk delete rows in SQL Table" /></li>
<li>Thats all, Click on OK to save Upsert Destination settings UI.</li>
</ol>
<p>&nbsp;</p>
<h2><span id="Conclusion">Conclusion</span></h2>
<p>In this article, we show how to read MS Access table data and migrate the data in Amazon Redshift table using SSIS. We show how to do connect access MS Access using OLE DB Source. Also, we show how to write Sync Insert, Update and Delete in target if not found in Source Using <a href="https://zappysys.com/products/ssis-powerpack/ssis-upsert-destination/" target="_blank" rel="noopener">ZS Upsert Destination</a>. If you liked this article and you want to try, you can download the <a href="https://zappysys.com/products/ssis-powerpack/">SSIS PowerPack from here (includes 70+ Components)</a>.</p>
<h2><span id="References">References</span></h2>
<ul>
<li><a href="https://zappysys.com/products/ssis-powerpack/download/" target="_blank" rel="noopener">ZappySys SSIS installer.</a></li>
<li><strong>Help File: </strong><a href="https://zappysys.com/onlinehelp/ssis-powerpack/index.htm#page=ssis-upsert-destination.htm" target="_blank" rel="noopener">Upsert Destination</a></li>
</ul>
<p>The post <a href="https://zappysys.com/blog/ssis-amazon-redshift-upsert-update-insert-delete/">Merge / Upsert data in Amazon Redshift using SSIS</a> appeared first on <a href="https://zappysys.com/blog">ZappySys Blog</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Amazon Redshift data load in Informatica PowerCenter</title>
		<link>https://zappysys.com/blog/amazon-redshift-data-load-in-informatica-powercenter/</link>
		
		<dc:creator><![CDATA[ZappySys]]></dc:creator>
		<pubDate>Wed, 17 Feb 2016 23:42:41 +0000</pubDate>
				<category><![CDATA[Redshift]]></category>
		<category><![CDATA[Amazon Redshift]]></category>
		<category><![CDATA[Amazon S3]]></category>
		<category><![CDATA[aws]]></category>
		<category><![CDATA[Informatica]]></category>
		<category><![CDATA[PowerCenter]]></category>
		<category><![CDATA[redshift]]></category>
		<category><![CDATA[ZappyShell]]></category>
		<guid isPermaLink="false">http://zappysys.com/blog/?p=360</guid>

					<description><![CDATA[<p>Introduction In our previous post you learned how to load data into Redshift using SSIS. Now in this post you will learn how to load data into Redshift using Informatica PowerCenter. For PowerCenter we will use ZappyShell Command line for Redshift Data Load. This small powerful command line utility can handle load of several millions [&#8230;]</p>
<p>The post <a href="https://zappysys.com/blog/amazon-redshift-data-load-in-informatica-powercenter/">Amazon Redshift data load in Informatica PowerCenter</a> appeared first on <a href="https://zappysys.com/blog">ZappySys Blog</a>.</p>
]]></description>
										<content:encoded><![CDATA[<h2>Introduction</h2>
<p>In our <a href="https://zappysys.com/posts/sql-server-to-redshift-data-load-using-ssis/#Command_line_approach_for_SQL_Server_to_Redshift_data_load" target="_blank">previous post</a> you learned how to load data into Redshift using SSIS. Now in this post you will learn how to load data into Redshift using <strong>Informatica PowerCenter</strong>. For PowerCenter we will use <a href="//zappysys.com/products/zappyshell/amazon-redshift-command-line-tools/" target="_blank">ZappyShell Command line for Redshift Data Load</a>. This small powerful command line utility can handle load of several millions or billions records in few minutes. It uses parallel execution engine which takes care most tedious steps of loading data into Amazon Redshift.</p>
<h2>Using informatica command task to load data into redshift</h2>
<p>Informatica PowerCenter has simple task to execute command lines. you can use this task to execute ZappyShell command line. Perform following steps to load data.</p>
<ol>
<li>Download and install <a href="//zappysys.com/products/zappyshell/amazon-redshift-command-line-tools/" target="_blank">ZappyShell for Amazon Redshift from here</a></li>
<li>Now you ready to data into Amazon Redshift</li>
<li>Drag new command task into your Informatica workflow</li>
<li>Enter below command to load data from SQL Server to Redshift. If you have ODBC connectivity then use ODBC DSN to read your data.</li>
<li>You can use script file approach too where each command line parameter can be nicely wrapped in new line.<br />
Here is sample command you can try to load data from SQL Server to Redshift. if you have ODBC connectivity then use ODBC DSN to read your data.<br />
<pre class="crayon-plain-tag">c:\zappyshell\aws.exe import Db 
	--source-driver ADONET_MSSQL 
	--source-query "select ROW_NUMBER()Over(order by a.CustomerID) Id, a.*,b.*,c.OrderID,c.OrderDate,c.Freight  from customers a,products b,orders c" 
	--source-archivemethod None 
	--source-stage-archivemethod Delete 
	--target-stage-archivemethod Delete 
	--source-stagepath "c:\redshift\stage" 
	--target-table "customerdata" 
	--target-stagepath "bw-rstest/cmdstage" 
	--target-truncate 
	--logfile "c:\redshift\log.txt" 
	--maxrows-perfile 100000 
	--region us-east-1 
	--accesskey "AKIA*****************" 
	--secretkey "lPi+XQ************************"  
	--source-connstr "Data Source=localhost;Initial Catalog=Northwind;Integrated Security=SSPI;"  
	--target-connstr "Host=mytestcluster-1.csu********.us-east-1.redshift.amazonaws.com;Port=5439;Database=dev;UserName=masteruser;Password=*********;EnableSsl=true;Timeout=30;CommandTimeout=3600;"</pre>
</li>
<li>For complete help on import command check help file //zappysys.com/onlinehelp/zappyshell/scr/aws/aws-redshift-import.htm</li>
</ol>
<div id="attachment_405" style="width: 746px" class="wp-caption alignnone"><a href="//zappysys.com/blog/wp-content/uploads/2016/02/amazon-redshift-import-command-line-tools-copy-s3-aws.png"><img loading="lazy" decoding="async" aria-describedby="caption-attachment-405" class="size-full wp-image-405" src="//zappysys.com/blog/wp-content/uploads/2016/02/amazon-redshift-import-command-line-tools-copy-s3-aws.png" alt="Informatica PowerCenter Redshift Data Load - Amazon Redshift Import Command line tools (COPY, S3, AWS)" width="736" height="571" srcset="https://zappysys.com/blog/wp-content/uploads/2016/02/amazon-redshift-import-command-line-tools-copy-s3-aws.png 736w, https://zappysys.com/blog/wp-content/uploads/2016/02/amazon-redshift-import-command-line-tools-copy-s3-aws-300x233.png 300w" sizes="(max-width: 736px) 100vw, 736px" /></a><p id="caption-attachment-405" class="wp-caption-text">Informatica PowerCenter Redshift Data Load &#8211; Amazon Redshift Import Command line tools (COPY, S3, AWS)</p></div>
<h2>ZappyShell Command Line for Redshift</h2>
<p>Below are few feature for Command line for Amazon Redshift</p>
<p>◾Import data to AWS Redshift database from files or relational source (e.g. MySQL, Oracle, SQL Server)<br />
◾Import huge amount of data (millions of rows) in few minutes with parallel load techniques<br />
◾Load local flat files to Redshift in just single line command (option to compress data files to *.gz to speed up transfer)<br />
◾Support for Client side encryption using AES 256 Key<br />
◾Load data from any data source (ODBC,ADO.net or OLEDB) using sql query<br />
◾Import compressed data files (*.gz) to redshift<br />
◾Archive files, error reporting, file splitting, many other features</p>
<a href="https://zappysys.com/blog/amazon-redshift-data-load-in-informatica-powercenter/"><img decoding="async" src="https://zappysys.com/blog/wp-content/plugins/wp-youtube-lyte/lyteCache.php?origThumbUrl=%2F%2Fi.ytimg.com%2Fvi%2FOz5Rfztzo0U%2Fhqdefault.jpg" alt="YouTube Video"></a><br /><br /></p>
<p>The post <a href="https://zappysys.com/blog/amazon-redshift-data-load-in-informatica-powercenter/">Amazon Redshift data load in Informatica PowerCenter</a> appeared first on <a href="https://zappysys.com/blog">ZappySys Blog</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Fix Redshift Copy ERROR: XX000: Mandatory url is not present in manifest file</title>
		<link>https://zappysys.com/blog/fix-redshift-copy-error-xx000-mandatory-url-is-not-present-in-manifest-file/</link>
		
		<dc:creator><![CDATA[ZappySys]]></dc:creator>
		<pubDate>Fri, 22 Jan 2016 19:56:55 +0000</pubDate>
				<category><![CDATA[Redshift]]></category>
		<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[redshift]]></category>
		<category><![CDATA[Redshift Data Transfer Task]]></category>
		<category><![CDATA[ssis]]></category>
		<category><![CDATA[SSIS PowerPack]]></category>
		<guid isPermaLink="false">http://zappysys.com/blog/?p=328</guid>

					<description><![CDATA[<p>If you are doing data transfer using SSIS Redshift Data Transfer task and if you receive following error then most likely you have not specified folder in your target location. Error: An error occurred with the following error message: "ERROR: XX000: Mandatory url is not present in manifest file.". Solution: To fix this error change [&#8230;]</p>
<p>The post <a href="https://zappysys.com/blog/fix-redshift-copy-error-xx000-mandatory-url-is-not-present-in-manifest-file/">Fix Redshift Copy ERROR: XX000: Mandatory url is not present in manifest file</a> appeared first on <a href="https://zappysys.com/blog">ZappySys Blog</a>.</p>
]]></description>
										<content:encoded><![CDATA[<p>If you are doing data transfer using <a href="https://zappysys.com/products/ssis-powerpack/ssis-amazon-redshift-data-transfer-task/">SSIS Redshift Data Transfer task</a><br />
 and if you receive following error then most likely you have not specified folder in your target location.</p>
<p><code><br />
Error: An error occurred with the following error message: "ERROR: XX000: Mandatory url is not present in manifest file.".<br />
</code><br />
<strong>Solution:</strong> To fix this error change target folder location (On Target tab -> Staging Area Panel -> Stage folder) such a way so it includes bucket and some folder name after that too. </p>
<p>Invalid Path: mybucket/<br />
Valid Path: mybucket/somefolder</p>
<div id="attachment_329" style="width: 271px" class="wp-caption alignnone"><a href="//zappysys.com/blog/wp-content/uploads/2016/01/redshift-error-xx000-mandatory-url-is-not-present-in-manifest-file.png"><img loading="lazy" decoding="async" aria-describedby="caption-attachment-329" src="//zappysys.com/blog/wp-content/uploads/2016/01/redshift-error-xx000-mandatory-url-is-not-present-in-manifest-file.png" alt="How to fix Redshift Copy Error for SSIS Redshift Data Transfer task: &quot;ERROR: XX000: Mandatory url is not present in manifest file.&quot;." width="261" height="448" class="size-full wp-image-329" srcset="https://zappysys.com/blog/wp-content/uploads/2016/01/redshift-error-xx000-mandatory-url-is-not-present-in-manifest-file.png 261w, https://zappysys.com/blog/wp-content/uploads/2016/01/redshift-error-xx000-mandatory-url-is-not-present-in-manifest-file-175x300.png 175w" sizes="(max-width: 261px) 100vw, 261px" /></a><p id="caption-attachment-329" class="wp-caption-text">How to fix Redshift Copy Error for SSIS Redshift Data Transfer task: &#8220;ERROR: XX000: Mandatory url is not present in manifest file.&#8221;.<br /></p></div>
<p>The post <a href="https://zappysys.com/blog/fix-redshift-copy-error-xx000-mandatory-url-is-not-present-in-manifest-file/">Fix Redshift Copy ERROR: XX000: Mandatory url is not present in manifest file</a> appeared first on <a href="https://zappysys.com/blog">ZappySys Blog</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>How to setup Amazon Redshift Cluster in few clicks</title>
		<link>https://zappysys.com/blog/how-to-setup-amazon-redshift-cluster-for-outside-data-access/</link>
		
		<dc:creator><![CDATA[ZappySys]]></dc:creator>
		<pubDate>Tue, 17 Nov 2015 14:47:38 +0000</pubDate>
				<category><![CDATA[AWS (Amazon Web Services)]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Redshift]]></category>
		<category><![CDATA[aws]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[data warehouse]]></category>
		<category><![CDATA[redshift]]></category>
		<category><![CDATA[Redshift Data Transfer Task]]></category>
		<category><![CDATA[ssis]]></category>
		<category><![CDATA[SSIS PowerPack]]></category>
		<guid isPermaLink="false">http://zappysys.com/blog/?p=169</guid>

					<description><![CDATA[<p>Introduction In this article you will learn how to Setup Amazon Redshift Cluster in few clicks. You will also learn how to set Inbound and Outbound Firewall Rules so you can access Redshift Cluster from outside of AWS Network (e.g. from your corporate network or your home). By default Redshift Cluster cannot be access from outside [&#8230;]</p>
<p>The post <a href="https://zappysys.com/blog/how-to-setup-amazon-redshift-cluster-for-outside-data-access/">How to setup Amazon Redshift Cluster in few clicks</a> appeared first on <a href="https://zappysys.com/blog">ZappySys Blog</a>.</p>
]]></description>
										<content:encoded><![CDATA[<h2>Introduction</h2>
<p>In this article you will learn <em>how to Setup Amazon Redshift Cluster</em> in few clicks. You will also learn how to set Inbound and Outbound Firewall Rules so you can access Redshift Cluster from outside of AWS Network (e.g. from your corporate network or your home). By default Redshift Cluster cannot be access from outside of your AWS Virtual Network (referred as VPC &#8211; Virtual Private Cloud)</p>
<p>Once Redshift Cluster is setup you can follow <a href="//zappysys.com/posts/sql-server-to-redshift-data-load-using-ssis/">these steps to Load data into Redshift</a> (Using <a href="//zappysys.com/products/ssis-powerpack/ssis-amazon-redshift-data-transfer-task/">SSIS Redshift Data Transfer Task</a> or <a href="//zappysys.com/products/zappyshell/amazon-redshift-command-line-tools/">Command line for Redshift</a>)</p>
<h2>What is Amazon Redshift</h2>
<p><em>Amazon Redshift</em> is a fully managed, petabyte-scale <em>data warehouse</em> service in the <em>cloud</em>. You can start with just a few hundred gigabytes of data and scale to a petabyte or more. This enables you to use your data to acquire new insights for your business and customers.</p>
<p>The first step to create a data warehouse is to launch a set of nodes, called an <em>Amazon Redshift cluster</em>. After you provision your cluster, you can upload your data set and then perform data analysis queries. Regardless of the size of the data set, Amazon Redshift offers fast query performance using the same SQL-based tools and business intelligence applications that you use today.</p>
<h2><span id="Setup_your_Amazon_Redshift_Cluster">Setup Amazon Redshift Cluster</span></h2>
<div>
<p><em>NOTE: Skip this step if you already setup you Redshift Cluster</em></p>
<ol>
<li>Login to your AWS Console and Click on Redshift icon. Or <a href="https://console.aws.amazon.com/redshift/home" target="_blank"><span style="color: #248cc8;">click here</span></a> to land directly to redshift</li>
<li>Click on Launch Cluster</li>
<li>On Cluster Detail Page specify Cluster Identifier, Database Name, Port, Master User and Password. Click Continue to go to next page
<div class="wp-caption alignnone">
<p><a href="//zappysys.com/onlinehelp/ssis-powerpack/scr/images/amazon-redshift-datatransfer-task/amazon-redshift-cluster-setup-1-create-database.png"><img loading="lazy" decoding="async" title="Configure Redshift Cluster Identifier, Database Name, Port , UserID and Password" src="//zappysys.com/onlinehelp/ssis-powerpack/scr/images/amazon-redshift-datatransfer-task/amazon-redshift-cluster-setup-1-create-database.png" alt="Configure Redshift Cluster Identifier, Database Name, Port , UserID and Password" width="381" height="365" /></a></p>
<p class="wp-caption-text">Configure Redshift Cluster Identifier, Database Name, Port , UserID and Password</p>
</div>
</li>
<li>On Node Configuration Page specify Node Type (This is VM Type), Cluster Type and Number of Node. If you are trying under Free Tire then select smallest Node possible (in this case it was dw2.large). Click Continue to go to next page
<div class="wp-caption alignnone">
<p><a href="//zappysys.com/onlinehelp/ssis-powerpack/scr/images/amazon-redshift-datatransfer-task/amazon-redshift-cluster-setup-2-specify-node-type.png"><img loading="lazy" decoding="async" title="Configure Redshift Node Type and Cluster Type" src="//zappysys.com/onlinehelp/ssis-powerpack/scr/images/amazon-redshift-datatransfer-task/amazon-redshift-cluster-setup-2-specify-node-type.png" alt="Configure Redshift Node Type and Cluster Type" width="386" height="366" /></a></p>
<p class="wp-caption-text">Configure Redshift Node Type and Cluster Type</p>
</div>
</li>
<li>On Additional Configuration Page you can pick VPC (virtual private connection), Security group for Cluster and other options for Encryption. For demo purpose select as below screenshot . Click Continue to review your settings and click Create Cluster
<div class="wp-caption alignnone">
<p><a href="//zappysys.com/onlinehelp/ssis-powerpack/scr/images/amazon-redshift-datatransfer-task/amazon-redshift-cluster-setup-3-configuration.png"><img loading="lazy" decoding="async" title="Configure Redshift Cluster Encryption, VPC and Additional Detail" src="//zappysys.com/onlinehelp/ssis-powerpack/scr/images/amazon-redshift-datatransfer-task/amazon-redshift-cluster-setup-3-configuration.png" alt="Configure Redshift Cluster Encryption, VPC and Additional Detail" width="545" height="452" /></a></p>
<p class="wp-caption-text">Configure Redshift Cluster Encryption, VPC and Additional Detail</p>
</div>
</li>
<li>Give it few mins while your cluster is being created. After few minutes (5-10 mins) you can go back to same page and review cluster Status and other properties as below. Copy Cluster Endpoint to somewhere because we will need it later.
<div class="wp-caption alignnone">
<p><a href="//zappysys.com/onlinehelp/ssis-powerpack/scr/images/amazon-redshift-datatransfer-task/amazon-redshift-cluster-setup-6-properties.png"><img loading="lazy" decoding="async" title="Check Redshift Cluster Status , Endpoint and Other Properties" src="//zappysys.com/onlinehelp/ssis-powerpack/scr/images/amazon-redshift-datatransfer-task/amazon-redshift-cluster-setup-6-properties.png" alt="Check Redshift Cluster Status , Endpoint and Other Properties" width="526" height="337" /></a></p>
<p class="wp-caption-text">Check Redshift Cluster Status , Endpoint and Other Properties</p>
</div>
</li>
</ol>
</div>
<h2><span id="Add_inbound_rule_for_Redshift_Cluster">Add inbound rule for Redshift Cluster</span></h2>
<div><em>NOTE: Skip this step if you have already added your IP to inbound exclusion rule.</em><br />
By default you cannot connect to Amazon Redshift cluster from outside AWS Network (e.g. from your On-Premises Machine). If you wish to connect then you must add inbound exception rule to allow your request to redshift cluster on specific port.</div>
<div>
<p>To add create new inbound rule perform following steps</p>
<ol>
<li>Under Redshift home page click [Security] tab. You may see following Notice depending on which region you are. Click on [Go to the EC2 Console] link or you can direct go to EC2 by clicking Services -&gt; EC2 menu at the top
<div class="wp-caption alignnone">
<p><a href="//zappysys.com/onlinehelp/ssis-powerpack/scr/images/amazon-redshift-datatransfer-task/amazon-redshift-cluster-setup-7-security-groups.png"><img loading="lazy" decoding="async" title="Configure Security Group and Inbound Filter Firewall Rule to allow Local Connection" src="//zappysys.com/onlinehelp/ssis-powerpack/scr/images/amazon-redshift-datatransfer-task/amazon-redshift-cluster-setup-7-security-groups.png" alt="Configure Security Group and Inbound Filter Firewall Rule to allow Local Connection" width="502" height="218" /></a></p>
<p class="wp-caption-text">Configure Security Group and Inbound Filter Firewall Rule to allow Local Connection</p>
</div>
</li>
<li>On EC2 Security Groups Page select Security group attached with your Redshift Cluster and then in the bottom pane click on Inbound Tab
<div class="wp-caption alignnone">
<p><a href="//zappysys.com/onlinehelp/ssis-powerpack/scr/images/amazon-redshift-datatransfer-task/amazon-redshift-cluster-setup-9-security-group-inbound-rule.png"><img loading="lazy" decoding="async" title="Security Group Screen - Add or Edit Inbound Firewall Rule to allow Local Connection" src="//zappysys.com/onlinehelp/ssis-powerpack/scr/images/amazon-redshift-datatransfer-task/amazon-redshift-cluster-setup-9-security-group-inbound-rule.png" alt="Security Group Screen - Add or Edit Inbound Firewall Rule to allow Local Connection" width="662" height="394" /></a></p>
<p class="wp-caption-text">Security Group Screen – Add or Edit Inbound Firewall Rule to allow Local Connection</p>
</div>
</li>
<li>On Inbound Tab click Edit option to modify default entry or you can add new Rule. Notice how IP Range is specified.. 0.0.0.0/0 means all IP. If you wish to add range then you have to set something like this&#8230; 50.34.234.10/250  .. this will cover 50.34.234.10 to 50.34.234.250 IP range. Make sure your port range covers Port you specified for Redshift cluster.</li>
<li>Click on Add rule if you wish to add new entry else edit as below and click save</li>
</ol>
</div>
<h2>Automate Redshift Cluster Creation</h2>
<p>If you have need to automate Redshift Cluster Creation or any of the following things automatically then check <a href="//zappysys.com/products/ssis-powerpack/ssis-amazon-redshift-cluster-management-task/">Redshift Cluster management Task</a></p>
<ul>
<li>Automate Amazon Redshift Cluster Create Action in few clicks. You can also add Access Security Rule.</li>
<li>Automate Amazon Redshift Cluster Delete Action</li>
<li>Fetch Amazon Redshift Cluster Property to SSIS Variable (e.g. Fetch Cluster Status)</li>
<li>Fetch all cluster and their properties as DataTable (Use ForEach Loop and iterate through all clusters)</li>
<li>Automate Redshift Cluster Snapshot Creation</li>
<li>Automate Redshift Cluster Snapshot Delete Action</li>
<li>Support for Wait until Cluster operation is done</li>
</ul>
<p>The post <a href="https://zappysys.com/blog/how-to-setup-amazon-redshift-cluster-for-outside-data-access/">How to setup Amazon Redshift Cluster in few clicks</a> appeared first on <a href="https://zappysys.com/blog">ZappySys Blog</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Extract / Unload Redshift data into SQL Server using SSIS</title>
		<link>https://zappysys.com/blog/extract-unload-redshift-data-sql-server-using-ssis/</link>
		
		<dc:creator><![CDATA[ZappySys]]></dc:creator>
		<pubDate>Tue, 27 Oct 2015 19:55:36 +0000</pubDate>
				<category><![CDATA[AWS (Amazon Web Services)]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Redshift]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[aws]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[etl]]></category>
		<category><![CDATA[extract]]></category>
		<category><![CDATA[redshift]]></category>
		<category><![CDATA[s3]]></category>
		<category><![CDATA[ssis]]></category>
		<category><![CDATA[SSIS PowerPack]]></category>
		<guid isPermaLink="false">http://zappysys.com/blog/?p=128</guid>

					<description><![CDATA[<p>Introduction In our previous article we saw how to load data into Redshift using SSIS or load data into Redshift using ZappyShell Redshift Command Line In this article we will walk through various steps to Extract/UNLOAD Redshift Data into SQL Server using Amazon S3 Storage Task and ExecuteSQL Task for Amazon Redshift. Below is the [&#8230;]</p>
<p>The post <a href="https://zappysys.com/blog/extract-unload-redshift-data-sql-server-using-ssis/">Extract / Unload Redshift data into SQL Server using SSIS</a> appeared first on <a href="https://zappysys.com/blog">ZappySys Blog</a>.</p>
]]></description>
										<content:encoded><![CDATA[<h2>Introduction</h2>
<p>In our previous article we saw <a href="//zappysys.com/posts/sql-server-to-redshift-data-load-using-ssis/">how to load data into Redshift using SSIS</a> or load data into Redshift using ZappyShell <a href="//zappysys.com/products/zappyshell/amazon-redshift-command-line-tools/">Redshift Command Line</a></p>
<p>In this article we will walk through various steps to <em>Extract/UNLOAD Redshift Data into SQL Server</em> using <a href="//zappysys.com/products/ssis-powerpack/ssis-amazon-s3-task/">Amazon S3 Storage Task</a> and <a href="//zappysys.com/products/ssis-powerpack/ssis-redshift-execute-sql-task/">ExecuteSQL Task for Amazon Redshift</a>. Below is the screenshot of actual SSIS Package to <em>Extract Redshift Data and Load into SQL Server</em></p>
<div id="attachment_164" style="width: 664px" class="wp-caption alignnone"><a href="//zappysys.com/blog/wp-content/uploads/2015/10/extract-unload-redshift-data-load-to-sql-server-ssis.png"><img loading="lazy" decoding="async" aria-describedby="caption-attachment-164" class="size-full wp-image-164" src="//zappysys.com/blog/wp-content/uploads/2015/10/extract-unload-redshift-data-load-to-sql-server-ssis.png" alt="Extract/Unload Redshift Data using SSIS and Load into SQL Server" width="654" height="565" srcset="https://zappysys.com/blog/wp-content/uploads/2015/10/extract-unload-redshift-data-load-to-sql-server-ssis.png 654w, https://zappysys.com/blog/wp-content/uploads/2015/10/extract-unload-redshift-data-load-to-sql-server-ssis-300x259.png 300w" sizes="(max-width: 654px) 100vw, 654px" /></a><p id="caption-attachment-164" class="wp-caption-text">Extract/Unload Redshift Data using SSIS and Load into SQL Server</p></div>
<h2>Requirements for Extract Redshift Data using SSIS</h2>
<p>Before you <a href="http://docs.aws.amazon.com/redshift/latest/dg/r_UNLOAD.html">UNLOAD</a> data from Redshift, you have to make sure few things.</p>
<ol>
<li>Setup your Redshift cluster (Follow these instructions <a href="//zappysys.com/blog/how-to-setup-amazon-redshift-cluster-for-outside-data-access/">to setup redshift cluster</a>)</li>
<li>Load some sample data to Redshift (Red more here: <a href="//zappysys.com/posts/sql-server-to-redshift-data-load-using-ssis/">How to load data to Redshift</a>)</li>
<li>Make sure you have correct connection settings to connect to Redshift cluster (Host name, Port, UserId, Password, DB name etc). You can get host name from AWS Console.</li>
<li>Make sure you have Access to S3 Bucket where files will be dumped from Redshift. You will need AccessKey and SecretKey to fetch files from S3</li>
</ol>
<h2>Step-1: Execute Redshift UNLOAD Command</h2>
<p>Very first step would be to unload redshift data as GZip file using <a href="//zappysys.com/products/ssis-powerpack/ssis-redshift-execute-sql-task/">ExecuteSQL Task for Amazon Redshift</a><br />
Below is SQL Command you can use to <em>extract data from Redshift</em>. Notice how we used variable placeholders in SQL Command. These placeholders are replaced at runtime with actual value stored in specified variable.</p><pre class="crayon-plain-tag">unload ('select * from (select * from customerdata limit 1000)')
to 's3://bw-rstest/stage/custdata'
credentials 'aws_access_key_id={{User::S3Accesskey}};aws_secret_access_key={{User::S3SecretKey}}'
ALLOWOVERWRITE</pre><p>
<b>Export as GZip files (Compressed files)</b></p>
<p>If you exporting data as compressed files to save data transfer cost then use GZIP option as below.</p>
<p><strong>NOTE:</strong> Make sure there are no spaces before and after AccessKey and SecretKey otherwise you may get error.</p><pre class="crayon-plain-tag">unload ('select * from (select * from customerdata limit 1000)')
to 's3://bw-rstest/stage/custdata_file_'
credentials 'aws_access_key_id={{User::S3Accesskey}};aws_secret_access_key={{User::S3SecretKey}}'
ALLOWOVERWRITE
GZIP</pre><p>
<h3>Common Errors / Troubleshooting</h3>
<p><strong>UNLOAD command issue with Region mismatch (S3 bucket vs Redshift Cluster)</strong></p>
<p>If your S3 bucket is in different region than Redshift cluster then above command may fail with &#8220;<em>301 permanent redirect error</em>&#8221; in that case you have to change your S3 bucket region. Region can be changed in AWS console (See S3 bucket properties and change location to match region with Redshift cluster region. Both regions must be same.</p>
<blockquote><p>ERROR: XX000: S3ServiceException:The bucket you are attempting to access must be addressed using the specified endpoint. Please send all future requests to this endpoint.,Status 301,Error PermanentRedirect</p></blockquote>
<p><strong>UNLOAD command issue with accesskey and secret key</strong></p>
<p>If you specify invalid accesskey or secretkey &#8211;or&#8211; you have misspelled keywords related to credentials &#8212; or &#8212; you have spaces before or after accesskey or secret key then you may get following error.</p>
<blockquote><p>ERROR: XX000: Invalid credentials. Must be of the format: credentials &#8216;aws_iam_role=&#8230;&#8217; or &#8216;aws_access_key_id=&#8230;;aws_secret_access_key=&#8230;[;token=&#8230;].</p></blockquote>
<h2>Step-2: Download data files from Amazon S3 Bucket to local machine</h2>
<p>Once files are exported to S3 bucket we can download then to local machine using <a href="//zappysys.com/products/ssis-powerpack/ssis-amazon-s3-task/">Amazon S3 Storage Task</a></p>
<h2>Step-3: Un-compress downloaded files</h2>
<p>If you have exported Redshift data as compressed files (using GZIP option) then you can use <a href="https://zappysys.com/products/ssis-powerpack/ssis-zip-file-task/" target="_blank">ZappySys Zip File task</a> to un-compress multiple files.</p>
<p>Or you can write Script to un-compress those files (see below code). You can skip this step if files are not compressed (not used GZIP option in command).</p>
<p>Here is sample <strong>C# code</strong> to un-compress <strong>GZip</strong> files</p><pre class="crayon-plain-tag">public void Main()
{
	System.IO.DirectoryInfo directorySelected = new System.IO.DirectoryInfo(@"C:\amazon\archive");

	foreach (System.IO.FileInfo fileToDecompress in directorySelected.GetFiles("custdata*_part_*"))
	{
		Decompress(fileToDecompress);
	}

	Dts.TaskResult = (int)ScriptResults.Success;
}
private static void Decompress(System.IO.FileInfo fileToDecompress)
{
	using (System.IO.FileStream originalFileStream = fileToDecompress.OpenRead())
	{
		string currentFileName = fileToDecompress.FullName;
		string newFileName = currentFileName.Remove(currentFileName.Length - fileToDecompress.Extension.Length);

		using (System.IO.FileStream decompressedFileStream = System.IO.File.Create(newFileName))
		{
			using (System.IO.Compression.GZipStream decompressionStream = new System.IO.Compression.GZipStream(originalFileStream, System.IO.Compression.CompressionMode.Decompress))
			{
				decompressionStream.CopyTo(decompressedFileStream);
				//Console.WriteLine("Decompressed: {0}", fileToDecompress.Name);
			}
		}
	}
}</pre><p>
&nbsp;</p>
<h2>Step-4: Loop through files using ForEachLoop Container</h2>
<p>Once files downloaded from S3 bucket we can now loop through files using SSIS ForEach Loop Task and load into SQL Server (One file in each iteration)</p>
<div id="attachment_165" style="width: 705px" class="wp-caption alignnone"><a href="//zappysys.com/blog/wp-content/uploads/2015/10/ssis-loop-amazon-s3-files.png"><img loading="lazy" decoding="async" aria-describedby="caption-attachment-165" class="size-full wp-image-165" src="//zappysys.com/blog/wp-content/uploads/2015/10/ssis-loop-amazon-s3-files.png" alt="Loop through files downloaded from Amazon S3 (Exported using Redshift UNLOAD Command)" width="695" height="384" srcset="https://zappysys.com/blog/wp-content/uploads/2015/10/ssis-loop-amazon-s3-files.png 695w, https://zappysys.com/blog/wp-content/uploads/2015/10/ssis-loop-amazon-s3-files-300x166.png 300w" sizes="(max-width: 695px) 100vw, 695px" /></a><p id="caption-attachment-165" class="wp-caption-text">Loop through files downloaded from Amazon S3 (Exported using Redshift UNLOAD Command)</p></div>
<h2>Step-5: Data Flow &#8211; Load Redshift Data Files to SQL Server</h2>
<p>Inside data flow you can use Flat File source and OLEDB Destination for SQL Server. Just map correct File columns to SQL Server fields and you should be good. If needed convert Unicode/Non-unicode columns using Data Conversion Transform (This is not needed if source is DT_STR and target also DT_STR.. or source is DT_WSTR and target is DT_WSTR i.e. Unicode).</p>
<h2>Downloads</h2>
<p>To download above SSIS Package click on the below links. In order to test below package you first have to <a href="//zappysys.com/products/ssis-powerpack/">download SSIS PowerPack</a><br />
<a href="//zappysys.com/blog/wp-content/uploads/2015/11/RedshiftExtractDemo_2008.zip">Download Demo SSIS Package &#8211; SSIS 2008</a><br />
<a href="//zappysys.com/blog/wp-content/uploads/2015/11/RedshiftExtractDemo_2012.zip">Download Demo SSIS Package &#8211; SSIS 2012/2014</a></p>
<h2>Conclusion</h2>
<p>amazon Redshift is great way to start your data warehouse projects with very minimum investment in a very simple pay as you go model but loading or unloading data from redshift can be challenging task. Using <a href="//zappysys.com/products/ssis-powerpack/">SSIS PowerPack</a> you can perform Redshift data load or unload in few clicks.</p>
<p>The post <a href="https://zappysys.com/blog/extract-unload-redshift-data-sql-server-using-ssis/">Extract / Unload Redshift data into SQL Server using SSIS</a> appeared first on <a href="https://zappysys.com/blog">ZappySys Blog</a>.</p>
]]></content:encoded>
					
		
		
			</item>
	</channel>
</rss>
