In this article, we will give a brief introduction of Hadoop and how it is integrated with SQL Server. Then, we will illustrate how to connect to the Hadoop cluster on-premises using the SSIS Hadoop connection manager and the related tasks.
Importing and Exporting data using SSIS Hadoop components
SQL Shack
In this article, we will briefly explain the Avro and ORC Big Data file formats. Then, we will be talking about Hadoop data flow task components and how to use them to import and export data into the Hadoop cluster. Then we will compare those Hadoop components with the Hadoop...
[Read More]
Installing Apache Pig 0.17.0 on Windows 10
Towards Data Science
Installing Apache Hive 3.1.2 on Windows 10
Towards Data Science
While working on a project two years ago, I wrote a step-by-step guide to install Hadoop 3.1.0 on Ubuntu 16.04 operating system. Since we are currently working on a new project where we need to install a Hadoop cluster on Windows 10, I decided to write a guide for this...
[Read More]
SQL Server JSON functions, a bridge between NoSQL and relational worlds
SQL Shack
Installing Hadoop 3.2.1 Single node cluster on Windows 10
Towards Data Science
Created a Nuget package for SchemaMapper class library
GitHub
An overview of the DTExec utility in SSIS
SQL Shack
DTExec is a tool used to configure and execute SQL Server Integration Services (SSIS) packages. Many developers and database administrators use this tool, especially when automating the package execution using third-party applications or non-supported programming languages. This article will introduce this tool to you, also help you understand how and...
[Read More]
Biml alternatives: Building SSIS packages programmatically using EzAPI
SQL Shack
Biml alternatives: Building SSIS packages programmatically using ManagedDTS
SQL Shack
An overview of the SSIS FTP Task
SQL Shack
Using external C# script files within Biml
SQL Shack
Monitoring activities using sp_WhoIsActive in SQL Server
SQL Shack
Extending Biml with C# scripts
SQL Shack
Converting SSIS packages to Biml scripts
SQL Shack
Using Biml scripts to generate SSIS packages
SQL Shack
SSIS: Execute T-SQL Statement Task vs Execute SQL Task
SQL Shack
How to get GraphLayout XML from programmatically generated SSIS package?
Stack Overflow
Getting started with Biml
SQL Shack
Manipulating SSAS OLAP cubes using SSIS
SQL Shack
SQL Server functions for converting a String to a Date
SQL Shack
A SQL join on multiple tables: overview and implementation
SQL Shack
SQL join multiple tables is one of the most popular types of statements executed while handling relational databases. As known, there are five types of join operations: Inner, Left, Right, Full and Cross joins. In this article, we will explain the meaning of Joins in SQL, we will describe each...
[Read More]
An overview of the SQL cursor @@FETCH_STATUS function
SQL Shack
SQL cursor is one of the most popular database objects. It is used to retrieve data from the result set of an SQL query one row at a time. Even if the cursor is not recommended from a performance perspective, they are still widely used especially when handling a small...
[Read More]
Yet another bunch of SQL string functions
SQL Shack
Linking relational databases with OLAP cubes
SQL Shack
One of the most asked questions is how to link an OLAP cube with a relational database, or in another way how to join the result of an MDX query with a table stored in a relational database. In this article, we will illustrate how to link a SQL Server...
[Read More]