Search Suggest

Creating BIML Script Component Transformation (rownumber)

Case
I want to add a Script Component transformation to my bimlscript to add a rownumber functionality to my packages.

Solution
For this example I will continue with an existing BIML example. Note the target in this example is an OLE DB destination that supports an identity column. Use your own destination like Excel, Flat File or PDW that doesn't supports identity columns.
Script Component Transformation Rownumber


















Above the <packages>-tag we are adding a <ScriptProjects>-tag where we define the Script Component code, including references, variables, input columns and output columns. In the <Transformations>-tag (Data Flow Task) we only reference to this Script Project.

The script code within the BIML script is aligned to the left to get a neat Script Component script layout. Otherwise you get a lot of ugly white space.


<Biml xmlns="http://schemas.varigence.com/biml.xsd">
<Annotations>
<Annotation>
File: Script Component Transformation RowNumber.biml
Description: Example of using the Script Component as
a transformation to add a rownumber to the destination.
Note: Example has an OLE DB Destination that supports
an identity column. Use your own Flat File, Excel or
PDW destination that doesn't supports an identity.
VS2012 BIDS Helper 1.6.6.0
By Joost van Rossum http://server.hoit.asia
</Annotation>
</Annotations>

<!--Package connection managers-->
<Connections>
<OleDbConnection
Name="Source"
ConnectionString="Data Source=.;Initial Catalog=ssisjoostS;Provider=SQLNCLI11.1;Integrated Security=SSPI;Auto Translate=False;">
</OleDbConnection>
<OleDbConnection
Name="Destination"
ConnectionString="Data Source=.;Initial Catalog=ssisjoostD;Provider=SQLNCLI11.1;Integrated Security=SSPI;Auto Translate=False;">
</OleDbConnection>
</Connections>

<ScriptProjects>
<ScriptComponentProject ProjectCoreName="sc_c253bef215bf4d6b85dbe3919c35c167.csproj" Name="SCR - Rownumber">
<AssemblyReferences>
<AssemblyReference AssemblyPath="Microsoft.SqlServer.DTSPipelineWrap" />
<AssemblyReference AssemblyPath="Microsoft.SqlServer.DTSRuntimeWrap" />
<AssemblyReference AssemblyPath="Microsoft.SqlServer.PipelineHost" />
<AssemblyReference AssemblyPath="Microsoft.SqlServer.TxScript" />
<AssemblyReference AssemblyPath="System.dll" />
<AssemblyReference AssemblyPath="System.AddIn.dll" />
<AssemblyReference AssemblyPath="System.Data.dll" />
<AssemblyReference AssemblyPath="System.Xml.dll" />
</AssemblyReferences>
<ReadOnlyVariables>
<Variable VariableName="maxrownumber" Namespace="User" DataType="Int32"></Variable>
</ReadOnlyVariables>
<Files>
<!-- Left alignment of .Net script to get a neat layout in package-->
<File Path="AssemblyInfo.cs">
using System.Reflection;
using System.Runtime.CompilerServices;

//
// General Information about an assembly is controlled through the following
// set of attributes. Change these attribute values to modify the information
// associated with an assembly.
//
[assembly: AssemblyTitle("SC_977e21e288ea4faaaa4e6b2ad2cd125d")]
[assembly: AssemblyDescription("")]
[assembly: AssemblyConfiguration("")]
[assembly: AssemblyCompany("SSISJoost")]
[assembly: AssemblyProduct("SC_977e21e288ea4faaaa4e6b2ad2cd125d")]
[assembly: AssemblyCopyright("Copyright @ SSISJoost 2015")]
[assembly: AssemblyTrademark("")]
[assembly: AssemblyCulture("")]
//
// Version information for an assembly consists of the following four values:
//
// Major Version
// Minor Version
// Build Number
// Revision
//
// You can specify all the values or you can default the Revision and Build Numbers
// by using the '*' as shown below:

[assembly: AssemblyVersion("1.0.*")]
</File>
<!-- Replaced greater/less than by &gt; and &lt; -->
<File Path="main.cs">#region Namespaces
using System;
using System.Data;
using Microsoft.SqlServer.Dts.Pipeline.Wrapper;
using Microsoft.SqlServer.Dts.Runtime.Wrapper;
#endregion

/// &lt;summary&gt;
/// Rownumber transformation to create an identity column
/// &lt;/summary&gt;
[Microsoft.SqlServer.Dts.Pipeline.SSISScriptComponentEntryPointAttribute]
public class ScriptMain : UserComponent
{
int rownumber = 0;

/// &lt;summary&gt;
/// Get max rownumber from variable
/// &lt;/summary&gt;
public override void PreExecute()
{
rownumber = this.Variables.maxrownumber;
}

/// &lt;summary&gt;
/// Increase rownumber and fill rownumber column
/// &lt;/summary&gt;
/// &lt;param name="Row"&gt;The row that is currently passing through the component&lt;/param&gt;
public override void Input0_ProcessInputRow(Input0Buffer Row)
{
rownumber++;
Row.rownumber = rownumber;
}
}
</File>
</Files>
<InputBuffer Name="Input0">
<Columns>
</Columns>
</InputBuffer>
<OutputBuffers>
<OutputBuffer Name="Output0">
<Columns>
<Column Name="rownumber" DataType="Int32"></Column>
</Columns>
</OutputBuffer>
</OutputBuffers>
</ScriptComponentProject>
</ScriptProjects>

<Packages>
<!--A query to get all tables from a certain database and loop through that collection-->
<# string sConn = @"Provider=SQLNCLI11.1;Server=.;Initial Catalog=ssisjoostS;Integrated Security=SSPI;";#>
<# string sSQL = "SELECT name as TableName FROM dbo.sysobjects where xtype = 'U' and category = 0 ORDER BY name";#>
<# DataTable tblAllTables = ExternalDataAccess.GetDataTable(sConn,sSQL);#>
<# foreach (DataRow row in tblAllTables.Rows) { #>

<!--Create a package for each table and use the tablename in the packagename-->
<Package ProtectionLevel="DontSaveSensitive" ConstraintMode="Parallel" AutoCreateConfigurationsType="None" Name="ssisjoost_<#=row["TableName"]#>">
<Variables>
<Variable Name="maxrownumber" DataType="Int32">0</Variable>
</Variables>

<!--The tasks of my control flow: get max rownumber and a data flow task-->
<Tasks>
<!--Execute SQL Task to get max rownumber from destination-->
<ExecuteSQL
Name="SQL - Get max rownumber <#=row["TableName"]#>"
ConnectionName="Destination"
ResultSet="SingleRow">
<DirectInput>SELECT ISNULL(max([rownumber]),0) as maxrownumber FROM <#=row["TableName"]#></DirectInput>
<Results>
<Result Name="0" VariableName="User.maxrownumber" />
</Results>
</ExecuteSQL>

<!--Data Flow Task to fill the destination table-->
<Dataflow Name="DFT - Process <#=row["TableName"]#>">
<!--Connect it to the preceding Execute SQL Task-->
<PrecedenceConstraints>
<Inputs>
<Input OutputPathName="SQL - Get max rownumber <#=row["TableName"]#>.Output"></Input>
</Inputs>
</PrecedenceConstraints>

<Transformations>
<!--My source with dynamic, but ugly * which could be replace by some .NET/SQL code retrieving the columnnames-->
<OleDbSource Name="OLE_SRC - <#=row["TableName"]#>" ConnectionName="Source">
<DirectInput>SELECT * FROM <#=row["TableName"]#></DirectInput>
</OleDbSource>

<ScriptComponentTransformation Name="SCR - Rownumber">
<ScriptComponentProjectReference ScriptComponentProjectName="SCR - Rownumber" />
</ScriptComponentTransformation>

<!--My destination with no column mapping because all source columns exist in destination table-->
<OleDbDestination Name="OLE_DST - <#=row["TableName"]#>" ConnectionName="Destination">
<ExternalTableOutput Table="<#=row["TableName"]#>"></ExternalTableOutput>
</OleDbDestination>
</Transformations>
</Dataflow>
</Tasks>
</Package>
<# } #>
</Packages>
</Biml>

<!--Includes/Imports for C#-->
<#@ template language="C#" hostspecific="true"#>
<#@ import namespace="System.Data"#>
<#@ import namespace="System.Data.SqlClient"#>


The result
After generating the package with the Script Component we have a neat script for adding the rownumber.
Row number script

Post a Comment