Monday, December 7, 2015

Data Flow Components

Data Flow Task is made up of different set of sources, transformations and destinations as available in the Toolbox / SSIS Toolbox. In addition to these, Data Flow Task has paths to connect different components. 


In the following articles we will be describing with examples different sources, followed by different transformations, finally different destinations and their properties. We will also discuss about Data Viewers.


Data flow sources


ADO NET Source, Excel Source, Flat File Source, OLE DB Source, Raw File Source, XML Source. CDC Source and ODBC Source are new Data Flow Source added in SSDT.


Data flow transformations: Transformations can be further categorized based on the usage:


a. Business Intelligence Transformations: Perform business intelligence operations such as cleaning data, mining text, and running data mining prediction queries.    


Slowly Changing Dimension, Fuzzy Grouping, Fuzzy Lookup, Term Extraction, Term Lookup, Data Mining Query. DQS Cleansing is new addition to SSDT.


b. Row Transformations: Update column values and create new columns. The transformation is applied to each row in the transformation input.


Character Map, Copy Column, Data Conversion, Derived Column, Export Column,  Import Column, Script Component, OLE DB Command


c. Rowset Transformations: Create new rowsets. The rowset can include aggregate and sorted values, sample rowsets, or pivoted and unpivoted rowsets.


Aggregate, Sort, Percentage Sampling, Row Sampling, Pivot, Unpivot


d. Split and Join Transformations: Distribute rows to different outputs, create copies of the transformation inputs, join multiple inputs into one output, and perform lookup operations.


Conditional SplitLookup, Merge, Merge Join, Multicast, Cache Transform, Union All. CDC Splitter is new addition to SSDT.


e. Auditing Transformations: Transformations to add audit information and count rows.


Audit, Row Count


Data flow destinations


ADO NET Destination, Data Mining Model Training, DataReader Destination, Dimension Processing, Excel Destination, Flat File Destination, OLE DB Destination, Partition Processing,  Raw File Destination, Recordset Destination, SQL Server Compact Destination, SQL Server Destination. ODBC Destination is new addition to SSDT.


Click for NEXT article in the sequence.


Please feel free to correct me by commenting your feedback and suggestions.

No comments:

Post a Comment