Lesson 1: Creating the Project and Basic Package

 

In this lesson, you will create a simple ETL package that extracts data from a single flat file source, transforms the data using two lookup transformation components, and writes that data to the FactCurrency fact table in AdventureWorksDW2012. As part of this lesson, you will learn how to create new packages, add and configure data source and destination connections, and work with new control flow and data flow components.

System_CAPS_ICON_important.jpg Important


This tutorial requires the AdventureWorksDW2012 sample database. For more information on installing and deploying AdventureWorksDW2012, see Reporting Services Product Samples on CodePlex.

This tutorial requires Microsoft SQL Server Data Tools.

For more information on installing the SQL Server Data Tools see SQL Server Data Tools Download.

Before creating a package, you need a good understanding of the formatting used in both the source data and the destination. Once you understand both of these data formats, you will be ready to define the transformations necessary to map the source data to the destination.

Looking at the Source

For this tutorial, the source data is a set of historical currency data contained in the flat file, SampleCurrencyData.txt. The source data has the following four columns: the average rate of the currency, a currency key, a date key, and the end-of-day rate.

Here is an example of the source data contained in the SampleCurrencyData.txt file:

1.00070049USD9/3/05 0:001.001201442

1.00020004USD9/4/05 0:001

1.00020004USD9/5/05 0:001.001201442

1.00020004USD9/6/05 0:001

1.00020004USD9/7/05 0:001.00070049

1.00070049USD9/8/05 0:000.99980004

1.00070049USD9/9/05 0:001.001502253

1.00070049USD9/10/05 0:000.99990001

1.00020004USD9/11/05 0:001.001101211

1.00020004USD9/12/05 0:000.99970009

When working with flat file source data, it is important to understand how the Flat File connection manager interprets the flat file data. If the flat file source is Unicode, the Flat File connection manager defines all columns as [DT_WSTR] with a default column width of 50. If the flat file source is ANSI-encoded, the columns are defined as [DT_STR] with a column width of 50. You will probably have to change these defaults to make the string column types more appropriate for your data. To do this, you will need to look at the data type of the destination where the data will be written to and then choose the correct type within the Flat File connection manager.

Looking at the Destination

The ultimate destination for the source data is the FactCurrency fact table in AdventureWorksDW. The FactCurrency fact table has four columns, and has relationships to two dimension tables, as shown in the following table.

Column NameData TypeLookup TableLookup Column
AverageRatefloatNoneNone
CurrencyKeyint (FK)DimCurrencyCurrencyKey (PK)
DateKeyInt (FK)DimDateDateKey (PK)
EndOfDayRatefloatNoneNone

Mapping Source Data to be Compatible with the Destination

Analysis of the source and destination data formats indicates that lookups will be necessary for the CurrencyKey and DateKey values. The transformations that will perform these lookups will obtain the CurrencyKey and DateKey values by using the alternate keys from DimCurrency and DimDate dimension tables.

Flat File ColumnTable NameColumn NameData Type
0FactCurrencyAverageRatefloat
1DimCurrencyCurrencyAlternateKeynchar (3)
2DimDateFullDateAlternateKeydate
3FactCurrencyEndOfDayRatefloat

This lesson contains the following tasks:

Step 1: Creating a New Integration Services Project

Show: