Solved

SSIS: Working with Complex Flat File

Posted on 2009-07-08
5
817 Views
Last Modified: 2013-11-10
I am very new in working with SSIS.  I would like to create a package that would take a series of flat files and convert some of the information into a new delimited flat file.  Flat file to flat file; I don't need to import into or extract out of a database.

Below is a small sample of what the source file looks like.  As you can see, this file is broken down into sections; there are header records (noted by '[' and ']'), and detail rows underneath the header record.  However, some header records contain data that corresponds with the header record itself.  For example, the 'ECONV' header record contains a 'GLLoadEnt' record with detail following it.  The rows beneath this header record correspond with the 'GLLoadEnt' record.  Theoretically, there could be multiple 'ECONV' header records, and the rows following each header record all correspond with the preceding header.

Columns are delimited using the '!' character.

I have written VB.NET code that takes this flat file and can import the information into a database, but I am researching the possibility of using SSIS.  I would appreciate any advice as to my approach.  If this is going to be too complex for SSIS, I would want to know that.

Any and all help would be greatly appreciated!
[ORGANIZATION]
Geographic          !Geographic Consolidation                !MAXIMUM             !Geographical        
Function            !Function Consolidation                  !MAXIMUM             !Function            
LC                  !Local Currency                          !MAXIMUM             !LCTop               
 
 
[ECONV=GLLoadEnt           !GL Load for Entities                    !MAXIMUM             !L]
 
100                                                           !EastSales.Input                          
110                                                           !EastProduction.Input                     
120                                                           !EastAdmin.Input                          
200                                                           !WestSales.Input                          
210                                                           !WestProduction.Input                     
220                                                           !WestAdmin.Input                          
 
 
[ENTITYLIST=AllEnt              !All Entitities w/o Subs                 !MAXIMUM             !D]
 
ORG=<ALL>
ENTITY=<ALL>
ALLDEP=Y
IMMDEP=Y
PARENT=Y
BASE=Y
JOURNAL=N
ELIM=N
INTERCO=N
DUPLICATE=N
CODE=<ALL>
CURRENCY=<ALL>
SUBSTRUCTURE=<NONE>
SUBENTITY=<ALL>
CHART=<ALL>
CONSOL=<ALL>
TRANS=<ALL>

Open in new window

0
Comment
Question by:Donovan Moore
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 2
  • 2
5 Comments
 
LVL 75

Expert Comment

by:Anthony Perkins
ID: 24810627
>>If this is going to be too complex for SSIS, I would want to know that.<<
Yes it is.  You would have to virtually duplicate the code you have in your VB.NET code and there would be no advantage to that.
0
 
LVL 15

Accepted Solution

by:
rob_farley earned 500 total points
ID: 24820149
You'd need to have a data flow which pulled the data out in a long string per row, and then have a Script Component containing your transformation like what you have in VB.Net now.

Although - this could be okay, because you'd end up not needing your VB.Net. Because you'd use the Script component, most of your logic would just go straight across, only changing the way that you output the rows from the component (ie, SomeOutputBuffer.AddRow() instead of pushing it into a database). You can have a large number of different outputs too, which may suit your needs if required.

There are few things you would want to consider... such as telling the data that it's sorted (even though it's not). That will stop SSIS from using parallelism. You should also make sure that the Output of your Script component isn't sync'd to the Input, so that you can control the Output from your component better.

So I would consider SSIS a nicer option than using VB.Net to populate a database before just pulling it straight out.

Rob
0
 

Author Closing Comment

by:Donovan Moore
ID: 31601333
Thanks for the advice.  I think I might go in another direction with this...I'm going to open a new question that outlines a new approach, and I will need honest feedback.
0
 
LVL 15

Expert Comment

by:rob_farley
ID: 24854929
Sure. When you do, can you post the question link in here so that I find it?

Rob
0
 
LVL 75

Expert Comment

by:Anthony Perkins
ID: 24855049
0

Featured Post

Complete VMware vSphere® ESX(i) & Hyper-V Backup

Capture your entire system, including the host, with patented disk imaging integrated with VMware VADP / Microsoft VSS and RCT. RTOs is as low as 15 seconds with Acronis Active Restore™. You can enjoy unlimited P2V/V2V migrations from any source (even from a different hypervisor)

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

In this article we will get to know that how can we recover deleted data if it happens accidently. We really can recover deleted rows if we know the time when data is deleted by using the transaction log.
For both online and offline retail, the cross-channel business is the most recent pattern in the B2C trade space.
Via a live example combined with referencing Books Online, show some of the information that can be extracted from the Catalog Views in SQL Server.
Viewers will learn how to use the SELECT statement in SQL and will be exposed to the many uses the SELECT statement has.

732 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question