Solved

SSIS Data Flow Task

Posted on 2009-04-07
14
604 Views
Last Modified: 2013-11-10
Hi experts,

I have an SSIS package which takes rows from an excel sheet and inserts them into a SQL DB.
The table where I want to do the inserts has a primary key which is calculated by a complex formula. This formula not always creates a unique value therefore I get an error in my SSIS package.
Unfortunately, I can't change that formula. So my only chance is to try the insert for so long until no error is thrown (the created value is unique).

But how do I implement this process in SSI?

Can anyone help?

Thanks a lot
0
Comment
Question by:arthrex
  • 6
  • 5
  • 2
  • +1
14 Comments
 
LVL 22

Expert Comment

by:PedroCGD
ID: 24085410
Try using a lookup transform that checks for existence of the PK in database... if not found insert, else ignore or try to create another key...
helped?
regards,
Pedro
www.pedrocgd.blogspot.com
0
 

Author Comment

by:arthrex
ID: 24085699
Sounds good.
This is the formula that is used to create the PK:
((1000000000)*rand((datepart(month,getdate())*(100000)+datepart(second,getdate())*(1000))+datepart(millisecond,getdate())))
So I tried to use this expression in the "Derived Column Transformation", but I can't use the random function there.
How would you do it?

Thanks for your help
0
 
LVL 22

Expert Comment

by:PedroCGD
ID: 24086215
but you never create duplicate PK with your expression... if you use year, month day, minute, second and milisec you always have unique values... use the format with 2 values for each part of date and 4 for year...

20090207_08103918
0
 

Author Comment

by:arthrex
ID: 24097121
Hello Pedro,

I think I know what the problem is now. Check the attachment to see my data flow task.
The excel source rows are pulled through the orchestration all at the same time. I can see that in the data viewer.
So I guess the OLE DB destination task tries to do a bulk insert and creates just one PK for all the rows.
I tried to limit the "Rows per batch" to 1 and the "Maximum insert commit size" to 1 but it didn't help.

I still get the "duplicate PK" Error message.

Do have any suggestions?

Thanks
dataflowtask.png
0
 
LVL 22

Expert Comment

by:PedroCGD
ID: 24097350
could you send me the package and excel file?!
regards,
Pedro
0
 
LVL 30

Expert Comment

by:nmcdermaid
ID: 24104016
Why aren't you using a IDENTITY or GUID for your primary key? You should never use a timestamp to try and create a unique key - hardware will always catch up with you eventually.
0
 

Author Comment

by:arthrex
ID: 24104448
As I already said,
I didn't create the DB and I cannot just change the primary key calculation
0
How to run any project with ease

Manage projects of all sizes how you want. Great for personal to-do lists, project milestones, team priorities and launch plans.
- Combine task lists, docs, spreadsheets, and chat in one
- View and edit from mobile/offline
- Cut down on emails

 
LVL 22

Expert Comment

by:PedroCGD
ID: 24106206
arthrex,
What you need is to create a unique with during the data flow for primery key, correct?
Cheers
Pedro
0
 
LVL 22

Expert Comment

by:PedroCGD
ID: 24106224
Check attached image
SSIS-Interface.JPG
0
 

Author Comment

by:arthrex
ID: 24106465
Hi Pedro,

well, that's one option.
Originally, I wanted to leave the primary key out and let the database table deal with it.
Since this didn't work, I tried to generate a key with this formula
((1000000000)*rand((datepart(month,getdate())*(100000)+datepart(second,getdate())*(1000))+datepart(millisecond,getdate())))
inside a script component and save the calculated value to a column or variable (That's why I had this primaryKey row there). But it didn't work with the script component.

Do you think you can create the script component for me?
That'd be awesome.

Thanks for the great support!
0
 
LVL 17

Expert Comment

by:HoggZilla
ID: 24107284
Did you try using an Error Output on the Insert. If the Insert fails to just write the row to the Error table? That would keep your package from ending in error.
0
 

Author Comment

by:arthrex
ID: 24107304
Hi HoggZilla,

the problem is the error occurs in the destination DB shape.
And, as it's name says, it's the destination.
If an error occurs there you can't redirect to another shape.

0
 
LVL 17

Expert Comment

by:HoggZilla
ID: 24107652
From an OLE DB Destination you can connect the Red Arrow to an Error output.  What am I missing.
0
 

Accepted Solution

by:
arthrex earned 0 total points
ID: 24867601
I couldn't solve it in SSIS.
So I just wrote a program that did all that.
0

Featured Post

How your wiki can always stay up-to-date

Quip doubles as a “living” wiki and a project management tool that evolves with your organization. As you finish projects in Quip, the work remains, easily accessible to all team members, new and old.
- Increase transparency
- Onboard new hires faster
- Access from mobile/offline

Join & Write a Comment

Nowadays, some of developer are too much worried about data. Who is using data, who is updating it etc. etc. Because, data is more costlier in term of money and information. So security of data is focusing concern in days. Lets' understand the Au…
JSON is being used more and more, besides XML, and you surely wanted to parse the data out into SQL instead of doing it in some Javascript. The below function in SQL Server can do the job for you, returning a quick table with the parsed data.
This videos aims to give the viewer a basic demonstration of how a user can query current session information by using the SYS_CONTEXT function
Via a live example, show how to setup several different housekeeping processes for a SQL Server.

762 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

23 Experts available now in Live!

Get 1:1 Help Now