Solved

SSIS Data Flow Task

Posted on 2009-04-07
14
612 Views
Last Modified: 2013-11-10
Hi experts,

I have an SSIS package which takes rows from an excel sheet and inserts them into a SQL DB.
The table where I want to do the inserts has a primary key which is calculated by a complex formula. This formula not always creates a unique value therefore I get an error in my SSIS package.
Unfortunately, I can't change that formula. So my only chance is to try the insert for so long until no error is thrown (the created value is unique).

But how do I implement this process in SSI?

Can anyone help?

Thanks a lot
0
Comment
Question by:arthrex
  • 6
  • 5
  • 2
  • +1
14 Comments
 
LVL 22

Expert Comment

by:PedroCGD
ID: 24085410
Try using a lookup transform that checks for existence of the PK in database... if not found insert, else ignore or try to create another key...
helped?
regards,
Pedro
www.pedrocgd.blogspot.com
0
 

Author Comment

by:arthrex
ID: 24085699
Sounds good.
This is the formula that is used to create the PK:
((1000000000)*rand((datepart(month,getdate())*(100000)+datepart(second,getdate())*(1000))+datepart(millisecond,getdate())))
So I tried to use this expression in the "Derived Column Transformation", but I can't use the random function there.
How would you do it?

Thanks for your help
0
 
LVL 22

Expert Comment

by:PedroCGD
ID: 24086215
but you never create duplicate PK with your expression... if you use year, month day, minute, second and milisec you always have unique values... use the format with 2 values for each part of date and 4 for year...

20090207_08103918
0
PRTG Network Monitor: Intuitive Network Monitoring

Network Monitoring is essential to ensure that computer systems and network devices are running. Use PRTG to monitor LANs, servers, websites, applications and devices, bandwidth, virtual environments, remote systems, IoT, and many more. PRTG is easy to set up & use.

 

Author Comment

by:arthrex
ID: 24097121
Hello Pedro,

I think I know what the problem is now. Check the attachment to see my data flow task.
The excel source rows are pulled through the orchestration all at the same time. I can see that in the data viewer.
So I guess the OLE DB destination task tries to do a bulk insert and creates just one PK for all the rows.
I tried to limit the "Rows per batch" to 1 and the "Maximum insert commit size" to 1 but it didn't help.

I still get the "duplicate PK" Error message.

Do have any suggestions?

Thanks
dataflowtask.png
0
 
LVL 22

Expert Comment

by:PedroCGD
ID: 24097350
could you send me the package and excel file?!
regards,
Pedro
0
 
LVL 30

Expert Comment

by:nmcdermaid
ID: 24104016
Why aren't you using a IDENTITY or GUID for your primary key? You should never use a timestamp to try and create a unique key - hardware will always catch up with you eventually.
0
 

Author Comment

by:arthrex
ID: 24104448
As I already said,
I didn't create the DB and I cannot just change the primary key calculation
0
 
LVL 22

Expert Comment

by:PedroCGD
ID: 24106206
arthrex,
What you need is to create a unique with during the data flow for primery key, correct?
Cheers
Pedro
0
 
LVL 22

Expert Comment

by:PedroCGD
ID: 24106224
Check attached image
SSIS-Interface.JPG
0
 

Author Comment

by:arthrex
ID: 24106465
Hi Pedro,

well, that's one option.
Originally, I wanted to leave the primary key out and let the database table deal with it.
Since this didn't work, I tried to generate a key with this formula
((1000000000)*rand((datepart(month,getdate())*(100000)+datepart(second,getdate())*(1000))+datepart(millisecond,getdate())))
inside a script component and save the calculated value to a column or variable (That's why I had this primaryKey row there). But it didn't work with the script component.

Do you think you can create the script component for me?
That'd be awesome.

Thanks for the great support!
0
 
LVL 17

Expert Comment

by:HoggZilla
ID: 24107284
Did you try using an Error Output on the Insert. If the Insert fails to just write the row to the Error table? That would keep your package from ending in error.
0
 

Author Comment

by:arthrex
ID: 24107304
Hi HoggZilla,

the problem is the error occurs in the destination DB shape.
And, as it's name says, it's the destination.
If an error occurs there you can't redirect to another shape.

0
 
LVL 17

Expert Comment

by:HoggZilla
ID: 24107652
From an OLE DB Destination you can connect the Red Arrow to an Error output.  What am I missing.
0
 

Accepted Solution

by:
arthrex earned 0 total points
ID: 24867601
I couldn't solve it in SSIS.
So I just wrote a program that did all that.
0

Featured Post

PRTG Network Monitor: Intuitive Network Monitoring

Network Monitoring is essential to ensure that computer systems and network devices are running. Use PRTG to monitor LANs, servers, websites, applications and devices, bandwidth, virtual environments, remote systems, IoT, and many more. PRTG is easy to set up & use.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

JSON is being used more and more, besides XML, and you surely wanted to parse the data out into SQL instead of doing it in some Javascript. The below function in SQL Server can do the job for you, returning a quick table with the parsed data.
Ever needed a SQL 2008 Database replicated/mirrored/log shipped on another server but you can't take the downtime inflicted by initial snapshot or disconnect while T-logs are restored or mirror applied? You can use SQL Server Initialize from Backup…
Via a live example, show how to extract information from SQL Server on Database, Connection and Server properties
Using examples as well as descriptions, and references to Books Online, show the different Recovery Models available in SQL Server and explain, as well as show how full, differential and transaction log backups are performed

770 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question