Solved

Best way to compare Data in tables

Posted on 2011-09-14
7
209 Views
Last Modified: 2013-12-17
Hi Experts.

I need to compare Data in a table with new information downloaded from a server.

This means the follwoing:

1. I download data from the webserver.
2. Read the data.
3. Before inserting, compare first. If the data is equal, then skip, if there is a change, then update and store the old value in a different table.

I have about 90 Columns in each table, this means that i dont want to make 90 if else statements.

I am using Linq To SQL

I need a way to do this dynamically.

Best Regards,

John Johnson.
0
Comment
Question by:databoks
  • 3
  • 2
  • 2
7 Comments
 
LVL 42

Expert Comment

by:EugeneZ
ID: 36535324
what is your sql server version/edition/sp?

try to use EXISTS
http://msdn.microsoft.com/en-us/library/ms188336.aspx

if you are in sqlserver 2005 and up
try EXCEPT
http://msdn.microsoft.com/en-us/library/ms188055.aspx
0
 
LVL 4

Expert Comment

by:rbride
ID: 36535536
Step 3 is the biggie here.

3. Before inserting, compare first. If the data is equal, then skip, if there is a change, then update and store the old value in a different table.

OK So I am assuming that you have the ability to identify your incoming data, i.e. you have some kind of a key. If that is the case then you can simply use the CHECKSUM function.

drop table t
go
drop table t2
go

create table t
(i int, v varchar(10), v2 varchar(10))
go
insert t values (1, 'abc', 'xyz')
insert t values (2, 'abcd', 'ghi')
go

create table t2 (i int, v varchar(10), v2 varchar(10), )
go

insert t2 values (1, 'abc', 'xyz')
insert t2 values (2, 'abd', 'ghi')
insert t2 values (3, 'edcd', 'xx')
go

ALTER TABLE t ADD CHK AS CHECKSUM(v,v2)
go
ALTER TABLE t2 ADD CHK AS CHECKSUM(v,v2)
go

select *
from t join t2 on t.i = t2.i
where t.chk <> t2.chk
go

However see [http://msdn.microsoft.com/en-us/library/ms189788.aspx]
If one of the values in the expression list changes, the checksum of the list also generally changes. However, there is a small chance that the checksum will not change

The other way to do it is to create the comparison statement using dynamic sql. First of all you need to build you query by getting meta information about your columns. For nullable columns, you can't just check with not equals, thus it is a little bit complicated:

select
'(' +
CASE
WHEN IS_NULLABLE = 'YES'
THEN '(t1.' + COLUMN_NAME + ' IS NULL AND t2.' + COLUMN_NAME + ' IS NOT NULL)'
     + ' OR ' +
     '(t2.' + COLUMN_NAME + ' IS NULL AND t1.' + COLUMN_NAME + ' IS NOT NULL)'
     + ' OR '
ELSE ''
END + '(t1.' + COLUMN_NAME + ' <> ' + 't2.' + COLUMN_NAME + ')'
from INFORMATION_SCHEMA.COLUMNS
where TABLE_NAME = 't'


This will produce a list like this which you have to build together into a string. Write a loop and build up your query that way.

((t1.i IS NULL AND t2.i IS NOT NULL) OR (t2.i IS NULL AND t1.i IS NOT NULL) OR (t1.i <> t2.i)
((t1.v IS NULL AND t2.v IS NOT NULL) OR (t2.v IS NULL AND t1.v IS NOT NULL) OR (t1.v <> t2.v)
((t1.v2 IS NULL AND t2.v2 IS NOT NULL) OR (t2.v2 IS NULL AND t1.v2 IS NOT NULL) OR (t1.v2 <> t2.v2)
0
 
LVL 4

Expert Comment

by:rbride
ID: 36535571
Um I forgot to add:
of course, if your table structure is static, just use the dynamic SQL above and
edit it all together as:

--> run this
select
CASE
WHEN IS_NULLABLE = 'YES'
THEN '(t1.' + COLUMN_NAME + ' IS NULL AND t2.' + COLUMN_NAME + ' IS NOT NULL)'
     + ' OR ' +
     '(t2.' + COLUMN_NAME + ' IS NULL AND t1.' + COLUMN_NAME + ' IS NOT NULL)'
     + ' OR '
ELSE ''
END + '(t1.' + COLUMN_NAME + ' <> ' + 't2.' + COLUMN_NAME + ') OR'
from INFORMATION_SCHEMA.COLUMNS
where TABLE_NAME = 't'

Copy the results put together something like this:

select t2.*
from t as t1 join t2 as t2 on t1.i = t2.i
where
(t1.v IS NULL AND t2.v IS NOT NULL) OR (t2.v IS NULL AND t1.v IS NOT NULL) OR (t1.v <> t2.v) OR
(t1.v2 IS NULL AND t2.v2 IS NOT NULL) OR (t2.v2 IS NULL AND t1.v2 IS NOT NULL) OR (t1.v2 <> t2.v2)

And that is your final statement to run. Of course you can always do this on the fly using the exec function but you need to program a lot more around it to get everything working.
0
Windows Server 2016: All you need to know

Learn about Hyper-V features that increase functionality and usability of Microsoft Windows Server 2016. Also, throughout this eBook, you’ll find some basic PowerShell examples that will help you leverage the scripts in your environments!

 
LVL 8

Author Comment

by:databoks
ID: 36536396
Thanks guy.

I forgot to mention that i download a XML file containig the data. I search through the XML for changes..

I use c#. I need c# code if possible.
0
 
LVL 42

Expert Comment

by:EugeneZ
ID: 36538118
OK
CHECK
compare 2 xml files with csharp

http://www.daniweb.com/software-development/csharp/threads/46345
0
 
LVL 8

Accepted Solution

by:
databoks earned 0 total points
ID: 36945511
I did this by using if else statements..
0
 
LVL 8

Author Closing Comment

by:databoks
ID: 36972733
i fixed this by using if else statements.
0

Featured Post

Complete VMware vSphere® ESX(i) & Hyper-V Backup

Capture your entire system, including the host, with patented disk imaging integrated with VMware VADP / Microsoft VSS and RCT. RTOs is as low as 15 seconds with Acronis Active Restore™. You can enjoy unlimited P2V/V2V migrations from any source (even from a different hypervisor)

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Everyone has problem when going to load data into Data warehouse (EDW). They all need to confirm that data quality is good but they don't no how to proceed. Microsoft has provided new task within SSIS 2008 called "Data Profiler Task". It solve th…
International Data Corporation (IDC) prognosticates that before the current the year gets over disbursing on IT framework products to be sent in cloud environs will be $37.1B.
This video shows how to set up a shell script to accept a positional parameter when called, pass that to a SQL script, accept the output from the statement back and then manipulate it in the Shell.
This videos aims to give the viewer a basic demonstration of how a user can query current session information by using the SYS_CONTEXT function

776 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question