Solved

Query to delete duplicate records but leave one

Posted on 2006-07-17
6
1,107 Views
Last Modified: 2008-02-01
I have a table that has a number of duplicate records for each different product, - that I need to delete
At the moment I'm running a "Find Duplicates" query but I don't know how to delete the duplicates leaving one record of each product - I can delete them all, but thats no good to me

SELECT B0101.TRADING_NAME, B0101.AGENT_NO, B0101.RUN_DETAIL_1, B0101.SUPPLY
FROM B0101
WHERE (((B0101.TRADING_NAME) In (SELECT [TRADING_NAME] FROM [B0101] As Tmp GROUP BY [TRADING_NAME],[AGENT_NO] HAVING Count(*)>1  And [AGENT_NO] = [B0101].[AGENT_NO])))
ORDER BY B0101.TRADING_NAME, B0101.AGENT_NO;

I'm about to go to work so I won't reply for about 12 hours
0
Comment
Question by:smidgen
6 Comments
 
LVL 11

Expert Comment

by:pootle_flump
ID: 17120367
Hi

This is a great article with more duplicate deleting options than you could ever need
http://www.sqlteam.com/forums/topic.asp?TOPIC_ID=6256

The specific strategy depends on your needs (e.g. is this a 24/7 table with huge volumes of data or something much smaller\ can be inaccessible for a period of time)....

HTH
0
 
LVL 65

Expert Comment

by:rockiroads
ID: 17120486
If your running FInd Duplicates Query, can I assume this is Access?
If so, well in query design of this query, select Query from Main Menu and change to Delete Query
this converts your find duplicates query to delete duplicates

0
 
LVL 50

Expert Comment

by:Lowfatspread
ID: 17120609
which database system is this for?
how would you decide which row to keep?

you could always...

select distinct ...
   into a temp table

delete everything in the current table

copy back from the temp table...
0
Windows Server 2016: All you need to know

Learn about Hyper-V features that increase functionality and usability of Microsoft Windows Server 2016. Also, throughout this eBook, you’ll find some basic PowerShell examples that will help you leverage the scripts in your environments!

 
LVL 1

Expert Comment

by:no001855
ID: 17122027
Following is a method I have used in oracle :
=================================================================
- Find one row for each id which I want to keep
- Delete the others with the same id

DELETE FROM table t WHERE EXISTS
(SELECT 'OK'
         FROM (SELECT id,rowidForOneOccurence for table WHERE more than one occurence) sq
         WHERE t.id=sq.id and t.ROWID<>sq.rowid)
;

Example for a dummy table t with one idfield id1, which record which are kept is "random" within id
================================================================
DELETE FROM testtrigg t WHERE EXISTS (SELECT 'OK' FROM
(SELECT DISTINCT id1 id1,Max(ROWID) over (PARTITION BY id1) mrowid FROM testtrigg t1 WHERE EXISTS (SELECT id1,Count(*) FROM testtrigg t2 GROUP BY id1 HAVING Count(1) > 1)) sq
WHERE sq.id1=t.id1 AND t.ROWID<>sq.mrowid)
;
0
 
LVL 1

Expert Comment

by:no001855
ID: 17122053
Statement should have been:

DELETE FROM testtrigg t WHERE EXISTS (SELECT 'OK' FROM
    (SELECT DISTINCT id1 id1,Max(ROWID) over (PARTITION BY id1) mrowid FROM testtrigg t1
      WHERE EXISTS (SELECT id,coun(*) FROM testtrigg t2 WHERE t2.id1=t1.id1 GROUP BY id1 HAVING Count(1) > 1)) sq
WHERE sq.id1=t.id1 AND t.ROWID<>sq.mrowid)
;
0
 
LVL 1

Accepted Solution

by:
no001855 earned 250 total points
ID: 17122522
Or rather You may simplify it, since two inner quewries may be combined.

DELETE FROM testtrigg t WHERE EXISTS
  (SELECT 'OK' FROM
    (SELECT id1,Max(ROWID) mrowid FROM testtrigg t1 GROUP BY id1 HAVING Count(1) > 1 ) sq
   WHERE sq.id1=t.id1 AND sq.mrowid<>t.ROWID)
;
0

Featured Post

Netscaler Common Configuration How To guides

If you use NetScaler you will want to see these guides. The NetScaler How To Guides show administrators how to get NetScaler up and configured by providing instructions for common scenarios and some not so common ones.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

I annotated my article on ransomware somewhat extensively, but I keep adding new references and wanted to put a link to the reference library.  Despite all the reference tools I have on hand, it was not easy to find a way to do this easily. I finall…
Many companies are looking to get out of the datacenter business and to services like Microsoft Azure to provide Infrastructure as a Service (IaaS) solutions for legacy client server workloads, rather than continuing to make capital investments in h…
Video by: Steve
Using examples as well as descriptions, step through each of the common simple join types, explaining differences in syntax, differences in expected outputs and showing how the queries run along with the actual outputs based upon a simple set of dem…
Polish reports in Access so they look terrific. Take yourself to another level. Equations, Back Color, Alternate Back Color. Write easy VBA Code. Tighten space to use less pages. Launch report from a menu, considering criteria only when it is filled…

920 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

13 Experts available now in Live!

Get 1:1 Help Now