Solved

Exclude Duplicates based on column value

Posted on 2014-02-17
9
225 Views
Last Modified: 2014-02-20
I have a view SSG_CATALOG that exports data for our catalog with columns  ITEMNMBR, ITEMDESC, DEALERPRICE, PRCLEVEL my problem is that because we used different price levels foreach item it creates a row for each item. For example Item ACCP2015BR1 has a price level of A, AAA, and B but the DEALERPRICE is the same for all 3 levels so I just want to return only the A price level if the DEALERPRICE for AAA and B = A.

Price Levels
0
Comment
Question by:skull52
  • 3
  • 3
  • 3
9 Comments
 
LVL 35

Expert Comment

by:David Todd
ID: 39865936
Hi,

My thought is a group by with the min of prclevel

ie
select
    c.itemnmbr
    , c.itemdesc
    , c.dealerprice
    , min( c.prclevel ) as PRCLEVEL
from dbo.SSG_CATALOG c
where
    somewhereclause
group by
    c.itemnmbr
    , c.itemdesc
    , c.dealerprice
order by
    c.itemnmbr
    , c.itemdesc
    , c.dealerprice
;

HTH
  David
0
 
LVL 65

Expert Comment

by:Jim Horn
ID: 39865938
One way would be to use RANK() to order the PRICELEVEL values, grouped by ITEMNBR and DEALERPRICE, alphabetically (Assuming A is better than AAA is better than B), then just grab all the rows that sort first.  
SELECT a.ITEMNMBR, a.ITEMDESC, a.DEALERPRICE, a.PRICELEVEL
FROM (
   SELECT ITEMNBR, ITEMDESC, DEALERPRICE, PRICELEVEL, 
      RANK() OVER (PARTITION BY ITEMNMBR, DEALERPRICE ORDER BY PRICELEVEL) as rank_order) a
WHERE a.rank_order = 1

Open in new window

0
 
LVL 35

Expert Comment

by:David Todd
ID: 39865951
Hi,

I'd be interested in the execution plan costs - if there are major differences between these two approaches.

Regards
  David
0
Master Your Team's Linux and Cloud Stack

Come see why top tech companies like Mailchimp and Media Temple use Linux Academy to build their employee training programs.

 

Author Comment

by:skull52
ID: 39866014
Thanks for the responses

David,
 I get the following error from your suggestion
Msg 4145, Level 15, State 1, Line 9
An expression of non-boolean type specified in a context where a condition is expected, near 'group'.

Jim,
I cant even get yours to run I think the reference to the  SSG_CATALOG is missing
0
 
LVL 65

Accepted Solution

by:
Jim Horn earned 500 total points
ID: 39866032
<correction to my above code.  You may have to check the column names, as the image was hard to read>
SELECT a.ITEMNMBR, a.ITEMDESC, a.DEALERPRICE, a.PRICELEVEL
FROM (
   SELECT ITEMNBR, ITEMDESC, DEALERPRICE, PRICELEVEL, 
      RANK() OVER (PARTITION BY ITEMNMBR, DEALERPRICE ORDER BY PRICELEVEL) as rank_order
   FROM SSG_CATALOG) a
WHERE a.rank_order = 1

Open in new window

0
 
LVL 35

Expert Comment

by:David Todd
ID: 39866036
Hi,

Did you just cut and paste my answer, or did you read it? You'll need to edit the where clause (or delete it!)

Regards
  David
0
 

Author Comment

by:skull52
ID: 39867902
David, My bad I did miss the WHERE clause.

Jim, thanks for fixing that, I knew the Reference to the table was missing so I added it

OK, so with using Jim's solution I get 25071 rows With David's I get 24974 a difference of 97 rows
0
 
LVL 65

Expert Comment

by:Jim Horn
ID: 39867942
>OK, so with using Jim's solution I get 25071 rows With David's I get 24974 a difference of 97 rows
Since we don't have access to your data set, you will have to identify the 97 row difference and troubleshoot.   Perhaps there are duplicate rows being returned, in which case you can replace SELECT with SELECT DISTINCT.
0
 

Author Comment

by:skull52
ID: 39874557
Jim,
I used DISTINCT but it only returned 1 row less, and I examined the results and they look good so I am going with your solution. Thanks to David for his solution also.
0

Featured Post

PRTG Network Monitor: Intuitive Network Monitoring

Network Monitoring is essential to ensure that computer systems and network devices are running. Use PRTG to monitor LANs, servers, websites, applications and devices, bandwidth, virtual environments, remote systems, IoT, and many more. PRTG is easy to set up & use.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Confronted with some SQL you don't know can be a daunting task. It can be even more daunting if that SQL carries some of the old secret codes used in the Ye Olde query syntax, such as: (+)     as used in Oracle;     *=     =*    as used in Sybase …
This article describes how to use the timestamp of existing data in a database to allow Tableau to calculate the prior work day instead of relying on case statements or if statements to calculate the days of the week.
This Micro Tutorial demonstrates using Microsoft Excel pivot tables, how to reverse engineer competitors' marketing strategies through backlinks.
Email security requires an ever evolving service that stays up to date with counter-evolving threats. The Email Laundry perform Research and Development to ensure their email security service evolves faster than cyber criminals. We apply our Threat…

773 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question