Solved

manipulate duplicate data

Posted on 2014-11-22
15
136 Views
Last Modified: 2014-11-22
My DOCUMENT table has  duplicate entries when I group BY name, size

The result of this query is

SELECT name, COUNT( * ) , id,filename  FROM DOCUMENT GROUP BY name, size HAVING COUNT( * ) >1

name          COUNT(*)      id      filename
--------------------------------------------------------------
docu1             45               33     fname1
docu2             85               59     fname2
docu3             43               33     fname5

I  want to change the  "filename" of all recurring entries to the filename of the first  entry of the group. (That means the first record of the group will be unchanged)

Can any body help me,
Thank you so much
0
Comment
Question by:myyis
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 8
  • 7
15 Comments
 
LVL 58

Expert Comment

by:Gary
ID: 40459837
So where for example the ID is 33 all entries should be filename1 (not filename5 etc)?
0
 
LVL 1

Author Comment

by:myyis
ID: 40459844
Like this

name      size        id       filename
---------------------------------------------
docu1      10         33      fname1
docu1      10         34      fname4    change  to  fname1
docu1      10         35      fname6    change  to  fname1
docu2      20         36      fname7
docu2      20         36      fname8    change  to  fname7

Also  I need to have the list of the old (changed) "filename" values ("fname4","fname6","fname8")

Thank you
0
 
LVL 58

Accepted Solution

by:
Gary earned 500 total points
ID: 40459861
Try
UPDATE table1 a
INNER JOIN
(SELECT filename,name FROM table1) b
ON a.name = b.name
SET a.filename=b.filename

Open in new window


list of the old (changed) "filename"
a list where?
0
Migrating Your Company's PCs

To keep pace with competitors, businesses must keep employees productive, and that means providing them with the latest technology. This document provides the tips and tricks you need to help you migrate an outdated PC fleet to new desktops, laptops, and tablets.

 
LVL 1

Author Comment

by:myyis
ID: 40459872
if possible using SELECT somewhere
0
 
LVL 58

Expert Comment

by:Gary
ID: 40459877
Yes, but to do what with it?
But you cannot do a recordset select while doing an update - it's one or the other
0
 
LVL 1

Author Comment

by:myyis
ID: 40459880
I will use the list to delete the repeating documents
0
 
LVL 1

Author Comment

by:myyis
ID: 40459884
I mean delete from directory. So may be I can use a SELECT first, then the 2d query to change
0
 
LVL 58

Expert Comment

by:Gary
ID: 40459889
Delete from what directory?
This question is in the MySQL zone.
0
 
LVL 1

Author Comment

by:myyis
ID: 40459900
Yeah I know,
if you can provide me also  the SELECT query, I can use the result set of records to will be changed.
Thank you.
0
 
LVL 58

Expert Comment

by:Gary
ID: 40459915
You can use this which will give a comma seperated field called dupes which contains all the grouped filenames

SELECT GROUP_CONCAT(filename) AS dupes FROM table GROUP BY name

Open in new window

0
 
LVL 1

Author Comment

by:myyis
ID: 40459980
Thank you for the select but also it gives the results that are unique.
I need something like this ("fname4","fname6","fname8"). Please check above
0
 
LVL 58

Expert Comment

by:Gary
ID: 40460000
Is ID a unique auto increment field?
0
 
LVL 1

Author Comment

by:myyis
ID: 40460007
No, the PK is (ID,ORID)
0
 
LVL 58

Assisted Solution

by:Gary
Gary earned 500 total points
ID: 40460031
SELECT filename
FROM table1 a
WHERE filename NOT IN 
(SELECT filename FROM (select name,filename from table1 group by name) b) 

Open in new window

0
 
LVL 1

Author Closing Comment

by:myyis
ID: 40460036
Great!
0

Featured Post

[Webinar] How Hackers Steal Your Credentials

Do You Know How Hackers Steal Your Credentials? Join us and Skyport Systems to learn how hackers steal your credentials and why Active Directory must be secure to stop them. Thursday, July 13, 2017 10:00 A.M. PDT

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Password hashing is better than message digests or encryption, and you should be using it instead of message digests or encryption.  Find out why and how in this article, which supplements the original article on PHP Client Registration, Login, Logo…
Containers like Docker and Rocket are getting more popular every day. In my conversations with customers, they consistently ask what containers are and how they can use them in their environment. If you’re as curious as most people, read on. . .
Monitoring a network: how to monitor network services and why? Michael Kulchisky, MCSE, MCSA, MCP, VTSP, VSP, CCSP outlines the philosophy behind service monitoring and why a handshake validation is critical in network monitoring. Software utilized …
This is my first video review of Microsoft Bookings, I will be doing a part two with a bit more information, but wanted to get this out to you folks.

734 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question