Solved

How to create a duplicate finder Application

Posted on 2016-09-08
9
132 Views
Last Modified: 2016-09-25
We have huge data which consists of 50-60 thousand rows and we receive about 600 to 700 rows of information everyday (excel)

What I am looking for is
1: when the daily data is received if i upload the current data to access or the application it should provide the combination of duplicates as mentioned below in different tabs
2: Able to customize the combinations
3: click on delete button of the combination to be able to delete specific rows from the master data

Customer Name (C)
Customer Number (N)
Invoice Amount (A)
Invoice Number (I)
Invoice Date (D)

some of the experts might have already done this, but I have never got an opportunity to work with access
MOCK_DATA.xlsx
0
Comment
Question by:Nirvana
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
9 Comments
 
LVL 18

Expert Comment

by:xtermie
ID: 41789155
Perhaps, you can use Excel's remove duplicates prior to importing to access
Open your file in Excel
  1. Click on Data tab
  2. In the Data tools, click Remove Duplicates
  3. Select all columns
  4. Click on Remove Duplicates Button

Not sure of what (2) and (3) means.  Can you please explain a bit more with an example?
0
 

Author Comment

by:Nirvana
ID: 41789300
Number 2 mean I should be able customize the combinations from the columns available for duplicates

attaching a sample view of the interface that I am looking for
sample.pptx
0
 
LVL 48

Expert Comment

by:Dale Fye (Access MVP)
ID: 41789667
Generally, when I import from Excel, I start by linking the Excel file to my database.  Then I import the data from the Excel file into a staging table which contains all of the fields I need from Excel, plus other fields that I use for my error checking process.

Then I run a series of queries that make sure that fields are not duplicates (if they are, I use these additional fields to annotate errors.  These errors might be duplicates based on a single field or multiple fields.  They also might be based on a column which requires a value that exists in a lookup field and does not match any of those values.  

I generally display these records in a form for the user to review and correct or mark mark them for import.  Then, once that process is complete, I import the acceptable records into my production table.  I find that it is easier to prevent these duplicates from the start than it is to import into the production table and then have to find them.
1
Back Up Your Microsoft Windows Server®

Back up all your Microsoft Windows Server – on-premises, in remote locations, in private and hybrid clouds. Your entire Windows Server will be backed up in one easy step with patented, block-level disk imaging. We achieve RTOs (recovery time objectives) as low as 15 seconds.

 

Author Comment

by:Nirvana
ID: 41790057
Thanks Dale.I will follow the steps and see the fact is I have never worked in access so it might take a little time. And how can I create a user interface for end user
0
 
LVL 48

Expert Comment

by:Dale Fye (Access MVP)
ID: 41790299
Where are you storing your data, Access?
1
 

Author Comment

by:Nirvana
ID: 41790636
Excel. I have my data in excel and then will be uploaded or imported to access
0
 
LVL 48

Accepted Solution

by:
Dale Fye (Access MVP) earned 500 total points
ID: 41808484
Nirvana,

Are you still working on this?

If so, the first step is to create a process which will link a file you select (ideally it would have the same file name and path each day). into your database.  The code might look something like:

Private sub cmd_Link_Excel_Click

     docmd.transferspreadsheet acLink, acSpreadsheetTypeExcel12, "ExcelLinked", _
                  "C:\yourpath\yourfilename.xlsx", true, "Sheet1$"

End Sub

This would link "Sheet1" (dont for get to add the $ after the worksheet name) from your file into your Access database.  You could add some code before that to select the file manually if you wanted to,  to do that, search on "vba file dialog" here in EE for code sample.
1
 
LVL 45

Expert Comment

by:aikimark
ID: 41808521
The easiest way to eliminate duplicates is to put a unique index on your destination table.  When you append your rows, any duplicates will not be inserted.
0

Featured Post

Use Case: Protecting a Hybrid Cloud Infrastructure

Microsoft Azure is rapidly becoming the norm in dynamic IT environments. This document describes the challenges that organizations face when protecting data in a hybrid cloud IT environment and presents a use case to demonstrate how Acronis Backup protects all data.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Did you know that more than 4 billion data records have been recorded as lost or stolen since 2013? It was a staggering number brought to our attention during last week’s ManageEngine webinar, where attendees received a comprehensive look at the ma…
This article describes a method of delivering Word templates for use in merging Access data to Word documents, that requires no computer knowledge on the part of the recipient -- the templates are saved in table fields, and are extracted and install…
This Micro Tutorial will demonstrate in Google Sheets how to use the HYPERLINK function to create live links inside your spreadsheet.
Excel styles will make formatting consistent and let you apply and change formatting faster. In this tutorial, you'll learn how to use Excel's built-in styles, how to modify styles, and how to create your own. You'll also learn how to use your custo…

696 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question