Import Data Techniques - Best Practice

I'm on the precipice of a pretty important design consideration and thought I'd ask some advice.  I am designing a web application that will contain a table that holds "people".  The users of my site will want their site to be filled with their people.  The hard way to do this is to give them an interface on my site that allows them to view, add, edit, delete - all things I will likely do.

But, the "easy" way (at least from a customer's perspective) is to give them an importing tool.  I have seen many ways of doing this over the years, and could likely figure each one out.  But here are my assumptions about my design:
  1.  Not all people have Excel, making a software-dependent upload tool a bad idea
  2.  Comma-separation is pretty good, but not if the data has commas in it and the user doesn't properly plan for this
  3.  XML is nice and structured, but is there an easy way for non-tech types to get data into my schema?

What I would like is this approach:  A user wants a list of 10,000 people in their company to be available on my web site.  They go to their HR and get that list in *SOME FORMAT*.  They bring that file to my website and upload it.  My program inspects the file and reports back anything bad that might happen before committing the changes.  The user accepts/rejects, and the data is synchronized.

Where I'm really getting hung up is the SOME FORMAT part.  I realize that a web interface may not be the only way to get this done (i.e. the user could stick it on an FTP site and I could consume from there), but let's assume I want that method.  Can someone point me in the right direction.  I'd really like some simple examples of schema or approaches that have worked well for others.
LVL 16
Steve KrileAsked:
Who is Participating?
I wear a lot of hats...

"The solutions and answers provided on Experts Exchange have been extremely helpful to me over the last few years. I wear a lot of hats - Developer, Database Administrator, Help Desk, etc., so I know a lot of things but not a lot about one thing. Experts Exchange gives me answers from people who do know a lot about one thing, in a easy to use platform." -Todd S.

In order to get data in "SOME" format you are going to need "SOME" software.  If Office products are out, then you know most people will have a notepad editor.  if that is the case, then a delimited or fixed width file is going to be pretty much your only option.  I wouldn't have non-technical types try to create XML, especially in Notepad.  Your choice of delimiting character is up to you.  I can be a comma, tab, or even something like the vertical bar ( | ).  If you use comma delimiting, you can wrap text in quotes so that the commas inside the quotes won't matter.  Also, with a delimited format, someone who is using EXCEL can generate that format easily.  The problem will be mistakes.  If data is typed in by hand, the format may be off so you will really need some good validation on the other end.  With your parameters, your choices are limited.  As far as the mechanism, you could just use an Upload control to get the file to the web server.  From there, the server code can parse it, put it in temp storage and display for syncronization.  Sorry I couldn't be more help.

Experts Exchange Solution brought to you by

Your issues matter to us.

Facing a tech roadblock? Get the help and guidance you need from experienced professionals who care. Ask your question anytime, anywhere, with no hassle.

Start your 7-day free trial
Steve KrileAuthor Commented:
You are right...I will need some type of software on the client side.  I guess that's the bit that's scaring me.  And, no, I was not expecting non-technical types to edit XML by hand.
Steve KrileAuthor Commented:
So, after much search and kvetching, I've decided to go with a .csv approach.  This makes j s kelly correct.

Here is the code I'm using.

The key bits for me were:
  1.  FileIO.TextFieldParser - didn't know this exists and it works with a normal IIS/Server, you don't need to have Excel installed for this to work
  2.  using System.IO.Stream to read the posted file directly rather than saving the file to some folder somewhere then opening it and reading.
            Dim ds As DataSet = New DataSet()
            Dim dt As DataTable = ds.Tables.Add("TheData")
            Dim myStream As System.IO.Stream
            myStream = FileUpload1.FileContent
            Dim afile As FileIO.TextFieldParser = New FileIO.TextFieldParser(myStream)
            Dim CurrentRecord As String() ' this array will hold each line of data
            afile.TextFieldType = FileIO.FieldType.Delimited
            afile.Delimiters = New String() {","}
            afile.HasFieldsEnclosedInQuotes = True
            Dim i As Integer = 0
            ' parse the actual file
            Do While Not afile.EndOfData
                    CurrentRecord = afile.ReadFields
                    If i = 0 Then
                        For x As Integer = 0 To CurrentRecord.GetUpperBound(0)
                        Dim dr As DataRow = dt.NewRow()
                        For x As Integer = 0 To CurrentRecord.GetUpperBound(0)
                            dr(x) = CurrentRecord(x).Trim()
                        Next x
                    End If
                Catch ex As FileIO.MalformedLineException
                End Try
                i = i + 1
            grvResults.DataSource = ds
        Catch ex As Exception
            ErrMsg = ex.Message
        End Try

Open in new window

Steve KrileAuthor Commented:
I had to a lot of digging to find element of the code to make the approach you suggest work.  That's OK, and not an insult...just the facts.
It's more than this solution.Get answers and train to solve all your tech problems - anytime, anywhere.Try it for free Edge Out The Competitionfor your dream job with proven skills and certifications.Get started today Stand Outas the employee with proven skills.Start learning today for free Move Your Career Forwardwith certification training in the latest technologies.Start your trial today

From novice to tech pro — start learning today.