Data Modeling

This is not an academic assignment.

I am designing a database that will be used to track the emails that are contained in PDFs and make comments about them. Numerous emails are printed to a single PDF file. Each PDF contains numerous emails. The PDFs are not going to be stored in a database. Only the file path and file name will be tracked.  There will be only one comment per file. This will be a single user database.

Please help me create the data model.

Here is a sample report:
Comments: File contents: 2013 tax preparation.

Here is some more detail:

Entity: Files
Description: Numerous emails are printed to a single PDF file. Each PDF contains numerous emails. The PDFs are not going to be stored in a database.
Attributes: Comments, File name

Entity: Path
Description: The location of the PDF. Example: c:\email\file1.pdf
Attributes: ?
Who is Participating?
I wear a lot of hats...

"The solutions and answers provided on Experts Exchange have been extremely helpful to me over the last few years. I wear a lot of hats - Developer, Database Administrator, Help Desk, etc., so I know a lot of things but not a lot about one thing. Experts Exchange gives me answers from people who do know a lot about one thing, in a easy to use platform." -Todd S.

Brian CroweDatabase AdministratorCommented:
Didn't you get an answer to this already?

What more are you looking for?
Mark01Author Commented:
Those comments did not address logical modeling; they addressed the physical modeling.
Scott PletcherSenior DBACommented:
Here's a few thoughts.  I trust that actually writing out a definition was actually very helpful in identifying areas that needed clarification and in making sure everyone's expectations were aligned.

Entities can be either singular or plural (that debate won't ever be settled), but be consistent: all singular or all plural.

 Entity: File
 Description: A standard-format PDF file used to hold related(?) emails.
     Numerous emails are printed to a single PDF file.  
     All emails in a single PDF must be related? by ?[date?; subject?; ??].
     PDFs are currently not stored in a database; they are stored as standard PDF files.
     Files may have zero to many associated comments.
     Path Id --identifier that specifies the drive and path to the file
     File Id --arbitrary unique sequential number assigned to every file name.  File names alone may not be unique, since the same file name could be stored in different paths.
     File Name --physical name of the file, excluding file extension.
     File Extension --
     File Type Id --physical file type; typically this will correlate to the file extension, but that's not actually required.  For example, a '[.]txt' file could actually [.]csv-format data and vice-versa.
     ?Date Created --
     ?Created By --
     Description --what this file is, its use, etc..

 Entity: File Comment
 Descrption: User-entered comments / notes about a particular file.
     File Id --the file to which the comments apply
     Comments --
     Date Created --
     Created By --

 Entity: File Type
 Description: Type of file details.  Will include the default extension and a brief description of the file type.
     File Type Id --arbitrary unique sequential number assigned to each unique File Type
     Default Extension --default extension for this type of file: 'PDF' for pdf files; 'xls';'xlsx' for Microsoft Excel files; etc..
     File Type --standardized description of the the file type: 'PDF'; 'Excel 2007-'; 'Excel 2010'.    

 Entity: Path
 Description: The path of a file. Example: c:\email\.  This does NOT include the file name, which is part of the file data.
     Path Id --arbitrary unique sequential number assigned to each unique path
     Drive --the "drive" name; may be a single letter, a share, a volume mount point, etc.
     Path --the remainder of the physical path to the file
     Description --purpose, use, etc., of this specific path
 Sample data:
     Holds email extract/archive PDF files. These files can be for any department in the company. This folder is accessible to all employees.
     Holds email only for the HR department where additional security is required.  Only HR members can access this folder.

Experts Exchange Solution brought to you by

Your issues matter to us.

Facing a tech roadblock? Get the help and guidance you need from experienced professionals who care. Ask your question anytime, anywhere, with no hassle.

Start your 7-day free trial
Ultimate Tool Kit for Technology Solution Provider

Broken down into practical pointers and step-by-step instructions, the IT Service Excellence Tool Kit delivers expert advice for technology solution providers. Get your free copy now.

Mark01Author Commented:
Yes. Writing out the definitions is very helpful.

I am currently using the following book to learn database design: 

Do you agree that the next step should be a preliminary field list and preliminary table list like the book recommends? The entities become tables and the attributes become the fields.
Scott PletcherSenior DBACommented:
Once you've completed the attribute list and have all the critical relationships and rules, yes.  You've got a bit still to go there, but if you're with someone who knows the data, it won't take long, maybe 30-45 minutes.

I'm not familiar with that particular author so I can't comment specifically on that book.
Mark01Author Commented:
Thank you, Scott Pletcher.
Scott PletcherSenior DBACommented:
You're very welcome.  Good luck with the project!
It's more than this solution.Get answers and train to solve all your tech problems - anytime, anywhere.Try it for free Edge Out The Competitionfor your dream job with proven skills and certifications.Get started today Stand Outas the employee with proven skills.Start learning today for free Move Your Career Forwardwith certification training in the latest technologies.Start your trial today
Microsoft SQL Server

From novice to tech pro — start learning today.

Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.