[Okta Webinar] Learn how to a build a cloud-first strategyRegister Now

  • Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 397
  • Last Modified:

Data Modeling

This is not an academic assignment.

I am designing a database that will be used to track the emails that are contained in PDFs and make comments about them. Numerous emails are printed to a single PDF file. Each PDF contains numerous emails. The PDFs are not going to be stored in a database. Only the file path and file name will be tracked.  There will be only one comment per file. This will be a single user database.

Please help me create the data model.

Here is a sample report:
Comments: File contents: 2013 tax preparation.

Here is some more detail:

Entity: Files
Description: Numerous emails are printed to a single PDF file. Each PDF contains numerous emails. The PDFs are not going to be stored in a database.
Attributes: Comments, File name

Entity: Path
Description: The location of the PDF. Example: c:\email\file1.pdf
Attributes: ?
  • 3
  • 3
1 Solution
Brian CroweCommented:
Didn't you get an answer to this already?


What more are you looking for?
Mark01Author Commented:
Those comments did not address logical modeling; they addressed the physical modeling.
Scott PletcherSenior DBACommented:
Here's a few thoughts.  I trust that actually writing out a definition was actually very helpful in identifying areas that needed clarification and in making sure everyone's expectations were aligned.

Entities can be either singular or plural (that debate won't ever be settled), but be consistent: all singular or all plural.

 Entity: File
 Description: A standard-format PDF file used to hold related(?) emails.
     Numerous emails are printed to a single PDF file.  
     All emails in a single PDF must be related? by ?[date?; subject?; ??].
     PDFs are currently not stored in a database; they are stored as standard PDF files.
     Files may have zero to many associated comments.
     Path Id --identifier that specifies the drive and path to the file
     File Id --arbitrary unique sequential number assigned to every file name.  File names alone may not be unique, since the same file name could be stored in different paths.
     File Name --physical name of the file, excluding file extension.
     File Extension --
     File Type Id --physical file type; typically this will correlate to the file extension, but that's not actually required.  For example, a '[.]txt' file could actually [.]csv-format data and vice-versa.
     ?Date Created --
     ?Created By --
     Description --what this file is, its use, etc..

 Entity: File Comment
 Descrption: User-entered comments / notes about a particular file.
     File Id --the file to which the comments apply
     Comments --
     Date Created --
     Created By --

 Entity: File Type
 Description: Type of file details.  Will include the default extension and a brief description of the file type.
     File Type Id --arbitrary unique sequential number assigned to each unique File Type
     Default Extension --default extension for this type of file: 'PDF' for pdf files; 'xls';'xlsx' for Microsoft Excel files; etc..
     File Type --standardized description of the the file type: 'PDF'; 'Excel 2007-'; 'Excel 2010'.    

 Entity: Path
 Description: The path of a file. Example: c:\email\.  This does NOT include the file name, which is part of the file data.
     Path Id --arbitrary unique sequential number assigned to each unique path
     Drive --the "drive" name; may be a single letter, a share, a volume mount point, etc.
     Path --the remainder of the physical path to the file
     Description --purpose, use, etc., of this specific path
 Sample data:
     Holds email extract/archive PDF files. These files can be for any department in the company. This folder is accessible to all employees.
     Holds email only for the HR department where additional security is required.  Only HR members can access this folder.
Independent Software Vendors: We Want Your Opinion

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!

Mark01Author Commented:
Yes. Writing out the definitions is very helpful.

I am currently using the following book to learn database design:

Do you agree that the next step should be a preliminary field list and preliminary table list like the book recommends? The entities become tables and the attributes become the fields.
Scott PletcherSenior DBACommented:
Once you've completed the attribute list and have all the critical relationships and rules, yes.  You've got a bit still to go there, but if you're with someone who knows the data, it won't take long, maybe 30-45 minutes.

I'm not familiar with that particular author so I can't comment specifically on that book.
Mark01Author Commented:
Thank you, Scott Pletcher.
Scott PletcherSenior DBACommented:
You're very welcome.  Good luck with the project!

Featured Post

Free Tool: Site Down Detector

Helpful to verify reports of your own downtime, or to double check a downed website you are trying to access.

One of a set of tools we are providing to everyone as a way of saying thank you for being a part of the community.

  • 3
  • 3
Tackle projects and never again get stuck behind a technical roadblock.
Join Now