razimalliv
asked on
Modeling a Base of Knowledge
I'm involved in a project of creating a base of knowledge for human resources in projects. When doing the ER model of a little part of the problem a doubt assails me: which is the preferred way of modeling it, considering efficiency, good design practices and database normalization. The application will be developed in .NET, but It could be queried using php, for example. We have 4 levels of entities which represents our knowledge base:
1 Supertopics
1.1 Topics
1.1.1 Subtopics
1.1.1.1 knowledge
I'm among two possible relational implementations:
If using weak entities, each table will have a composed key: a unique Id plus the keys of the foreign tables of previous levels. It seems to be better to make complex reports with this model because of the replication of keys form all previous levels.
- supertopic (IDSupertopic,...)
- topic (IDSupertopic, IDTopic,...)
- subtopic (IDSupertopic, IDTopic, IDSubtopic,...)
- knowledge(IDSupertopic, IDTopic, IDSubtopic, IDKnowledge...)
If using strong entities, each table will have a simple unique key, and a foreign key from the previous level object (table). It seems easier to manage this kind of model because of the simplification of table keys.
- supertopic (ConsecutiveKey,...)
- Topic (ConsecutiveKey, IDSupertopic, ...)
- subtopic(ConsecutiveKey, IDTopic...)
- knowledge(ConsecutiveKey, IDSubtopic...).
Can anyone give me some advise related with advantages and disadvantages of each possibility?
1 Supertopics
1.1 Topics
1.1.1 Subtopics
1.1.1.1 knowledge
I'm among two possible relational implementations:
If using weak entities, each table will have a composed key: a unique Id plus the keys of the foreign tables of previous levels. It seems to be better to make complex reports with this model because of the replication of keys form all previous levels.
- supertopic (IDSupertopic,...)
- topic (IDSupertopic, IDTopic,...)
- subtopic (IDSupertopic, IDTopic, IDSubtopic,...)
- knowledge(IDSupertopic, IDTopic, IDSubtopic, IDKnowledge...)
If using strong entities, each table will have a simple unique key, and a foreign key from the previous level object (table). It seems easier to manage this kind of model because of the simplification of table keys.
- supertopic (ConsecutiveKey,...)
- Topic (ConsecutiveKey, IDSupertopic, ...)
- subtopic(ConsecutiveKey, IDTopic...)
- knowledge(ConsecutiveKey, IDSubtopic...).
Can anyone give me some advise related with advantages and disadvantages of each possibility?
ASKER
Your solution is a very good one. Additionally I would add the attribute "level" (LONG) to the tblTopicStructure that will help for some reports. I would also set the primary key to TopicID, SubTopicID and delete the key TopicStructureID. You give me a very good idea for implementing this kind of structures similar to BOM structure.
I this particular problem--I'm sorry I was not clear--, one element of level N is associated with 1 and only 1 element of level N-1 (is the real case I'm solving). So, reading a bit about normalization, option 1 is at least in 2NF, while option 2 is not.
Considering this restriction (one element of level N is associated with 1 and only 1 element N-1 level), your solution with my little modifications will be also in 2NF.
I this particular problem--I'm sorry I was not clear--, one element of level N is associated with 1 and only 1 element of level N-1 (is the real case I'm solving). So, reading a bit about normalization, option 1 is at least in 2NF, while option 2 is not.
Considering this restriction (one element of level N is associated with 1 and only 1 element N-1 level), your solution with my little modifications will be also in 2NF.
<<Additionally I would add the attribute "level" (LONG) to the tblTopicStructure that will help for some reports.>>
You generate that on the fly.
<< I would also set the primary key to TopicID, SubTopicID and delete the key TopicStructureID. >>
That depends on if your a die hard meaningless key person or not. Personally, I'd agree with you, but many would not.
One thing you might want to add there is a display order field (or sequence number), so subtopics are given in a specific order. That would be another candidate key.
Jim.
You generate that on the fly.
<< I would also set the primary key to TopicID, SubTopicID and delete the key TopicStructureID. >>
That depends on if your a die hard meaningless key person or not. Personally, I'd agree with you, but many would not.
One thing you might want to add there is a display order field (or sequence number), so subtopics are given in a specific order. That would be another candidate key.
Jim.
ASKER
<< That depends on if your a die hard meaningless key person or not. Personally, I'd agree with you, but many would not >>
To close this topic, please give me a few word about normalization in this case. From my point of view, setting the primary key to TopicID, SubTopicID and deleting the key TopicStructureID, it's not just a pleasure but an issue about normalization. Setting this composed key for tblTopicStructure really models this table as a many-to-many self-relationship of tblTopics. Now, a N to M relationship in the ER model resolve to a weak relationship (entity) with no own keys; in my opinion, this implementation will assure the integrity of the database.
To close this topic, please give me a few word about normalization in this case. From my point of view, setting the primary key to TopicID, SubTopicID and deleting the key TopicStructureID, it's not just a pleasure but an issue about normalization. Setting this composed key for tblTopicStructure really models this table as a many-to-many self-relationship of tblTopics. Now, a N to M relationship in the ER model resolve to a weak relationship (entity) with no own keys; in my opinion, this implementation will assure the integrity of the database.
ASKER CERTIFIED SOLUTION
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
As a start:
tblTopics - One record per topic
TopicID - AN - PK
Title - Text - CK1 - Brief title
Synopsis - Text - Overall description of the topic
Knowledge - Text
now:
tblTopicStructure - One record per Topic/SubTopic pair
TopicStructureID - AN - PK
TopicID - LONG - CK1A - FK to tblTopics
SubTopicID - LONG - CK1B - FK to tblTopics
With this, you can have any number of subtopics for a given level, any number of levels to a topic structure, with knowledge at each level (or none), and topics can be shared between structures.
For example, the same topic "poisonous frogs" might appear under a topic of "poisons" and "frogs"
You might also want to add:
tblTopicTags - One record per topic/tag
TopicTagID - AN - PK
TopicID - LONG - CK1A - FK to tblTopics
Tag - Text - CK1B
In order to include tag based searching of the knowledge base.
Jim.