Link to home
Start Free TrialLog in
Avatar of razimalliv
razimalliv

asked on

Modeling a Base of Knowledge

I'm involved in a project of creating a base of knowledge for human resources in projects. When doing the ER model of a little part of the problem a doubt assails me: which is the preferred way of modeling it, considering  efficiency, good design practices and database normalization. The application will be developed in .NET, but It could be queried using php, for example. We have 4 levels of entities which represents our knowledge base:

1   Supertopics
1.1   Topics
1.1.1   Subtopics
1.1.1.1   knowledge
     
I'm among two possible relational implementations:
If using weak entities, each table will have a composed key: a unique Id plus the keys of the foreign tables of previous levels. It seems to be better to make complex reports with this model because of the replication of keys form all previous levels.
- supertopic (IDSupertopic,...)
- topic (IDSupertopic, IDTopic,...)
- subtopic (IDSupertopic, IDTopic, IDSubtopic,...)
- knowledge(IDSupertopic, IDTopic, IDSubtopic, IDKnowledge...)

If using strong entities, each table will have a simple unique key, and a foreign key from the previous level object (table). It seems easier to manage this kind of model because of the simplification of table keys.
- supertopic (ConsecutiveKey,...)
- Topic (ConsecutiveKey, IDSupertopic, ...)
- subtopic(ConsecutiveKey, IDTopic...)
- knowledge(ConsecutiveKey, IDSubtopic...).

Can anyone give me some advise related with advantages and disadvantages of each possibility?
Avatar of Jim Dettman (EE MVE)
Jim Dettman (EE MVE)
Flag of United States of America image

I'm not sure I'd attack it either way.  I've never done a knowledge base, but your outline is really no different then a BOM structure in MFG.

As a start:

tblTopics - One record per topic
TopicID - AN - PK
Title - Text - CK1 - Brief title
Synopsis - Text - Overall description of the topic
Knowledge - Text

now:

tblTopicStructure - One record per Topic/SubTopic pair
TopicStructureID - AN - PK
TopicID - LONG - CK1A - FK to tblTopics
SubTopicID - LONG - CK1B - FK to tblTopics

 With this, you can have any number of subtopics for a given level, any number of levels to a topic structure, with knowledge at each level (or none), and topics can be shared between structures.

 For example, the same topic "poisonous frogs" might appear under a topic of "poisons" and "frogs"

 You might also want to add:

tblTopicTags - One record per topic/tag
TopicTagID - AN - PK
TopicID - LONG - CK1A - FK to tblTopics
Tag - Text - CK1B

 In order to include tag based searching of the knowledge base.

Jim.
Avatar of razimalliv
razimalliv

ASKER

Your solution is a very good one.  Additionally I would add the attribute "level" (LONG) to the tblTopicStructure that will help for some reports. I would also set the primary key to TopicID, SubTopicID  and delete the key TopicStructureID.  You give me a very good idea for implementing this kind of structures similar to  BOM structure.

I this particular problem--I'm sorry I was not clear--, one element of level N is associated with 1 and only 1 element of level N-1 (is the real case I'm solving). So, reading a bit about normalization, option 1 is at least in 2NF, while option 2 is not.

Considering this restriction (one element of level N is associated with 1 and only 1 element N-1 level),  your solution with my little modifications will be also in 2NF.
<<Additionally I would add the attribute "level" (LONG) to the tblTopicStructure that will help for some reports.>>

  You generate that on the fly.

<< I would also set the primary key to TopicID, SubTopicID  and delete the key TopicStructureID. >>

That depends on if your a die hard meaningless key person or not.  Personally, I'd agree with you, but many would not.

 One thing you might want to add there is a display order field (or sequence number), so subtopics are given in a specific order.   That would be another candidate key.

Jim.
<< That depends on if your a die hard meaningless key person or not.  Personally, I'd agree with you, but many would not >>
To close this topic, please give me a few word about normalization in this case.  From my point of view, setting the primary key to TopicID, SubTopicID and deleting the key TopicStructureID, it's not just a pleasure but an issue about normalization. Setting this composed key for tblTopicStructure really models this table as a many-to-many self-relationship   of tblTopics. Now, a N to M relationship in the ER model resolve to a weak relationship (entity) with no own keys; in my opinion, this implementation will assure the integrity of the database.
ASKER CERTIFIED SOLUTION
Avatar of Jim Dettman (EE MVE)
Jim Dettman (EE MVE)
Flag of United States of America image

Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial