Link to home
Start Free TrialLog in
Avatar of nsxlane
nsxlane

asked on

How can I get a list of documents that don't have renditions?

Software:  EMC Documentum 5.3 SP3 running on Microsoft Windows Server 2003 and Oracle 10g.

Within Documentum, how can I query to find documents that don't have renditions.

When I do a dump on a document object (dump,c,r_object_id), I don't see any attribute that relates to a rendition. I dumped before and after a given object had a rendition, and saw no changes in the attributes (except for r_modify_date).

Using DQL, I've also reviewed the dmr_content table, and have created a nested query that gets me pretty close.  (select r_object_id from dm_document where r_object_id in (select distinct parent_id from dmr_content where rendition <> 2)

The last part of that DQL query, the "<> 2" implies give me all documents that don't have a PDF rendition (or at least that's what I thought it should do).

If I do "=2" it returns documents that do have PDF renditions.  However, if I do "<> 2" it gives me all documents with and without PDF renditions.

I may not be understanding the rendition attribute found in the dmr_content table and how it's used, which is why my query is failing.

Ultimately, I would like to be able to auto-queue a random group of documents that I know to be missing renditions due to a bug.

I would like to query to find the object IDs of all documents without PDF Renditions, and then queue these documents.  Can anyone help me isolate this query?

Does the queuing part need to be done using DFC, or can it be in a more lightweight fashion (perhaps DQL?)

Thanks,

Lane
Avatar of michelmanias
michelmanias
Flag of Switzerland image

Hi,
 If you just want to find all objects with more than 1 rendition and without pdf rendition, you may want to use that query.
select r_object_id, object_name, a_content_type, r_object_type from dm_document where r_object_id in  (select parent_id from dmr_content where full_format<>'pdf' and rendition>0) ;
Otherwise, just remove rendition>0 in that query but you will get all object without pdf rendition
Regards
Michel
Avatar of nsxlane
nsxlane

ASKER

After toying with this, I also found another approach.  The following query returns the results I'm looking for.  I don't know why this required the double nested query.  When I tried a single nest and added the "r_object_id not in" I got zero results.  But when I did the double nested query as shown here, it returns exacly what I'm looking for.

select r_object_id from dm_document where r_object_id not in (select r_object_id from dm_document where r_object_id in (select distinct parent_id from dmr_content where rendition = 2))

Just out of curiousity, if ou remove the 'rendition>0' from your query, how does that return docs without PDF renditions.  Wouldn't that return docs without renditions of any type?
because in your query you have to keep : full_format<>'pdf'
rendition value description are :
0, for original content
1, for a rendition generated by the server
2, for a rendition generated by the client
3, meaning keep the rendition when the content with which it is associated is updated or removed from the document or repository
Avatar of nsxlane

ASKER

That is excellent information.  I looked through the documentation and could not find what the possible values/meanings were for the rendition attribute in the dmr_content table.

Can you also explain to me  what the full_format value is used for?  Is it telling me in what format a document will open in?

If, for example, I have a Microsoft Word document that does have a PDF rendition, will the Full_Format field be Word when there is no rendition and PDF when there is?

Thanks for all your help.
ASKER CERTIFIED SOLUTION
Avatar of michelmanias
michelmanias
Flag of Switzerland image

Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial