Solved

How can I get a list of documents that don't have renditions?

Posted on 2008-10-07
6
2,449 Views
Last Modified: 2013-11-15
Software:  EMC Documentum 5.3 SP3 running on Microsoft Windows Server 2003 and Oracle 10g.

Within Documentum, how can I query to find documents that don't have renditions.

When I do a dump on a document object (dump,c,r_object_id), I don't see any attribute that relates to a rendition. I dumped before and after a given object had a rendition, and saw no changes in the attributes (except for r_modify_date).

Using DQL, I've also reviewed the dmr_content table, and have created a nested query that gets me pretty close.  (select r_object_id from dm_document where r_object_id in (select distinct parent_id from dmr_content where rendition <> 2)

The last part of that DQL query, the "<> 2" implies give me all documents that don't have a PDF rendition (or at least that's what I thought it should do).

If I do "=2" it returns documents that do have PDF renditions.  However, if I do "<> 2" it gives me all documents with and without PDF renditions.

I may not be understanding the rendition attribute found in the dmr_content table and how it's used, which is why my query is failing.

Ultimately, I would like to be able to auto-queue a random group of documents that I know to be missing renditions due to a bug.

I would like to query to find the object IDs of all documents without PDF Renditions, and then queue these documents.  Can anyone help me isolate this query?

Does the queuing part need to be done using DFC, or can it be in a more lightweight fashion (perhaps DQL?)

Thanks,

Lane
0
Comment
Question by:nsxlane
  • 3
  • 2
6 Comments
 

Expert Comment

by:michelmanias
ID: 22678935
Hi,
 If you just want to find all objects with more than 1 rendition and without pdf rendition, you may want to use that query.
select r_object_id, object_name, a_content_type, r_object_type from dm_document where r_object_id in  (select parent_id from dmr_content where full_format<>'pdf' and rendition>0) ;
Otherwise, just remove rendition>0 in that query but you will get all object without pdf rendition
Regards
Michel
0
 

Author Comment

by:nsxlane
ID: 22680832
After toying with this, I also found another approach.  The following query returns the results I'm looking for.  I don't know why this required the double nested query.  When I tried a single nest and added the "r_object_id not in" I got zero results.  But when I did the double nested query as shown here, it returns exacly what I'm looking for.

select r_object_id from dm_document where r_object_id not in (select r_object_id from dm_document where r_object_id in (select distinct parent_id from dmr_content where rendition = 2))

Just out of curiousity, if ou remove the 'rendition>0' from your query, how does that return docs without PDF renditions.  Wouldn't that return docs without renditions of any type?
0
 

Expert Comment

by:michelmanias
ID: 22684685
because in your query you have to keep : full_format<>'pdf'
rendition value description are :
0, for original content
1, for a rendition generated by the server
2, for a rendition generated by the client
3, meaning keep the rendition when the content with which it is associated is updated or removed from the document or repository
0
 

Author Comment

by:nsxlane
ID: 22686234
That is excellent information.  I looked through the documentation and could not find what the possible values/meanings were for the rendition attribute in the dmr_content table.

Can you also explain to me  what the full_format value is used for?  Is it telling me in what format a document will open in?

If, for example, I have a Microsoft Word document that does have a PDF rendition, will the Full_Format field be Word when there is no rendition and PDF when there is?

Thanks for all your help.
0
 

Accepted Solution

by:
michelmanias earned 250 total points
ID: 22687028
In order to understant how it works, may be you can try that query written in another way

select d.r_object_id, d.object_name, d.a_content_type, d.r_object_type,c.rendition,c.full_format from dm_document d,dmr_content c where any c.parent_id=d.r_object_id and c.rendition>0 ;

If you have one word document with two renditions, it will display a_content_type=msw.. and full_format=pdf if that second rendition is a PDF rendition

full_format is the Full format specification for the content
Hope it helps you
0

Featured Post

Do You Know the 4 Main Threat Actor Types?

Do you know the main threat actor types? Most attackers fall into one of four categories, each with their own favored tactics, techniques, and procedures.

Join & Write a Comment

This script checks a path to see if a folder exists. If the folder does exist you will get output "The folder has previously been created. No action taken" If not it will create the folder. Then adds one user modify permission to the folder. It …
Read about how to choose the best possible content marketing agency to suit your needs. Content marketing has become an integral part of running a successful tech business, so it is wise to be informed.
In this video, we show how to perform Bates Numbering/Stamping of PDF documents using Power PDF Advanced, the newest product from the Document Imaging division of Nuance Communications. There are two editions of Power PDF — Standard and Advanced. Th…
This video Micro Tutorial is the second in a two-part series that shows how to create and use custom scanning profiles in Nuance's PaperPort 14.5 (http://www.experts-exchange.com/articles/17490/). But the ability to create custom scanning profiles a…

743 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

12 Experts available now in Live!

Get 1:1 Help Now