Solved

How can I get a list of documents that don't have renditions?

Posted on 2008-10-07
6
2,545 Views
Last Modified: 2013-11-15
Software:  EMC Documentum 5.3 SP3 running on Microsoft Windows Server 2003 and Oracle 10g.

Within Documentum, how can I query to find documents that don't have renditions.

When I do a dump on a document object (dump,c,r_object_id), I don't see any attribute that relates to a rendition. I dumped before and after a given object had a rendition, and saw no changes in the attributes (except for r_modify_date).

Using DQL, I've also reviewed the dmr_content table, and have created a nested query that gets me pretty close.  (select r_object_id from dm_document where r_object_id in (select distinct parent_id from dmr_content where rendition <> 2)

The last part of that DQL query, the "<> 2" implies give me all documents that don't have a PDF rendition (or at least that's what I thought it should do).

If I do "=2" it returns documents that do have PDF renditions.  However, if I do "<> 2" it gives me all documents with and without PDF renditions.

I may not be understanding the rendition attribute found in the dmr_content table and how it's used, which is why my query is failing.

Ultimately, I would like to be able to auto-queue a random group of documents that I know to be missing renditions due to a bug.

I would like to query to find the object IDs of all documents without PDF Renditions, and then queue these documents.  Can anyone help me isolate this query?

Does the queuing part need to be done using DFC, or can it be in a more lightweight fashion (perhaps DQL?)

Thanks,

Lane
0
Comment
Question by:nsxlane
  • 3
  • 2
6 Comments
 

Expert Comment

by:michelmanias
ID: 22678935
Hi,
 If you just want to find all objects with more than 1 rendition and without pdf rendition, you may want to use that query.
select r_object_id, object_name, a_content_type, r_object_type from dm_document where r_object_id in  (select parent_id from dmr_content where full_format<>'pdf' and rendition>0) ;
Otherwise, just remove rendition>0 in that query but you will get all object without pdf rendition
Regards
Michel
0
 

Author Comment

by:nsxlane
ID: 22680832
After toying with this, I also found another approach.  The following query returns the results I'm looking for.  I don't know why this required the double nested query.  When I tried a single nest and added the "r_object_id not in" I got zero results.  But when I did the double nested query as shown here, it returns exacly what I'm looking for.

select r_object_id from dm_document where r_object_id not in (select r_object_id from dm_document where r_object_id in (select distinct parent_id from dmr_content where rendition = 2))

Just out of curiousity, if ou remove the 'rendition>0' from your query, how does that return docs without PDF renditions.  Wouldn't that return docs without renditions of any type?
0
 

Expert Comment

by:michelmanias
ID: 22684685
because in your query you have to keep : full_format<>'pdf'
rendition value description are :
0, for original content
1, for a rendition generated by the server
2, for a rendition generated by the client
3, meaning keep the rendition when the content with which it is associated is updated or removed from the document or repository
0
 

Author Comment

by:nsxlane
ID: 22686234
That is excellent information.  I looked through the documentation and could not find what the possible values/meanings were for the rendition attribute in the dmr_content table.

Can you also explain to me  what the full_format value is used for?  Is it telling me in what format a document will open in?

If, for example, I have a Microsoft Word document that does have a PDF rendition, will the Full_Format field be Word when there is no rendition and PDF when there is?

Thanks for all your help.
0
 

Accepted Solution

by:
michelmanias earned 250 total points
ID: 22687028
In order to understant how it works, may be you can try that query written in another way

select d.r_object_id, d.object_name, d.a_content_type, d.r_object_type,c.rendition,c.full_format from dm_document d,dmr_content c where any c.parent_id=d.r_object_id and c.rendition>0 ;

If you have one word document with two renditions, it will display a_content_type=msw.. and full_format=pdf if that second rendition is a PDF rendition

full_format is the Full format specification for the content
Hope it helps you
0

Featured Post

Optimizing Cloud Backup for Low Bandwidth

With cloud storage prices going down a growing number of SMBs start to use it for backup storage. Unfortunately, business data volume rarely fits the average Internet speed. This article provides an overview of main Internet speed challenges and reveals backup best practices.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

In a previous article here at Experts Exchange (http://www.experts-exchange.com/articles/18414/Create-a-PDF-file-with-Contact-Sheets-montage-of-thumbnails-for-all-JPG-files-in-a-folder-and-each-of-its-subfolders-using-an-automated-batch-method.html)…
When the confidentiality and security of your data is a must, trust the highly encrypted cloud fax portfolio used by 12 million businesses worldwide, including nearly half of the Fortune 500.
This video is the first in a two-part series that discusses PaperPort's "Send To Bar" feature . This first video tutorial explains the purpose of the Send To Bar, how to use it, and how to hide unwanted items that are automatically created on it whe…
In this first video of the three-part Xpdf series, we introduce and describe Xpdf, a library containing nine command line utilities that perform various functions on PDF files. We show where the library is located and how to download it, discuss its…

776 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question