Solved

How can I get a list of documents that don't have renditions?

Posted on 2008-10-07
6
2,723 Views
Last Modified: 2013-11-15
Software:  EMC Documentum 5.3 SP3 running on Microsoft Windows Server 2003 and Oracle 10g.

Within Documentum, how can I query to find documents that don't have renditions.

When I do a dump on a document object (dump,c,r_object_id), I don't see any attribute that relates to a rendition. I dumped before and after a given object had a rendition, and saw no changes in the attributes (except for r_modify_date).

Using DQL, I've also reviewed the dmr_content table, and have created a nested query that gets me pretty close.  (select r_object_id from dm_document where r_object_id in (select distinct parent_id from dmr_content where rendition <> 2)

The last part of that DQL query, the "<> 2" implies give me all documents that don't have a PDF rendition (or at least that's what I thought it should do).

If I do "=2" it returns documents that do have PDF renditions.  However, if I do "<> 2" it gives me all documents with and without PDF renditions.

I may not be understanding the rendition attribute found in the dmr_content table and how it's used, which is why my query is failing.

Ultimately, I would like to be able to auto-queue a random group of documents that I know to be missing renditions due to a bug.

I would like to query to find the object IDs of all documents without PDF Renditions, and then queue these documents.  Can anyone help me isolate this query?

Does the queuing part need to be done using DFC, or can it be in a more lightweight fashion (perhaps DQL?)

Thanks,

Lane
0
Comment
Question by:nsxlane
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 3
  • 2
6 Comments
 

Expert Comment

by:michelmanias
ID: 22678935
Hi,
 If you just want to find all objects with more than 1 rendition and without pdf rendition, you may want to use that query.
select r_object_id, object_name, a_content_type, r_object_type from dm_document where r_object_id in  (select parent_id from dmr_content where full_format<>'pdf' and rendition>0) ;
Otherwise, just remove rendition>0 in that query but you will get all object without pdf rendition
Regards
Michel
0
 

Author Comment

by:nsxlane
ID: 22680832
After toying with this, I also found another approach.  The following query returns the results I'm looking for.  I don't know why this required the double nested query.  When I tried a single nest and added the "r_object_id not in" I got zero results.  But when I did the double nested query as shown here, it returns exacly what I'm looking for.

select r_object_id from dm_document where r_object_id not in (select r_object_id from dm_document where r_object_id in (select distinct parent_id from dmr_content where rendition = 2))

Just out of curiousity, if ou remove the 'rendition>0' from your query, how does that return docs without PDF renditions.  Wouldn't that return docs without renditions of any type?
0
 

Expert Comment

by:michelmanias
ID: 22684685
because in your query you have to keep : full_format<>'pdf'
rendition value description are :
0, for original content
1, for a rendition generated by the server
2, for a rendition generated by the client
3, meaning keep the rendition when the content with which it is associated is updated or removed from the document or repository
0
 

Author Comment

by:nsxlane
ID: 22686234
That is excellent information.  I looked through the documentation and could not find what the possible values/meanings were for the rendition attribute in the dmr_content table.

Can you also explain to me  what the full_format value is used for?  Is it telling me in what format a document will open in?

If, for example, I have a Microsoft Word document that does have a PDF rendition, will the Full_Format field be Word when there is no rendition and PDF when there is?

Thanks for all your help.
0
 

Accepted Solution

by:
michelmanias earned 250 total points
ID: 22687028
In order to understant how it works, may be you can try that query written in another way

select d.r_object_id, d.object_name, d.a_content_type, d.r_object_type,c.rendition,c.full_format from dm_document d,dmr_content c where any c.parent_id=d.r_object_id and c.rendition>0 ;

If you have one word document with two renditions, it will display a_content_type=msw.. and full_format=pdf if that second rendition is a PDF rendition

full_format is the Full format specification for the content
Hope it helps you
0

Featured Post

SharePoint Admin?

Enable Your Employees To Focus On The Core With Intuitive Onscreen Guidance That is With You At The Moment of Need.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

This article provides the solution to a question (http://www.experts-exchange.com/Software/Photos_Graphics/Images_and_Photos/Q_28674207.html) posed here at Experts Exchange. The asker of the question has many JPG images in many folders, and all of t…
This script checks a path to see if a folder exists. If the folder does exist you will get output "The folder has previously been created. No action taken" If not it will create the folder. Then adds one user modify permission to the folder. It …
In this fifth video of the Xpdf series, we discuss and demonstrate the PDFdetach utility, which is able to list and, more importantly, extract attachments that are embedded in PDF files. It does this via a command line interface, making it suitable …
In this sixth video of the Xpdf series, we discuss and demonstrate the PDFtoPNG utility, which converts a multi-page PDF file to separate color, grayscale, or monochrome PNG files, creating one PNG file for each page in the PDF. It does this via a c…

707 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question