Solved

I need to identify and count unique words in a group of documents

Posted on 2010-08-25
2
396 Views
Last Modified: 2012-05-10
I have a group of documents that I am trying to assess the most frequently used words in.
I am looking for a tool (similar to a wordpress tag cloud) that will run through a series of docs (Office and PDF) to create a list of the words used and their frequency across all docs.
I am happy to do it one at a time as the group is not overly large, but do need to be able to do both word and pdf files.
0
Comment
Question by:Barry Gill
2 Comments
 
LVL 4

Accepted Solution

by:
tzwimfam earned 500 total points
ID: 33520187
it is kind of manual but this works.... http://tagcrowd.com/
0
 
LVL 9

Author Closing Comment

by:Barry Gill
ID: 33530298
Thanks for this, hopefully I can automate it a bit :)
0

Featured Post

Free Tool: Postgres Monitoring System

A PHP and Perl based system to collect and display usage statistics from PostgreSQL databases.

One of a set of tools we are providing to everyone as a way of saying thank you for being a part of the community.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Update 21-May-2015: I temporarily removed the source code to make major changes to the program. Regards, Joe INTRODUCTION This article presents a solution to a question (http://www.experts-exchange.com/Programming/Installation/Q_28396542.html)…
This article focuses on how to remove password security from multiple PDF files by Adobe Acrobat program. Sometimes it is essential to access the stored data items and to print, edit as well as copy content from Portable Document Format files in abs…
In this fifth video of the Xpdf series, we discuss and demonstrate the PDFdetach utility, which is able to list and, more importantly, extract attachments that are embedded in PDF files. It does this via a command line interface, making it suitable …
In this seventh video of the Xpdf series, we discuss and demonstrate the PDFfonts utility, which lists all the fonts used in a PDF file. It does this via a command line interface, making it suitable for use in programs, scripts, batch files — any pl…

828 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question