[Okta Webinar] Learn how to a build a cloud-first strategyRegister Now

x
?
Solved

I need to identify and count unique words in a group of documents

Posted on 2010-08-25
2
Medium Priority
?
406 Views
Last Modified: 2012-05-10
I have a group of documents that I am trying to assess the most frequently used words in.
I am looking for a tool (similar to a wordpress tag cloud) that will run through a series of docs (Office and PDF) to create a list of the words used and their frequency across all docs.
I am happy to do it one at a time as the group is not overly large, but do need to be able to do both word and pdf files.
0
Comment
Question by:Barry Gill
2 Comments
 
LVL 4

Accepted Solution

by:
tzwimfam earned 2000 total points
ID: 33520187
it is kind of manual but this works.... http://tagcrowd.com/
0
 
LVL 9

Author Closing Comment

by:Barry Gill
ID: 33530298
Thanks for this, hopefully I can automate it a bit :)
0

Featured Post

Free Tool: Port Scanner

Check which ports are open to the outside world. Helps make sure that your firewall rules are working as intended.

One of a set of tools we are providing to everyone as a way of saying thank you for being a part of the community.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

*Adobe Acrobat 9 was used for this article.  Particular steps may vary depending on software versions. Adobe Acrobat has many, many variables that my be utilized to customize your forms for clarity and ease of use. The Form Editing Tool will be y…
Microsoft Word is a program we have all encountered at some point, but very few of us have dug deep into its full scope of features, let alone customized it to suit our needs. Luckily making the ribbon (aka toolbar, first introduced in Word 2007) wo…
This Micro Tutorial well show you how to find and replace special characters in Microsoft Word. This is similar to carriage returns to convert columns of values from Microsoft Excel into comma separated lists.
We often encounter PDF files that are pure images, that is, they do not have text characters, but instead contain only raster graphics. The most common causes of this are document scanning software and faxing software/services that create image-only…
Suggested Courses
Course of the Month19 days, 13 hours left to enroll

872 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question