Solved

I need to identify and count unique words in a group of documents

Posted on 2010-08-25
2
398 Views
Last Modified: 2012-05-10
I have a group of documents that I am trying to assess the most frequently used words in.
I am looking for a tool (similar to a wordpress tag cloud) that will run through a series of docs (Office and PDF) to create a list of the words used and their frequency across all docs.
I am happy to do it one at a time as the group is not overly large, but do need to be able to do both word and pdf files.
0
Comment
Question by:Barry Gill
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
2 Comments
 
LVL 4

Accepted Solution

by:
tzwimfam earned 500 total points
ID: 33520187
it is kind of manual but this works.... http://tagcrowd.com/
0
 
LVL 9

Author Closing Comment

by:Barry Gill
ID: 33530298
Thanks for this, hopefully I can automate it a bit :)
0

Featured Post

Free Tool: Path Explorer

An intuitive utility to help find the CSS path to UI elements on a webpage. These paths are used frequently in a variety of front-end development and QA automation tasks.

One of a set of tools we're offering as a way of saying thank you for being a part of the community.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Preface: When I started this series, I used the term CommandBars because that is the Office Object class that it discusses. Unfortunately, when Microsoft introduced Office 2007, they replaced the standard Commandbar menus with "The Ribbon" and rem…
This article describes how to use the Send to Mail Recipient command. The instructions apply generally to Office 2007 and later versions, but Microsoft® Word 2013 was used for the specific steps and figures.  What is Send to Mail Recipient? Send…
This video shows and describes the main difference between both orientations in Microsoft Word. Viewers will understand when to use each orientation and how to get the most out of them.
In this seventh video of the Xpdf series, we discuss and demonstrate the PDFfonts utility, which lists all the fonts used in a PDF file. It does this via a command line interface, making it suitable for use in programs, scripts, batch files — any pl…

738 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question