Solved

Parsing Word Documents to a database field

Posted on 2008-10-17
3
285 Views
Last Modified: 2009-01-02
Hi there, I have a classic ASP VBScript site using MS SQL 2000 the database doesn't have Full text search enabled as it's a remote database.

What I would like to do is create some code ideally on the database that searches through the profile table, for all records where cvparse = n and CV not null, then for each record parse all of the information from the CV doc (doc, pdf) and stores it in CVdetail and then updates the cvparse from n  to y.

I'd like it to run automatically daily?

Is this possible?

thank you
0
Comment
Question by:garethtnash
3 Comments
 
LVL 51

Accepted Solution

by:
Mark Wills earned 500 total points
ID: 22747597
yes it is possible to an extent, but really need to consider doing it at all. Best to all several "key tags" to be maintained and then search on those - the database then has a full qualified path name to the original document. PDF's can be challenging to parse, so again, when "submitting" a document to "file", then categorise of write a summary prior to commiting. Now, if trying to do in unattended mode, then it will become difficult. Again the PDF will be the challenge. These types of document can very quickly choke any chance of performance if held inside the database. Now parsing document for searching criteria, then you will have to create noise words / thesaurus to make sure that entire documents are not part of key criteria and indexed lookups. Is this an automatic or operator invoked task, and are the files biliographic in nature where you can aut generate several attributes... (e.g. known content such as law documents / specification sheets etc).
0
 

Author Comment

by:garethtnash
ID: 22752673
You've completely lost me, but, I can change the upload to only accept .doc or .docx... and the documents are cvs??

Any advice?

Thank you
0

Featured Post

What is SQL Server and how does it work?

The purpose of this paper is to provide you background on SQL Server. It’s your self-study guide for learning fundamentals. It includes both the history of SQL and its technical basics. Concepts and definitions will form the solid foundation of your future DBA expertise.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Slowly Changing Dimension Transformation component in data task flow is very useful for us to manage and control how data changes in SSIS.
This article shows gives you an overview on SQL Server 2016 row level security. You will also get to know the usages of row-level-security and how it works
Using examples as well as descriptions, and references to Books Online, show the different Recovery Models available in SQL Server and explain, as well as show how full, differential and transaction log backups are performed
Viewers will learn how to use the SELECT statement in SQL to return specific rows and columns, with various degrees of sorting and limits in place.

770 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question