Solved

Parsing Word Documents to a database field

Posted on 2008-10-17
3
281 Views
Last Modified: 2009-01-02
Hi there, I have a classic ASP VBScript site using MS SQL 2000 the database doesn't have Full text search enabled as it's a remote database.

What I would like to do is create some code ideally on the database that searches through the profile table, for all records where cvparse = n and CV not null, then for each record parse all of the information from the CV doc (doc, pdf) and stores it in CVdetail and then updates the cvparse from n  to y.

I'd like it to run automatically daily?

Is this possible?

thank you
0
Comment
Question by:garethtnash
3 Comments
 
LVL 51

Accepted Solution

by:
Mark Wills earned 500 total points
ID: 22747597
yes it is possible to an extent, but really need to consider doing it at all. Best to all several "key tags" to be maintained and then search on those - the database then has a full qualified path name to the original document. PDF's can be challenging to parse, so again, when "submitting" a document to "file", then categorise of write a summary prior to commiting. Now, if trying to do in unattended mode, then it will become difficult. Again the PDF will be the challenge. These types of document can very quickly choke any chance of performance if held inside the database. Now parsing document for searching criteria, then you will have to create noise words / thesaurus to make sure that entire documents are not part of key criteria and indexed lookups. Is this an automatic or operator invoked task, and are the files biliographic in nature where you can aut generate several attributes... (e.g. known content such as law documents / specification sheets etc).
0
 

Author Comment

by:garethtnash
ID: 22752673
You've completely lost me, but, I can change the upload to only accept .doc or .docx... and the documents are cvs??

Any advice?

Thank you
0

Featured Post

Why You Should Analyze Threat Actor TTPs

After years of analyzing threat actor behavior, it’s become clear that at any given time there are specific tactics, techniques, and procedures (TTPs) that are particularly prevalent. By analyzing and understanding these TTPs, you can dramatically enhance your security program.

Join & Write a Comment

Everyone has problem when going to load data into Data warehouse (EDW). They all need to confirm that data quality is good but they don't no how to proceed. Microsoft has provided new task within SSIS 2008 called "Data Profiler Task". It solve th…
JSON is being used more and more, besides XML, and you surely wanted to parse the data out into SQL instead of doing it in some Javascript. The below function in SQL Server can do the job for you, returning a quick table with the parsed data.
Familiarize people with the process of retrieving data from SQL Server using an Access pass-thru query. Microsoft Access is a very powerful client/server development tool. One of the ways that you can retrieve data from a SQL Server is by using a pa…
Viewers will learn how the fundamental information of how to create a table.

747 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

13 Experts available now in Live!

Get 1:1 Help Now