Free text search

Hi,

I would like to do free text searching on text/imagecolumns holding different documents (word,rtf,html etc) in SQL 6.5. Does anyone know of any good 3rd party search engine that would index text/imagecolumns and let me search them through APIs? Also, I think I read something about this being included in SQL 7.0, is this true?

Thanks,
pivar
LVL 22
pivarAsked:
Who is Participating?
 
aliciaamConnect With a Mentor Commented:
The ability to search just against file system data already exists today in Index Server 2.0.  
This capability is further enhanced by the SQL Server 7.0 Distributed Query Processor which, in conjunction with the Microsoft OLE DB Provider for Index Server 2.0, can be used to issue both property and full-text searches against data in the file system and to join the results with data in the database.

Different Scenarios, Different Solutions
Below are three examples that highlight the different scenarios for using either Microsoft Index Server or Microsoft Site Server.
 1.      A customer who has data on one Microsoft Internet Information Server (IIS) Web site and only wishes to query across file system files should use Microsoft Index Server.
       2.      A customer who needs searches to occur on more than one Web server, file server files, databases, Exchange public folders or specified Internet sites, should use Microsoft Site Server.
       3.      Distributed Capability: Site Server can be distributed among different servers to provide distributed crawling and searching for a multinational corporation. The processing may be distributed to reduce bandwidth consumption, provide redundancy, or to optimize performance when there are slow connections between subsidiaries. In the example below, there is a crawler in the United States and in the United Kingdom headquarters. The multinational crawls locally and propagates the resulting catalogs to every server worldwide that offers Search capabilities. This enables local searching of multinational content.
        
Common Search Engine
Since both Index Server and the Search feature in Site Server use the same underlying powerful engine, you can find the following functionality in both products:
·      Querying:
·      Supports full-text and property querying.
·      Supports Boolean, proximity, word stemming, vector space, and free-text queries.
·      Queries can use Structured Query Language (SQL): The query capabilities include SQL syntax, which supports Microsoft ActiveX® scripting and can be used in Microsoft Visual Basic® development system, Java, and C++ applications. Or SQL syntax can be used in an ASP, using Visual Basic Scripting Edition or JScript™ development software to create dynamic, customized queries. With this, application developers can view the file system as another OLE-DB data source.
 
·      Extensible and Compatible Filter Architecture: Filters allow the extraction of text and properties from documents of various formats.
·      Security: Windows NT security enabled on filtering and searching.
·      Powerful Scripting: Compatible scripting language with minor differences.
·      Multiple Language Support: Supports indexing and searching of documents in several different languages, including English, French, German, Japanese, Dutch, Spanish, Italian, and Swedish, without making any changes to the files or the search engine.
      
0
Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.

All Courses

From novice to tech pro — start learning today.