Hello Experts,
Problem:
I'm trying to track down an issue we're seeing with a intranet built on the MOSS2007 SP1 platform. As one would expect there is a bunch of different content - docs, list items, bdc, and the pages themselves. Everything seems to be working fine except that it appears as though none of the .aspx pages are being indexed. This includes wikis (since they are constructed out of .aspx pages) which is really minimizing the value of the wikis. When I execute a search on a term that is in the text of an .aspx page nothing is returned, but the result set includes documents or list items that contain the search term.
Specs:
MOSS 2007 SP1
Windows 2003
SQL 2005
single server installation
Diagnostic information:
First place I looked was the crawl log. It is filled with entries like the following (total items in index - 12682, errors in index - 5252):
>>>
http://website/Dept/Marketing/Pages/Brosky.org.aspx The filtering process has been terminated Local Office SharePoint Server sites 8/12/2008 11:26 AM
Digging deeper I found this series of entries in the diagnostic log files:
0x2218 Search Server Common MS Search Administration 8wni High Resuming default catalog with reason 'GPR_PROPAGATION' for application 'SharedServices1'...
08/12/2008 10:24:13.05 OWSTIMER.EXE (0x22C0) 0x2218 Search Server Common MS Search Administration 8wnj High Resuming anchor text catalog with reason GPR_PROPAGATION' for application 'SharedServices1'...
08/12/2008 10:24:13.10 OWSTIMER.EXE (0x22C0) 0x2218 Search Server Common MS Search Administration 7hmh High exception in SearchUpgradeProvisioner Keyword Config System.InvalidOperationExc
eption: jobServerSearchServiceInst
ance is null at Microsoft.Office.Server.Se
arch.Admin
istration.
SearchUpgr
adeProvisi
oner..ctor
(SearchSer
viceInstan
ce searchServiceInstance) at Microsoft.Office.Server.Se
arch.Admin
istration.
OSSPrimary
GathererPr
oject.Prov
isionConte
ntSources(
)
08/12/2008 10:24:27.49 w3wp.exe (0x2944) 0x2334 CMS Publishing Cache 0 High Released semaphore for for ID CMSCACHE1997930116
08/12/2008 10:24:27.49 w3wp.exe (0x2944) 0x2334 CMS Publishing Cache 0 High Unregistered hosting environment for ID CMSCACHE1997930116
08/12/2008 10:24:27.46 w3wp.exe (0x2944) 0x2334 CMS Publishing 8fjh High AppDomainUnloadListener.St
op(False) called. ShutdownReason=HostingEnvi
ronment, this=15368010
08/12/2008 10:24:27.46 w3wp.exe (0x2944) 0x2334 CMS Publishing 8x0f High AppDomainUnloadListener.St
op() about to AcquireWriterLock(), this.countdownSeconds=120
08/12/2008 10:24:27.46 w3wp.exe (0x2944) 0x2334 CMS Publishing 8x0i High AppDomainUnloadListener.St
op() finally{} block with (timedOut == false), ensured that all LRO thread finished
08/12/2008 10:24:29.52 w3wp.exe (0x2944) 0x03EC ULS Logging Unified Logging Service 8wsw High Now terminating ULS (w3wp.exe, Microsoft.Office.Server.Na
tive.dll)
08/12/2008 10:24:29.52 w3wp.exe (0x2944) 0x03EC ULS Logging Unified Logging Service 8wsw High Now terminating ULS (w3wp.exe, onetnative.dll)
08/12/2008 10:24:55.01 w3wp.exe (0x1014) 0x0464 Windows SharePoint Services General 8kh7 High Access denied.
08/12/2008 10:24:55.10 w3wp.exe (0x1014) 0x0464 Windows SharePoint Services General 8kh7 High There is no Web named "/Dept/Marketing/Pages/Bro
sky.org.as
px".
08/12/2008 10:24:55.43 w3wp.exe (0x1014) 0x0464 Windows SharePoint Services General 72kg High #20001: Cannot open file "Resources.en-US.resx" for reading.
08/12/2008 10:24:55.43 w3wp.exe (0x1014) 0x0464 Windows SharePoint Services General 72kg High (#3: Cannot open file "Resources.en-US.resx" for reading.)
Troubleshooting steps so far:
1. Content access account - this is a domain account and it has the correct read policy applied to it. I also can access any of the above .aspx pages by directly browsing to them using this account, so I do not believe it is a permissions issue. I also tried switching the default content access account to my admin account and saw the same errors in the crawl log.
2. Search settings - I have verified all the search settings seem to be configured properly. Considering the indexing is working for documents and list items, I don't think this is a global search problem.
3. Crawl rules - I do not have any crawl rules excluding .aspx files
4. File type inclusions - .aspx files are on the search file type inclusion list
5. Search visibility - I confirmed that the site(s) in question are set to have all .aspx pages indexed regardless of the fine grained permissions (Site Settings==>Search visibility). I also confirmed that the pages libraries are configured to have their content included in the search results (document library settings ==> Advanced Settings)
6. Content refresh - I have reset IIS, rebooted the machine, reset the crawled content and execute a nightly full crawl and several incrementals during the day.
7. Searched Internet sites for hours, finding little or no information to point me in any meaningful direction.
I really don't know where to proceed at this point, so if any one has any suggestions or things to look at, I'm all ears. I can provide more extensive logging information if necessary. I feel like I have been fairly thorough in troubleshooting this issue, but clearly I am missing something.
Thanks in advance!
-Scott
Start Free Trial