CHtmlView::GetSource() Problem Retrieving Full Source

When I CHtmlView::Navigate2(..) to a particular Web page, a subsequent call to CHtmlView::GetSource(..) from CHtmlView::OnNavigateComplete2() does not retrieve the full HTML source code. Instead, it is cut short. That is, only some of the source is retrieved. For example, if I open up IE to the web page and View->Source, and the source is, say, 100kB, then calling CHtmlView::GetSource(..) after navigating to the same page retrieves something < 100kB.

Any ideas on how to get the full source? Thanks
Who is Participating?
DanRollinsConnect With a Mentor Commented:
The Microsoft documentation indicates that the two operations are functionally identical. In all of my experience, they are the same.
The only thing that comes to mind is... perhaps the anomaly relates to the containing object -- a CString. For instance, a CString will end at the first embedded NULL character. I don't know how one could get into the HTML source, but that's one thing I'd look at (in the debugger, use the memory viewer to see if there is eyeball-readable data beyond the end of the CString).
An error you might possibly be making is to to call CHtmlView::GetSource(..) before you have gotten the OnDocumentComplete() call(or, alternately, before the READYSTATE_COMPLETE status is set).
If you have checked these things, please provide a URL and I'll try an experiement to see if I can reproduce the problem.
RaymunAuthor Commented:
I want to add a little more info:

When I CHtmlView::Navigate2(..) to a page with a big source file, a subsequent call to CHtmlView::GetSource(..) from CMyHtmlView::OnNavigateComplete2(..) gives me a source full of scripting code and the actual HTML code (supposed to be at the end) is cut off. When I do the same procedure on a page with a small source file, I get the full code (script + HTML).

I also tried overriding GetSource(..) to use IHTMLwhatever to get the source:
CMyHtmlView::GetSource(CString &strRef)
   CComPtr<IDispatch> pDisp = GetHtmlDocument();
   CComPtr<IHTMLDocument2> pDoc;
   pDisp->QueryInterface(IID_HTMLDocument2, (void**)&pDoc);
   CComPtr<IHTMLElement> pBody;


However, after the call to get_body(..), pBody is always NULL regardless of what page I've navigated to, even though both pDisp and pDoc are NOT NULL.

Another idea i tried was overriding CHtmlView::GetSource(..) and keeping everything the same except changing
   hMemory = GlobalAlloc(GMEM_MOVEABLE, 0);
   hMemory = GlobalAlloc(GMEM_MOVEABLE, <some large number>);
and after testing different numbers, the returned source is still cut short.

I am running out of ideas. Please help.

Thanks for the input. What two identical functions are you referring to? Anyway I will look into the CString and see what I can find. I wasn't familiar with OnDocumentComplete(..). According to Microsoft, the event is fired after finishing downloading a Web page.  I thought CHtmlView::OnNavigateComplete2(..) is called on the same event? What's the difference?

RaymunAuthor Commented:
RaymunAuthor Commented:
OnDocumentComplete() did the trick. Thanks Dan
Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.

All Courses

From novice to tech pro — start learning today.