www.website-watcher.com Forum Index www.website-watcher.com
HomeProductsNewsContact
 
 FAQFAQ   SearchSearch   RegisterRegister   ProfileProfile   Log inLog in 

Pages That Aren't Fully Captured?

 
Post new topic   Reply to topic    www.website-watcher.com Forum Index -> Local Website Archive
View previous topic :: View next topic  
Author Message
J-Mac



Joined: 15 Mar 2006
Posts: 75

PostPosted: Wed Jan 30, 2008 8:33 pm    Post subject: Pages That Aren't Fully Captured? Reply with quote

Since I have been using LWA (Lite version for a year and a half, and Pro for a day), a lot of pages I try to archive cannot be captured as is. Or part of the page does not show. Is this due to part of the page being served in a separate frame or called by a script or something?

It's almost always a "personalized" portion of a page that is not captured. Most often I am trying to get an archived copy of a purchase receipt to avoid having to print a hard copy and storing it. I'll see the parts of the page that are "boilerplate", but nothing related to me or my purchase.

However I know that the page can be "scraped", or screen captures. In these cases I must use Screenshot Captor, SnagIt, or even highlight what I need and clip it to Evernote or OneNote. But since these apps can grab all of the page, why doesn't LWA? Is there a method for doing this of which I am unaware? A setting or anything that would allow me to archive these pages?

A few moments ago it was an online order at Drugstore.com. When I tried to archive the page showing my paid order, LWA instead was trying to capture the login page.

Any help is appreciated, Martin!

Thank you.

Jim
_________________
J-Mac
Back to top
View user's profile Send private message
Martin Aignesberger
Site Admin
Site Admin


Joined: 11 May 2005
Posts: 5861

PostPosted: Thu Jan 31, 2008 11:42 am    Post subject: Reply with quote

Quote:
Since I have been using LWA (Lite version for a year and a half, and Pro for a day), a lot of pages I try to archive cannot be captured as is. Or part of the page does not show. Is this due to part of the page being served in a separate frame or called by a script or something?


Do you have a page where this can be reproduced?
And from which browser do you try to archive the page?

Quote:
A few moments ago it was an online order at Drugstore.com. When I tried to archive the page showing my paid order, LWA instead was trying to capture the login page.


This can happen if LWA has to reload in it's internal browser. I think it should work if you archive the page directly from Internet Explorer...
_________________
Martin Aignesberger [SUPPORT]
Back to top
View user's profile Send private message
J-Mac



Joined: 15 Mar 2006
Posts: 75

PostPosted: Thu Jan 31, 2008 5:17 pm    Post subject: Reply with quote

Martin Aignesberger wrote:
Do you have a page where this can be reproduced?
And from which browser do you try to archive the page?


Next time it happens I'll post the page. Though this occurs on most pages where I am logged in to my account.

I am using Firefox 2.0.0.11.

Quote:
This can happen if LWA has to reload in it's internal browser. I think it should work if you archive the page directly from Internet Explorer...


OK, that makes sense. If the internal browser tends to reload pages (because I am not reloading anything myself), then I can see where it would no longer be logged in to my account. Though I don't see how I could help this - as I said, I don't ask anything to reload; I believe that would be the program if it is reloading.

Although I guess I can try this in Internet Explorer - but just to test it to see if that helps; but I'm not going to start using IE as my browser - I have no desire to do that. When I come across a page that I decide to archive in LWA, I don't plan it ahead of time, and since I use Firefox for browsing, that's the browser LWA would have to work in.

Maybe I won't be able to use it as I had thought.

Thanks Martin.

Jim
_________________
J-Mac
Back to top
View user's profile Send private message
Martin Aignesberger
Site Admin
Site Admin


Joined: 11 May 2005
Posts: 5861

PostPosted: Fri Feb 01, 2008 8:18 am    Post subject: Reply with quote

LWA can only connect directly to IE, not to Firefox. If you use Firefox as your browser, LWA will reload the pages before they can be archived. The only work around for pages behind logins is to use the keystrokes method and let Firefox save the page.
_________________
Martin Aignesberger [SUPPORT]
Back to top
View user's profile Send private message
J-Mac



Joined: 15 Mar 2006
Posts: 75

PostPosted: Fri Feb 01, 2008 9:17 am    Post subject: Reply with quote

Martin Aignesberger wrote:
LWA can only connect directly to IE, not to Firefox. If you use Firefox as your browser, LWA will reload the pages before they can be archived. The only work around for pages behind logins is to use the keystrokes method and let Firefox save the page.


Well, I guess I won't be archiving much after all, then. I am not certain how to use the "Send Keystroke" method. Can't change it in the capture dialog. In Options>Supported Applications, I tried changing the two entries for Mozilla Firefox to "Send Keystrokes" but then I cannot grab anything at all from Firefox - it says it cannot find that application. I defined it and browsed to the Program Files path to list Firefox as an application, but LWA still cannot find Firefox.

Not sure if I did any of this correctly - Help file is not very verbose on how to do this.

Thanks.

Jim
_________________
J-Mac
Back to top
View user's profile Send private message
Martin Aignesberger
Site Admin
Site Admin


Joined: 11 May 2005
Posts: 5861

PostPosted: Fri Feb 01, 2008 3:47 pm    Post subject: Reply with quote

Quote:
it says it cannot find that application


I could reproduce this, will try to fix this in the next version.

Could you please try to delete the file "Mozilla Firefox15.ini" in the app folder. Then use the following content for "Mozilla Firefox.ini".

Code:
; ---------------------------------------------------------------------------
; Program template for WebSite-Watcher and Local Website Archive
; Will be overwritten with a new version of the software
; http://www.aignes.com
; ---------------------------------------------------------------------------

[General]
Name=Mozilla Firefox
IsBrowser=1
AppExeName=firefox.exe
AppWindowClass=MozillaUIWindowClass
AppWindowCaptionSubStr=right(- Mozilla Firefox)
GetNameUrlMethod=dde(Firefox)
RemoveFromCaptionLeft=
RemoveFromCaptionRight=","

Language=english,({Alt}f){$}a{$}<FILENAME>{Enter}


Then restart LWA. Is Firefox found after that change?
_________________
Martin Aignesberger [SUPPORT]
Back to top
View user's profile Send private message
J-Mac



Joined: 15 Mar 2006
Posts: 75

PostPosted: Fri Feb 01, 2008 4:48 pm    Post subject: Reply with quote

Martin,

I just tried that. This time it did not say it couldn't find Firefox; instead the LWA dialog disappeared and then a few different windows flashed open and closed - too fast to see what they were. And then I saw nothing. I opened LWA and it had the page I captured listed in my archive. I clicked on it to see what it actually captures in the internal browser window, but it is just hanging with an hourglass.

Windows Task Manager says it is "Not Responding", and the process pane indicates that the process wsarc.exe is using between 50% and 60% CPU.

It has now been that way for 12 minutes - doesn't appear it is going to change anytime soon...

OK - it started getting up around 70 - 80% CPU, so I had to kill it.

Well, it did something this time. Not quite what we expected, though!

Jim
_________________
J-Mac
Back to top
View user's profile Send private message
J-Mac



Joined: 15 Mar 2006
Posts: 75

PostPosted: Fri Feb 01, 2008 4:51 pm    Post subject: Reply with quote

Martin,

I re-opened LWA and this time I right-clicked on that item in the archive and clicked on "Open page in browser". It then opened in the internal browser - lower pane - but it was the login page, not the actual page I tried to archive.

Thanks.

Jim
_________________
J-Mac
Back to top
View user's profile Send private message
Martin Aignesberger
Site Admin
Site Admin


Joined: 11 May 2005
Posts: 5861

PostPosted: Mon Feb 04, 2008 8:48 am    Post subject: Reply with quote

Quote:
I just tried that. This time it did not say it couldn't find Firefox; instead the LWA dialog disappeared and then a few different windows flashed open and closed - too fast to see what they were.


Seems that these windows are the save-as dialogs of Firefox.


Quote:
And then I saw nothing. I opened LWA and it had the page I captured listed in my archive. I clicked on it to see what it actually captures in the internal browser window, but it is just hanging with an hourglass.

Windows Task Manager says it is "Not Responding", and the process pane indicates that the process wsarc.exe is using between 50% and 60% CPU.

It has now been that way for 12 minutes - doesn't appear it is going to change anytime soon...


strange. Do you have the same effect when you archive for example www.aignes.com with the keystrokes method?

And can you try to reproduce that problem. If you can reproduce it, please perform the following actions:

1) Download the tool madTraceProcess from: http://www.aignes.com/download/madtraceprocess.zip
2) Start it when LWA hangs, select "wsarc.exe" and press OK.
3) Then send me the resulting report by email - http://www.aignes.com/email.htm

Probably I can see where LWA hangs....

Quote:
I re-opened LWA and this time I right-clicked on that item in the archive and clicked on "Open page in browser". It then opened in the internal browser - lower pane - but it was the login page, not the actual page I tried to archive.


Yes, this command opens the online version of the page in the embedded IE, but without your login cookies from Firefox. That's why you get the login page here.
_________________
Martin Aignesberger [SUPPORT]
Back to top
View user's profile Send private message
Display posts from previous:   
Post new topic   Reply to topic    www.website-watcher.com Forum Index -> Local Website Archive All times are GMT
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum


Powered by phpBB © 2001, 2005 phpBB Group