Welcome to MobyThreads.com!
FAQFAQ   SearchSearch      ProfileProfile    Private MessagesPrivate Messages   Log inLog in
All support for the MobyThreads Threaded phpBB MOD can now be found on welsolutions at this forum

How do I parse this page?

 
   Web Hosting and Web Master Forums (Home) -> Webmaster RSS
Related Topics:
New Page won't come up. - I run a site for my group. From the main page, there is a link to the group's history (page name The member writing it sent me an updated version, which I uploaded the previous version). Many members are telling me they

View a page without CSS? - This may sound like a really stupid question but I was reading something about CSS today and it mentioned that the viewer could the style sheet. How do they do this? I would like to test my sites to see how they appear to those that have..

Keep page in frameset - Hi, is it possible to let a page stay in the frameset, so when a single page is loaded, the whole frameset is loaded greetz, neutje

page expiration - IE6 a user of a web database app I support complains about getting the page has page when paging back. So he has to refresh (and process re-run) But It does not happen on my system. I don't get the page. How do I fix this ? ..

Best designed web page ever - Be sure to scroll down to the blue text on the blue Anna
Next:  Online favorites manager...  
Author Message
nntp2

External


Since: Oct 25, 2004
Posts: 4



(Msg. 1) Posted: Tue Oct 26, 2004 4:25 pm
Post subject: How do I parse this page?
Archived from groups: alt>www>webmaster, others (more info?)

I am trying to parse
http://www.ebay.com without success.

I view the source, and I see a lot of ?/td>. This page is unsavable.

It displays perfectly in IE, but once the source is saved/viewed, it no long
display right in IE.

When I use LYNX to view it, it is formated perfectly.

My question is how Ebay allow any brower to display the content right
without allowing viewing source or safe as?

 >> Stay informed about: How do I parse this page? 
Back to top
Login to vote
usenet16

External


Since: Oct 26, 2004
Posts: 1



(Msg. 2) Posted: Tue Oct 26, 2004 9:59 pm
Post subject: Re: How do I parse this page? [Login to view extended thread Info.]
Archived from groups: per prev. post (more info?)

Quoth "nntp" <nntp.DeleteThis@rogers.com>:
 > I am trying to parse
 > <a style='text-decoration: underline;' href="http://www.ebay.com" target="_blank">http://www.ebay.com</a> without success.
 >
 > I view the source, and I see a lot of ?/td>. This page is unsavable.
 >
 > It displays perfectly in IE, but once the source is saved/viewed, it no long
 > display right in IE.
 >
 > When I use LYNX to view it, it is formated perfectly.
 >
 > My question is how Ebay allow any brower to display the content right
 > without allowing viewing source or safe as?

They can't. You've probably got character-set issues. Use LWP to retreive the
page.

Ben

--
I must not fear. Fear is the mind-killer. I will face my fear and
I will let it pass through me. When the fear is gone there will be
nothing. Only I will remain.
ben.DeleteThis@morrow.me.uk Frank Herbert, 'Dune'<!-- ~MESSAGE_AFTER~ -->

 >> Stay informed about: How do I parse this page? 
Back to top
Login to vote
news31

External


Since: Sep 23, 2004
Posts: 49



(Msg. 3) Posted: Wed Oct 27, 2004 12:10 am
Post subject: Re: How do I parse this page? [Login to view extended thread Info.]
Archived from groups: per prev. post (more info?)

"nntp" <nntp RemoveThis @rogers.com> wrote in message
news:_dydnarTGPNdFePcRVn-sQ@rogers.com...
 >I am trying to parse
 > <a style='text-decoration: underline;' href="http://www.ebay.com" target="_blank">http://www.ebay.com</a> without success.
 >
 > I view the source, and I see a lot of ?/td>. This page is unsavable.
 >
 > It displays perfectly in IE, but once the source is saved/viewed, it no
 > long
 > display right in IE.
 >
 > When I use LYNX to view it, it is formated perfectly.
 >
 > My question is how Ebay allow any brower to display the content right
 > without allowing viewing source or safe as?
 >

I don't have a copy of Lynx, so I can't duplicate your problem, but...
Opera saves the file with images and IE displays it just fine from the saved
files.

Ebay.com (index.html) uses an external CSS stylesheet. It also uses a
sizeable number of external javascript files and 68 images to make up the
page I looked at.

George<!-- ~MESSAGE_AFTER~ -->
 >> Stay informed about: How do I parse this page? 
Back to top
Login to vote
1usa

External


Since: Oct 26, 2004
Posts: 1



(Msg. 4) Posted: Wed Oct 27, 2004 12:25 am
Post subject: Re: How do I parse this page? [Login to view extended thread Info.]
Archived from groups: per prev. post (more info?)

"nntp" <nntp.TakeThisOut@rogers.com> wrote in
news:_dydnarTGPNdFePcRVn-sQ@rogers.com:

 > I am trying to parse
 > <a style='text-decoration: underline;' href="http://www.ebay.com" target="_blank">http://www.ebay.com</a> without success.
 >
 > I view the source, and I see a lot of ?/td>. This page is unsavable.

That ain't true. If you have any questions on parsing HTML using
HTML::Parser, please post them here. Otherwise, this waaay off-topic.

Sinan<!-- ~MESSAGE_AFTER~ -->
 >> Stay informed about: How do I parse this page? 
Back to top
Login to vote
usenet200410

External


Since: Oct 03, 2004
Posts: 111



(Msg. 5) Posted: Wed Oct 27, 2004 12:59 am
Post subject: Re: How do I parse this page? [Login to view extended thread Info.]
Archived from groups: per prev. post (more info?)

[F'ups set to a.w.w.]

nntp wrote:

 > <a style='text-decoration: underline;' href="http://www.ebay.com" target="_blank">http://www.ebay.com</a>
 > I view the source, and I see a lot of ?/td>. This page is unsavable.
 > It displays perfectly in IE, but once the source is saved/viewed, it no long
 > display right in IE. My question is how Ebay allow any brower to
 > display the content right without allowing viewing source or safe as?

IE doesn't simply show you the source when you hit the "view source"
button. Oh no. That would be too easy. It does all kinds of weird crap
first and then shows you some modified source code. I'm guessing that some
of that weird crap screws up some of the characters.

Look at the source code in a different browser and it displays fine.

Not that you should try to emulate any of that code. It's pants.

--
Toby A Inkster BSc (Hons) ARCS
Contact Me ~ <a style='text-decoration: underline;' href="http://tobyinkster.co.uk/contact" target="_blank">http://tobyinkster.co.uk/contact</a><!-- ~MESSAGE_AFTER~ -->
 >> Stay informed about: How do I parse this page? 
Back to top
Login to vote
nospam34

External


Since: Oct 20, 2003
Posts: 294



(Msg. 6) Posted: Wed Oct 27, 2004 7:03 am
Post subject: Re: How do I parse this page? [Login to view extended thread Info.]
Archived from groups: per prev. post (more info?)

nntp wrote:

 > I am trying to parse
 > <a style='text-decoration: underline;' href="http://www.ebay.com" target="_blank">http://www.ebay.com</a> without success.

In Perl, try
<a style='text-decoration: underline;' href="http://search.cpan.org/~gaas/HTML-Parser-3.35/Parser.pm" target="_blank">http://search.cpan.org/~gaas/HTML-Parser-3.35/Parser.pm</a>

 > I view the source, and I see a lot of ?/td>. This page is unsavable.

 > It displays perfectly in IE, but once the source is saved/viewed, it no
 > long display right in IE.

Maybe it uses css, or needs images to provide formatting hints.

 > When I use LYNX to view it, it is formated perfectly.
 >
 > My question is how Ebay allow any brower to display the content right
 > without allowing viewing source or safe as?

Please don't clutter Perl newsgroups with web server questions.

gtoomey<!-- ~MESSAGE_AFTER~ -->
 >> Stay informed about: How do I parse this page? 
Back to top
Login to vote
nntp2

External


Since: Oct 25, 2004
Posts: 4



(Msg. 7) Posted: Wed Oct 27, 2004 7:03 am
Post subject: Re: How do I parse this page? [Login to view extended thread Info.]
Archived from groups: per prev. post (more info?)

  > > I am trying to parse
  > > <a style='text-decoration: underline;' href="http://www.ebay.com" target="_blank">http://www.ebay.com</a> without success.
 >
 > In Perl, try
<font color=purple> > <a style='text-decoration: underline;' href="http://search.cpan.org/~gaas/HTML-Parser-3.35/Parser.pm</font" target="_blank">http://search.cpan.org/~gaas/HTML-Parser-3.35/Parser.pm</font</a>>
 >
  > > I view the source, and I see a lot of ?/td>. This page is unsavable.
 >
  > > It displays perfectly in IE, but once the source is saved/viewed, it no
  > > long display right in IE.
 >
 > Maybe it uses css, or needs images to provide formatting hints.
Have you looked at the source codes of <a style='text-decoration: underline;' href="http://www.ebay.com?" target="_blank">www.ebay.com?</a>
I don't know what you mean by uses images to provide formatting hints.<!-- ~MESSAGE_AFTER~ -->
 >> Stay informed about: How do I parse this page? 
Back to top
Login to vote
sbryce1

External


Since: Jan 24, 2004
Posts: 425



(Msg. 8) Posted: Wed Oct 27, 2004 7:03 am
Post subject: Re: How do I parse this page? [Login to view extended thread Info.]
Archived from groups: alt>www>webmaster, others (more info?)

nntp wrote:

 > I don't know what you mean by uses images to provide formatting hints.

Transparent GIFs, perhaps?<!-- ~MESSAGE_AFTER~ -->
 >> Stay informed about: How do I parse this page? 
Back to top
Login to vote
spam10

External


Since: Feb 02, 2004
Posts: 2



(Msg. 9) Posted: Wed Oct 27, 2004 5:06 pm
Post subject: Re: How do I parse this page? [Login to view extended thread Info.]
Archived from groups: alt>www>webmaster, others (more info?)

JRS: In article <Xns958EB135B42A7asu1cornelledu.DeleteThis@132.236.56.8>, dated
Tue, 26 Oct 2004 21:25:13, seen in news:comp.lang.javascript, A. Sinan
Unur <1usa.DeleteThis@llenroc.ude.invalid> posted :
 >"nntp" <nntp.DeleteThis@rogers.com> wrote in
 >news:_dydnarTGPNdFePcRVn-sQ@rogers.com:
 >
  >> I am trying to parse
  >> <a style='text-decoration: underline;' href="http://www.ebay.com" target="_blank">http://www.ebay.com</a> without success.
  >>
  >> I view the source, and I see a lot of ?/td>. This page is unsavable.
 >
 >That ain't true. If you have any questions on parsing HTML using
 >HTML::Parser, please post them here. Otherwise, this waaay off-topic.

Please take greater, or at least better, thought before using a word
such as "here".

--
© John Stockton, Surrey, UK. ?@merlyn.demon.co.uk Turnpike v4.00 IE 4 ©
<URL:http://www.jibbering.com/faq/> JL/RC: FAQ of news:comp.lang.javascript
<URL:http://www.merlyn.demon.co.uk/js-index.htm> jscr maths, dates, sources.
<URL:http://www.merlyn.demon.co.uk/> TP/BP/Delphi/jscr/&c, FAQ items, links.<!-- ~MESSAGE_AFTER~ -->
 >> Stay informed about: How do I parse this page? 
Back to top
Login to vote
tadmc

External


Since: Oct 03, 2004
Posts: 2



(Msg. 10) Posted: Wed Oct 27, 2004 5:06 pm
Post subject: Re: How do I parse this page? [Login to view extended thread Info.]
Archived from groups: per prev. post (more info?)

Dr John Stockton <spam RemoveThis @merlyn.demon.co.uk> wrote:
 > JRS: In article <Xns958EB135B42A7asu1cornelledu RemoveThis @132.236.56.8>, dated
 > Tue, 26 Oct 2004 21:25:13, seen in news:comp.lang.javascript, A. Sinan
 > Unur <1usa RemoveThis @llenroc.ude.invalid> posted :
  >>"nntp" <nntp RemoveThis @rogers.com> wrote in
  >>news:_dydnarTGPNdFePcRVn-sQ@rogers.com:
  >>
   >>> I am trying to parse
   >>> <a style='text-decoration: underline;' href="http://www.ebay.com" target="_blank">http://www.ebay.com</a> without success.
   >>>
   >>> I view the source, and I see a lot of ?/td>. This page is unsavable.
  >>
  >>That ain't true. If you have any questions on parsing HTML using
  >>HTML::Parser, please post them here. Otherwise, this waaay off-topic.
 >
 > Please take greater, or at least better, thought before using a word
 > such as "here".


Please take greater, or at least better, notice of the Newsgroups
header before determining which "where" is "here".

Smile


--
Tad McClellan SGML consulting
tadmc RemoveThis @augustmail.com Perl programming
Fort Worth, Texas<!-- ~MESSAGE_AFTER~ -->
 >> Stay informed about: How do I parse this page? 
Back to top
Login to vote
Display posts from previous:   
   Web Hosting and Web Master Forums (Home) -> Webmaster All times are: Pacific Time (US & Canada) (change)
Page 1 of 1

 
You can post new topics in this forum
You can reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum



[ Contact us | Terms of Service/Privacy Policy ]