Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
[solved] screen scraping with firefox - best practices?
#18
the problem is not in the rx it is in how the html is returned using a.WebPageProp(0 0 html)
html using a.WebPageProp(0 0 html) looks like this

<a sl-processed="1" data-disqus-identifier="678089 http://thechive.com/?p=678089" class="comment-count" href="http://thechive.com/2014/01/17/so-its-a-bit-windy-in-nebraska-today-video/#disqus_thread" title="Comments for: So it’s a bit windy in Nebraska today&nbsp;(Video)">31</a>

where firefox shows this
<a class="comment-count" href="http://thechive.com/2014/01/17/so-its-a-bit-windy-in-nebraska-today-video/#comments" title="Comments for: So it’s a bit windy in Nebraska today&nbsp;(Video)"><span class="dsq-postid" rel="678089 http://thechive.com/?p=678089">View Comments</span></a>

so it fails
if you use
Code:
Copy      Help
str pattern=
;class="comment-count" href="(.*)" title="Comments for:
it works


Messages In This Thread

Forum Jump:


Users browsing this thread: 16 Guest(s)