+ Reply to Thread + Post New Thread
Results 1 to 4 of 4

Thread: How to scrapea wordpress blog without RSS?

  1. #1
    Noobie
    Join Date
    Apr 2008
    Posts
    41
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Default How to scrapea wordpress blog without RSS?

    Let's say we have a 3rd party blog without RSS feed or at best a RSS feed with just a few words of each post. How can I scarpe the whole content (with images, links, etc.) of every single new post of it ? It is updated regularly and I need a code/plugin/service that can scrape new posts and generate an RSS feed out of it so that I can use inside my Semi-Autoblog. I know it is possible and it won't be that hard cause many people are doing this right now.

    Any suggestion or help will be appreciated.

  2. Shorten URL    SEO Services    Buy Xrumer

    Sponsored Links

  3. #2
    Noobie
    Join Date
    Apr 2008
    Posts
    31
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Default

    match it with php

  4. #3
    Noobie
    Join Date
    Apr 2008
    Posts
    41
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Default

    thanks mate, could you plz explain it more, is there any script out there?

  5. #4
    SEO Apprentice
    Join Date
    Aug 2011
    Posts
    12
    Thanks
    4
    Thanked 1 Time in 1 Post

    Default

    You're gonna need to build a custom bot of some type, easiest way is to hire someone that knows a scripting language and Regex

    Not terribly difficult if you know what you're doing, if you're on a low money budget and have time to invest you'd highly benefit from learning enough PHP to pull the page with CURL and then scrape the content with Regex....

    Good Luck

+ Reply to Thread

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts