+ Reply to Thread + Post New Thread
Results 1 to 3 of 3

Thread: Recognize sentences

  1. #1
    Noobie
    Join Date
    Apr 2008
    Posts
    2
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Default Recognize sentences


  2. Shorten URL    SEO Services    Buy Xrumer

    Sponsored Links

  3. #2
    Noobie
    Join Date
    Apr 2008
    Posts
    27
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Default

    Compare contents of each (div/table) tag against the search query used to locate the site. If a match is found, you have found your div tag which should contain the content. Also remember to strip eg. javascript (in order to avoid google ads/unrelated content in your article).

    Not a bulletproof solution but the best I could think of in a minute.

  4. #3
    Noobie
    Join Date
    Apr 2008
    Posts
    2
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Default

    Thanks for the advice, heiska!

    What I came up with yesterday:

    1. Just allow a-z A-Z 0-9 , ! . ? -
    If there's any other character in it, it's not a sentence! This will filter out some correct sentences but works quite good...

    2. Check for the length and the number of spaces in it.

    3. Only grab content between p-html-tags!

    The results are pretty good now...

+ Reply to Thread

Tags for this Thread

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts