Post Reply 
Yahoo Filters...
Feb. 11, 2009, 05:42 PM
Post: #31
RE: Yahoo Filters...
I've got some matches after reloading a page.

Code:
SiteSpec Yahoo c mod ysm clear     http://finance.yahoo.com/news/Banks-Rescue-Will-Make-Things-cnbc-14311295.html
SiteSpec Yahoo c mod ysm clear     http://finance.yahoo.com/news/Below-Book-Value-Sign-Me-tsmp-14306701.html
SiteSpec Yahoo c mod ysm clear     http://finance.yahoo.com/news/BlackBerrymaker-RIM-outlook-rb-14323059.html
SiteSpec Yahoo c mod ysm clear     http://finance.yahoo.com/news/Dems-Tentative-settlement-on-apf-14324154.html
SiteSpec Yahoo c mod ysm clear     http://finance.yahoo.com/news/Five-ways-to-use-social-cbsmpf-14302317.html
SiteSpec Yahoo c mod ysm clear     http://finance.yahoo.com/news/Madoffs-wife-withdrew-15-mln-rb-14324140.html
SiteSpec Yahoo c mod ysm clear     http://finance.yahoo.com/news/MillerCoors-4thquarter-profit-apf-14306436.html
SiteSpec Yahoo c mod ysm clear     http://finance.yahoo.com/news/Oil-prices-steady-after-US-apf-14309696.html
SiteSpec Yahoo c mod ysm clear     http://finance.yahoo.com/news/Peanut-Co-owner-urged-apf-14323497.html
SiteSpec Yahoo c mod ysm clear     http://finance.yahoo.com/news/Peanut-company-in-salmonella-apf-14310406.html
SiteSpec Yahoo c mod ysm clear     http://finance.yahoo.com/news/Senate-passes-stimulus-Next-cnnm-14311542.html
SiteSpec Yahoo c mod ysm clear     http://finance.yahoo.com/news/Stock-Preacher-Issues-cnw-14305621.html
SiteSpec Yahoo c mod ysm clear     http://finance.yahoo.com/news/Stocks-show-partial-rebound-apf-14323611.html
SiteSpec Yahoo c mod ysm clear     http://finance.yahoo.com/news/Wall-St-tumbles-on-bank-plan-rb-14312204.html
SiteSpec Yahoo c mod ysm clear     http://finance.yahoo.com/news/Wall-Street-CEOs-feel-the-rb-14324330.html
SiteSpec Yahoo c mod ysm clear     http://finance.yahoo.com/news/Why-Geithners-Graveyard-Touch-usnews-14320840.html
Add Thank You Quote this message in a reply
Feb. 11, 2009, 07:32 PM
Post: #32
RE: Yahoo Filters...
I've added the test string to the class value expression (versus ID, as previously), you as well?
Add Thank You Quote this message in a reply
Feb. 11, 2009, 07:43 PM
Post: #33
RE: Yahoo Filters...
i tried it in about four different places, none axed it...
but i can't say as i can rule out any errors on my part...

are you talking about here:
Code:
"class=("
"$AV((ydirspns)\3)(*</div >&&(^*<div)*)$SET(1=solo \3)"
"|$AV((ad|lrec|m(d m|)inimantle*|ovt*|sponsor*|yad-*|ynw-(ov-box|rt-art-bd-ad)|yshp_ad_*|ysm)\3)"
"$INEST(<div,</div)</div >$SET(1=c \3)"
")"
Add Thank You Quote this message in a reply
Feb. 11, 2009, 08:01 PM
Post: #34
RE: Yahoo Filters...
Yep. So after the addition my expression looks like:
Code:
|<div(^$TST(script=*))[^>]++(
class=(
$AV((ydirspns)\3)(*</div >&&(^*<div)*)$SET(1=solo \3)|$AV((
ad|lrec|m(d m|)inimantle*|mod ysm*|ovt*|sponsor*|yad-*|ynw-(ov-box|rt-art-bd-ad)|yshp_ad_*|ysm
)\3)$INEST(<div,</div)</div >$SET(1=c \3)
)
Add Thank You Quote this message in a reply
Feb. 11, 2009, 09:56 PM
Post: #35
RE: Yahoo Filters...
works like a charm (my mishap was in "how" i was merging it in with the last "ysm" in one of the match rows, mea culpa)...

your "help-link-wrap" list is the perfect example for another addition, "y-feedback"...


updated version:
Code:
Name = "Yahoo: Clean up Site     7.09.07 [sd] (d.s) {po 2/11/09}"
Active = TRUE
URL = "$TYPE(htm)(^([^.]+.|)search.yahoo)[^/]++.yahoo.co(^$TST(keyword=*.a_code.*))"
Limit = 16384
Match = "<table("
        "[^>]+>("
        "([^>]+>)+{3} (sponsored [a-z]+)\3(*</table >&&(^*<t(r|able))*)"
        "( <table[^<]+<tr(*</table >&&(^*<t(r|able))*))+ $NEST(<table,</table >)$SET(1=table-1-\3)"
        "|([^>]+>)++{1,5} (advertisem[a-z]+)\3 <"
        "$INEST(<table,</table)</table >$SET(1=table-2-\3)"
        ")"
        "|[^>]++\s(width=[#750][^>]++height=([#112])\1|id=$AV(yfncbrobtn)$SET(1=Finance Top Ads))"
        "$INEST(<table,</table)</table >"
        ")"
        ""
        "|<td[^>]++("
        "class=$AV((ymapttd)\1|ygma$SET(1=+)*)$INEST(<td,</td)</td >"
        "|id=$AV(ygma(^greet)*)$SET(1=Search Bar)$INEST(<td,</td)</td >"
        ")"
        ""
        "|<div(^$TST(script=*))[^>]++("
        "class=("
        "$AV((ydirspns)\3)(*</div >&&(^*<div)*)$SET(1=solo \3)|$AV(("
        "ad|lrec|m(d m|)inimantle*|mod ysm*|ovt*|sponsor*|yad-*|ynw-(ov-box|rt-art-bd-ad)|yshp_ad_*|ysm"
        ")\3)$INEST(<div,</div)</div >$SET(1=c \3)"
        ")"
        "|id=("
        "$AV(("
        "help-link-wrap|y-feedback|nux-container|"
        "adlrec|ks-ypn-ads|marketplace|mantlecx|minimantle|news-sponsor|nns|northad|southad|spnsr"
        "|spons[oe]red*|srec|swads|ws_ie7|yat(gad|ad_*)|ydireast|yfi_(pf_)+(ad*|ysm)|ygmatop|ymadbn"
        "|ymap(clad|ead|nad|west|footer)[0-9]+|eyebrow$SET(2=<div style="line-height:7px">&nbsp;</div>"
        "<div id="eyebrow" style="display:none"><br id="sethomepage" /></div>)"
        ")\3)$INEST(<div,</div)</div >$SET(1=i \3)"
        "|$AV(footer)$SET(3=p)(*> )\#($INEST(<div,</div)&&("
        "*(<(ul|span)\3 id=$AV(copyright)*</$TST(\3) > |<p[ >]*</p > )\#)+{1,*}*"
        ")$SET(1=bot-\3)$SET(2=<div id="footer"\@)"
        ")"
        ")"
        ""
        "|<!-- (start north banner -*- end (north banner)\1|overtur(e*)\3 -*- / (overtur$TST(\3))\1) -- >"
        ""
        "|<script(^$TST(comment=[12]))("
        "(^[^>]++src=)[^>]+> $INEST(<script,?+{20}*,</script)</script > $NEST(<script,</script > )+"
        "$NEST(<noscript,*/(/((row|us).(bc)\1|??.(ard)\1).yahoo.com|(M)\1\=[0-9][0-9]+.[0-9]+.)*,</noscript >)"
        "|[^>]+> var ADF(adids)\1*</script >"
        ")"
        "&($TST(volat=*.log:2*)$ADDLST(Log-Main,[$DTM(d T)]\tSiteSpec Yahoo \1 \t\u)|)"
Replace = "\r\n<!-- PROX-S: Removed by Yahoo Cleaner (\1) -->\r\n\2"
          "$SET(sSpec=$GET(sSpec)\1.)"
Add Thank You Quote this message in a reply
Feb. 11, 2009, 10:18 PM
Post: #36
RE: Yahoo Filters...
Added. Smile!

BTW, you would really help me saving time, if you would finally bring yourself to post or send in the matches, no matter if they are easy to find or not! Wink
Add Thank You Quote this message in a reply
Feb. 11, 2009, 10:24 PM
Post: #37
RE: Yahoo Filters...
sure thing, sorry 'bout that...
Add Thank You Quote this message in a reply
Feb. 21, 2009, 12:34 PM
Post: #38
RE: Yahoo Filters...
ProxRocks, while modifying the Yahoo filters for the stand-alone sets, i've noticed two glitches at Y!Search:

"Yahoo Search: Clean up Site" was missing an ad block (class="bbox"). We didn't see that at US Yahoo, b/c there it has a "SPONSOR RESULTS" heading, which is targeted by the generic ad filters.

While there, i was verifying the other matching expressions, ending up with removing half of them (and adding two).
I've attached the matches for the new layout.


Doing a search at Yahoo now ends up with the typical tracking string:
http://search.yahoo.com/search;_ylt=A0geu.LJ8Z9JLz8BhBSl87UF?p=ipod&fr=sfp&fr2=&iscqry=

So i have extended "Yahoo et al.: Multi URL Untangler" to also match form action attributes. It works, but i'm unsure about this change, because i think i had it like that long time ago, but removed it because it was breaking something. Now, i don't remember what this "something" was.

Code:
[Patterns]
Name = "Yahoo et al.: Multi URL Untangler     9.02.14 (multi) [sd] (d.s) TEST"
Active = TRUE
Multi = TRUE
URL = "($TYPE(htm)|$TYPE(js))([^/]++.yahoo.co|[^.]+.altavista.com/|(www.|)alltheweb.com/)"
Bounds = "<a*>|<form*>"
Limit = 1024
Match = "<(a(rea)+|form)\0(\s*(href|action)=)\2$AVQ((\\+"+(^ \+))\3("
        ""
        "*/("
        "click(%3f|\?)u\=http(s|)\7(%3a|:)(%2f|/)(%2f|/)+\8(%26|\&)$SET(4=http\7://$UESC(\8)\3)*"
        "|\?http(s|)\7(%3a|:)(%2f|/)(%2f|/)+\8$SET(4=http\7://$UESC(\8))"
        "|:http(s|)\7://+\8$SET(4=http\7://\8)"
        ")"
        "|"
        "*\*-+http(s|)\7("
        "(%3a|:)(%2f|\\+/)(%2f|(\\+)\9/)+(*\*-+http(s|)\7(%3a|:)(%2f|/)(%2f|/)+)+"
        "(\8(done\=*|search\?*|\=%22*)\5|\8;_yl[ct]\=[^"'?&\\ ]+\5|\8)$SET(4=http\7:\9/\9/$UESC(\8))"
        "|%253a%(25|)2f(%(25|)2f)+(*\*-+http(s|)\7%253a%(25|)2f(%(25|)2f)+)+"
        "(\8\&[^"' ]+\5|\8)$SET(4=http\7://$UESC($UESC(\8)))"
        ")"
        "|"
        "\4;_yl(c\=|t\=(^*\*-+http))[^"'?&\\ ]+\5"
        ""
        "))(\6)"
        "&(^$TST(comment=1)|$TST(tNoscript=1))($TST(\2=*class=*)|$SET(1= class=\3ProXPref\3))"
Replace = "<\0\1\2\3\4\5\6"

Name = "Yahoo Search: Clean up Site     9.02.14 [sd] (d.s)"
Active = TRUE
URL = "$TYPE(htm)([^.]+.search.yahoo.com/|search.([^.]+.|)yahoo.com/)"
Limit = 16384
Match = "<div[^>]++\s("
        "class=$AV(bbox|yschspns&\1)"
        "|id=($AV(east&&\1)|$AV(yschsec&\1)[^>]+> (<img*> )+<h[1-6])"
        ")$INEST(<div,</div)</div >"
        "|<h2> <a[^>]+> sponsor [^<]+</a > </h2 > $NEST(<ul,</ul >)$SET(1=sponsor)"
        "( <div[^>]++class=$AV(whatsthis)$INEST(<div,</div)</div >|)"
        "&($TST(volat=*.log:2*)$ADDLST(Log-Main,[$DTM(d T)]\tSiteSpec YSearch \1 \t\u)|)"
Replace = "\r\n<!-- PROX-S: Removed by Yahoo Search Cleaner (\1) -->\r\n"
          "$SET(sSpec=$GET(sSpec)\1.)"


Attached File(s)
.txt  ysearch-matches.txt (Size: 5.06 KB / Downloads: 470)
Add Thank You Quote this message in a reply
Feb. 26, 2009, 12:08 PM
Post: #39
RE: Yahoo Filters...
"Yahoo: Auto Login 8.09.03 (edit!) [sd] (o.s)"
was missing this login screen:
http://https-px-.login.yahoo.com/config/....yahoo.com
...linked from: http://briefcase.yahoo.com/bc//home

Apparently because the ".tries=1" query param is incrementing the corresponding input value to:
Code:
<input type="hidden" name=".tries" value="2">

I've changed the anti-looping test from name=$AV(.tries) value=[#2:*] to name=$AV(.tries) value=[#3:*] .
Also, the filter name wasn't indicating that it is set to multi-match.


BTW, is anyone running the Yahoo Untangler "also match forms" test version?


Attached File(s)
.txt  ylogin_2-26.txt (Size: 2.44 KB / Downloads: 361)
Add Thank You Quote this message in a reply
Feb. 26, 2009, 01:09 PM
Post: #40
RE: Yahoo Filters...
many thanks...

(Feb. 26, 2009 12:08 PM)sidki3003 Wrote:  BTW, is anyone running the Yahoo Untangler "also match forms" test version?

are you referring to Yahoo et al.: Multi URL Untangler 9.02.14 (multi) [sd] (d.s) TEST?
if so, yep, i'm running it...


speaking of Yahoo filters, i've recently noticed that some my.yahoo.com news links are "bouncing through" http://feedproxy.google.com/ (funny! google has found a way to bring revenue in from a yahoo site, and it's all because yahoo turned down microsoft's merge/acquisition offer)...

i'm not seeing any way "around" the feedproxy links, so i'm wondering if it's just my obsessive paranoia or if these links shouldn't be "worried about"... 'cause i'm thinking about having Proxo "highlight" them similar to "mailto" links...
Add Thank You Quote this message in a reply
Feb. 26, 2009, 01:30 PM
Post: #41
RE: Yahoo Filters...
(Feb. 26, 2009 01:09 PM)ProxRocks Wrote:  are you referring to Yahoo et al.: Multi URL Untangler 9.02.14 (multi) [sd] (d.s) TEST?
if so, yep, i'm running it...

Yes. Because i'm still not sure whether this was a safe change, since form submits are somewhat critical.


Quote:i'm not seeing any way "around" the feedproxy links, so i'm wondering if it's just my obsessive paranoia or if these links shouldn't be "worried about"... 'cause i'm thinking about having Proxo "highlight" them similar to "mailto" links...

I haven't seen that yet. I think feedproxy.google.com is slowly replacing feeds.feedburner.com, after former acquired latter. So it's not just an ad domain. (Nonetheless, the config is blocking feedproxy.google.com/~s/* scripts, if "Block specific Third Party Scripts" is active.)
Add Thank You Quote this message in a reply
Feb. 26, 2009, 01:35 PM
Post: #42
RE: Yahoo Filters...
(Feb. 26, 2009 01:30 PM)sidki3003 Wrote:  
(Feb. 26, 2009 01:09 PM)ProxRocks Wrote:  are you referring to Yahoo et al.: Multi URL Untangler 9.02.14 (multi) [sd] (d.s) TEST?
if so, yep, i'm running it...

Yes. Because i'm still not sure whether this was a safe change, since form submits are somewhat critical.

i'm not sure "where" i'd test that, to be honest...
prxbx and SSL sites are about the only "form submits" that i can think of in encountering...
i know some sites have "hidden forms", is the fear of safety in regards to "hidden forms"?
Add Thank You Quote this message in a reply
Feb. 26, 2009, 01:40 PM
Post: #43
RE: Yahoo Filters...
It's more about doing a Yahoo search, or logging in/out, or doing something in an application (e.g. Y!Mail), and ending up with an error page.
Add Thank You Quote this message in a reply
Feb. 26, 2009, 02:56 PM
Post: #44
RE: Yahoo Filters...
everything's working fine-and-dandy here then Big Teeth
Add Thank You Quote this message in a reply
Feb. 27, 2009, 11:57 PM
Post: #45
RE: Yahoo Filters...
Well, things are getting complicated. I hope it's worth the effort, and Yahoo isn't changing the code tomorrow. (I seriously wonder why people care to respond to config changes, even though the user base is well below 10K.)


Anyway, i did get an error page, after logging out (clicking "Abmelden"), at:
http://de.yahoo.com/
...but it's not the added <form> test.
-> extend $UESC for this subroutine.

Also, "5 Year CD" below the 2nd result at:
http://search.yahoo.com/search?p=loan&fr...2=&iscqry=
...didn't get you to the supposed link.
-> use double $UESC for this subroutine.

Also, above mentioned 2nd result, "E-LOAN: Savings Account,..." is a "mined" link. Click link, but don't release mouse button, drag instead, hit ESC, see the link changing.
-> look for "ourl" attribute.

Code:
[Patterns]
Name = "Yahoo et al.: Multi URL Untangler     9.02.27 (multi) [sd] (d.s) TEST6"
Active = TRUE
Multi = TRUE
URL = "($TYPE(htm)|$TYPE(js))([^/]++.yahoo.co|[^.]+.altavista.com/|(www.|)alltheweb.com/)"
Bounds = "<a*>|<form*>"
Limit = 1024
Match = "<(a(rea)+|form)\0(\s*(href|action)=)\2$AVQ((\\+"+(^ \+))\6\1)( ourl=($AV(*))\3|$TST(\1="
        ""
        "*/("
        "click(%3f|\?)u\=http(s|)\7(%3a|:)(%2f|/)(%2f|/)+\8(%26|\&)$SET(3=\6http\7://$UESC($UESC(\8))\6)*"
        "|\?http(s|)\7(%3a|:)(%2f|/)(%2f|/)+\8$SET(3=\6http\7://$UESC(\8))"
        "|:http(s|)\7://+\8$SET(3=\6http\7://\8)"
        ")"
        "|"
        "*\*-+http(s|)\7("
        "(%3a|:)(%2f|\\+/)(%2f|(\\+)\9/)+(*\*-+http(s|)\7(%3a|:)(%2f|/)(%2f|/)+)+"
        "((*(done\=|search\?|\=%22)*)\8|\8;_yl[ct]\=[^"'?&\\ ]+\5|\8)$SET(3=\6http\7:\9/\9/$UESC(\8)\5)"
        "|%253a%(25|)2f(%(25|)2f)+(*\*-+http(s|)\7%253a%(25|)2f(%(25|)2f)+)+"
        "(\8\&[^"' ]+\5|\8)$SET(3=\6http\7://$UESC($UESC(\8))\5)"
        ")"
        "|"
        "\8;_yl(c\=|t\=(^*\*-+http))[^"'?&\\ ]+$SET(3=\6\8\5)\5"
        ""
        "))(\4)"
        "&(^$TST(comment=1)|$TST(tNoscript=1))$SET(1=$TST(\2=(^*class=)*) class=\6ProXPref\6)"
Replace = "<\0\1\2\3\4"
Add Thank You Quote this message in a reply
Post Reply 


Forum Jump: