Post Reply 
Cut: Chained Ad Path URLs
Mar. 21, 2009, 12:13 PM (This post was last modified: Apr. 02, 2009 08:50 PM by sidki3003.)
Post: #1
Cut: Chained Ad Path URLs
One of the new filters in the 2009 configs is "<script>: Cut: Chained Ad Path URLs", required to deal with concatenated scripts, which get more and more popular:
Code:
<script src="http://myserver.com/load?adscript.js,requiredscript.js,trackingscript.js"></script>

Example: http://www.spike.com/

So far so good. However, lately i also see concatenated offsite scripts:
Code:
<script src="http://myserver.com/load?http%3A//adserver.com/x.js,requiredscript.js,http%3A//trackingserver.com/y.js"></script>

Example: http://kirstenrokz.buzznet.com/user/


Below filter tests each chained component against the complete ad-list combo (AdHosts-J, AdDomains, etc.). I'm not sure whether the recursive expressions are correct and sufficiently robust, hence "WIP".
Code:
[Patterns]
Name = "<script>: Cut: Chained Ad Path URLs     9.03.20 (multi) [sd] (d.1)"
Active = TRUE
Multi = TRUE
URL = "$TYPE(htm)"
Bounds = "$NEST(<script\s,*src=$AV(*\?*,*)*,>)"
Limit = 1024
Match = "(*src=)\1$AVQ("
        "(*\?*\=)\2("
        "[^,="']+[,=]+"
        "&&"
        ",+((https+%3a)+//($LST(AdHosts-J))\8$SET(a=$GET(a) AdHj \8)|((^(^http))|(^http))("
        "$LST(AdList)$SET(a=$GET(a) \9)|(^$TST(keyword=*.a_track_s.*))"
        "((^http|[/.])|((https+%3a)+//[^/?]+)+*[/=_-])($LST(AdPaths-J)(^[a-z0-9]))\8$SET(a=$GET(a) AdPj \8)"
        "))(*(\&*)\#|*)"
        "|\#"
        ")+\#"
        ")\3"
        ""
        "&$TST(\8=*)"
        "$SET(eAdJS=$GET(eAdJS)"
        "%3Cspan class=%22ProxFly-Span%22>$GET(mHead) Chain URL:%3C/span>"
        "$ESC($GET(a))%3Cbr class=%22ProxFly-Br%22 />"
        ")"
        "($TST(volat=*.log:2*)$ADDLST(Log-Main,[$DTM(d T)]\tWEB JS_Chain_URL\t$GET(a) \t\u)|)"
        "($TST(volat=*.log:[12]*)$ADDLST(Log-Rare,WEB JS_Chain_URL\t$GET(a) \t\u)|)"
Replace = "\1\2\@\3$SET(a=)"



The benefit of extending the filter as described becomes especially obvious if you look at the second filter hit (as well as the resulting script) on latter example page, after adding below entry (found via Ghostery) to AdHosts-J:
Code:
# Ads - Lotame
[^/]++.crwdcntrl.net/$SET(7=var LOTCC={add:function(){},addAction:function()
  {},addBehavior:function(){},addInterest:function(){},addMedia:function(){},
  bcp:function(){}};)
  &&(($TST(volat=*.log:[12]*)\8&$ADDLST(Log-Rare,ALST AdHj \8 \t\u))|*)


edit: "WIP" flag removed.
Add Thank You Quote this message in a reply
Post Reply 


Messages In This Thread
Cut: Chained Ad Path URLs - sidki3003 - Mar. 21, 2009 12:13 PM
RE: Cut: Chained Ad Path URLs - ProxRocks - Mar. 21, 2009, 12:39 PM
RE: Cut: Chained Ad Path URLs - sidki3003 - Mar. 21, 2009, 12:50 PM
RE: Cut: Chained Ad Path URLs - ProxRocks - Mar. 21, 2009, 01:11 PM
RE: Cut: Chained Ad Path URLs - sidki3003 - Mar. 21, 2009, 01:14 PM
RE: Cut: Chained Ad Path URLs - lnminente - Mar. 21, 2009, 02:36 PM
RE: Cut: Chained Ad Path URLs - sidki3003 - Mar. 21, 2009, 03:01 PM
RE: Cut: Chained Ad Path URLs - lnminente - Mar. 21, 2009, 09:33 PM
RE: Cut: Chained Ad Path URLs - sidki3003 - Mar. 21, 2009, 10:06 PM
RE: Cut: Chained Ad Path URLs - lnminente - Mar. 21, 2009, 10:24 PM
RE: Cut: Chained Ad Path URLs - sidki3003 - Apr. 02, 2009, 08:48 PM

Forum Jump: