Post Reply 
Base: Speeding up ad-list
Feb. 19, 2009, 11:57 PM
Post: #1
Base: Speeding up ad-list
A short explaining how lists works:

A list file is very similar to writing (word1|word2|word3|...), but use to be more and more large. This post will speak about how i did to create an adlist for proxomitron starting from the famous easy list from adblock plus.

Taking a look to its keywords, most of them start by "http://","/" or ".". Let's try to use this list (of course after some adaptations to proxomitron):

Having the URL to parse in the variable \1, and being http://www.host.com/sub1/sub2.adbureau.example if we use $TST(\1=$LST(adlist)*), then it would parse only one time and would match if we find a keyword wich match with the beginning of our url

One possible code to use would be $TST(\1=*$LST(adlist)*) but it would be really really slow. It would look all the words in the list for:
Code:
http://www.host.com/sub1/sub2.adbureau.example
ttp://www.host.com/sub1/sub2.adbureau.example
tp://www.host.com/sub1/sub2.adbureau.example
p://www.host.com/sub1/sub2.adbureau.example
://www.host.com/sub1/sub2.adbureau.example
//www.host.com/sub1/sub2.adbureau.example
...
ub1/sub2.adbureau.example
...
ample
mple
ple
le
e


So after some days of research, a pencil, a paper and using the log window, i got to something useable.

Copy this filter to the clipboard and import it, go to the test window and test with this code in it:
href=http://www.host.com/sub1/sub2.adbureau.ext
Code:
[Patterns]
Name = "<example> Parsing Adlist Release Candidate {ln}090220"
Active = FALSE
Limit = 256
Match = "href=$AV(\1)"
        ""
        "$LOG(!C\1)$TST(\1=("
        "(\w)\3|"
        "*((^(http|ftp)://|//)(http|.|/)\w)\3"
        ")$LOG(W\3)$TST(\3="
        "(.|/|)(\w)\9$LOG(w\9)prxfail"
        "))"

In gray you will see which parts of the url will be parsed by the adlist.
Feel free to post suggestions or comment anything.
Add Thank You Quote this message in a reply
Post Reply 


Messages In This Thread
Base: Speeding up ad-list - lnminente - Feb. 19, 2009 11:57 PM
RE: Base: Speeding up ad-list - whenever - Feb. 20, 2009, 06:34 AM
RE: Base: Speeding up ad-list - sidki3003 - Feb. 20, 2009, 06:53 AM
RE: Base: Speeding up ad-list - lnminente - Feb. 20, 2009, 05:16 PM
RE: Base: Speeding up ad-list - sidki3003 - Feb. 20, 2009, 05:41 PM
RE: Base: Speeding up ad-list - Kye-U - Feb. 20, 2009, 06:17 PM
RE: Base: Speeding up ad-list - sidki3003 - Feb. 20, 2009, 07:30 PM
RE: Base: Speeding up ad-list - lnminente - Feb. 20, 2009, 06:30 PM
RE: Base: Speeding up ad-list - Kye-U - Feb. 20, 2009, 06:32 PM
RE: Base: Speeding up ad-list - lnminente - Feb. 20, 2009, 07:37 PM
RE: Base: Speeding up ad-list - Kye-U - Feb. 21, 2009, 04:54 AM
RE: Base: Speeding up ad-list - Siamesecat - Feb. 21, 2009, 06:31 AM
RE: Base: Speeding up ad-list - lnminente - Feb. 21, 2009, 03:39 PM
RE: Base: Speeding up ad-list - lnminente - Mar. 15, 2009, 07:13 PM
RE: Base: Speeding up ad-list - sidki3003 - Mar. 15, 2009, 08:35 PM
RE: Base: Speeding up ad-list - lnminente - Mar. 15, 2009, 09:47 PM
RE: Base: Speeding up ad-list - whenever - Mar. 16, 2009, 03:15 AM
RE: Base: Speeding up ad-list - lnminente - Mar. 16, 2009, 10:17 AM
RE: Base: Speeding up ad-list - sidki3003 - Mar. 16, 2009, 03:26 PM
RE: Base: Speeding up ad-list - lnminente - Mar. 16, 2009, 08:08 PM
RE: Base: Speeding up ad-list - ProxRocks - Mar. 17, 2009, 12:42 AM
RE: Base: Speeding up ad-list - lnminente - Mar. 17, 2009, 12:17 PM
RE: Base: Speeding up ad-list - lnminente - May. 27, 2009, 01:04 AM

Forum Jump: