Left: | ||
Right: |
OLD | NEW |
---|---|
(Empty) | |
1 abpcrawler | |
2 ========== | |
3 | |
4 Firefox extension that loads a range of websites and records which | |
Sebastian Noack
2015/04/27 14:55:50
Apparently its not only a Firefox extension but al
Wladimir Palant
2015/05/07 00:04:59
Done.
| |
5 elements are filtered by [Adblock Plus](http://adblockplus.org). | |
6 | |
7 Requirements | |
8 ------------ | |
9 | |
10 * [Python 2.x](https://www.python.org) | |
Sebastian Noack
2015/04/27 14:55:50
We actually require Python 2.7 specifically, as we
Wladimir Palant
2015/05/07 00:04:59
Done.
| |
11 * [The Jinja2 module](http://jinja.pocoo.org/docs) | |
12 * [mozrunner module](https://pypi.python.org/pypi/mozrunner) | |
Sebastian Noack
2015/04/27 14:55:50
I think you should add Firefox to that list as wel
Wladimir Palant
2015/05/07 00:04:59
Done.
| |
13 | |
14 Running | |
15 ------- | |
16 | |
17 Execute the following: | |
18 | |
19 ./run.py -b /usr/bin/firefox urls.txt outputdir | |
20 | |
21 This will run the specified Firefox binary to crawl the URLs from `urls.txt` | |
22 (one URL per line). The resulting data and screenshots will be written to the | |
23 `outputdir` directory. Firefox will close automatically once all URLs have been | |
24 processed. | |
25 | |
26 Optionally, you can provide the path to the Adblock Plus repository - Adblock | |
saroyanm
2015/05/04 18:13:43
Maybe make sense to also add some small notes abou
Wladimir Palant
2015/05/07 00:04:59
Done.
| |
27 Plus will no longer be downloaded then. | |
28 | |
29 License | |
saroyanm
2015/05/04 18:13:43
Is there a purpose why we use MPL instead of GPL ?
Wladimir Palant
2015/05/07 00:04:59
This extension was written before we switched to t
saroyanm
2015/05/07 13:19:00
Got it, thanks.
| |
30 ------- | |
31 | |
32 This Source Code is subject to the terms of the Mozilla Public License | |
33 version 2.0 (the "License"). You can obtain a copy of the License at | |
34 http://mozilla.org/MPL/2.0/. | |
OLD | NEW |