sitescripts/crawler/README.md - Issue 8327353: Crawler backend

Keyboard Shortcuts

	File
u :	up to issue
m :	publish + mail comments
M :	edit review message
j / k :	jump to file after / before current file
J / K :	jump to next file with a comment after / before current file
	Side-by-side diff
i :	toggle intra-line diffs
e :	expand all comments
c :	collapse all comments
s :	toggle showing all comments
n / p :	next / previous diff chunk or comment
N / P :	next / previous comment
<Up> / <Down> :	next / previous line
<Enter> :	respond to / edit current comment
d :	mark current comment as done

	Issue
u :	up to list of issues
m :	publish + mail comments
j / k :	jump to patch after / before current patch
o / <Enter> :	open current patch in side-by-side view
i :	open current patch in unified diff view

	Issue List
j / k :	jump to issue after / before current issue
o / <Enter> :	open current issue
# :	close issue

	Comment/message editing
<Ctrl> + s or <Ctrl> + Enter :	save comment
<Esc> :	cancel edit

Unified Diff: sitescripts/crawler/README.md

Issue 8327353: Crawler backend (Closed)

Patch Set: Created Sept. 27, 2012, 6:22 a.m.

Use n/p to move between diff chunks; N/P to move between comments.

Jump to:

Index: sitescripts/crawler/README.md

===================================================================

new file mode 100644

--- /dev/null

+++ b/sitescripts/crawler/README.md

@@ -0,0 +1,44 @@

+crawler

+=======

+Backend for the Adblock Plus Crawler. It provides the following URLs:

+* */crawlableSites* - Return a list of sites to be crawled

+* */crawlerData* - Receive data on filtered elements

+Required packages

+-----------------

+* [simplejson](http://pypi.python.org/pypi/simplejson/)

+Database setup

+--------------

+Just execute the statements in _schema.sql_.

+Configuration

+-------------

+Just add an empty _crawler_ section to _/etc/sitescripts_ or _.sitescripts_.

+Also make sure that the following keys are configured in the _DEFAULT_

+section:

+* _database_

+* _dbuser_

+* _dbpassword_

+* _basic\_auth\_realm_

+* _basic\_auth\_username_

+* _basic\_auth\_password_

+Extracting crawler sites

+------------------------

+Make _filter\_list\_repository_ in the _crawler_ configuration section

+point to the local Mercurial repository of a filter list.

+Then execute the following:

+ python -m sitescripts.crawler.bin.extract_crawler_sites > crawler_sites.sql

+Now you can execute the insert statements from _crawler\_sites.sql_.