Here is a list of technologies used for the Krawla Project (in no particular order):
python
zeromq
mongo
json
msgpack
-
binary serialization protocol
faster than json
good for data that needs to be sent over the wire
bad for files that a human needs to read
sqlite
flask
beautifulsoup
css selectors
regular expressions (regex)
requests
firefox
selenium
stem
git
virtualenv
ssh
pandas