Pydoop is a Python interface to Hadoop that allows you to write MapReduce applications in pure Python. Pydoop offers several features not commonly found in other Python libraries for Hadoop: -- a rich HDFS API; -- a MapReduce API that allows to write pure Python record readers / writers, partitioners and combiners; -- transparent Avro (de)serialization; -- easy installation-free usage; WWW: http://crs4.github.io/pydoop/