Since CSV (comma-separated-values) ﬁles are supported by almost every statistics and/or math tool, I sticked to use the format for all my data collections.
Regardig the Python Programming Language
Yesterday I wanted to start using the Eclipse extension PyDev, because I was tired of using the pdb module. Even more sophisticated debugging modules like pudb did not satisfy my need for intuitive and comfortable python debugging. Even though I heard of the PyDev extension quite a while ago, I never gave it a try, as I was reluctant using the Eclipse Framework for Python.
To parse a gzipped xml ﬁle using the minidom parser in Python, one has two options: Either hand over the ﬁle object pointing to the xml ﬁle or hand over the full content as a string. As I thought it would be the more powerful variant in terms of eﬃciency, I chose to use the ﬁle object, given by gzip.open(). This works for the SAX parser, but fails for the minidom parser somehow. This is for sure a bug, but it persists over many versions and many operating systems (tried Linux and Mac).