The data "dump" was done because we were dealing with microarray data... which was/is a bit of a P.I.T.A.
The method we used was a generally accepted protocol at the time.
Nowadays, the procedures have changed a bit and I wouldn't dump as much data.
I was told by a nationally renowned statistician when we first started doing the arrays that if we gave him a set of array data, he could tell if they were created on the same day, if we used the same samples, if different technicians did them, if it was under a full moon..... Basically, they're VERY tricky to replicate and sensitive to all sorts of stuff...
But yah, I know dumping data is generally not the best thing to do. I'll dump 2 or 3 points out of 1000, but more than that gives me the heeby-jeebies nowadays.