[Haskell-cafe] Sneaking haskell in the workplace -- cleaning csv files

Tomasz Zielonka tomasz.zielonka at gmail.com
Sat Jun 16 08:01:14 EDT 2007


On Sat, Jun 16, 2007 at 12:08:22PM +0100, Jim Burton wrote:
> Tomasz Zielonka wrote:
> >It would be easier to experiment if you could provide us with an
> >example input file. If you are worried about revealing sensitive
> >information, you can change all characters other then newline,
> >~ and , to "A"s, for example. An accompanying output file, for checking
> >correctness, would be even nicer.
>
> Hi Tomasz, I can do that but they do essentially look like the example 
> above, except with 10 - 30 columns, more data in each column, and more 
> rows, maybe this side of a million. They are produced by an Oracle 
> export which escapes the delimiter (often a tilde) from within the cols. 
> The output file should have exactly one row per line, with extra 
> newlines replaced by a string given as a param (it might be a space or a 
> html tag -- I only just remembered this and my initial effort doesn't do 
> it).

I guess you've tried to convince Oracle to produce the right format in
the first place, so there would be no need for post-processing...?

I wonder what would you get if you set the delimiter to be a newline ;-)

Best regards
Tomek


More information about the Haskell-Cafe mailing list