Parsing Dates from CSV files with FileHelpers Library
In my current project I have found myself needing to parse a CSV file coming from a legacy C++ application. This is a fairly common task, and in the past I have always written the parsing engine myself, especially when there was some validation related to the business logic involved.
This time I have chosen to try the reportedly excellent FileHelpers library. Why reinvent the wheel yet another time? The library is well documented and in a matter of minutes I was able to start experimenting with some test data.
To simplify the discussion, suppose that the CSV file has the following structure:
# City, Date, Temperature Chicago; 20090913 07:00:00; 62.23 New York; 20090913 08:00:00; 81.3
Each row of the CSV file is mapped to the following class:
The attributes are pretty self-explanatory: they specify which is the separator value and which lines can be ignored.
The interesting part here is the way FileHelpers allows to specify a pattern to
convert a string into a DateTime
value. In this case, the pattern used
here is yyyyMMdd HH:mm:ss
to match the CSV file.
Once the mapping between the CSV and the .NET object is setup, I have created a class wrapping the FileHelpers engine (just in case some day I want to change the parsing engine), with the following (simplified) method:
This was working like a charm... until I moved the code to a separate test machine.
All of a sudden the parser would complain that it could not parse
the dates from the file. After a lot of thought, I realized that the test
machine had a different regional settings: while my machine had English formats,
the test machine had Italian formats. And for some reason FileHelpers was not able
to work correctly in this scenario (even though I had explicitly specified the date
and time pattern in the CsvRecord
class).
Luckily, FileHelpers lets you specify your own converters if the predefined ones won't do. I therefore created my custom converter for the dates as follows:
and rewritten the CsvRecord
class as follows:
This time it worked also with Italian locale.
Frankly, I don't know why the FileHelpers class had problems with the Italian locale (possibly with locales different from English ones). Probably having a look at the source code would reveal the truth.
I suspect that it is parsing the dates with a given format but without specifying an invariant culture format provider. But that is just a wild guess.
Leave a Comment