Post by JackPost by Adam CarterTo clean up csv files I use excel's find/replace to swap the commas
occurring within fields for something benign. How does this magic
work? Different character sets within the same file?
Is it possible to do this with shell scripting?
Once Excel (or LibreOffice) reads in a csv file, the commas are no
longer present, and it just searches within the cells. It might be
possible for a shell script to do it, but you need to parse the file to
distinguish any commas separating the fields from commas within the
fields. I'm sure there are plenty of utilities to do this, but it's
certainly not trivial.
The other thing is, look up the definition (such as there is) of CSVs.
Special characters (such as commas) can be quoted. Standard practice as
far as I can tell, is that any cell containing a comma will be
double-quoted, and the quotes are stripped on import.
The other trick I learnt is that to prevent Excel mangling text, you
precede it with a single quote - for example I want eg "+7" in a cell,
so I have to enter '+7.
Cheers,
Wol