CHCSVParser is an Objective-C parser for CSV files.
- Mac OS X 10.7+
- iOS 6+
In order to use
CHCSVParser, you'll need to include the following two files in your project:
CHCSVParser requires ARC.
CHCSVParser works very similarly to an
NSXMLParser, in that it synchronously parses the data and invokes delegate callback methods to let you know that it has found a field, or has finished reading a line, or has encountered a syntax error.
CHCSVParser can be created in one of three ways:
- With a URL to a file
- With the contents of an
- With an
CHCSVParser can be configured to parse other "character-separated" file formats, such as "TSV" (tab-separated). You can specify the delimiter of the parser during initialization. The delimiter can only be one character in length, and cannot be any newline character or
". Additionally, depending on which options you set on the parser, you may not use
= as the delimiter either
CHCSVParser will not sanitize the output of the fields; in other words, individual fields will be returned exactly as they are found in the CSV file. However, if you wish the fields to be cleaned (surrounding double quotes stripped, characters unescaped, etc), you can specify this by setting the
sanitizesFields property to
CHCSVParser has other properties to alter the parsing behavior:
recognizesBackslashesAsEscapesallows you to parse delimited files where special characters (the delimiter, newlines, etc) are escaped using a backslash. When this option is enabled, you may not use a backslash as a delimiter. This option is disabled by default.
recognizesCommentswill skip parsing fields that being with an octothorpe (
#). These fields are reported to the parser delegate as comments, and comments are terminated by an unescaped newline character. This option is disabled by default.
recognizesLeadingEqualSignallows quoted fields to begin with an
=. Some programs use a leading equal sign to indicate that the contents of the field should be interpreted explicitly, and things like insignificant digits should not be removed. This option is disabled by default.
CHCSVWriter has several methods for constructing CSV files:
-writeField: accepts an object and writes its
-description (after being properly escaped) out to the CSV file. It will also write field separator (
,) if necessary. You may pass an empty string (
nil to write an empty field.
-finishLine is used to terminate the current CSV line. If you do not invoke
-finishLine, then all of your CSV fields will be on a single line.
-writeLineOfFields: accepts an array of objects, sends each one to
-writeField:, and then invokes
-writeComment: accepts a string and writes it out to the file as a CSV-style comment.
If you wish to write CSV directly into an
NSString, you should create an
NSOutputStream for writing to memory and use that as the output stream of the
CHCSVWriter. For an example of how to do this, see the
-[NSArray(CHCSVAdditions) CSVString] method.
CHCSVWriter can be customized with a delimiter other than
, during initialization.
There are a couple of category methods on
NSString to simplify the common reading and writing of delimited files.
In addition, the convenience APIs allow for additional parsing options beyond what is provided by
CHCSVParser. When you specify the
CHCSVParserOptionUsesFirstLineAsKeys option, parsing will return an array of
CHCSVOrderedDictionary instances, instead of an array of arrays of strings.
CHCSVOrderedDictionary is an
NSDictionary subclass that maintains a specific order to its key-value pairs, and allows you to look up keys and values by index.
CHCSVParser relies on knowing the encoding of the content. It should work with pretty much any kind of file encoding, if you can provide what that encoding is. If you do not know the encoding of the file, then
CHCSVParser can make a naïve guess.
CHCSVParser will try to guess the encoding of the file from among these options:
- MacOS Roman (
NSMacOSRomanStringEncoding; the default/fallback encoding)
- UTF-8 (
- UTF-16BE (
- UTF-16LE (
- UTF-32BE (
- UTF-32LE (
- ISO 2022-KR (
CHCSVParser is conscious of low-memory environments, such as the iPhone or iPad. It can safely parse very large CSV files, because it only loads portions of the file into memory at a single time.
Credits & Contributors
CHCSVParser uses code to discover file encoding that was provided by Rainer Brockerhoff.
CHCSVParser is licensed under the MIT license, which is reproduced in its entirety here:
Copyright (c) 2014 Dave DeLong
Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.