Csv Shredder Java Callout

This directory contains the Java source code and pom.xml file required to compile a set of Java callouts for Apigee, that involve parsing CSVs, and handling the output of that data.

callout	description
`CsvShredder`	parses a CSV, creates a Java map object from that data, and stores the map into a context variable. It also stores a JSON version of that data into a different context variable.
`MapExtractor`	retrieves items from a Java Map object, by key.

These callouts can work together or independently.

For example, you could use CsvShredder to parse a CSV within Apigee, then use PopulateCache to store the resulting Java Map object in the Apigee cache. Then LookupCache to retrieve a Java object from Cache, and then MapExtractor to query the cached object on subsequent API calls.

As another example, you could use CsvShredder to parse a CSV within Apigee, then just emit the resulting JSON to the response.

Disclaimer

This example is not an official Google product, nor is it part of an official Google product.

Two classes

com.google.apigee.callouts.CsvShredder - reads the ambient request or response message, parse the CSV, serialize as a Java map, and also as a json object.
com.google.apigee.callouts.MapExtractor - extract a value from a context variable that contains a Java map.

In each case, you can configure the callout with a set of properties.

CsvShredder configuration

The CsvShredder accepts some optional properties that affect its behavior.

property	description
`fieldlist`	a comma-separated list of names to apply to the fields in the CSV. The default behavior is to use the first row in the CSV as a header.
`trim-spaces`	a string, if "true", then the callout trims leading and trailing spaces from the values in the CSV
`output-format`	either `map`, or `list`. Defaults to `map`. If the value in the first column is not unique, consider parsing to a list.
`contrive-primary-key`	`true` or `false`. Setting this to `true` may be helpful when parsing to a map, and the first element is not unique. In this case, the callout will contrive a unique primary key for each element.

Example:

<JavaCallout name='Java-ShredCsv'>
  <Properties>
    <Property name="trim-spaces">true</Property>
    <Property name="output-format">list</Property>
  </Properties>
  <ClassName>com.google.apigee.callouts.CsvShredder</ClassName>
  <ResourceURL>java://apigee-csv-parse-20220104.jar</ResourceURL>
</JavaCallout>

See the example bundle for more configuration examples.

Using these callouts

You do not need to build the source code in order to use the callouts in Apigee. All you need is the built JAR, and the appropriate configuration for the callouts. If you want to build it, feel free. The instructions are at the bottom of this readme.

create a cache called 'csv-cache' in the Apigee environment. This is used by the demonstration apiproxy. You can use the Admin UI to do so.

Now deploy the API Proxy bundle with your favorite tool, for example importAndDeploy.js

# for Apigee Edge SaaS or Apigee X
node importAndDeploy.js -v -o $ORG -e $ENV -d bundle

Use a client to load a CSV into the cache, via the proxy. Eg,

# for Apigee X
endpoint=https://whatever-your-endpoint-is

curl -i -X POST \
    -H content-type:text/csv \
    $endpoint/csv-shredder/shred?name=sample \
    --data-binary @sample.csv

Use a client to query from the cache, via the proxy. Eg,

curl -i -X GET \
    $endpoint/csv-shredder/shred/sample/PRIMARY_KEY

Dependencies

Maven will resolve all the dependencies during the build / compile phase. The jars that are dependencies must be available as resources for the proxy at runtime. The maven pom file should copy those files to the right place, automatically.

Notes

The example proxy uses the CsvShredder callout to produce a Java object of type Map<String,Map<String,String>>, and then caches it using the CachePopulate policy. The example proxy uses the queryparam "name" to store the cached item. For this demonstration, you can have as many different cached maps as you like, each accessible by name.
The first row of the CSV is expected to be the header row, which defines the names of the fields in each row. If you don't do that, then you can specify a fieldlist property.
When reading into a map, for all data rows, the first field in each row of the CSV is treated as the primary key; in other words, it is treated as the key for the map. The remaining fields are a map of "field name" => "value", where the field names are those that are defined in the first row. If the first field is not unique in your data, then the resulting map will be "missing" some values. To work around this, use either the contrive-primary-key property, or set the output-format to list. Both of these properties are documented above.

Example 1: Simple CSV

For example, the super-simple.csv file has these contents:

   PK,field1,field2
   A,B,C
   D,E,F
   G,H,I

To shred that csv, use this command:

  curl -i -X POST \
    -H content-type:text/csv \
    $endpoint/csv-shredder/shred?name=simple \
     --data-binary @csv/super-simple.csv

Notice that the file is specified to curl with --data-binary. If you use -d or --data-ascii, curl will eliminate newlines, which will cause the CSV to be mangled before it is sent to the API Proxy.

The cache will then hold a Map with 3 key/value pairs. it will look like this:

  {
     A => { field1 => B, field2 => C, PK => A },
     D => { field1 => E, field2 => F, PK => D },
     G => { field1 => H, field2 => I, PK => G }
  }

To query the map, you must specify the map name, and the value of the "primary key", both of which are passed as url path elements.

For example,

curl -i $endpoint/csv-shredder/field/simple/A

result:

{
  "status": "ok",
  "data": {
    "field1" : "B",
    "field2" : "C",
    "PK" : "A"
  }
}

Example 2: Sacramento Real Estate Transactions

I got this sample CSV data for Sacramento real estate transactions from SpatialKey: https://support.spatialkey.com/spatialkey-sample-csv-data/

This CSV looks like this:

street,city,zip,state,beds,baths,sqft,type,sale_date,price,latitude,longitude
3526 HIGH ST,SACRAMENTO,95838,CA,2,1,836,Residential,Wed May 21 00:00:00 EDT 2008,59222,38.631913,-121.434879
51 OMAHA CT,SACRAMENTO,95823,CA,3,1,1167,Residential,Wed May 21 00:00:00 EDT 2008,68212,38.478902,-121.431028
2796 BRANCH ST,SACRAMENTO,95815,CA,2,1,796,Residential,Wed May 21 00:00:00 EDT 2008,68880,38.618305,-121.443839
2805 JANETTE WAY,SACRAMENTO,95815,CA,2,1,852,Residential,Wed May 21 00:00:00 EDT 2008,69307,38.616835,-121.439146
6001 MCMAHON DR,SACRAMENTO,95824,CA,2,1,797,Residential,Wed May 21 00:00:00 EDT 2008,81900,38.51947,-121.435768
...

You can see it has a header row, and then a series of lines, each with fields corresponding to the header row.

To shred this more complicated CSV and load it into a Java Map, which then gets inserted into cache, use this:

  curl -i -X POST \
    -H content-type:text/csv \
    $endpoint/csv-shredder/shred?name=sacramento \
     --data-binary @csv/Sacramento-RealEstate-Transactions.csv

Then, to query the map:

curl -i "$endpoint/csv-shredder/field/sacramento/51%20OMAHA%20CT"

the result:

{
  "status": "ok",
  "data": {
    "baths" : "1",
    "zip" : "95823",
    "beds" : "3",
    "price" : "68212",
    "street" : "51 OMAHA CT",
    "state" : "CA",
    "longitude" : "-121.431028",
    "latitude" : "38.478902",
    "type" : "Residential",
    "sqft" : "1167",
    "sale_date" : "Wed May 21 00:00:00 EDT 2008",
    "city" : "SACRAMENTO"
  }
}

Example 3: Converting CSV to JSON

This example just sends in a CSV, and gets back an equivalent JSON in response.

  curl -i -X POST \
    -H content-type:text/csv \
    $endpoint/csv-shredder/tojson \
     --data-binary @csv/Sacramento-RealEstate-Transactions.csv

result:

HTTP/1.1 200 OK
Date: Wed, 05 May 2021 17:35:36 GMT
Content-Type: application/json
Content-Length: 347604
Connection: keep-alive
apiproxy: csv-shredder r3
X-time-target-elapsed: 0.0
X-time-total-elapsed: 324.0

{
  "2109 HAMLET PL" : {
    "zip" : "95608",
    "baths" : "2",
    "city" : "CARMICHAEL",
    "sale_date" : "Tue May 20 00:00:00 EDT 2008",
    "street" : "2109 HAMLET PL",
    "price" : "484000",
    "latitude" : "38.602754",
    "sqft" : "1598",
    "state" : "CA",
    "beds" : "2",
    "type" : "Residential",
    "longitude" : "-121.329326"
  },
  "2100 BEATTY WAY" : {
    "zip" : "95747",
    "baths" : "2",
    "city" : "ROSEVILLE",
    "sale_date" : "Thu May 15 00:00:00 EDT 2008",
    "street" : "2100 BEATTY WAY",
    "price" : "208250",
    "latitude" : "38.737882",
    "sqft" : "1371",
    "state" : "CA",
    "beds" : "3",
    "type" : "Residential",
    "longitude" : "-121.308142"
  },
  "2103 BURBERRY WAY" : {
    "zip" : "95835",
    "baths" : "2",
    "city" : "SACRAMENTO",
    "sale_date" : "Mon May 19 00:00:00 EDT 2008",
    "street" : "2103 BURBERRY WAY",
    "price" : "362305",
    "latitude" : "38.67342",
    "sqft" : "1800",
    "state" : "CA",
    "beds" : "3",
    "type" : "Residential",
    "longitude" : "-121.508542"
  },
  ...

Notice: the order of the items in the JSON is not necessarily the same as the order of the items in the original CSV !

Building

unpack (if you can read this, you've already done that).
configure the build on your machine by loading the Apigee jars into your local cache.
```
bash ./buildsetup.sh
```
Build with maven.
```
mvn clean package
```

The above will copy the generated JAR and its dependencies to the bundle directory.

LICENSE

Bugs

The unit tests for this project are thin.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
bundle/apiproxy		bundle/apiproxy
callout		callout
csv		csv
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
NOTICE		NOTICE
Readme.md		Readme.md
buildsetup.sh		buildsetup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bundle/apiproxy

bundle/apiproxy

callout

callout

csv

csv

.gitignore

.gitignore

CONTRIBUTING.md

CONTRIBUTING.md

LICENSE

LICENSE

NOTICE

NOTICE

Readme.md

Readme.md

buildsetup.sh

buildsetup.sh

Repository files navigation

Csv Shredder Java Callout

Disclaimer

Two classes

CsvShredder configuration

Using these callouts

Dependencies

Notes

Example 1: Simple CSV

Example 2: Sacramento Real Estate Transactions

Example 3: Converting CSV to JSON

Building

LICENSE

Bugs

About

Releases

Packages

Languages

License

DinoChiesa/Apigee-Csv-Shredder

Folders and files

Latest commit

History

Repository files navigation

Csv Shredder Java Callout

Disclaimer

Two classes

CsvShredder configuration

Using these callouts

Dependencies

Notes

Example 1: Simple CSV

Example 2: Sacramento Real Estate Transactions

Example 3: Converting CSV to JSON

Building

LICENSE

Bugs

About

Topics

Resources

License

Stars

Watchers

Forks

Languages