Skip to content

Cobol program using MS SQL Server as backend data base #445

@jcstrydom

Description

@jcstrydom

Background

I am currently working on a project where a Cobol based system is using a MS SQL Server instance as its back end.

I am able to connect to the SQL server database via a JDBC connection which returns the table into a Spark Dataframe, however it is still encoded with EBCDIC encoding, which is an obvious problem when using AWS GLUE and wanting to post the data into parquet files for down stream processes. I am also able to parse the copybook via your copybook parser.

However, these two structures are vastly different, which are posing limitations to the process that I would like to build. I would still want to use your package as I believe there are inherent synergies.

Question

There are a few questions:

  1. Is there any advise that you can give me with regards my use case and using your package?
  2. Is there a way that I can just use your decoding technology while in flight, or after the data has landed in the dataframe?
  3. Is there a way to flatten the schema structure once the parser has completed?

Your assistance would be greatly appreciated.

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requested

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions