Skip to content
This repository has been archived by the owner on Jun 17, 2024. It is now read-only.
maheshwarishikha edited this page Jun 19, 2018 · 8 revisions

Welcome to the svn-connector-for-storediq wiki!

Short Name

Adding new data sources to IBM StoredIQ using Connector API SDK

Short Description

Demonstrates the methodology to build a custom connector for a new data source using Connector API SDK with IBM StoredIQ.

Offering Type

IBM StoredIQ, Python

Introduction

IBM StoredIQ enables organizations to gain visibility and control over unstructured data, rapidly connecting people to business-critical information. With IBM StoredIQ, organizations can identify, analyze, and act on dynamic data to meet their eDiscovery, information governance, data management, and records management requirements. IBM StoredIQ connects to many different data sources. Data sources include email servers, file shares, Box etc. A connector is a software component that establishes connection between a data source and StoredIQ. Using IBM StoredIQ Connector API SDK any developer can develop connectors to new data sources outside IBM StoredIQ development environment. This code pattern demonstrates the methodology for the same.

Author

By Shikha Maheshwari, Balaji Kadambi, Manjula Hosurmath

Code

Demo

N/A

Video

N/A

Overview

IBM StoredIQ platform helps organizations understand and analyze their data in place, identify relevant subsets, and then take action by providing a powerful, massively scalable platform for unstructured data management. IBM StoredIQ supports 85+ data sources out of the box. Some of the data sources include Box, Microsoft Office 365, FileNet etc.

Data Source is an important part for IBM StoredIQ solution. A software component of IBM Stored IQ that is used to connect to a data source is called connector. IBM StoredIQ provides a Connector API SDK which can be used by business partners and customers to create custom connector for new data sources which StoredIQ does not support. The IBM StoredIQ Connecter API SDK simplifies connector development by decoupling connector logic from the StoredIQ application logic. It can also be used to customize and extend existing connector. Once you create a new connector, you can use it to manage data in IBM StoredIQ just like you do it with the supported data sources.

This code pattern helps you to understand the methodology and the steps of building a connector for a new data source. When the user has completed this code pattern, they will understand how to:

  • Develop the IBM StoredIQ Connector
  • Integrate the connector with live IBM StoredIQ
  • Register the connector with live IBM StoredIQ
  • Test the connector

Flow

architecture

  1. Develop the connector for data source using Connector SDK.
  2. Integrate and Register the connector with StoredIQ Data Server and Gateway Server.
  3. Add volume for the new connector using Administrator Dashboard of StoredIQ, harvest it and create infoset.
  4. Content of the data source can be seen at Data Workbench dashboard.

Included components

  • IBM StoredIQ: IBM StoredIQ Platform provides scalable analysis and governance of unstructured data in-place across disparate and distributed email, file shares, desktops, and collaboration sites.

  • SVN Server: Subversion is a version control system that keeps track of changes made to files and folders or directories, thus facilitating data recovery and providing a history of the changes that have been made over time.

  • FTP Server: An FTP server is a computer which has a file transfer protocol (FTP) address and is dedicated to receiving an FTP connection. The FTP is a standard network protocol used for the transfer of computer files between a client and server on a computer network.

Featured technologies

  • Python: Python is a programming language that lets you work quickly and integrate systems more effectively.

  • IBM StoredIQ Connector API SDK: Using IBM StoredIQ Connector API SDK, developers can develop Connectors to new data sources outside IBM StoredIQ development environment.

Blog

Title - Adding new data sources to IBM StoredIQ using Connector API SDK

Data is growing exponentially. We are the creators and consumers of data. With this growth of data, organizations find it difficult to make business value out of data effectively. This growth in data has also contributed to new challenges like security, governing and protecting privacy. IBM StoredIQ platform provides powerful solutions for managing unstructured data in-place. It addresses the problems of records management, electronic discovery, compliance, storage optimization, and data migration initiatives.

Organizations have freedom to choose data sources for their need. It may involve multiple data sources with different versions. A data source can be considered as a location which contains unstructured content. By providing an in-depth assessment of unstructured data where it is, the StoredIQ gives organizations visibility into data to make more informed business and legal decisions. Data Source is an important part for IBM StoredIQ solution. StoredIQ provides flexibility to customers to choose data source and it supports 85+ data sources out of the box. Some of the data sources include Box, Microsoft Office 365, FileNet etc. All the features of IBM StoredIQ can be utilized by making a connection between a data source and StoredIQ. The connection between a data source and StoredIQ is established using a connector.

IBM StoredIQ provides a Connector API SDK which can be used by business partners and customers to create custom connector for new data sources which StoredIQ does not support. The IBM StoredIQ Connecter API SDK simplifies connector development by decoupling connector logic from the StoredIQ application logic. It can also be used to customize and extend existing connector. Once you create a new connector, you can use it to manage data in StoredIQ just like you do it with the supported data sources.

Our new code pattern Adding new data sources to IBM StoredIQ using Connector API SDK helps you to understand the methodology and the steps of building a connector of a new data source. When the user has completed this code pattern, they will understand how to develop, integrate, register and test the connector for IBM StoredIQ.

Links