KFServing Inference Client

A Go re-implementation of seldon-batch-processor. See docs to understand its usage.

The main reason why we choose to re-implement is because the original implementation follow Seldon protocol, while we need KFServing V2 Protocol in order to use MLServer as the inference backed.

man

$ ./kfserving-inference-client -h
Usage of ./kfserving-inference-client:
  -host string
    	The hostname for the seldon model to send the request to, which can be the ingress of the Seldon model or the service itself
  -i string
    	The local filestore path where the input file with the data to process is located
  -m string
    	model name
  -o string
    	The local filestore path where the output file should be written with the outputs of the batch processing
  -u int
    	Batch size greater than 1 can be used to group multiple predictions into a single request. (default 100)
  -w int
    	The number of parallel request processor workers to run for parallel processing (default 100)

Build Docker Image

$ make build

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
.github/workflows		.github/workflows
example/simple		example/simple
inference		inference
mapping		mapping
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
client.go		client.go
go.mod		go.mod
go.sum		go.sum
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KFServing Inference Client

man

Build Docker Image

About

Releases 4

Packages

Languages

lianxmfor/kfserving-inference-client

Folders and files

Latest commit

History

Repository files navigation

KFServing Inference Client

man

Build Docker Image

About

Resources

Stars

Watchers

Forks

Releases 4

Packages 0

Languages

Packages