Skip to content

Latest commit

 

History

History
21 lines (14 loc) · 543 Bytes

File metadata and controls

21 lines (14 loc) · 543 Bytes

Hugging Face VSCode Endpoint Server

starcoder server for huggingface-vscode custom endpoint.

Can't handle distributed inference very well yet.

Usage

pip install -r requirements.txt
python main.py

Fill http://localhost:8000/api/generate/ into Hugging Face Code > Model ID or Endpoint in VSCode.

API

curl -X POST http://localhost:8000/api/generate/ -d '{"inputs": "", "parameters": {"max_new_tokens": 64}}'
# response = {"generated_text": ""}