Why GPT-CC is lower than CodeX significantly. #73

BitcoinNLPer · 2021-10-29T08:57:03Z

This is Codex results

The following result is GPT-CC

Is it caused by the quality of the pre-trained corpus data?

Thanks

Symbolk · 2022-01-18T12:03:21Z

Also curious, as I know, the HumanEval dataset contains 164 problems, according to the latest result in README, the model does not even pass any one of them!

taisazero · 2022-07-07T15:03:56Z

Ya that's correct we discovered issues with the pre-training corpus data. We fixed the issue and released a new pertaining corpus and are in the process of processing the dataset further and pre-training a new GPT-CC.

In the meantime, check out these awesome models: https://huggingface.co/spaces/codeparrot/code-generation-models

taisazero closed this as completed Jul 7, 2022

taisazero mentioned this issue Jul 7, 2022

how does it compare with GitHub copilot? #80

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why GPT-CC is lower than CodeX significantly. #73

Why GPT-CC is lower than CodeX significantly. #73

BitcoinNLPer commented Oct 29, 2021

Symbolk commented Jan 18, 2022

taisazero commented Jul 7, 2022

Why GPT-CC is lower than CodeX significantly. #73

Why GPT-CC is lower than CodeX significantly. #73

Comments

BitcoinNLPer commented Oct 29, 2021

Symbolk commented Jan 18, 2022

taisazero commented Jul 7, 2022