Code generation and code search for Python and Javascript.
Similar to GitHub Copilot with one major difference: Code search is leveraged to make up for smaller models and less data. The generation model makes use of search results along with code context to generate contextual code. Moreover, code search results from all over GitHub are displayed in a seperate file along with the exact url of the source code.
https://marketplace.visualstudio.com/items?itemName=samipdahal.codeon
Currently, GPT-Neo-125M model is used for generation.
Training code is under 'gen' directory.
Codebert-base model is used for code search, along with nearest neighbor approximation algorithm ScaNN. CodeSearchNet dataset is used as codebase to search over.
Code to finetune the model and setup ScaNN is under 'search' directory.
#YOUR_QUERY.
//YOUR_QUERY.
(Note the dot ‘.’ at the end.)
# concat two dicts.
// merge two arrays.
- The extension only supports python and javascript as of now and won't run on files not ending with '.py' or '.js'
VSCode 1.59.0 or up.
If you spot any mistakes or any possible improvements, please feel free to let me know and contributions are welcome!