>_ cosmonaut code

purpose

it's a code explorer, explainer and assessment tool.

to be honest, we built this tool because we needed it for the work we do. sure there are some great tools out there, but none of them quite hit the mark for our needs.

generative ai will maximise outputs and 10x developer output. what about the quality? this tool aims to address this - i.e., use the source of the problem to provide a solution.

goals

provide a viable tool for local codebase analysis.
helps new developers to quickly get up to speed on a large or legacy codebase.
provide a tool that will help developers and code maintainers manage their code and start a conversation on quality.
allow code owners to check the overall health of the code in a simple way.
output an actionable report with auto-PRs that will improve the code base over time.

non-goals

~~provide a tool that will automagically improve a codebase.~~
provide a tool that does not require further conversation, review or analysis of the code.
provide a tool that does not require thinking or discussion.

use cases

new developers on a project.
new maintainers of a codebase or take over of code base / project.
entry-point for a formal external audit of code base.
entry-point for a formal external security audit of code base.
entry-point for due diligence of technology assets.
code owner reporting on technical-debt and general health of asset.

current state

currently, most things are working and a solid report is produced in either json or html.

the most stable and tested provider is openai. The best results, by far, are with the gpt-4 service, which uses the latest preview model. the openai gpt-3.5 works, but tends to over state issues and the quality of resolution offered to isses is not as good. it does run faster, however and is cheaper to run.

google are late to the party, but have come in the door with a half-drunken bottle. the public instance of gemini-pro is both faster, cheaper and produces better results that openai's gpt-3.5. it is slightly behind the gpt-4 preview model, but not far. do your own testing; we've found the late 2023 comparisons online to be highly misleading. the google provider is not as tested as openai, so you will see more errors in the log output. it should recover from these errors, but it is less robust.

disclaimer

this is really early days. running over a really big repo with the latest model will be super slow and possibly fail. we've tested it up to ~1500 code files, what with timeout retries etc., takes a couple of hours, cost about 5 usd. your mileage may vary. we think the value will come when it can be run over multiple models and compared and filtered.

as with all similar tools, it does produce false flags. it overplays or (rarely) downplays security issues. in some cases it may flag so many issues that the response is truncated, creating an error. we are working on this.

there is significant variation between models and even review runs on the same repository with the same model, particularly with older models. some models are silent on obvious issues and transfixed on trivial issues.

there are issues with the language file type matching via the github linguist regex. we will likely move to something more robust, or fix the crate that causes the mismatching.

we recommend that you run it multiple times at first to gain a base line; fix the big issues and then let it run periodically.

right now it's deliberately a barebones offering. it works well, and we have gotten value from it, but there is a lot more to do. it's been fun to do.

the google public api provider works, but is less robust than openai.

there is a local instance wired up. it does work, but it highly fragile and unlikely to complete. it currently uses lm studio.

usage

download pre-release

MacOS Apple Silicon

configuration

configure: add a settings.json, maybe in the settings folder, with the following:

{
    "sensitive": {
        "api_key": "[YOUR_API_KEY]"
    },
    "repository_path": "[FULL_PATH_TO_REPO]",
    "report_output_path": "[FULL_PATH_TO_OUTPUT]",
    "chosen_provider": "[CHOICE OF PROVIDER]",
    "chosen_service": "[CHOICE OF SERVICE]",
    "output_type": "html"
}

chosen_provider is in:

openai (default)
google (note API key only, ADC does not work as this is the public version)

chosen_service is in:

gpt-4 (default)
gpt-3.5
gemini-pro (for google provider)

output_type is in:

html
json - (default)

run:

export SENSITIVE_SETTINGS_PATH=[PATH_TO_YOUR_SETTINGS.JSON]

download release above

mv cosmonaut_code_0.2.0_macos-aarch64 cosmonaut_code

./cosmonaut_code

via rust locally

tldr

install rust; clone the repo; cd repo; add config (see above); cargo run.

curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh

git clone https://github.com/cosmonaut-nz/cosmonaut-code.git

cd cosmonaut-code

Add settings (as per above)

cargo run

contributing

yes please!!

see contributing for the rules, they are standard though.

work status

we do our best to release working code. we hacked this out pretty quickly so the code's quality is not all that right now.

status today is: "it works, and the happy path is pretty solid. deviate from the path and there be dragons"

outline tasks

>_ we are cosmonaut

Name		Name	Last commit message	Last commit date
Latest commit History 142 Commits
.github		.github
assets/img		assets/img
output		output
settings		settings
src		src
.gitignore		.gitignore
.gitlab-ci.yml		.gitlab-ci.yml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.toml		Cargo.toml
LICENCE.md		LICENCE.md
README.md		README.md
build.rs		build.rs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

>_ cosmonaut code

purpose

goals

non-goals

use cases

current state

disclaimer

usage

configuration

via rust locally

tldr

contributing

work status

outline tasks

About

Releases

Packages

Contributors 2

Languages

License

cosmonaut-nz/cosmonaut-code

Folders and files

Latest commit

History

Repository files navigation

>_ cosmonaut code

purpose

goals

non-goals

use cases

current state

disclaimer

usage

configuration

via rust locally

tldr

contributing

work status

outline tasks

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages