Docs fix#826
Conversation
jwilber
left a comment
There was a problem hiding this comment.
Added some small changes and approved:
models/geneformer.md: Changed 20M of 26M cells to 20M of the 26M cellspretrain.md: Changed Pott's model to Potts model (no apostrophe). Added comma here after the word tier here "we are working on a free tier so a credit card...". Remove training after pretraining (redundant) here: To load pretraining training and validation data with mapped UniRef90
sequences to UniRef50 clustersinitialization-guide.md: Separated sentence here: The port number for a Jupyter Lab server, default port is 8888 -> The port number for a Jupyter Lab server. The default port is 8888
training-models.md: Changed two word issues: context specific -> context-specific, usecase -> use case.
|
@trvachov thanks for sharing this for my review. It looks like the text-related issues in the bug were fixed by the LLM, but issues with charts (outlined in point 2 of the bug report) are still remaining. What is the plan to fix these? (Also just a note that it looks like the "!!! note" rendering issue is still there on the GitHub preview of the file - see screenshot - but don't know if this will be fine on the docs?)
|
4a1b8eb to
f86b91a
Compare
|
/ok to test |
@trvachov, there was an error processing your request: See the following link for more information: https://docs.gha-runners.nvidia.com/cpr/e/1/ |
f86b91a to
bfd1cee
Compare
|
/ok to test bfd1cee |
Codecov ReportAll modified and coverable lines are covered by tests ✅
✅ All tests successful. No failed tests found. Additional details and impacted files@@ Coverage Diff @@
## main #826 +/- ##
==========================================
- Coverage 84.40% 84.39% -0.02%
==========================================
Files 138 138
Lines 8685 8685
==========================================
- Hits 7331 7330 -1
- Misses 1354 1355 +1 |
Signed-off-by: Timur Rvachov <trvachov@nvidia.com>
bfd1cee to
38cd661
Compare
|
/ok to test |
@trvachov, there was an error processing your request: See the following link for more information: https://docs.gha-runners.nvidia.com/cpr/e/1/ |
|
/ok to test 38cd661 |
Co-authored-by: lvojtku <lvojtku@nvidia.com> Signed-off-by: Timur Rvachov <120140748+trvachov@users.noreply.github.com>
Co-authored-by: lvojtku <lvojtku@nvidia.com> Signed-off-by: Timur Rvachov <120140748+trvachov@users.noreply.github.com>
Co-authored-by: lvojtku <lvojtku@nvidia.com> Signed-off-by: Timur Rvachov <120140748+trvachov@users.noreply.github.com>
Co-authored-by: lvojtku <lvojtku@nvidia.com> Signed-off-by: Timur Rvachov <120140748+trvachov@users.noreply.github.com>
Co-authored-by: lvojtku <lvojtku@nvidia.com> Signed-off-by: Timur Rvachov <120140748+trvachov@users.noreply.github.com>
Co-authored-by: lvojtku <lvojtku@nvidia.com> Signed-off-by: Timur Rvachov <120140748+trvachov@users.noreply.github.com>
|
/ok to test 8e79b71 |
### Description
### For the Geneformer documentation:
1. **Capitalization standardization**:
- Fixed capitalization of "BioNeMo", "Geneformer", "HuggingFace",
"ReLU", "BERT MLM"
- Corrected spelling of "Crohn's disease" (previously "Chron's disease")
- Fixed "children" (previously "chidlren")
2. **Formatting improvements**:
- Properly formatted model version bullet points with nesting
- Added proper headings for property categories
- Fixed displayed values (e.g., ".5M" → "0.5M")
- Standardized formatting of data collection/labeling methods sections
3. **Image captions**:
- Replaced low-quality image captions with descriptive, properly
formatted titles
- Made chart descriptions more professional and consistent
4. **Grammatical improvements**:
- Fixed article usage and punctuation
- Improved sentence structure and clarity
- Fixed section headings capitalization and consistency
5. **Fixed broken notes**:
- Corrected `!! note` to `!!! note` for proper rendering
### For the ESM-2 pretraining documentation:
1. **Grammar and clarity improvements**:
- Fixed article usage ("a ESM-2" → "an ESM-2")
- Fixed formatting of numeric values (e.g., "1." → "1.0")
- Fixed typos ("depreciation" → "deprecation")
- Fixed "trainiing" → "training"
2. **Consistency in terminology**:
- Standardized "BioNeMo" capitalization
- Ensured consistent treatment of "ESM-2" references
3. **Structure and formatting**:
- Improved spacing and paragraph breaks
- Fixed section formatting and readability
### For the training-models documentation:
1. **Capitalization and consistency**:
- Standardized capitalization of model sizes (8M, 650M, 3B)
- Fixed capitalization of "ESM2", "Geneformer", "Python", "YAML"
- Changed "WandB" to "Weights and Biases" consistently
2. **Formatting improvements**:
- Changed code blocks consistently to include language tags
- Added proper spacing and improved paragraph formatting
- Fixed punctuation in lists and note sections
3. **Grammar and clarity**:
- Added missing commas after introductory phrases
- Fixed formatting of lists for better readability
- Made bulleted explanations more consistent
### Type of changes
<!-- Mark the relevant option with an [x] -->
- [ ] Bug fix (non-breaking change which fixes an issue)
- [ ] New feature (non-breaking change which adds functionality)
- [ ] Refactor
- [x] Documentation update
- [ ] Other (please describe):
### CI Pipeline Configuration
Configure CI behavior by applying the relevant labels:
-
[SKIP_CI](https://github.com/NVIDIA/bionemo-framework/blob/main/docs/docs/user-guide/contributing/contributing.md#skip_ci)
- Skip all continuous integration tests
-
[INCLUDE_NOTEBOOKS_TESTS](https://github.com/NVIDIA/bionemo-framework/blob/main/docs/docs/user-guide/contributing/contributing.md#include_notebooks_tests)
- Execute notebook validation tests in pytest
-
[INCLUDE_SLOW_TESTS](https://github.com/NVIDIA/bionemo-framework/blob/main/docs/docs/user-guide/contributing/contributing.md#include_slow_tests)
- Execute tests labelled as slow in pytest for extensive testing
> [!NOTE]
> By default, the notebooks validation tests are skipped unless
explicitly enabled.
#### Authorizing CI Runs
We use
[copy-pr-bot](https://docs.gha-runners.nvidia.com/apps/copy-pr-bot/#automation)
to manage authorization of CI
runs on NVIDIA's compute resources.
* If a pull request is opened by a trusted user and contains only
trusted changes, the pull request's code will
automatically be copied to a pull-request/ prefixed branch in the source
repository (e.g. pull-request/123)
* If a pull request is opened by an untrusted user or contains untrusted
changes, an NVIDIA org member must leave an
`/ok to test` comment on the pull request to trigger CI. This will need
to be done for each new commit.
### Usage
<!--- How does a user interact with the changed code -->
```python
TODO: Add code snippet
```
### Pre-submit Checklist
<!--- Ensure all items are completed before submitting -->
- [ ] I have tested these changes locally
- [ ] I have updated the documentation accordingly
- [ ] I have added/updated tests as needed
- [ ] All existing tests pass successfully
---------
Signed-off-by: Timur Rvachov <trvachov@nvidia.com>
Signed-off-by: Timur Rvachov <120140748+trvachov@users.noreply.github.com>
Co-authored-by: lvojtku <lvojtku@nvidia.com>
Signed-off-by: Cory Ye <cye@nvidia.com>
### Description
### For the Geneformer documentation:
1. **Capitalization standardization**:
- Fixed capitalization of "BioNeMo", "Geneformer", "HuggingFace",
"ReLU", "BERT MLM"
- Corrected spelling of "Crohn's disease" (previously "Chron's disease")
- Fixed "children" (previously "chidlren")
2. **Formatting improvements**:
- Properly formatted model version bullet points with nesting
- Added proper headings for property categories
- Fixed displayed values (e.g., ".5M" → "0.5M")
- Standardized formatting of data collection/labeling methods sections
3. **Image captions**:
- Replaced low-quality image captions with descriptive, properly
formatted titles
- Made chart descriptions more professional and consistent
4. **Grammatical improvements**:
- Fixed article usage and punctuation
- Improved sentence structure and clarity
- Fixed section headings capitalization and consistency
5. **Fixed broken notes**:
- Corrected `!! note` to `!!! note` for proper rendering
### For the ESM-2 pretraining documentation:
1. **Grammar and clarity improvements**:
- Fixed article usage ("a ESM-2" → "an ESM-2")
- Fixed formatting of numeric values (e.g., "1." → "1.0")
- Fixed typos ("depreciation" → "deprecation")
- Fixed "trainiing" → "training"
2. **Consistency in terminology**:
- Standardized "BioNeMo" capitalization
- Ensured consistent treatment of "ESM-2" references
3. **Structure and formatting**:
- Improved spacing and paragraph breaks
- Fixed section formatting and readability
### For the training-models documentation:
1. **Capitalization and consistency**:
- Standardized capitalization of model sizes (8M, 650M, 3B)
- Fixed capitalization of "ESM2", "Geneformer", "Python", "YAML"
- Changed "WandB" to "Weights and Biases" consistently
2. **Formatting improvements**:
- Changed code blocks consistently to include language tags
- Added proper spacing and improved paragraph formatting
- Fixed punctuation in lists and note sections
3. **Grammar and clarity**:
- Added missing commas after introductory phrases
- Fixed formatting of lists for better readability
- Made bulleted explanations more consistent
### Type of changes
<!-- Mark the relevant option with an [x] -->
- [ ] Bug fix (non-breaking change which fixes an issue)
- [ ] New feature (non-breaking change which adds functionality)
- [ ] Refactor
- [x] Documentation update
- [ ] Other (please describe):
### CI Pipeline Configuration
Configure CI behavior by applying the relevant labels:
-
[SKIP_CI](https://github.com/NVIDIA/bionemo-framework/blob/main/docs/docs/user-guide/contributing/contributing.md#skip_ci)
- Skip all continuous integration tests
-
[INCLUDE_NOTEBOOKS_TESTS](https://github.com/NVIDIA/bionemo-framework/blob/main/docs/docs/user-guide/contributing/contributing.md#include_notebooks_tests)
- Execute notebook validation tests in pytest
-
[INCLUDE_SLOW_TESTS](https://github.com/NVIDIA/bionemo-framework/blob/main/docs/docs/user-guide/contributing/contributing.md#include_slow_tests)
- Execute tests labelled as slow in pytest for extensive testing
> [!NOTE]
> By default, the notebooks validation tests are skipped unless
explicitly enabled.
#### Authorizing CI Runs
We use
[copy-pr-bot](https://docs.gha-runners.nvidia.com/apps/copy-pr-bot/#automation)
to manage authorization of CI
runs on NVIDIA's compute resources.
* If a pull request is opened by a trusted user and contains only
trusted changes, the pull request's code will
automatically be copied to a pull-request/ prefixed branch in the source
repository (e.g. pull-request/123)
* If a pull request is opened by an untrusted user or contains untrusted
changes, an NVIDIA org member must leave an
`/ok to test` comment on the pull request to trigger CI. This will need
to be done for each new commit.
### Usage
<!--- How does a user interact with the changed code -->
```python
TODO: Add code snippet
```
### Pre-submit Checklist
<!--- Ensure all items are completed before submitting -->
- [ ] I have tested these changes locally
- [ ] I have updated the documentation accordingly
- [ ] I have added/updated tests as needed
- [ ] All existing tests pass successfully
---------
Signed-off-by: Timur Rvachov <trvachov@nvidia.com>
Signed-off-by: Timur Rvachov <120140748+trvachov@users.noreply.github.com>
Co-authored-by: lvojtku <lvojtku@nvidia.com>
Signed-off-by: Farhad Ramezanghorbani <farhadr@nvidia.com>
### Description
### For the Geneformer documentation:
1. **Capitalization standardization**:
- Fixed capitalization of "BioNeMo", "Geneformer", "HuggingFace",
"ReLU", "BERT MLM"
- Corrected spelling of "Crohn's disease" (previously "Chron's disease")
- Fixed "children" (previously "chidlren")
2. **Formatting improvements**:
- Properly formatted model version bullet points with nesting
- Added proper headings for property categories
- Fixed displayed values (e.g., ".5M" → "0.5M")
- Standardized formatting of data collection/labeling methods sections
3. **Image captions**:
- Replaced low-quality image captions with descriptive, properly
formatted titles
- Made chart descriptions more professional and consistent
4. **Grammatical improvements**:
- Fixed article usage and punctuation
- Improved sentence structure and clarity
- Fixed section headings capitalization and consistency
5. **Fixed broken notes**:
- Corrected `!! note` to `!!! note` for proper rendering
### For the ESM-2 pretraining documentation:
1. **Grammar and clarity improvements**:
- Fixed article usage ("a ESM-2" → "an ESM-2")
- Fixed formatting of numeric values (e.g., "1." → "1.0")
- Fixed typos ("depreciation" → "deprecation")
- Fixed "trainiing" → "training"
2. **Consistency in terminology**:
- Standardized "BioNeMo" capitalization
- Ensured consistent treatment of "ESM-2" references
3. **Structure and formatting**:
- Improved spacing and paragraph breaks
- Fixed section formatting and readability
### For the training-models documentation:
1. **Capitalization and consistency**:
- Standardized capitalization of model sizes (8M, 650M, 3B)
- Fixed capitalization of "ESM2", "Geneformer", "Python", "YAML"
- Changed "WandB" to "Weights and Biases" consistently
2. **Formatting improvements**:
- Changed code blocks consistently to include language tags
- Added proper spacing and improved paragraph formatting
- Fixed punctuation in lists and note sections
3. **Grammar and clarity**:
- Added missing commas after introductory phrases
- Fixed formatting of lists for better readability
- Made bulleted explanations more consistent
### Type of changes
<!-- Mark the relevant option with an [x] -->
- [ ] Bug fix (non-breaking change which fixes an issue)
- [ ] New feature (non-breaking change which adds functionality)
- [ ] Refactor
- [x] Documentation update
- [ ] Other (please describe):
### CI Pipeline Configuration
Configure CI behavior by applying the relevant labels:
-
[SKIP_CI](https://github.com/NVIDIA/bionemo-framework/blob/main/docs/docs/user-guide/contributing/contributing.md#skip_ci)
- Skip all continuous integration tests
-
[INCLUDE_NOTEBOOKS_TESTS](https://github.com/NVIDIA/bionemo-framework/blob/main/docs/docs/user-guide/contributing/contributing.md#include_notebooks_tests)
- Execute notebook validation tests in pytest
-
[INCLUDE_SLOW_TESTS](https://github.com/NVIDIA/bionemo-framework/blob/main/docs/docs/user-guide/contributing/contributing.md#include_slow_tests)
- Execute tests labelled as slow in pytest for extensive testing
> [!NOTE]
> By default, the notebooks validation tests are skipped unless
explicitly enabled.
#### Authorizing CI Runs
We use
[copy-pr-bot](https://docs.gha-runners.nvidia.com/apps/copy-pr-bot/#automation)
to manage authorization of CI
runs on NVIDIA's compute resources.
* If a pull request is opened by a trusted user and contains only
trusted changes, the pull request's code will
automatically be copied to a pull-request/ prefixed branch in the source
repository (e.g. pull-request/123)
* If a pull request is opened by an untrusted user or contains untrusted
changes, an NVIDIA org member must leave an
`/ok to test` comment on the pull request to trigger CI. This will need
to be done for each new commit.
### Usage
<!--- How does a user interact with the changed code -->
```python
TODO: Add code snippet
```
### Pre-submit Checklist
<!--- Ensure all items are completed before submitting -->
- [ ] I have tested these changes locally
- [ ] I have updated the documentation accordingly
- [ ] I have added/updated tests as needed
- [ ] All existing tests pass successfully
---------
Signed-off-by: Timur Rvachov <trvachov@nvidia.com>
Signed-off-by: Timur Rvachov <120140748+trvachov@users.noreply.github.com>
Co-authored-by: lvojtku <lvojtku@nvidia.com>
Signed-off-by: Ubuntu <camirr@nvidia.com>

Description
For the Geneformer documentation:
Capitalization standardization:
Formatting improvements:
Image captions:
Grammatical improvements:
Fixed broken notes:
!! noteto!!! notefor proper renderingFor the ESM-2 pretraining documentation:
Grammar and clarity improvements:
Consistency in terminology:
Structure and formatting:
For the training-models documentation:
Capitalization and consistency:
Formatting improvements:
Grammar and clarity:
Type of changes
CI Pipeline Configuration
Configure CI behavior by applying the relevant labels:
Note
By default, the notebooks validation tests are skipped unless explicitly enabled.
Authorizing CI Runs
We use copy-pr-bot to manage authorization of CI
runs on NVIDIA's compute resources.
automatically be copied to a pull-request/ prefixed branch in the source repository (e.g. pull-request/123)
/ok to testcomment on the pull request to trigger CI. This will need to be done for each new commit.Usage
Pre-submit Checklist