feat(downsample_cells_template): add downsample_cells_template functi… by fangliu117 · Pull Request #393 · FNLCR-DMAP/SCSAWorkflow

fangliu117 · 2025-08-04T13:07:39Z

…on and unit tests

Copilot

Pull Request Overview

This PR adds a new downsample_cells_template function that provides a platform-agnostic interface for cell downsampling operations, converted from NIDAP template functionality. The implementation includes comprehensive parameter handling, multiple input format support, and flexible output options.

Implements run_from_json function with support for JSON, dict, and direct DataFrame inputs
Adds comprehensive test coverage including I/O workflows, error validation, and function mocking
Provides CLI interface and handles multiple file formats (CSV, pickle)

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 5 comments.

File	Description
`src/spac/templates/downsample_cells_template.py`	Main template implementation with parameter parsing, data loading, and downsample execution
`tests/templates/test_downsample_cells_template.py`	Comprehensive unit tests covering various input formats, error cases, and function validation

Copilot · 2025-08-04T13:09:05Z

src/spac/templates/downsample_cells_template.py

+import pickle
+
+# Add parent directory to path for imports
+sys.path.append(str(Path(__file__).parent.parent.parent))


Modifying sys.path at runtime is an anti-pattern that can cause import issues. Consider using relative imports or proper package structure instead.

Suggested change

sys.path.append(str(Path(__file__).parent.parent.parent))

Copilot · 2025-08-04T13:09:06Z

tests/templates/test_downsample_cells_template.py

+sys.path.append(
+    os.path.dirname(os.path.realpath(__file__)) + "/../../src"
+)


Modifying sys.path in tests is an anti-pattern. Consider using proper test discovery mechanisms or PYTHONPATH configuration instead.

Suggested change

sys.path.append(

os.path.dirname(os.path.realpath(__file__)) + "/../../src"

)

Copilot · 2025-08-04T13:09:06Z

src/spac/templates/downsample_cells_template.py

+    print("Downsampled! Processed dataset info:")
+    print(down_sampled_dataset.info())


[nitpick] Using print statements in library code reduces flexibility for users who may want to control logging. Consider using the logging module instead.

Suggested change

print("Downsampled! Processed dataset info:")

print(down_sampled_dataset.info())

logging.info("Downsampled! Processed dataset info:")

logging.info(down_sampled_dataset.info())

Copilot · 2025-08-04T13:09:06Z

src/spac/templates/downsample_cells_template.py

+        print(
+            f"Downsample Cells completed → {saved_files[output_file]}"
+        )
+        return saved_files
+    else:
+        # Return the dataframe directly for in-memory workflows
+        print("Returning DataFrame (not saving to file)")
+        return down_sampled_dataset
+
+
+# CLI interface
+if __name__ == "__main__":


[nitpick] Using print statements in library code reduces flexibility for users who may want to control logging. Consider using the logging module instead.

Suggested change

print(

f"Downsample Cells completed → {saved_files[output_file]}"

)

return saved_files

else:

# Return the dataframe directly for in-memory workflows

print("Returning DataFrame (not saving to file)")

return down_sampled_dataset

# CLI interface

if __name__ == "__main__":

logging.info(

f"Downsample Cells completed → {saved_files[output_file]}"

)

return saved_files

else:

# Return the dataframe directly for in-memory workflows

logging.info("Returning DataFrame (not saving to file)")

return down_sampled_dataset

# CLI interface

if __name__ == "__main__":

logging.basicConfig(level=logging.INFO)

Copilot · 2025-08-04T13:09:07Z

src/spac/templates/downsample_cells_template.py

+        return saved_files
+    else:
+        # Return the dataframe directly for in-memory workflows
+        print("Returning DataFrame (not saving to file)")


[nitpick] Using print statements in library code reduces flexibility for users who may want to control logging. Consider using the logging module instead.

feat(downsample_cells_template): add downsample_cells_template functi…

47adf3e

…on and unit tests

Copilot AI review requested due to automatic review settings August 4, 2025 13:07

Copilot AI reviewed Aug 4, 2025

View reviewed changes

fangliu117 merged commit 71dc3d6 into dev Aug 4, 2025
19 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(downsample_cells_template): add downsample_cells_template functi…#393

feat(downsample_cells_template): add downsample_cells_template functi…#393
fangliu117 merged 1 commit intodevfrom
template_downsample_cells

fangliu117 commented Aug 4, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Aug 4, 2025

Uh oh!

Copilot AI Aug 4, 2025

Uh oh!

Copilot AI Aug 4, 2025

Uh oh!

Copilot AI Aug 4, 2025

Uh oh!

Copilot AI Aug 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	sys.path.append(
	os.path.dirname(os.path.realpath(__file__)) + "/../../src"
	)

		print("Downsampled! Processed dataset info:")
		print(down_sampled_dataset.info())

Conversation

fangliu117 commented Aug 4, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Aug 4, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Aug 4, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Aug 4, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Aug 4, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Aug 4, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants