Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Extend the Matrix Class by Color Dimension #152

Open
Samyssmile opened this issue Dec 20, 2023 · 1 comment
Open

Extend the Matrix Class by Color Dimension #152

Samyssmile opened this issue Dec 20, 2023 · 1 comment
Assignees
Labels
blocker enhancement New feature or request
Milestone

Comments

@Samyssmile
Copy link
Owner

Extend Matrix Class to Support Multiple Channels in Image Data

Background

Our current implementation of the Matrix class in the machine learning library is limited to handling image data solely based on rows and columns. This structure implies that each image in a dataset is represented by a single Matrix instance. A critical limitation of this approach is the loss of color information in the images.

Objective

To enhance the representation of image data in our library, it's essential to introduce the concept of "channels" within the Matrix class. This addition will allow us to represent images in a more comprehensive manner, preserving color information.

Specifications

  1. Channels Integration:

    • Modify the Matrix class to include a new property: channels.
    • The channels attribute indicates the depth of color information in an image.
      • For instance, channels=3 would correspond to an RGB (Red, Green, Blue) image, whereas channels=1 would indicate a grayscale image.
  2. Representation of Images:

    • Adapt the image representation so that each image in the dataset is depicted by multiple matrices, corresponding to the number of channels.
    • Specifically, a standard RGB image will be represented by three Matrix instances within the dataset.
  3. Network Learning Adaptation:

    • Ensure that the neural network learns using multiple matrices per image. This change is crucial for capturing the full spectrum of information present in colored images.

Expected Outcome

Implementing this feature will significantly enhance the library's ability to process and learn from colored images, leading to potentially more accurate and nuanced model performances.

For local testing use the fractality 256x256 dataset

@Samyssmile Samyssmile added this to the 1.0.9 milestone Dec 20, 2023
@Samyssmile Samyssmile added the enhancement New feature or request label Dec 20, 2023
@Samyssmile Samyssmile self-assigned this Feb 18, 2024
@Samyssmile
Copy link
Owner Author

After invenstigations, it looks like this one is a block for three Layers requested for CNN.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
blocker enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant