Change the implementation of the `cached_*` functions to be structs that hide the cache implementation details by JSorngard · Pull Request #22 · Axect/puruspe

JSorngard · 2024-09-05T11:31:57Z

This PR switches the cached_* functions out for structs that contain the HashMap inside themselves as discussed in #12. This way the user does not have to think about the implementation details, and can not accidentally insert values into the cache that mess with the results.

This PR does not add tests.

…t hide the cache implementation details

JSorngard · 2024-09-05T11:45:06Z

I am unsure about my documentation of the panics.

Mostly I don't really feel like I know what a "too large x value" is. I may have misunderstood the implementation, or just been unable to find what the limit on x values is.

While I think it may be a good idea to do this, this is not the right PR for that, This reverts commit a638170.

…d a remove function to the caches.

…ways. Add a remove function to the caches." This is a massive change, and the purpose of the caches is not really to iterate over the contents. This reverts commits 9e7f5b6 and b5d83c6

JSorngard · 2024-09-05T14:16:39Z

I have added some convenience functions to the caches if the user wants to e.g. get the value corresponding to some inputs, but does not want to compute them if they are not in the cache.

I was thinking about adding functions that iterate over the values in the cache in various ways, but that required a lot of code and changes to make the API nice, so I do not include them in this PR. The attempt is visible in commit 9e7f5b6.

…ake with a set of inputs after removing them is recomputing them. And if so, why were they removed in the first place?

…u_Knu, like the other caches have

JSorngard · 2024-09-06T07:28:50Z

There may be other functionality one wishes to add, but I think this is a good first step. Some convenience functions and some tools for affecting memory allocation.

Axect · 2024-09-10T02:08:08Z

Thank you for your excellent PR. I've reviewed the files, and the implementation of CachedBesselJY, CachedBesselIK, CachedJnuYnu, and CachedInuKnu looks good.

I have one suggestion: I noticed that all these structs have similar methods (new, with_capacity, contains, clear, get, is_empty, len, shrink_to_fit, reserve, retain). It would be better to manage these through a common trait (e.g., a Cached trait).

Since the implementations of these functions are also common, we can implement them at the trait level. This will significantly reduce the amount of code and eliminate duplication.

Then, we would only need to separately implement the actual Bessel function calculation code for each struct.

What do you think about this approach? I believe it will make the code more maintainable and substantially reduce duplication.

Let me know your thoughts on this suggestion.

JSorngard · 2024-09-10T13:15:21Z

Thank you for the feedback! It is a great idea to try to unify the implementation since it is the same for all the caches.

I tried to do it with a trait but ran into the issue that each cache must implement the trait, which means that each trait impl essentially contains all the same code still, so it doesn't really save any code. Then the user must import the cache as well as the trait in order to use it. The last point is not much of a problem, but the first meant that the total amount of code actually increased. However, if there is a way to do this that I just missed I welcome feedback.

I think I managed to kind of achieve your suggestion in another way though! I made a macro that creates an impl block that implements the common functionality. This way we only have to call the macro once per cache, and a bunch of code is saved.

What do you think of this solution?

Axect · 2024-09-10T13:42:11Z

Thank you for your implementation using macros. While it's a good effort to reduce code duplication, I believe we can achieve even better results using traits with default implementations. Let me explain:

We can define a Cached trait with generic types and default implementations for common methods:

use std::collections::HashMap;
use std::hash::Hash;

type BesselKey = (u64, u64);
type BesselValue = (f64, f64, f64, f64);

pub trait Cached<K: Hash + Eq, V> {
    // required methods
    fn as_hashmap(&self) -> &HashMap<K, V>;
    fn as_hashmap_mut(&mut self) -> &mut HashMap<K, V>;
    fn new() -> Self;
    fn with_capacity(capacity: usize) -> Self;

    // common methods from required methods
    // we can define default implementations for them
    fn contains(&self, key: &K) -> bool {
        self.as_hashmap().contains_key(key)
    }

    // Other common methods with default implementations...
}

Then, for each cache struct like CachedBesselJY, we only need to implement the fundamental methods and any struct-specific methods:

pub struct CachedBesselJY {
    cache: HashMap<BesselKey, BesselValue>,
}

impl Cached<BesselKey, BesselValue> for CachedBesselJY {
    fn as_hashmap(&self) -> &HashMap<BesselKey, BesselValue> {
        &self.cache
    }

    fn as_hashmap_mut(&mut self) -> &mut HashMap<BesselKey, BesselValue> {
        &mut self.cache
    }

    fn new() -> Self {
        Self { cache: HashMap::new() }
    }

    fn with_capacity(capacity: usize) -> Self {
        Self { cache: HashMap::with_capacity(capacity) }
    }
}

impl CachedBesselJY {
    pub fn calculate(&mut self, nu: f64, x: f64) -> BesselValue {
        // Struct-specific implementation...
    }
}

This approach offers several advantages over using macros:

Code Reduction and Maintainability: Common methods are implemented in the trait, reducing code duplication across structs. This makes the code easier to maintain and update.
Type Safety and Compiler Optimizations: The Rust compiler can perform better type checking and optimizations with traits at compile-time.
Developer Experience: Most IDEs provide better support for traits, including auto-completion and go-to-definition functionality. This, combined with improved readability, makes the code easier to work with.
Flexibility and Extensibility: It's easier to add or modify methods in a trait, and new cache types can be easily added by implementing the Cached trait.
Documentation: Traits can be easily documented using standard Rust doc comments, which integrate well with rustdoc.
Compile Time Efficiency: Traits often lead to faster compile times compared to macros, especially for large codebases. Macros are expanded early in the compilation process, generating new code that then needs to be parsed, type-checked, and compiled. In contrast, traits are a core language feature that the Rust compiler is highly optimized to handle. When you modify a macro, all code that uses that macro needs to be recompiled. With traits, changes often require less recompilation.

By using type aliases like BesselKey and BesselValue, we also improve code readability and maintainability.

This approach maintains the DRY principle while leveraging Rust's powerful type system and trait capabilities. For large projects with many cache types, the benefits of using traits over macros can be substantial, leading to more efficient development cycles and better long-term maintainability.

What do you think about this approach? Would you like me to elaborate on any part of this explanation?

P.S. Please note that I'm currently writing this on my mobile phone while commuting home by bus. As such, there might be some minor errors in the code examples. I appreciate your understanding, and please feel free to point out any issues you notice. Once I'm back at my desk, I'll be happy to review and refine the code if necessary. Thank you for your patience!

JSorngard · 2024-09-10T13:55:31Z

Thank you for the feedback!

This was one of the things I tried, however it unfortunately has one issue: it enables the user to obtain mutable access to the HashMap. This means that we can't rely on the invariant that the HashMap contains correct values, which is the problem that the reimplementation aims to solve.

We could skip the as_hashmap_mut function and implement the functions in the trait manually for the caches anyway though.

Axect · 2024-09-12T01:45:16Z

Thank you for your insightful response. I apologize for the delay in getting back to you - I had a hospital appointment yesterday that prevented me from doing any work.

I appreciate you pointing out that excellent point. You're absolutely right; by insisting on a trait implementation, we would inevitably expose the cache to the user. While users accessing as_hashmap_mut would likely be proficient enough to use the cache correctly, I agree with your concern about maintaining the integrity of the cache's contents.

Given this consideration, I concur that implementing with macros is indeed the better approach in this case. It allows us to maintain better control over the cache's invariants and prevent unintended modifications.

Based on this, I'm happy to approve the merge of your pull request. Your implementation using macros is a solid solution that addresses the issues at hand while maintaining the necessary encapsulation.

Once again, thank you for your excellent implementation and for sharing your thoughtful perspective on this matter. Your contribution is greatly appreciated.

JSorngard added 4 commits September 5, 2024 13:23

Change the implementation of the cached_* functions to be structs tha…

1f7079a

…t hide the cache implementation details

Consistent documentation

7e6d6f4

unify naming scheme of caches

a638170

spelling

64b3541

JSorngard added 2 commits September 5, 2024 13:48

Things are inserted 'into' the cache, not 'in' it.

b27a62a

Revert "unify naming scheme of caches".

47de006

While I think it may be a good idea to do this, this is not the right PR for that, This reverts commit a638170.

JSorngard marked this pull request as draft September 5, 2024 12:40

JSorngard added 7 commits September 5, 2024 14:41

Add a contains function to all caches

6ff788d

Add clear function to caches

339b6ae

Add get functions to all caches

fc19509

Add is_empty function to all caches

48af611

Add with_capacity function to all caches

5244314

Add iterators that visit the data in the caches in different ways. Ad…

9e7f5b6

…d a remove function to the caches.

Motivate the overridden implementations

b5d83c6

JSorngard marked this pull request as ready for review September 5, 2024 14:11

JSorngard added 2 commits September 5, 2024 16:12

Revert "Add iterators that visit the data in the caches in different …

0574faf

…ways. Add a remove function to the caches." This is a massive change, and the purpose of the caches is not really to iterate over the contents. This reverts commits 9e7f5b6 and b5d83c6

Add a remove function to the caches

c591bd6

JSorngard added 10 commits September 5, 2024 16:20

Specify that the input arguments must match the cache exactly

b572aa7

Grammar

06304a8

Remove the remove function from the caches. The inly action one can t…

3641568

…ake with a set of inputs after removing them is recomputing them. And if so, why were they removed in the first place?

Add an indication of which return values are what to CachedInuKnu::In…

811dd54

…u_Knu, like the other caches have

Specify that get only returns a reference to the function values

a239d8d

Add a shrink_to_fit function to the caches.

ea6992f

Specify that shrink_to_fit does not remove elements

c272d2d

Add a reserve function to the caches

e8a208f

Add a retain function

b8dcd94

clarify docstring or retain

536ac88

Unify behaviour with macro

7b8d7cd

Axect approved these changes Sep 12, 2024

View reviewed changes

Axect merged commit 60c50a7 into Axect:v0.3.0 Sep 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change the implementation of the `cached_*` functions to be structs that hide the cache implementation details#22

Change the implementation of the `cached_*` functions to be structs that hide the cache implementation details#22
Axect merged 26 commits intoAxect:v0.3.0from
JSorngard:v0.3.0

JSorngard commented Sep 5, 2024

Uh oh!

JSorngard commented Sep 5, 2024 •

edited

Loading

Uh oh!

JSorngard commented Sep 5, 2024 •

edited

Loading

Uh oh!

JSorngard commented Sep 6, 2024

Uh oh!

Axect commented Sep 10, 2024

Uh oh!

JSorngard commented Sep 10, 2024

Uh oh!

Axect commented Sep 10, 2024

Uh oh!

JSorngard commented Sep 10, 2024 •

edited

Loading

Uh oh!

Axect commented Sep 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

JSorngard commented Sep 5, 2024

Uh oh!

JSorngard commented Sep 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JSorngard commented Sep 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JSorngard commented Sep 6, 2024

Uh oh!

Axect commented Sep 10, 2024

Uh oh!

JSorngard commented Sep 10, 2024

Uh oh!

Axect commented Sep 10, 2024

Uh oh!

JSorngard commented Sep 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Axect commented Sep 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

JSorngard commented Sep 5, 2024 •

edited

Loading

JSorngard commented Sep 5, 2024 •

edited

Loading

JSorngard commented Sep 10, 2024 •

edited

Loading