Optimize Feature Engineering Part 2 #9

jylee-bcm · 2024-07-01T21:45:16Z

Applied scalar multiplication to vector, not on matrix
Used indexing technique
Transformed from float64 to float32

Changed Output

There's subtle change on outputs, but I think they are ignorable.

>>> pd.testing.assert_series_equal(a["diffuse_Phrank_STRING"], b["diffuse_Phrank_STRING"])
Series values are different (0.04513 %)

Reduced the running time

Before:

After:

* Applied scalar multiplication to vector * Used indexing technique * Transformed from float64 to float32

hyunhwan-bcm · 2024-07-02T00:52:06Z

Changed Output

There's subtle change on outputs, but I think they are ignorable.
>>> pd.testing.assert_series_equal(a["diffuse_Phrank_STRING"], b["diffuse_Phrank_STRING"])
Series values are different (0.04513 %)

This is an expected behavior (confirmed by Chaozhong), but not sure why. We can ignore this anyway.

hyunhwan-bcm

a minor comment. thanks for doing this.

hyunhwan-bcm · 2024-07-02T00:51:02Z

bin/mod5_diffusion.py

    for i in range(0, max_iter):
        Fs = np.append(Fs, F, axis=1)
-        F = alpha * nn @ Fs[:, [i]] + fY
+        F = nn @ (alpha * Fs[:, [i]]) + (1 - alpha) * y


any benefit for using (1 - alpha) * y instead of using fY?

any benefit for using (1 - alpha) * y instead of using fY?

I thought it adds intuitiveness to the equation, and no drawbacks on speed.

jylee-bcm · 2024-07-02T15:26:12Z

This is an expected behavior (confirmed by Chaozhong), but not sure why. We can ignore this anyway.

Because I used float32 instead of float64, for the purpose of speed optimization, but not necessary. It only improves 30s.

hyunhwan-bcm

Clear all good

Optimize Feature Engineering Part 2

369a36e

* Applied scalar multiplication to vector * Used indexing technique * Transformed from float64 to float32

jylee-bcm added the enhancement New feature or request label Jul 1, 2024

jylee-bcm requested a review from hyunhwan-bcm July 1, 2024 21:45

jylee-bcm self-assigned this Jul 1, 2024

hyunhwan-bcm reviewed Jul 2, 2024

View reviewed changes

hyunhwan-bcm approved these changes Jul 3, 2024

View reviewed changes

hyunhwan-bcm merged commit 246503e into nextflow_conversion Jul 3, 2024

jylee-bcm mentioned this pull request Jul 5, 2024

Optimize mod5 diffusion LiuzLab/AI_MARRVEL#31

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize Feature Engineering Part 2 #9

Optimize Feature Engineering Part 2 #9

jylee-bcm commented Jul 1, 2024

hyunhwan-bcm commented Jul 2, 2024

Changed Output

hyunhwan-bcm left a comment

hyunhwan-bcm Jul 2, 2024

jylee-bcm Jul 2, 2024

jylee-bcm commented Jul 2, 2024 •

edited

Loading

hyunhwan-bcm left a comment

Optimize Feature Engineering Part 2 #9

Optimize Feature Engineering Part 2 #9

Conversation

jylee-bcm commented Jul 1, 2024

Changed Output

Reduced the running time

Before:

After:

hyunhwan-bcm commented Jul 2, 2024

Changed Output

hyunhwan-bcm left a comment

Choose a reason for hiding this comment

hyunhwan-bcm Jul 2, 2024

Choose a reason for hiding this comment

jylee-bcm Jul 2, 2024

Choose a reason for hiding this comment

jylee-bcm commented Jul 2, 2024 • edited Loading

hyunhwan-bcm left a comment

Choose a reason for hiding this comment

jylee-bcm commented Jul 2, 2024 •

edited

Loading