Fix type hints for dropout, dropout parameter references, and add docs for FCLayer and FCStack. #2061

justinxzhao · 2022-05-25T23:14:54Z

No description provided.

ludwig/encoders/h3_encoders.py

ludwig/encoders/sequence_encoders.py

tests/ludwig/modules/test_fully_connected_modules.py

github-actions · 2022-05-26T00:01:06Z

Unit Test Results

      6 files ±0       6 suites ±0 2h 24m 31s ⏱️ -41s
2 802 tests +2 2 770 ✔️ +2   32 💤 ±0 0 ❌ ±0
8 406 runs +6 8 306 ✔️ +6 100 💤 ±0 0 ❌ ±0

Results for commit fc7a9c0. ± Comparison against base commit b920c86.

♻️ This comment has been updated with latest results.

w4nderlust · 2022-05-26T02:24:51Z

It's not super clear to me why in some cases we are renaming parameters from dropout to recurrent dropout and in some cases we are dropping them altogether (no pun intended). Can you give a bit more context @justinxzhao ?

justinxzhao · 2022-05-26T05:46:21Z

It's not super clear to me why in some cases we are renaming parameters from dropout to recurrent dropout and in some cases we are dropping them altogether (no pun intended). Can you give a bit more context @justinxzhao ?

The changes in this PR is mainly to make the use of separate dropout parameters, where there are already multiple dropout parameters in the constructor, consistent across all Ludwig modules. New consistency in this PR:

H3RNN:
- dropout -> H3Embed
- recurrent_dropout -> RecurrentStack
StackedRNN:
- dropout -> EmbedSequence
- recurrent_dropout -> RecurrentStack
- fc_dropout -> FCStack
StackedCNNRNN:
- dropout -> EmbedSequence
- conv_dropout -> Conv1DStack
- recurrent_dropout -> RecurrentStack
- fc_dropout -> FCStack

As for removing recurrent_dropout=... -- the constructor of RecurrentStack doesn't actually have a kwarg for recurrent_dropout. It's simply called dropout=....

On a side note, I wonder if it's reasonable to simplify/consolidate all of the different dropout parameters into a more global dropout parameter that we can use for everything. Curious to get people's thoughts @w4nderlust @dantreiman @geoffreyangus @ShreyaR

w4nderlust · 2022-05-26T20:23:36Z

@justinxzhao got it, make sense.
Although it seems to me that StackedCNNRNN has a couple issues.
it doesn't have type hints, the order where the dropout parameter is placed is wrong (it should be close to the embedding parameter at the beginning), in some cases the default value for dropouts is 0, in others it is 0.0 (not a big issue, but consistency is great), and finally in the recurrent stack we have dropout=dropout, but it should be dropout=recurrent_dropout.

A side note: in all these cases, dropout is the one used in embeddings, while other modules have their own. It could be better to rename it to embedding_dropout or embed_dropout for clarity and consistency potentially. What do you think?

justinxzhao · 2022-06-01T21:27:15Z

I'll defer to #1924 to add type hints everywhere.

Perhaps once we have schemas checked-in, we can use them to automatically generate docstrings from them and do a grand substitution/update everywhere.

Fixed dropout=recurrent_dropout in the StackedCNNRNN.

A side note: in all these cases, dropout is the one used in embeddings, while other modules have their own. It could be better to rename it to embedding_dropout or embed_dropout for clarity and consistency potentially. What do you think?

I'll leave it as dropout for now, to ensure backwards compatibility.

I'm beginning to lean towards consolidating all of the per-module dropout parameters into a single parameter (filed #2080 to track) unless we see strong evidence that dramatically different dropouts results in significantly performance gains. Perhaps we can continue the discussion on that issue, and make that change later.

… for FCLayer and FCStack.

geoffreyangus

LGTM! Left a comment re: dropout here: #2080

justinxzhao requested review from tgaddair, w4nderlust and geoffreyangus May 25, 2022 23:15

geoffreyangus reviewed May 25, 2022

View reviewed changes

ludwig/encoders/h3_encoders.py Show resolved Hide resolved

geoffreyangus reviewed May 25, 2022

View reviewed changes

ludwig/encoders/sequence_encoders.py Show resolved Hide resolved

geoffreyangus reviewed May 25, 2022

View reviewed changes

tests/ludwig/modules/test_fully_connected_modules.py Show resolved Hide resolved

justinxzhao added this to In progress in Ludwig Documentation May 26, 2022

justinxzhao moved this from In progress to Done in Ludwig Documentation May 26, 2022

justinxzhao moved this from Done to In progress in Ludwig Documentation May 26, 2022

justinxzhao removed this from In progress in Ludwig Documentation May 26, 2022

justinxzhao added this to To do in Code Quality May 26, 2022

justinxzhao moved this from To do to In progress in Code Quality May 26, 2022

justinxzhao force-pushed the type_hint_fixes branch from 87a780d to 5595e84 Compare June 1, 2022 21:12

justinxzhao added 3 commits June 2, 2022 17:26

Fix type hints for dropout, fix dropout parameter usage, and add docs…

47b81e7

… for FCLayer and FCStack.

Address PR feedback.

5e3f9c8

Fix dropout=recurrent_dropout

fc7a9c0

justinxzhao force-pushed the type_hint_fixes branch from 5595e84 to fc7a9c0 Compare June 2, 2022 21:27

justinxzhao requested a review from geoffreyangus June 3, 2022 15:31

geoffreyangus approved these changes Jun 3, 2022

View reviewed changes

justinxzhao merged commit e43053a into master Jun 3, 2022

justinxzhao deleted the type_hint_fixes branch June 3, 2022 17:59

Code Quality automation moved this from In progress to Done Jun 3, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix type hints for dropout, dropout parameter references, and add docs for FCLayer and FCStack. #2061

Fix type hints for dropout, dropout parameter references, and add docs for FCLayer and FCStack. #2061

justinxzhao commented May 25, 2022

github-actions bot commented May 26, 2022 •

edited

Loading

w4nderlust commented May 26, 2022

justinxzhao commented May 26, 2022

w4nderlust commented May 26, 2022

justinxzhao commented Jun 1, 2022

geoffreyangus left a comment

Fix type hints for dropout, dropout parameter references, and add docs for FCLayer and FCStack. #2061

Fix type hints for dropout, dropout parameter references, and add docs for FCLayer and FCStack. #2061

Conversation

justinxzhao commented May 25, 2022

github-actions bot commented May 26, 2022 • edited Loading

Unit Test Results

w4nderlust commented May 26, 2022

justinxzhao commented May 26, 2022

w4nderlust commented May 26, 2022

justinxzhao commented Jun 1, 2022

geoffreyangus left a comment

Choose a reason for hiding this comment

github-actions bot commented May 26, 2022 •

edited

Loading