Rnn scorer #190

SeBorgey · 2025-04-24T22:17:23Z

No description provided.

SeBorgey · 2025-04-24T22:17:58Z

решить, что делать с дампером
странная ошибка с импортом адама

voorhs · 2025-04-25T05:16:09Z

autointent/modules/scoring/_rnn.py

+
+    def __init__(
+        self,
+        rnn_config: RNNConfig | str | dict[str, Any] | None = None,


Лучше делать не так.лусге чтобы все параметры были непосредственными параметрами конструктора. Без этого не будет удобно указывать серч спейс

class RNNConfig(BaseModel): model_config = ConfigDict(extra="forbid") model_name: str = Field("rnn", description="Name of the RNN model.") embed_dim: int = Field(128, description="Dimension of word embeddings.") hidden_dim: int = Field(512, description="Dimension of hidden states in RNN.") n_layers: int = Field(2, description="Number of RNN layers.") dropout: float = Field(0.1, description="Dropout rate.") device: str = Field(None, description="Torch notation for CPU or CUDA.") max_seq_length: int = Field(128, description="Maximum sequence length.") padding_idx: int = Field(0, description="Index used for padding.") pretrained_embs: Any = Field(None, description="Pretrained embedding weights if available.") batch_size: PositiveInt = Field(32, description="Batch size for model inference.") @classmethod def from_search_config(cls, values: dict[str, Any] | str | BaseModel | None) -> Self: if values is None: return cls() if isinstance(values, BaseModel): return values # type: ignore[return-value] if isinstance(values, str): return cls(model_name=values) return cls(**values)

Я правильно понял, что я должен удалить обратно этот класс, и все параметры продублировать в самом классе везде, где нужно?

тогда init станет очень большим и from_context тоже.
Одни и те же параметры будут перечисляться 3 раза подряд в коде. Сначала в аргументах init, потом в реализации init, затем в аргументах from_context.
ruff запрещает в аргументы функции пихать больше 10 параметров

в целом верно, если что будем игнорить ошибку ruff, потому что серч спейс важнее

но я вот смотрю, некоторые параметры это не совсем гиперпараметры а действительно конфиг. я бы сделал так:

отнести в конфиг: device, max_seq_length, padding_idx

отнести в init: embed_dim, hidden_dim, n_layers, dropout

убрать: model_name, pretrained_embs

autointent/modules/scoring/_rnn.py

tests/modules/scoring/test_rnn.py

voorhs · 2025-04-25T05:24:13Z

Дампер уже реализован в пр про CNNScorer, но кажется он не работает до конца. Можете обсудить с Лерой и заколлабиться

SeBorgey added 5 commits April 24, 2025 23:44

first code for rnn scorer

4907eb0

config fix

759ceb0

last ruff fix

bd1305c

tests

64c575c

typing

42ea0cd

SeBorgey requested a review from voorhs April 24, 2025 22:18

voorhs requested changes Apr 25, 2025

View reviewed changes

SeBorgey added 3 commits April 28, 2025 10:07

Merge remote-tracking branch 'origin/dev' into rnn_scorer

9cf1a49

device

cc455b5

dump load test

330ae6d

SeBorgey requested a review from voorhs April 28, 2025 09:47

SeBorgey and others added 17 commits April 30, 2025 22:04

parameters

344e776

upgrade dumper for rnn, upgrade tests for new config

b2c21d9

mypy fix

106d8ba

Merge remote-tracking branch 'origin/dev' into rnn_scorer

0cc749d

fix tests exept dumpload

8e4cb26

dumpload test fix

5fe84d3

pull dev

4dc0193

codestyle

4aea62c

pull dev

ca639fa

refactor working with vocab

1e0ea07

refactor cnn and rnn

55d24c6

fix typing and codestyle

30eeea9

refactor init arguments of rnn and cnn scorers

1d7a786

add strict mode to Dumper

27d11ed

codestyle

4d9f63c

Update optimizer_config.schema.json

8abc4ad

bug fix

9fc1a0f

voorhs approved these changes Jun 16, 2025

View reviewed changes

voorhs added 4 commits June 17, 2025 02:49

add TODO comments

6bcffc6

try to implement early stopping

2aa4131

pull dev

e1119ab

add logging messages

b869817

voorhs merged commit 72cc7dc into dev Jun 18, 2025
21 of 22 checks passed

voorhs deleted the rnn_scorer branch June 18, 2025 19:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Rnn scorer #190

Rnn scorer #190

Uh oh!

SeBorgey commented Apr 24, 2025

Uh oh!

SeBorgey commented Apr 24, 2025

Uh oh!

voorhs Apr 25, 2025

Uh oh!

SeBorgey Apr 28, 2025

Uh oh!

SeBorgey Apr 28, 2025

Uh oh!

voorhs Apr 30, 2025

Uh oh!

Uh oh!

Uh oh!

voorhs commented Apr 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Rnn scorer #190

Rnn scorer #190

Uh oh!

Conversation

SeBorgey commented Apr 24, 2025

Uh oh!

SeBorgey commented Apr 24, 2025

Uh oh!

voorhs Apr 25, 2025

Choose a reason for hiding this comment

Uh oh!

SeBorgey Apr 28, 2025

Choose a reason for hiding this comment

Uh oh!

SeBorgey Apr 28, 2025

Choose a reason for hiding this comment

Uh oh!

voorhs Apr 30, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

voorhs commented Apr 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants