Title

Author	Denis Gudkov
Consultant	Daniil Dorin
Advisor	Andrii Hrabovyi

Assets

Abstract

Modern vision models excel at recognition but fail to grasp geometric relationships like symmetry composition. This work investigates whether neural networks can internalize the algebraic structure of the dihedral group D₄ (square symmetries) rather than memorizing visual patterns. Using a Siamese encoder with autoregressive Transformer decoder, we train a model to predict whether two images are related by a D₄ transformation and identify the specific element. We demonstrate that the model learns true group properties: invariance to operation sequences (horizontal_flip → vertical_flip ≡ rotate_180), consistency with composition (g₂·g₁), canonical element representation, and correct rejection of unrelated pairs. Analysis of attention maps and embeddings reveals internal encoding of the D₄ multiplication table. Unlike VLMs that fail at such tasks, our architecture captures symbolic geometric structures.

Citation

If you find our work helpful, please cite us.

@article{citekey,
    title={Title},
    author={Name Surname, Name Surname (consultant), Name Surname (advisor)},
    year={2025}
}

Licence

Our project is MIT licensed. See LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.vscode		.vscode
code		code
paper		paper
slides		slides
.gitignore		.gitignore
LICENSE		LICENSE
LINKREVIEW.md		LINKREVIEW.md
README.md		README.md
noise_sensitivity.png		noise_sensitivity.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Title

Assets

Abstract

Citation

Licence

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Title

Assets

Abstract

Citation

Licence

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages