Refactored data clumps with the help of LLMs (research project) #26790

compf · 2024-04-25T18:30:09Z

Hello maintainers,

I am conducting a master thesis project focused on enhancing code quality through automated refactoring of data clumps, assisted by Large Language Models (LLMs).

Data clump definition

A data clump exists if

two methods (in the same or in different classes) have at least 3 common parameters and one of those methods does not override the other, or
At least three fields in a class are common with the parameters of a method (in the same or in a different class), or
Two different classes have at least three common fields

See also the following UML diagram as an example

I believe these refactoring can contribute to the project by reducing complexity and enhancing readability of your source code.

Pursuant to the EU AI Act, I fully disclose the use of LLMs in generating these refactorings, emphasizing that all changes have undergone human review for quality assurance.

Even if you decide not to integrate my changes to your codebase (which is perfectly fine), I ask you to fill out a feedback survey, which will be scientifically evaluated to determine the acceptance of AI-supported refactorings. You can find the feedback survey under https://campus.lamapoll.de/Data-clump-refactoring/en

Thank you for considering my contribution. I look forward to your feedback. If you have any other questions or comments, feel free to write a comment, or email me under tschoemaker@uni-osnabrueck.de .

Best regards,
Timo Schoemaker
Department of Computer Science
University of Osnabrück

ShadelessFox

Thanks for the contribution.

Unfortunately, all I see is formatted code and questionable code extraction that doesn't even use the newest Java features, such as records. Moreover, the code added doesn't include the copyright header or uses nullability annotations as in the rest of the codebase.

ShadelessFox · 2024-04-30T11:37:09Z

You still can help us make DBeaver better by:

Fixing a bug https://github.com/dbeaver/dbeaver/issues?q=is%3Aissue+is%3Aopen+label%3Abug
Localizing it to your native language or improving existing localization https://github.com/dbeaver/dbeaver/wiki/Localization

compf · 2024-04-30T12:31:16Z

Thank you for the feedback

refactored data clumps

1a280a0

E1izabeth added the external label Apr 29, 2024

ShadelessFox reviewed Apr 30, 2024

View reviewed changes

ShadelessFox closed this Apr 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactored data clumps with the help of LLMs (research project) #26790

Refactored data clumps with the help of LLMs (research project) #26790

compf commented Apr 25, 2024

ShadelessFox left a comment

ShadelessFox commented Apr 30, 2024

compf commented Apr 30, 2024

Refactored data clumps with the help of LLMs (research project) #26790

Refactored data clumps with the help of LLMs (research project) #26790

Conversation

compf commented Apr 25, 2024

ShadelessFox left a comment

Choose a reason for hiding this comment

ShadelessFox commented Apr 30, 2024

compf commented Apr 30, 2024