-
-
Notifications
You must be signed in to change notification settings - Fork 139
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Question regarding mining strategy #109
Comments
Hi @sims1253! I will try to answer your question, and suggest you a couple of articles to read to clarify this point. It's quite complicated as you can imagine :) The answer to your question is no, doing When you do
and we do
and there is not the merge commit, Now let's move to
left parent is Depending on what you need, More information on merge commits: look this |
Thank you so much for taking the time :) Nice :) And thanks again! |
Indeed that's correct. You're welcome! Happy mining 😄 |
This is not entirely about pydriller but I figured there might be enough experience here to help with this:
I am trying to get the history of a branch and have to work with the diffs of those commits.
Before I did this manually by using
git log --first-parent
and some parsing to get all the hashes that I am interested in and then usinghash^!
to get the diffs for each commit.First, I am not entirely sure this did what I intentioned. Merges are occult stuff I can not seem to wrap my head around.
However, I tried to recreate the behaviour with RepositoryMining using
order="reverse", "only_in_branch=<origin_branch>
and while comparing the results of the two approaches I noticed they don't return the same commits.Shouldn't
only_in_branch=<origin_branch>
lead to the same behaviour as--first-parent
?And maybe someone can help me with understanding the merging stuff. Am I right in thinking that I do have to include merge commits in the history, otherwise there will be jumps I can not follow?
The text was updated successfully, but these errors were encountered: