RepositoryIterator and ReferenceIterator implementations #20

ajnavarro · 2017-09-04T13:27:22Z

Using a base abstract class RootedRepoIterator, we add two implementations, one of them to iterate repository metadata (repository id, urls, is fork) and references metadata (repository_id, name, hash).

With this RootedRepoIterator we should be able to implement CommitIterator and BlobIterator too.

Filter logic must be implemented before start with BlobIterator.

Split test logic into Traits to be able to use them in all the Specs.
Added a BaseRootedRepoIterator trait with a helper to test iterators more easly.

ajnavarro · 2017-09-04T13:27:54Z

Merge #19 first.

codecov · 2017-09-04T13:38:00Z

Codecov Report

Merging #20 into master will decrease coverage by 1.82%.
The diff coverage is 78%.

@@             Coverage Diff              @@
##             master      #20      +/-   ##
============================================
- Coverage     88.46%   86.63%   -1.83%     
- Complexity       11       25      +14     
============================================
  Files             6       10       +4     
  Lines           182      232      +50     
  Branches         17       23       +6     
============================================
+ Hits            161      201      +40     
- Misses           14       19       +5     
- Partials          7       12       +5

Impacted Files	Coverage Δ	Complexity Δ
...tech/sourced/api/iterator/RootedRepoIterator.scala	`50% <50%> (ø)`	`6 <6> (?)`
...in/scala/tech/sourced/api/util/GitUrlsParser.scala	`88.88% <88.88%> (ø)`	`0 <0> (?)`
...tech/sourced/api/iterator/RepositoryIterator.scala	`90.9% <90.9%> (ø)`	`3 <3> (?)`
.../tech/sourced/api/iterator/ReferenceIterator.scala	`92.85% <92.85%> (ø)`	`4 <4> (?)`
...tech/sourced/api/provider/RepositoryProvider.scala	`81.81% <0%> (+2.27%)`	`4% <0%> (+1%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 990ae19...53ced47. Read the comment docs.

erizocosmico · 2017-09-04T13:48:05Z

src/main/scala/tech/sourced/api/util/GitUrlsParser.scala

+      } catch {
+        case _: URISyntaxException => None
+      }
+    }).distinct.min


Is min used because of something specific or just to get one of the results?

min === sorted.head

yes, that's what I meant, do we need them sorted or do we just want the first?

we want the first after sort them.

Discussed IRL: sorted is needed, lgtm then

I must admit that it's not clear to me either, why sorting is needed.
Could be a good idea to document that

erizocosmico · 2017-09-04T13:52:33Z

src/main/scala/tech/sourced/api/provider/RepositoryProvider.scala

+}
+
+object RepositoryProvider {
+  var provider: RepositoryProvider = _


if this is a singleton, what about moving everything to the RepositoryProvider object instead of having the class and manually manage the singleton?

Also, if you do RepositoryProvider("foo") and then RepositoryProvider("bar") what you get is a repository provider with "foo" as localPath, which is a bit misleading.

This code is from another PR. Can you comment this there?: #19

Sure!

Done: https://github.com/src-d/spark-api/pull/19/files#r136828187

erizocosmico · 2017-09-04T13:54:51Z

src/main/scala/tech/sourced/api/provider/SivaRDDProvider.scala

+object SivaRDDProvider {
+  var provider: SivaRDDProvider = _
+
+  def apply(sc: SparkContext): SivaRDDProvider = {


same as with RepositoryProvider

This code is from another PR. Can you comment this there?: #19

Done: https://github.com/src-d/spark-api/pull/19/files#r136828317

bzz

Having all changes from a different PRs together make review harder

ajnavarro · 2017-09-06T08:13:33Z

Having all changes from a different PRs together make review harder

Yes, sorry about that, but at this early stage on the project, is really difficult split functionality and go forward without depending of another in-process functionalities.

Using a base abstract class RootedRepoIterator, we add two implementations, one of them to iterate repository metadata (repository id, urls, is fork) and references metadata (repository_id, name, hash). With this RootedRepoIterator we should be able to implement CommitIterator and BlobIterator too. Filter logic must be implemented before start with BlobIterator. - Split test logic into Traits to be able to use them in all the Specs. - Added a BaseRootedRepoIterator trait with a helper to test iterators more easly.

erizocosmico reviewed Sep 4, 2017

View reviewed changes

This was referenced Sep 4, 2017

[DS] Model iterators #12

Closed

CommitIterator implementation #21

Merged

erizocosmico approved these changes Sep 5, 2017

View reviewed changes

bzz self-requested a review September 6, 2017 07:59

bzz approved these changes Sep 6, 2017

View reviewed changes

ajnavarro force-pushed the feature/repository-iterator branch from 402afa9 to 53ced47 Compare September 6, 2017 09:18

ajnavarro merged commit 5a0bf4a into src-d:master Sep 6, 2017

ajnavarro deleted the feature/repository-iterator branch September 6, 2017 09:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RepositoryIterator and ReferenceIterator implementations #20

RepositoryIterator and ReferenceIterator implementations #20

ajnavarro commented Sep 4, 2017 •

edited

ajnavarro commented Sep 4, 2017

codecov bot commented Sep 4, 2017 •

edited

erizocosmico Sep 4, 2017

ajnavarro Sep 4, 2017 •

edited

erizocosmico Sep 4, 2017

ajnavarro Sep 4, 2017

erizocosmico Sep 4, 2017

bzz Sep 6, 2017

erizocosmico Sep 4, 2017 •

edited

ajnavarro Sep 4, 2017 •

edited

erizocosmico Sep 4, 2017 •

edited

erizocosmico Sep 4, 2017

ajnavarro Sep 4, 2017

erizocosmico Sep 4, 2017

bzz left a comment

ajnavarro commented Sep 6, 2017

RepositoryIterator and ReferenceIterator implementations #20

RepositoryIterator and ReferenceIterator implementations #20

Conversation

ajnavarro commented Sep 4, 2017 • edited

ajnavarro commented Sep 4, 2017

Merge #19 first.

codecov bot commented Sep 4, 2017 • edited

Codecov Report

Choose a reason for hiding this comment

ajnavarro Sep 4, 2017 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

erizocosmico Sep 4, 2017 • edited

Choose a reason for hiding this comment

ajnavarro Sep 4, 2017 • edited

Choose a reason for hiding this comment

erizocosmico Sep 4, 2017 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bzz left a comment

Choose a reason for hiding this comment

ajnavarro commented Sep 6, 2017

ajnavarro commented Sep 4, 2017 •

edited

codecov bot commented Sep 4, 2017 •

edited

ajnavarro Sep 4, 2017 •

edited

erizocosmico Sep 4, 2017 •

edited

ajnavarro Sep 4, 2017 •

edited

erizocosmico Sep 4, 2017 •

edited