merging upstream #1

codegod100 · 2021-11-21T00:11:59Z

No description provided.

This PR fixes an error returned if there are no uids for geo query. { tourist(func: near(loc, [-122.469829, 37.771935], 1000) ) { name } } It used to return an error while it should return an empty result.

Bumps [pyyaml](https://github.com/yaml/pyyaml) from 5.3.1 to 5.4. - [Release notes](https://github.com/yaml/pyyaml/releases) - [Changelog](https://github.com/yaml/pyyaml/blob/master/CHANGES) - [Commits](yaml/pyyaml@5.3.1...5.4) Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

) Bumps [pyyaml](https://github.com/yaml/pyyaml) from 5.3.1 to 5.4. - [Release notes](https://github.com/yaml/pyyaml/releases) - [Changelog](https://github.com/yaml/pyyaml/blob/master/CHANGES) - [Commits](yaml/pyyaml@5.3.1...5.4) Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

Bumps [rsa](https://github.com/sybrenstuvel/python-rsa) from 3.4.2 to 4.1. - [Release notes](https://github.com/sybrenstuvel/python-rsa/releases) - [Changelog](https://github.com/sybrenstuvel/python-rsa/blob/main/CHANGELOG.md) - [Commits](sybrenstuvel/python-rsa@version-3.4.2...version-4.1) Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* update VAULT_VERSION=latest for vault docker image. * add instructions for using bound_cidr_list role where sercret-id is no longer needed.

This commit implements map-reduce based online restore. The offline restore has been removed. It also implements backup-export on top of map-reduce based restore. Co-authored-by: Manish R Jain <manish@dgraph.io>

…gn (#7666) This PR along with the previous restore PR is a BREAKING change. Marking this PR as breaking, because we forgot to mark the previous one. - Make restore map run concurrently for faster execution. - Add progress updates every second for both map and reduce phase. - Refactor code to break out the map and reduce code into separate files. - Make reduce cheap by avoiding marshal-unmarshal step. With these changes, I see map phase is faster than reduce and both finish in about 2 mins each. Map runs at 200 MBps, while Reduce runs at 130 MBps, processing 20GB of uncompressed data in under 5 mins. Changes: * Work on optimizing restore * Some file moves * Moved map output to a temp directory. * Add restoreTs in export-backup Co-authored-by: Ahsan Barkati <ahsanbarkati@gmail.com>

@recurse

…t levels with same alias are selected. (#7639) We are generating incorrect order of the fields in a result of the @normalize directive when there are multiple fields with same Alias at nested levels. As a result of that in resulting JSON, the nodes with the same Alias were not converted to list. For example, if we have below data in dgraph . { "_id": "Process", "Name": "Process", "Extends": { "_id": "Core/Namespace", "Name": "Namespace" } } And if we do below query with @recurse and @normalize directive { item as var(func:eq(_id,"Process")) b2(func:uid(item))@normalize@recurse { Name:Name id:_id Extends } } We will get this response from Dgraph where fields with same alias are not together and while converting response to JSON , we are not able to combine fields with same Alias in a list. "b2": [ { "Name": "Process", "id": "Process", "Name":"Namespace", "id": "Core/Namespace" } ] In this PR we fix this in normalize, by putting all fields with the same Alias together. So now response of above query will be "b2": [ { "Name": [ "Process", "Namespace" ], "id": [ "Process", "Core/Namespace" ] } ] This behaviour is broken by PR #6362. Prior to this , we used sorting on fields which guarantee that fields with same name or Alias comes together.

Append the directory prefix in the s3 paths for rename operation.

… (GRAPHQL-1118) (#6649) This PR makes Zero HTTP endpoints available in GraphQL admin. It also adds a flag `--disable_admin_http` in Zero which can be used to disable the administrative HTTP endpoints in Zero. The following are considered as the administrative endpoints for Zero: * `/state` * `/removeNode` * `/moveTablet` * `/assign` * `/enterpriseLicense` RFC: https://discuss.dgraph.io/t/moving-zero-http-endpoints-to-alpha-graphql-admin/10725

…7665) Bumps [pygments](https://github.com/pygments/pygments) from 2.6.1 to 2.7.4. - [Release notes](https://github.com/pygments/pygments/releases) - [Changelog](https://github.com/pygments/pygments/blob/master/CHANGES) - [Commits](pygments/pygments@2.6.1...2.7.4) Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

* update compose/run.sh to work on mac, support new bin path, modularize script for readability

- Gzip compression is slow. For backups, I've switched to using snappy for compression. This change is backwards compatible. - Refactor how backup reader is generated to deal with the compression that's used for creating backup. - Avoid creating a z.Buffer for every call to marshal uid list. Instead, reuse the same one over and over. With these changes, we can backup 150GB of data in ~50 mins on my workstation, using 8 goroutines and under 1.5GB of jemalloc memory.

With the refactor of File System and Minio handlers in enterprise backup code, the operations that they now do are pretty standard. So, we can move that code out to x package, under open source, to be used by export as well.

@dgraph

This PR adds support for language tags in GraphQL. This feature is available in Dgraph. We exposed language tags to GraphQL using @dgraph directive. RFC: https://discuss.dgraph.io/t/rfc-allow-language-tag-support-in-graphql/13027 Co-authored-by: Abhimanyu Singh Gaur <12651351+abhimanyusinghgaur@users.noreply.github.com>

* Remove unwanted arrays * Simplify code and add comments * Fix lint errors

* Remove unused function * Add tracing for existence query execution * Fix tests

) There might be a scenario where in a multi-tenant cluster a malicious user leases out a large number of UIDs or bump it to a large number, causing an issue for the other tenants. This PR adds a super flag --limit to zero. This flag limits the number of UIDs (uid-lease=100000;) that can be leased by a namespace within an interval specified by refill-interval=30s;. This PR also changes the xidmap's New() API as we need to pass in the dgo client. Along with it, it does some minor cleaning where flag parsing was in the critical path.

…#7632) The GetNew function may fetch the posting list from the cache. This cached copy of the posting list might be still in use at other places and so we need to acquire a lock on it before we read it. Co-authored-by: NamanJain8 <jnaman806@gmail.com>

Fix queries involving `regexp`, `allofterms`, `alloftext` and `match` function with pagination. These functions relies on indexes and needs to fetch postings/uids for each relevant index key to generate the final result. Remove early pagination for these cases.

There was some noise in logs due to errors printed when JWT was not found in context. This creates a bad user experience and creates confusion for users. This PR just don't print these error logs. The proper handling is done in the downstream functions.

* Return error if operand for math functions are invalid

For the upgrade in versions <= 20.07, dgraph upgrade tool updates the acl relates data. Hence, it had --acl flag. With the update of the upgrade tool for 21.03, it should work without ACL as well. This PR adds that support.

… path (#7679)

When there was a single plist to start from and there was nothing in mutationMap, the encode does not happen and hence the splits were not getting generated. This PR fixes that issue.

Open-source the backup and restore feature.

…isition (#8068) Optimize populateSchema() by avoiding repeated lock acquisition. we can get the schema for the predicate once and then check for the required field without taking a read lock.

The bulk loader creates the p directory and hence should have the correct external magic version. With the change that introduced the magic version, somehow bulk loader got missed out.

Negative offsets (e.g., offset: -4) can cause panics when sorting. This can happen when the query has the following characteristics: 1. The query is sorting on an indexed predicate 2. The results include nodes that also don't have the sorted predicate 3. A negative offset is used. (panic trace is from v20.11.2-rc1-23-gaf5030a5) panic: runtime error: slice bounds out of range [-4:] goroutine 1762633 [running]: github.com/dgraph-io/dgraph/worker.sortWithIndex(0x1fb12e0, 0xc00906a880, 0xc009068660, 0x0) /ext-go/1/src/github.com/dgraph-io/dgraph/worker/sort.go:330 +0x244d github.com/dgraph-io/dgraph/worker.processSort.func2(0x1fb12e0, 0xc00906a880, 0xc009068660, 0xc0090686c0) /ext-go/1/src/github.com/dgraph-io/dgraph/worker/sort.go:515 +0x3f created by github.com/dgraph-io/dgraph/worker.processSort /ext-go/1/src/github.com/dgraph-io/dgraph/worker/sort.go:514 +0x52a

The Docker SDK will not allow mapping a port that it not explicitly exposed in Dockerfile (the docker cli does not have the same restrictions, any port can be mapped with the -P command line parameter). For more background: https://discuss.dgraph.io/t/docker-compose-dgraph-standalone/14635

Fixes 2 race conditions: - Txn's MaxAssignedSeen and AppliedIndexSeen were accessed without locks, this leads to a read-write race. - Rollups are not supposed to modify the List.plist. But it was modifying the plist by assigning it in out.plist that further modifies it by running splits.

Update ACL tests to cover the behavior for dgraph.all permissions with reverse edges. When the dgraph.all permission is set, it also applies to reverse edges.

@default

…es at create and update (#8017) Adds support for @default directive and don't remove fields from input types.

* chore(release): adding changelog for v21.03 (#7696) * adding changelog for v21.03 (cherry picked from commit a77bbe8) * update changelog (#7905) Update the changelog. (cherry picked from commit ea1cb5f) * doc(changelog): Add v21.03.2 changelog. (#8000) (cherry picked from commit a3d41b1) Co-authored-by: aman bansal <bansalaman2905@gmail.com> Co-authored-by: Ahsan Barkati <ahsan@dgraph.io> Co-authored-by: Daniel Mai <daniel@dgraph.io>

…#8092) This PR improves the performance of the rollups. Also, it fixes memory issues of the bulk loader. - Use the optimized Split API from the Sroar to split the bitmap (while doing rollup). This is an improvement over the recursive binary split that was very slow. - In bulk loader, use HandoverSkipList API instead of WriteBatch to avoid the memory constraints of doing a huge batch write. - Also, it introduces BitForbidPosting which limits the capability to store large posting lists that generate splits of length greater than the --limit max-splits flag. Once a posting list is marked as Forbidden it cannot be recovered.

…8097) This will attempt to connect to Kafka over TLS using the system certs. * Add helper function x.TLSBaseConfig. Sets the min TLS version to v1.2 along with the minimum cipher suites.

(cherry picked from commit d4aeea2)

… the tablets (#8… (#8100) * feat: adding bulk call for alpha to inform zero about the tablets (#8089) * adding bulk call for alpha to inform zero about the tablets (#8088) (cherry picked from commit c204d0a) * fix(zero): fix update membership to make bulk tablet proposal instead of multiple small (#8090) Converting the smaller tablet proposals into a single bulk call makes the execution faster. Else, this update becomes of the order of O(n^2), where n is the number of tablet updates. (cherry picked from commit 695c6d7) Co-authored-by: aman bansal <bansalaman2905@gmail.com>

…8101) There is an issue in ExtractNamespaceFromPredicate. The issue is the parsing was done assuming ns in <ns>-<attr> to be decimal (actually it is hexadecimal). This leads to the following issues. A predicate a-name, it was skipped. A predicate 11-name was parsed as namespace 11, actually it is namespace 17 (0x11). (cherry picked from commit 0c9f601)

(cherry picked from commit 89b573a)

* chore(changelog): adding changelog for 21.12 * adding tag name

* Go 1.17 * Node v14

…8105)

"Dgraph version" should now be based on v21.12.

* opening up whitelist for local setup * update to private address cidrs * increase lint test timeout * Revert "increase lint test timeout" This reverts commit 5f2ded8. * Update golangci-lint.yml Fix OOM errors using GOGC Co-authored-by: Akon Dey <akon-dey@users.noreply.github.com>

* [draft] fix(ci): ci porting * adding more build stages adding next build stages * proto & unit tests adds unit tests & Proto tests * protobuf compiler adds protobuf compiler step * Update go.yml * debug commit env specific debug commit * go path go path issues * Update go.yml * test fixes test fixes * testing testing * Update and rename go.yml to CI.yml finalized all steps * port PR #8144 * fix gitignore for test logs * fix hmac, dummy secrets were pushed * rm dummy secrets * Update golangci-lint.yml increase timeouts

aman-bansal and others added 30 commits March 25, 2021 12:51

fixing sys backup test

2949f56

return if no uids exist in queries for Geo (#7651)

0f1ee1e

This PR fixes an error returned if there are no uids for geo query. { tourist(func: near(loc, [-122.469829, 37.771935], 1000) ) { name } } It used to return an error while it should return an empty result.

Chore(vault) - Update with instructions for cidr_bound_list (#7661)

d05980c

* update VAULT_VERSION=latest for vault docker image. * add instructions for using bound_cidr_list role where sercret-id is no longer needed.

Perf(restore): Implement map-reduce based restore (#7664)

98f6828

This commit implements map-reduce based online restore. The offline restore has been removed. It also implements backup-export on top of map-reduce based restore. Co-authored-by: Manish R Jain <manish@dgraph.io>

Fix s3 backup copy (#7669)

b5be8ce

Append the directory prefix in the s3 paths for rename operation.

chore(compose) - update run.sh wrapper script (#7676)

1df37a9

* update compose/run.sh to work on mac, support new bin path, modularize script for readability

fix(run.sh): Add a needed quote to make run.sh work.

9702095

Move UriHandler code out to x package (#7684)

1169d2e

With the refactor of File System and Minio handlers in enterprise backup code, the operations that they now do are pretty standard. So, we can move that code out to x package, under open source, to be used by export as well.

picking up changelog from 20.11 to master (#7697)

ef382b1

fix(export): use UriHandler for exports (#7690)

d174d7a

Improve IsDir and IsFile (#7704)

c48623d

Chore(GraphQL): Minor refactoring of mutation_rewriter.go (#7653)

17f7e90

* Remove unwanted arrays * Simplify code and add comments * Fix lint errors

Fix(GraphQL): Fix Execution Trace for Add and Update Mutations (#7656)

5bc5c26

* Remove unused function * Add tracing for existence query execution * Fix tests

Use Vault without secret ID (#7642)

f1f26cb

fix(Query): [Breaking] Return error for illegal math operations. (#7631)

48a8bd4

* Return error if operand for math functions are invalid

fix(flag): fix bulk loader flag and remove flag parsing from critical…

eda10f7

… path (#7679)

NamanJain8 and others added 30 commits October 1, 2021 02:28

fix(split): enable split of posting list with single plist (#8062)

3592353

When there was a single plist to start from and there was nothing in mutationMap, the encode does not happen and hence the splits were not getting generated. This PR fixes that issue.

Make backup-restore an open source feature (#8067)

898ecd1

Open-source the backup and restore feature.

opt(schema): Optimize populateSchema() by avoiding repeated lock acqu…

d935b8b

…isition (#8068) Optimize populateSchema() by avoiding repeated lock acquisition. we can get the schema for the predicate once and then check for the required field without taking a read lock.

fix(magic): fix the magic version in bulk loader etc (#8070)

84c4ea2

The bulk loader creates the p directory and hence should have the correct external magic version. With the change that introduced the magic version, somehow bulk loader got missed out.

contrib(sbs): add support for mutations replay in sbs (#8072)

4a716ad

fix(lambda): upgrade lambda dependencies to fix vulnerabilities (#8074)

81977e4

fix(fragment): merge the nested fragments fields (#8075)

3f514fe

test(acl): Test reverse edge access with dgraph.all. (#8093)

0fee18e

Update ACL tests to cover the behavior for dgraph.all permissions with reverse edges. When the dgraph.all permission is set, it also applies to reverse edges.

chore(deps): update badgerdb to latest to get manifest issue fix (#8094)

55fc5b2

feat(graphql): adds @default directive for setting default field valu…

7e6f813

…es at create and update (#8017) Adds support for @default directive and don't remove fields from input types.

feat(cdc): Add superflag to enable TLS without CA or certs. (#7946) (#…

3bfd269

…8097) This will attempt to connect to Kafka over TLS using the system certs. * Add helper function x.TLSBaseConfig. Sets the min TLS version to v1.2 along with the minimum cipher suites.

fix(restore): return nil if there is error (#7899) (#8098)

ebe64f7

(cherry picked from commit d4aeea2)

Don't ban namespace in export_backup (#8099)

da16e22

(cherry picked from commit 89b573a)

chore(changelog): adding changelog for 21.12 and adding tagname (#8096)

4aeb11f

* chore(changelog): adding changelog for 21.12 * adding tag name

build: Update release.sh. (#8102)

021e21f

* Go 1.17 * Node v14

fix(test): increase pending query limit and max splits (#8104)

26ef562

fix(deps): update badger to fix race condition with drop operation (#…

4fa5c04

…8105)

updating gqlparser version with race fix (#8106)

d62ed5f

build: Fix Dgraph version string for master branch. (#8108)

d6dbacb

"Dgraph version" should now be based on v21.12.

chore(issues): Reopen GitHub issues. (#8114)

85a3dea

Update dgraph.bib (#8128)

90b8c76

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

merging upstream #1

merging upstream #1

Uh oh!

codegod100 commented Nov 21, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

30 participants

merging upstream #1

Are you sure you want to change the base?

merging upstream #1

Uh oh!

Conversation

codegod100 commented Nov 21, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

30 participants