Skip to content

Releases: moj-analytical-services/splink_graph

v0.8.2 splink_graph update for spark 3.1.1 and graphframes 0.8.2

20 Feb 00:56
Compare
Choose a tag to compare

splink_graph updated so it works on contemporary version of Spark (3.1.1) and graphframes (0.8.2 )

Full Changelog: v0.5.0...v0.8.2

v0.5.0 documentation update and a few more metrics

14 Mar 08:52
e926687
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.4.21...v0.5

v0.4.21 latest version with fixes and some new cluster metrics

19 Oct 10:20
ffbd691
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.4.19...v0.4.21

v0.4.19 latest version with small fixes

31 Aug 15:39
774a4f9
Compare
Choose a tag to compare

main additions from previous release:

  • support for python 3.6 added and tested
  • graph hash function has its own pandas_udf

version to be used in AWS Glue and ONS DAP

04 Aug 12:05
Compare
Choose a tag to compare

v0.4.12 added connected components

01 Aug 09:14
7e057ec
Compare
Choose a tag to compare

added connected components functionality

  • graphframes (working in a distributed manner) version / needs jars (included) / util functions help setting up
  • networkx (working on local master node) version since not every case needs scale
  • nx based cc also saves optionaly each subcluster in networkx edgelist format (this is useful for embeddings ... tested and it works)

added number of bridges cluster metric

  • number of bridges cluster metric is added. Useful for finding problematic clusters/subgraphs

v0.4.9 API for demos etc

21 Jul 06:13
3190efc
Compare
Choose a tag to compare
Merge pull request #42 from moj-analytical-services/APIfixes

fix some default parameter nonsense i had on edge_metrics

more consistent API

12 Jul 14:01
9bdde26
Compare
Choose a tag to compare

From this release owardn, functions to be seperated logically in appropriate submodules:

eg

from splink_graph.cluster_metrics import x
from splink_graph.node_metrics import x
from splink_graph.edge_metrics import x

Also column names are more consistent.

splink-graph 0.3.12

08 Jun 11:46
91c3775
Compare
Choose a tag to compare
  • added more cluster statistics functions
  • added custom fields to pandas udf functions
  • added more tests

Tidy up and consolidate cluster statistics functionality

27 Apr 17:07
cc49c22
Compare
Choose a tag to compare
  • linting
  • diameter_radius_transitivity function added in order to get most important cluster statistics
    from one pandas_udf function