Skip to content

Unipisa/WeaviateNET

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

35 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Weaviate.NET

This library is designed to wrap Weaviate vector DB for .NET core. The core API has been wrapped with NSwagStudio and then some extra layer has been added. The API mapping is incomplete and Unit tests should be generalized (they currently use a demo instance).

A great attention has been paid to the type system so that many errors will be detected at compile time.

It is designed to manipulate Weaviate objects and not just to adapt the connection to some LLM framework as other NuGet packages do.

What's new in version 1.23.9.1

important: This is a major release of the library. Now configuration of modules is fully supported through attributes.

  • Tested against version 1.23.9 of Weaviate
  • Added support for module configuration both at class and property level

Version 1.21.6.2

  • Added support for WeaviateRef type, but it is still in alpha.
  • Fixed a bug when using nullable value types in the schema.
  • Fixed a bug in the query generator for GeoCoordinates type.

Version 1.21.1.5

important: This is a major release with significant improvements in the query support.

  • Implemented type safe GraphQL Generator for Get and Aggregate queries: now it is possible to write queries in a more concise way. See the example below. Results are still to be parsed using JQuery.

PowerShell support

The library is now available through PowerShell cmdlets though the usage requires some level of understanding of the Newtonsoft.Json library and in particular how to access JObject values from PowerShell. However, sometimes is useful to access the object model from PowerShell. Over time access to generic methods will be supported, for the moment the module is really an experimental feature.

The module is available from PowerShell Gallery and can be installed with:

Install-Module PSWeaviateNET

To configure the module:

Import-Module PSWeaviateNET

$conf = new-object PSWeaviateNET.WeaviateConfiguration
$conf.WeaviateEndpoint = "https://yourWeaviateUrl/v1"
$conf.$conf.WeaviateApiKey = "Your key if needed"

ConvertTo-Json $conf > weaviate.conf

To use the module:

Import-Module PSWeaviateNET

Connect-Weaviate -ConfigurationFile weaviate.conf
Find-WeaviateObjects -Query "{ Get { MovieDBTest { film genre } } }"
$s = Get-Schema
$s.Classes

Implementation status

The library implements almost all the schema, class, and object manipulation. It is possible to perform GraphQL queries though the current implementation is pretty raw. Thanks to Newtonsoft JObject it is possible to parse returned data pretty easily.

Note that it is still possible to use the NSwag generated API from Weaviate swagger file by using the Client property of the WeaviateDB connection. By missing APIs we intend that the calls have not yet been nicely wrapped in the object model and tested.

Missing APIs

  • References management
  • /batch/references endpoint is not implemented
  • Explore query generator
  • Parser of query results

Untested APIs

  • Tenant
  • Shards
  • Backups
  • Classifications

Roadmap

After implementing the missing endpoints in the object model we will work in providing a better GraphQL experience. In the ideal world a LINQ provider would be great to integrate queries into C#.

Package versioning

The Weaviate.NET package on NuGet follows the versioning schema of Weaviate: version 1.20.5.1 of the package is the first release tested on version 1.20.5 of Weaviate. So look at the last number to check the library revision.

Using the library

The data model is respectful of the database object model. To start create a connection (using the authorization key) and update the Schema to load the configuration:

weaviateDB = new WeaviateDB("https://yourhost/v1", "WEAVIATE_KEY");
await weaviateDB.Update();

In the Weaviate model objects are made of a set of fields (i.e. properties) with the appropriate data type (see WeaviateDataType class for the full list). Weaviate.NET maps properties to class fields (with few restrictions of course). So we can define the class Movie to describe a movie document:

[IndexNullState] // annotation to configure indexing options
public class Movie
{
    public string? film;
    public string? genre;
    public string? leadStudio;
    public int audienceScore;
    public double profitability;
    public int rottenTomatoes;
    public double worldWideGross;
    public int year;
}

Notice that all names follow the camel notation, if you use pascal notation Weaviate will lower case the first letter.

If we want to expose the id field in the Movie class we can use the JsonIgnore attribute to avoid mapping it to a property:

    [JsonIgnore]
    public Guid? id;

Important: If you use a class with less properties the library will work, this is useful to perform schema updates or property masking. Magic is performed by Newtonsoft serializer/deserializer.

We can create the class in the schema and load data (of type WeaviateObject< Movie>):

var mc = weaviateDB.Schema.NewClass<Movie>("MovieDBTest");
Movie[] movies = ReadMoviesDBFromCsv();
var toadd = new List<WeaviateObject<Movie>>();
foreach (var m in movies) {
  var d = mc.Create(); // create the document
  d.Properties = m;    // set the properties (of type Movie)
  toadd.Add(d);
}
await mc.Add(toadd);

To query the database you can use a query generator which allows you to compose a query from typed constituents. The generator allows you to generate the query text. This is an evolution of the library though it is not yet the final versione of query support.

        var qg = mc.CreateGetQuery(selectall: true);
        mc.Filter
          .Limit(5)
          .NearText("robot in the future");
        var q = new GraphQLQuery();
        q.Query = qg.ToString();
        var ret = await weaviateDB.Schema.RawQuery(q);
        var d = ret.Data["Get"];
        var a = data.ToObject<Movie[]>();

In any case you can use explicitly the GraphQL syntax and the ability of JObject to deserialize a field.

        var q = new GraphQLQuery();
        q.Query = @"{
  Get {
    MovieDBTest(
      limit: 5
      nearText: { concepts: [ "robot in the future" ] }
    )
    {
      film
      genre
      leadStudio
      audienceScore
      profitability
      rottenTomatoes
      worldWideGross
      year
    }
  }
}";
        var ret = await weaviateDB.Schema.RawQuery(q);
        var d = ret.Data["Get"];
        var a = data.ToObject<Movie[]>();

Previous changes

Available in the Wiki documentation

About

dotnet core library to use Weaviate Vector DB

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published