Pyorient Create_Record Passes Null Values to OrientDB When Unicode Object is Used Instead of String #6757

JSv4 · 2016-09-29T18:41:09Z

OrientDB Version, operating system, or hardware.

v2.2.1

Operating System

[X ] Linux - Debian 8 x64

Expected behavior and actual behavior

Just for starters, I'm new to OrientDB and am not a python expert, so if this is an easy fix, my apologies and thanks in advance.

I have an OrientDB with a class called Nodes. The Node class has the following variables:

Abs_Address (String) [Also the index... no duplicates allowed]
Content (String)
Heading (String)
Type (String)
Value (String)
Parent (Link -->Nodes)
Children (Linkset -->Nodes)

I have a pyorient method to create a new node that has created literally thousands of them with no problems... so long as Abs_Address is not a unicode object. However, I've run into a weird problem that I only noticed when the Abs_Address was a unicode object. When my Abs_Address str object contained the unicode character u"\u2013", python treated it as type (unicode). When that happens, I started noticing I was getting a duplicate entry error from OrientDB. This was odd to me, so I traced it and found that pyorient's create record method was working in my code... unless the record I created included a unicode object... whenever that happened... the record that OrientDB reported getting was full of null data. This explained the duplicate record error. Every record that had a unicode character (I assume I can't check tens of thousands of possible records) was either passed by pyorient or interpreted by OrientDB as being null. This clearly happened more than once and that is why the duplicate record error was being thrown.

Here's the code I used to add records



def add_Node(self, Ono = None):

    if Ono!=None:

        new_Node = {'@Nodes':
                {
                    "Abs_Address":Ono.absolute_address,
                    'Content':Ono.content,
                    'Heading':Ono.heading,
                    'Type':Ono.type,
                    'Value':Ono.value
                }
       }

        try:
            print 'trying to add'
            rec_position = self.pyo_client.record_create(14, new_Node)  
            return rec_position

        except Exception as inst:
            print 'Exception:'
            print inst
            print 'Absolute address arg is'
            print Ono.absolute_address
            print self.get_Node_By_Address(Ono.absolute_address)
            print 'Return rid of new object'
            return self.get_Node_By_Address(Ono.absolute_address)[0] #this is what to use where we suspect the Abs_Address already exists... originally this only caught duplicate entry errors but I expanded it to figure out what's wrong with the code. 

    else:
        return 'NONE'

#where record already exists... get it by its absolute address
def get_Node_By_Address(self,Abs_Address=''):
    return self.pyo_client.query("SELECT * FROM Nodes WHERE Abs_Address='"+str(Abs_Address.replace(u"\u2013","-"))+"'")

For more info and some of the error message output, see this stackexchange post: http://stackoverflow.com/questions/39761684/general-python-unicode-ascii-casting-issue-causing-trouble-in-pyorient

Steps to reproduce the problem

As I said, the code above threw an error that there was a duplicate key anytime Abs_Address was type unicode. I found a way to get around this, but it's not a workable fix for me. Basically, I scanned for the specific unicode character causing trouble (an ndash u"u\2013") and replaced with a minus sign. This works, but there are several problems with this. First of all, I can't be sure this is the only unicode address I'll get, so there may still be errors. Worse, I know other variables will likely have unicode string data as there will be unusual symbols and such. Looking at the errors I was getting, it's clear that, whenever OrientDB got something that was a unicode object, it saw it as a null object. My short-term fix is to check for ndash, replace with minus, and then cast to type str(). This won't work long-term:

new_Node = {"@Nodes":
                        {
                            "Abs_Address":str(Ono.absolute_address.replace(u"\u2013","-")), # replace utf-8 symbol (ndash) to ascii (-)... WHY CAN'T I USE UTF-8
                            "Content":Ono.content.replace(u"\u2013","-"),
                            "Heading":Ono.heading.replace(u"\u2013","-"),
                            "Type":Ono.type.replace(u"\u2013","-"),
                            "Value":Ono.value.replace(u"\u2013","-")
                        }
        }

The text was updated successfully, but these errors were encountered:

tglman · 2016-09-30T09:42:41Z

hi @jmanjohns2,

did you already raised and issue here : https://github.com/mogui/pyorient/ ? that's the correct repository for pyorient.

Regards

JSv4 · 2016-09-30T13:04:18Z

Hi @tglman,

On stackexchange, people suggested I report it here as it might be a problem with OrientDB too. I'll submit the same post to pyorient's github as well.

It's now on the pyorient issue list too:

mogui/pyorient#219

G4br13l3 · 2016-12-27T18:10:44Z

Hi @jmanjohns2, I'm closing this issue as this is the wrong repository. As already reported by @tglman the correct one is:

https://github.com/mogui/pyorient/

Thanks.

lvca added the bug label Sep 29, 2016

lvca assigned tglman Sep 29, 2016

lvca added this to the 2.2.x (next hotfix) milestone Sep 29, 2016

G4br13l3 added the wrong repository label Dec 27, 2016

G4br13l3 closed this as completed Dec 27, 2016

robfrank modified the milestones: 2.2.x (next hotfix), 2.2.15 Jan 16, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pyorient Create_Record Passes Null Values to OrientDB When Unicode Object is Used Instead of String #6757

Pyorient Create_Record Passes Null Values to OrientDB When Unicode Object is Used Instead of String #6757

JSv4 commented Sep 29, 2016 •

edited

tglman commented Sep 30, 2016

JSv4 commented Sep 30, 2016 •

edited

G4br13l3 commented Dec 27, 2016

Pyorient Create_Record Passes Null Values to OrientDB When Unicode Object is Used Instead of String #6757

Pyorient Create_Record Passes Null Values to OrientDB When Unicode Object is Used Instead of String #6757

Comments

JSv4 commented Sep 29, 2016 • edited

OrientDB Version, operating system, or hardware.

Operating System

Expected behavior and actual behavior

Steps to reproduce the problem

tglman commented Sep 30, 2016

JSv4 commented Sep 30, 2016 • edited

G4br13l3 commented Dec 27, 2016

JSv4 commented Sep 29, 2016 •

edited

JSv4 commented Sep 30, 2016 •

edited