NA_character_ not identified as NaN after importing it into Python #983

psads-git · 2023-01-24T16:04:43Z

I am using the following code inside a R magic cell:

%%R -o df

library(tibble)

df <- tibble(x = c("a", "b", NA))

However, when I run in another cell (a Python one):

df.isna()

I get

       x
1  False
2  False
3  False

In fact, the imported dataframe is

               x
1              a
2              b
3  NA_character_

The following fixes the problem:

df['x'] = df['x'].map(lambda val: np.nan if isinstance(val, rpy2.rinterface_lib.sexp.NACharacterType) else val)

My question is: Should not this be done automatically by rpy2?

(For more details, please see: https://stackoverflow.com/questions/75223099/na-character-not-identidied-as-nan-after-importing-it-into-python-with-rpy2 )

The text was updated successfully, but these errors were encountered:

ievgennaida · 2023-01-27T10:23:29Z

Might be related:
#979

lgautier · 2023-01-29T17:12:59Z

Thanks. This seems like a mistake in the conversion rules. The type of the column is object, and it leaves the special R value NA_character_ as a Python object (and pandas does not consider it a missing value).

In [8]: df.dtypes
Out[8]: 
x    object
dtype: object

(issue #983)

* The numpy converter did not list CHARSXP R objects as vectors. (issue #983) Also made the lookup for vector types a set (constant lookup time).

lgautier · 2023-02-05T22:59:22Z

With the PR #989 merged one now gets:

In [5]: df
Out[5]: 
      x
1     a
2     b
3  None

In [6]: df.isna()
Out[6]: 
       x
1  False
2  False
3  True

FedericoCozziUni · 2023-10-23T09:21:20Z

Hello,
I get this behavior with Python 3.11.3 & rpy2 3.5.14

>>> import rpy2.robjects as ro
>>> ro.r("b = c(NA,'def')")
>>> ro.r("df = data.frame(b)")
>>> rdf = ro.r('df')
>>> print(rdf)
     b
1 <NA>
2  def
>>> from rpy2.robjects.conversion import localconverter
>>> from rpy2.robjects import pandas2ri
>>> with localconverter(ro.default_converter + pandas2ri.converter):
...     df = ro.conversion.rpy2py(rdf)
>>> print(df)
               b
1  NA_character_
2            def

I would like to get a Python value (e.g. None) instead of NA_character_
Which rpy2 version should I use?

D3SL · 2024-06-03T09:13:13Z

I'm having this bug with RPY 3.5.16. I was able to get around it briefly by using an older version of rpy2 (due to #1106) and converting through pandas2ri.activate(). In 3.5.16 there doesn't seem any way around this.

3.5.16 seems to be a severe regression in general when it comes to conversions. Previously I could run code like foo=ro.r('''RCODE''') or ro.globalenv['bar']=pydata without issue. Now I have to use the below undocumented patterns to convert between R and Python:

with (ro.default_converter + pandas2ri.converter).context():
   ro.r.assign('foo',ro.conversion.py2rpy(data) )

 with (ro.default_converter + pandas2ri.converter).context():
   foo=ro.conversion.rpy2py(ro.globalenv['bar'])

psads-git added the bug Something isn't working label Jan 24, 2023

psads-git changed the title ~~NA_character_ not identidied as NaN after importing it into Python~~ NA_character_ not identified as NaN after importing it into Python Jan 27, 2023

lgautier added a commit that referenced this issue Jan 29, 2023

The numpy converter did not list CHARSXP R objects as vectors.

2fefbe1

(issue #983)

lgautier mentioned this issue Jan 29, 2023

The numpy converter did not list CHARSXP R objects as vectors. #987

Merged

lgautier added a commit that referenced this issue Feb 5, 2023

The numpy converter did not list CHARSXP R objects as vectors. (#987)

144a59e

* The numpy converter did not list CHARSXP R objects as vectors. (issue #983) Also made the lookup for vector types a set (constant lookup time).

lgautier closed this as completed Feb 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NA_character_ not identified as NaN after importing it into Python #983

NA_character_ not identified as NaN after importing it into Python #983

psads-git commented Jan 24, 2023

ievgennaida commented Jan 27, 2023

lgautier commented Jan 29, 2023

lgautier commented Feb 5, 2023

FedericoCozziUni commented Oct 23, 2023

D3SL commented Jun 3, 2024 •

edited

Loading

NA_character_ not identified as NaN after importing it into Python #983

NA_character_ not identified as NaN after importing it into Python #983

Comments

psads-git commented Jan 24, 2023

ievgennaida commented Jan 27, 2023

lgautier commented Jan 29, 2023

lgautier commented Feb 5, 2023

FedericoCozziUni commented Oct 23, 2023

D3SL commented Jun 3, 2024 • edited Loading

D3SL commented Jun 3, 2024 •

edited

Loading