error in method _scores_to_accuracy() of Matcher.py #23

caixiaocherry · 2019-10-03T01:53:24Z

Got error when i tried to run the example at:
m.fit_scores(balance=True, nmodels=10)
When the function calls the static method _scores_to_accuracy(), got error of mis matching size.
In this function, y is a DataFrame with shape as (n, 1), while preds is a list. I fixed the code by convert preds to a matrix

def _scores_to_accuracy(m, X, y):
    preds = [1.0 if i >= .5 else 0.0 for i in m.predict(X)]
    return (y == preds).sum() * 1.0 / len(y)

def _scores_to_accuracy(m, X, y):
    preds = [1.0 if i >= .5 else 0.0 for i in m.predict(X)]
    # return (y == preds).sum() * 1.0 / len(y)
    return (y.to_numpy().T == preds).sum() * 1.0 / len(y)

The code above works for me.

The text was updated successfully, but these errors were encountered:

xiaolinzhuo · 2019-10-03T19:54:56Z

I found the same problem! I fixed it with
return (y.values == preds).sum() * 1.0 / len(y)

swyoon · 2019-10-06T09:53:23Z

The solution by @caixiaocherry worked for me, but @xiaolinzhuo 's didn't.

tszumowski · 2019-11-15T14:00:52Z

@caixiaocherry 's solution worked for me. This was with these versions:

pandas==0.25.1
-e git+git@github.com:benmiroglio/pymatch.git@982778f3fe438f6d6b2905472a3951722edca266#egg=pymatch

@caixiaocherry it may be worth making a PR given multiple people had this issue?

tszumowski · 2019-11-15T14:02:08Z

Actually on additional inspection, this seems similar to
#12

diogoalvesderesende · 2020-05-07T07:17:51Z

Hey everyone, I am quite new to python and am having this issue as well. Can anyone tell me how to make the changes that @caixiaocherry suggested? I tried to google it but to no avail. Thanks a lot!

caixiaocherry · 2020-05-07T15:59:04Z

@diogoalvesderesende , you only need to comment out the following return statement:
return (y == preds).sum() * 1.0 / len(y)
to following return statement:
return (y.to_numpy().T == preds).sum() * 1.0 / len(y)

This shall solve the problem.

w2998 · 2020-05-16T22:38:34Z

@caixiaocherry 's solution did not for me, it returned error Fitting Models on Balanced Samples: 1\100Error: 'DataFrame' object has no attribute 'to_numipy' While. @xiaolinzhuo 's solution works, but the average accuracy seems wrong, it returned value, which was greater than 1. I was using lending club's data as this article https://medium.com/@bmiroglio/introducing-the-pymatch-package-6a8c020e2009.

caixiaocherry · 2020-05-16T22:55:08Z

U sure u wrote the correct function name? It should be to_numpy not to_numipy.

On Sat, May 16, 2020 at 3:38 PM w2998 ***@***.***> wrote: @caixiaocherry <https://github.com/caixiaocherry> 's solution did not for me, it returned error Fitting Models on Balanced Samples: 1\100Error: 'DataFrame' object has no attribute 'to_numipy' While. @xiaolinzhuo <https://github.com/xiaolinzhuo> 's solution works, but the average accuracy seems wrong, it returned value, which was greater than 1. I was using lending club's data as this article ***@***.***/introducing-the-pymatch-package-6a8c020e2009 . — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#23 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AIG56LA2LKGIIGY4NXGA6ZDRR4IXLANCNFSM4I45MO3Q> .

-- Xiao Cai *Data Science Team | 502.296.7789 (c) | 425.298.6877 (o) | xiao@astrumu.com <xiao@astrumu.com>* *AstrumU*

w2998 · 2020-05-16T22:57:40Z

Oh, yes. that was a typo. Now, it works, and results make sense.

brendachang12 · 2020-07-31T16:23:27Z

I tried both
return (y.values == preds).sum() * 1.0 / len(y)
return (y.to_numpy().T == preds).sum() * 1.0 / len(y)
in the Matcher.py file but neither of them worked. Is there another way to get around this error?

This is the error I'm receiving:
Fitting Models on Balanced Samples: 1\100Error: Unable to coerce to Series, length must be 1: given 4484
Fitting Models on Balanced Samples: 1\100Error: Unable to coerce to Series, length must be 1: given 4484
Fitting Models on Balanced Samples: 1\100Error: Unable to coerce to Series, length must be 1: given 4484
Fitting Models on Balanced Samples: 1\100Error: Unable to coerce to Series, length must be 1: given 4484
Fitting Models on Balanced Samples: 1\100Error: Unable to coerce to Series, length must be 1: given 4484

Average Accuracy: nan%

caixiaocherry · 2020-08-11T16:25:44Z

Unable to coerce to Series, length must be 1

Could you print out y and preds to check the shape? It seems the broadcast failed.

PeterOrmosi · 2020-08-20T13:17:24Z

@diogoalvesderesende , you only need to comment out the following return statement:
return (y == preds).sum() * 1.0 / len(y)
to following return statement:
return (y.to_numpy().T == preds).sum() * 1.0 / len(y)

This shall solve the problem.

this worked for me fine, thanks

HongruZhai · 2020-09-21T03:07:05Z

@caixiaocherry 's solution works like a charm. Thanks!

Fixed the error from this issue benmiroglio#23.

RishabhArora90 · 2020-10-02T21:01:58Z

I am facing the same issue. Can anyone please explain how to change in the source code?

caixiaocherry · 2020-10-06T23:21:48Z

@RishabhArora90 , i think this bug had been patched, so you might only need to re install the package?

Edited based on the issue: benmiroglio#23

ziadzee · 2021-08-09T12:50:44Z

Hello,

I am currently having this issue although I have added @caixiaocherry solution to the source code:

    @staticmethod
    def _scores_to_accuracy(m, X, y):
        preds = [1.0 if i >= .5 else 0.0 for i in m.predict(X)]
        # return (y == preds).sum() * 1.0 / len(y)
        return (y.to_numpy().T == preds).sum() * 1.0 / len(y)

Am I missing something? Do I need to downgrade pandas?

Thanks

adriennekline · 2021-09-03T20:06:22Z

I made the recommended change but now I am getting an accuracy of 219400.0%. The error was stated as 'Static column dropped: resultError: Perfect separation detected, results not available.

CarlosDullius · 2021-09-15T14:46:04Z

I found the same problem! I fixed it with
return (y.values == preds).sum() * 1.0 / len(y)

It worked for me. I have installed the package today

medcharleslaidi · 2022-02-07T03:04:29Z

I installed pymatch and I also getting this error. I understand that I need to change the pymatch code. I downloaded the package using pip install pymatch. Could someone point me a tutorial to help me change the code and use this function ? Thank you very much

CarlosDullius · 2022-02-07T20:57:47Z

Go on
C:\Users{YOURUSER}\AppData\Local\Programs\Python\Python36\Lib\site-packages\pymatch
or if you use anaconda
C:\Users{YOURUSER}\Anaconda3\Lib\site-packages\pymatch
And edit into Matcher.py

return (y == preds).sum() * 1.0 / len(y)
to following statement:
return (y.to_numpy().T == preds).sum() * 1.0 / len(y)

PS: I think linux are the same .../Python36/Lib/site-packages/pymatch

kelleyjbrady · 2022-04-01T21:24:29Z

I am encountering this error in April 2022, but I can see that Matcher.py has already been fixed in lines: 523-526 to:

@staticmethod
  def _scores_to_accuracy(m, X, y):
      preds = [[1.0 if i >= .5 else 0.0 for i in m.predict(X)]]
      return (y.to_numpy().T == preds).sum() * 1.0 / len(y)

Is there another solution to this bug?

philffm · 2022-04-06T23:48:58Z

@kelleyjbrady same error here - also in April 2022 🤷🏽‍♂️

CarlosDullius · 2022-04-07T11:56:26Z

@kelleyjbrady and @philffm follow my tutorial above, it will solve your problem I am sure.
They din't fixed it in

master/build/lib/pymatch/Matcher.py
@kelleyjbrady you probably are looking to

master/pymatch/Matcher.py

But when you install it with PIP, then you will get from the build folder.

kelleyjbrady · 2022-04-15T21:59:22Z

Thanks @CarlosDullius I will check it out. I ended up using a propensity matching package that is being presented in July at EMBC 2022, the author has written a blog post on Medium and uploaded to PyPi, but the author (@adriennekline) hasn’t updated the github page yet. I think anyone who has followed this thread all the way down here will be able to figure out how to use the package despite current lack of extensive documentation (the PyPi page has a 'quick start'). It was pretty easy to get it up and running on a simple age+sex match I was working on. @philffm may be interested.

Kwakyejin · 2022-12-18T14:23:48Z

@staticmethod
def _scores_to_accuracy(m, X, y):
    preds = [1.0 if i >= .5 else 0.0 for i in m.predict(X)]
    # return (y == preds).sum() * 1.0 / len(y)
    return (y.to_numpy().T == preds).sum() * 1.0 / len(y)

Ayeshasaeedhaq · 2023-01-13T21:52:32Z

Has anyone updated the code in the package? or has anyone created a clone with corrected code? I am not sure if I am able to correct the code at my end. Because now I am getting the following error

'bool' object has no attribute 'sum'

SinKasula mentioned this issue Apr 30, 2020

ValueError: Unable to coerce to Series, length must be 1: given 4000 #34

Open

antgonza mentioned this issue May 23, 2020

Error in Example.ipynb "ValueError: negative dimensions are not allowed" #38

Open

tklimonova added a commit to hyper-island-data-analyst/pymatch that referenced this issue Sep 21, 2020

Update Matcher.py

3b03d7a

Fixed the error from this issue benmiroglio#23.

caixiaocherry closed this as completed Oct 6, 2020

beespinosa referenced this issue in beespinosa/pymatch Oct 27, 2020

🐛 fix error of mis matching size

70897dd

beespinosa mentioned this issue Oct 27, 2020

🐛 fix error of mis matching size #43

Merged

batra-akshita added a commit to batra-akshita/pymatch that referenced this issue Aug 3, 2021

Updated for PSM issues/23

ac3f713

Edited based on the issue: benmiroglio#23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

error in method _scores_to_accuracy() of Matcher.py #23

error in method _scores_to_accuracy() of Matcher.py #23

caixiaocherry commented Oct 3, 2019

xiaolinzhuo commented Oct 3, 2019

swyoon commented Oct 6, 2019 •

edited

tszumowski commented Nov 15, 2019

tszumowski commented Nov 15, 2019

diogoalvesderesende commented May 7, 2020

caixiaocherry commented May 7, 2020

w2998 commented May 16, 2020

caixiaocherry commented May 16, 2020 via email

w2998 commented May 16, 2020

brendachang12 commented Jul 31, 2020

caixiaocherry commented Aug 11, 2020

PeterOrmosi commented Aug 20, 2020

HongruZhai commented Sep 21, 2020

RishabhArora90 commented Oct 2, 2020

caixiaocherry commented Oct 6, 2020

ziadzee commented Aug 9, 2021

adriennekline commented Sep 3, 2021 •

edited

CarlosDullius commented Sep 15, 2021

medcharleslaidi commented Feb 7, 2022

CarlosDullius commented Feb 7, 2022 •

edited

kelleyjbrady commented Apr 1, 2022

philffm commented Apr 6, 2022

CarlosDullius commented Apr 7, 2022

kelleyjbrady commented Apr 15, 2022 •

edited

Kwakyejin commented Dec 18, 2022

Ayeshasaeedhaq commented Jan 13, 2023 •

edited

error in method _scores_to_accuracy() of Matcher.py #23

error in method _scores_to_accuracy() of Matcher.py #23

Comments

caixiaocherry commented Oct 3, 2019

xiaolinzhuo commented Oct 3, 2019

swyoon commented Oct 6, 2019 • edited

tszumowski commented Nov 15, 2019

tszumowski commented Nov 15, 2019

diogoalvesderesende commented May 7, 2020

caixiaocherry commented May 7, 2020

w2998 commented May 16, 2020

caixiaocherry commented May 16, 2020 via email

w2998 commented May 16, 2020

brendachang12 commented Jul 31, 2020

caixiaocherry commented Aug 11, 2020

PeterOrmosi commented Aug 20, 2020

HongruZhai commented Sep 21, 2020

RishabhArora90 commented Oct 2, 2020

caixiaocherry commented Oct 6, 2020

ziadzee commented Aug 9, 2021

adriennekline commented Sep 3, 2021 • edited

CarlosDullius commented Sep 15, 2021

medcharleslaidi commented Feb 7, 2022

CarlosDullius commented Feb 7, 2022 • edited

kelleyjbrady commented Apr 1, 2022

philffm commented Apr 6, 2022

CarlosDullius commented Apr 7, 2022

kelleyjbrady commented Apr 15, 2022 • edited

Kwakyejin commented Dec 18, 2022

Ayeshasaeedhaq commented Jan 13, 2023 • edited

swyoon commented Oct 6, 2019 •

edited

adriennekline commented Sep 3, 2021 •

edited

CarlosDullius commented Feb 7, 2022 •

edited

kelleyjbrady commented Apr 15, 2022 •

edited

Ayeshasaeedhaq commented Jan 13, 2023 •

edited