Add OFV market resolver #225

kongzii · 2024-05-07T06:56:36Z

Proposed changes

Implements a new tool that can be used for market resolution.

Types of changes

What types of changes does your code introduce? (A breaking change is a fix or feature that would cause existing functionality and APIs to not work as expected.)
Put an x in the box that applies

Non-breaking fix (non-breaking change which fixes an issue)
Breaking fix (breaking change which fixes an issue)
Non-breaking feature (non-breaking change which adds functionality)
Breaking feature (breaking change which adds functionality)
Refactor (non-breaking change which changes implementation)
Messy (mixture of the above - requires explanation!)

Checklist

Put an x in the boxes that apply.

I have read the CONTRIBUTING doc
I am making a pull request against the main branch (left side). Also you should start your branch off our main.
Lint and unit tests pass locally with my changes
- mypy found 54 errors, but in files that aren't created or modified by me
I have added tests that prove my fix is effective or that my feature works

Further comments

I used a customised fork of OpenFactVerifier (PR is currently open again their main branch) to create a new market resolver.

The steps are:

Verify that the given market is reasonable to resolve using the is_predictable_binary function.
Rewrite the market's question into a statement. For example, Will former Trump Organization CFO Allen Weisselberg be sentenced to jail by 15 April 2024? would be rewritten to Former Trump Organization CFO Allen Weisselberg was sentenced to jail by 15 April 2024..
Use OpenFactVerifier to verify the given statement. It will be either True, False or None if there is some kind of problem.

I also implemented a benchmark where I compare (1) current resolution visible on Omen, (2) resolution obtained by packages.napthaai.customs.resolve_market_reasoning.resolve_market_reasoning by running it today and (3) resolution by OFV. The results are:

Current accuracy: 73.47%
Original's new run accuracy: 57.14
OFV's accuracy: 91.84%

However, I don't understand why (2) is so low, I expected it to be at least a little better than (1), because new information is available on internet. If you can point out a bug in my implementation, I'd re-run the benchmark again.

The benchmark is run over 50 markets that I resolved manually and the full results are available here.

I will pick just the ones where OFV made mistakes:

Will Apple introduce spatial computing in Mac's screen by 14th April 2024?
- This is questionable, should Vision Pro count as it? I guess nobody would really expect that Macbooks would get spatial computing by themselves by April 2024, but Vision Pro allows to interact with macs.
Will McDonald's successfully buy back all its Israeli restaurants by 12 April 2024?
- I found articles that it will happen, but no article that it already did happen. I found an article from April 5 saying they signed the deal. Bu that shouldn't count as market was created on April 8.
Will Samsung release the Galaxy Z Fold 6 Ultra on or before 11 April 2024?
Will Disney Plus implement its password-sharing crackdown by 11 April 2024?
- It's questionable if annoucment should make this YES, or the fact that it will be used from June should make it NO.

As we can see, 3 out of 4 mistakes are questionable.

kongzii · 2024-05-07T07:18:24Z

pyproject.toml

-py-multicodec = "==0.2.1"
-grpcio = "==1.53.0"
+python = ">=3.10,<3.12"
+open-autonomy = { git = "https://github.com/kongzii/open-autonomy.git", rev = "13344d6551222224492024623cd10aa79ad1a13e" }


What's the plan for the dependency updates in related repositories, please?

And I know Evan was working on some dependency-related stuff as well, I'll get in touch with him about the current status.

What's the issue you're facing? Did you have to make adjustments to open-autonomy? If so, feel free to make a PR on the main open-autonomy repo, we can take a look.

Yes, PR for open-autonomy is open: valory-xyz/open-autonomy#2223
And for tomte too: valory-xyz/tomte#27

The issue we are facing is that with current strict versioning, it's almost impossible to use it with other libraries.

If we were to merge this, it means that we need to keep using your fork until we reflect your changes on our framework.

Then maybe you could merge and release changes in these two PRs so I can use them properly here?

kongzii · 2024-05-08T09:18:44Z

@richardblythman Could you please explain why this pattern of doing def run(**kwargs) is necessary? I saw it in other tools as well but I didn't understand, because from the caller's perspective, having

def run(**kwargs): ...

or having

def run(
  prompt: str,
  api_keys: dict[str, str],
): ...

is called in the same way by doing run(prompt=..., api_keys=...). But in the second way it cleaner what arguments the function expects and what types.

richardblythman · 2024-05-08T09:22:22Z

def run(
  prompt: str,
  api_keys: dict[str, str],
): ...

@kongzii I think this would have worked, but it's not what you had originally.

kongzii · 2024-05-08T09:26:03Z

I think this would have worked, but it's not what you had originally.

@richardblythman Yeah, I see the changes in arguments, that's clear to me that it needs to have the same interface as other tools to make it simpler for calling from agents/mech. I just didn't understand why the **kwargs.

richardblythman · 2024-05-08T09:30:07Z

@kongzii I just copied and pasted. Not sure if the team at Valory prefer one over the other.

# Conflicts: # packages/packages.json # packages/valory/agents/mech/aea-config.yaml # packages/valory/customs/prediction_request/component.yaml # packages/valory/services/mech/service.yaml # packages/valory/skills/task_execution/skill.yaml

0xArdi and others added 3 commits April 24, 2024 12:54

feat: add keychain

4791548

Make tomte a dev dependency

0cf7a8a

Add OFV market resolver

2cff00a

kongzii commented May 7, 2024

View reviewed changes

kongzii had a problem deploying to ci May 7, 2024 08:07 — with GitHub Actions Failure

richardblythman had a problem deploying to ci May 8, 2024 08:38 — with GitHub Actions Failure

fix run() interface

d6e8353

angrybayblade and others added 12 commits May 10, 2024 21:14

chore: remove redundant package

0b4694f

gemini request tool

a3b87ab

cleaned up testing prompts

9011c96

Fix for run signature changes

36f2386

fix types

3237dcf

fix return value

4789d18

fix wrong version of secretstr

f7d6255

another fixes

5d01be1

eval each question 3x

5db1b4d

remove old comment

3473cb2

ipfs updates

81e8e57

add kwargs

010f2e9

kongzii mentioned this pull request May 17, 2024

Experiment with OpenFactVerification gnosis/prediction-market-agent#74

Closed

kongzii had a problem deploying to ci May 17, 2024 11:08 — with GitHub Actions Failure

victorpolisetty and others added 5 commits May 17, 2024 09:54

update error handling for response.text

d1a3d59

feat: update tools to use KeyChain

eecbf34

feat: add max retries

3a08009

fix: prediction_request_rag tool resposne

9715316

kongzii added 6 commits July 9, 2024 07:47

fix mack

3498fe7

fix gitleaks ignore

efb8709

trying to fix tools

9c64692

revert cpu only torch

1be3d41

relock

760d126

try to fix tox

f2481ad

kongzii had a problem deploying to ci July 9, 2024 08:18 — with GitHub Actions Failure

maybe maybe!

3e5f5b7

kongzii force-pushed the peter/ofv-resolver branch from 07ce9a2 to 3e5f5b7 Compare July 9, 2024 08:29

fix aeaconfig

2facdcc

kongzii had a problem deploying to ci July 9, 2024 08:52 — with GitHub Actions Failure

kongzii added 2 commits July 9, 2024 10:57

update locks

4f7fd61

Move OFV to gnosis folder instead of kongzii

3a5678a

kongzii had a problem deploying to ci July 9, 2024 09:33 — with GitHub Actions Failure

add graphapi keyy

df9df8e

kongzii force-pushed the peter/ofv-resolver branch from 4bd1df3 to df9df8e Compare July 9, 2024 12:20

kongzii added 4 commits July 9, 2024 14:40

black

4ba88c0

remove sentene transformers

cab249d

never enough of fixes!

4f78c3d

lock

5db5ec0

kongzii had a problem deploying to ci July 9, 2024 17:38 — with GitHub Actions Failure

0xArdi mentioned this pull request Aug 7, 2024

feat: add ofv_market_resolver tool #245

Closed

10 tasks

0xArdi changed the base branch from main to feat/ofv-2 August 7, 2024 15:00

0xArdi changed the base branch from feat/ofv-2 to main August 7, 2024 15:02

0xArdi changed the base branch from main to feat/ofv-2 August 7, 2024 15:03

0xArdi force-pushed the feat/ofv-2 branch from b306bb2 to 82036d0 Compare August 7, 2024 15:04

0xArdi merged commit 678a47f into valory-xyz:feat/ofv-2 Aug 7, 2024
5 of 7 checks passed

This was referenced Aug 7, 2024

Feat/ofv 2 #246

Closed

Feat/ofv merged #247

Merged

evangriffiths mentioned this pull request Aug 23, 2024

Use the Open Fact Verifier mech to resolve markets valory-xyz/market-creator#114

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add OFV market resolver #225

Add OFV market resolver #225

kongzii commented May 7, 2024 •

edited

Loading

kongzii May 7, 2024

0xArdi May 7, 2024

kongzii May 7, 2024

0xArdi Jul 5, 2024

kongzii Jul 5, 2024

kongzii commented May 8, 2024

richardblythman commented May 8, 2024 •

edited

Loading

kongzii commented May 8, 2024

richardblythman commented May 8, 2024

Add OFV market resolver #225

Add OFV market resolver #225

Conversation

kongzii commented May 7, 2024 • edited Loading

Proposed changes

Types of changes

Checklist

Further comments

kongzii May 7, 2024

Choose a reason for hiding this comment

0xArdi May 7, 2024

Choose a reason for hiding this comment

kongzii May 7, 2024

Choose a reason for hiding this comment

0xArdi Jul 5, 2024

Choose a reason for hiding this comment

kongzii Jul 5, 2024

Choose a reason for hiding this comment

kongzii commented May 8, 2024

richardblythman commented May 8, 2024 • edited Loading

kongzii commented May 8, 2024

richardblythman commented May 8, 2024

kongzii commented May 7, 2024 •

edited

Loading

richardblythman commented May 8, 2024 •

edited

Loading