Add: learning performance-improving code edits 🥧 #65

SwayamInSync · 2023-04-23T18:19:03Z

PR to add LEARNING PERFORMANCE-IMPROVING CODE EDITS with PIE dataset, few shot evaluations for program performance improvement

Muennighoff · 2023-04-23T18:22:24Z

Nice! Some comments:

Remove .pyc .DS_Store and other files from the PR that are not important
Can't we upload all those .txt files to the hub? / Do we even need them? This PR adds way too many files imo
Can you share any results you got?

SwayamInSync · 2023-04-23T18:24:35Z

@Muennighoff Those are test cases and are needed in order to evaluate the correctness of the generated program.

Muennighoff · 2023-04-23T18:37:24Z

@Muennighoff Those are test cases and are needed in order to evaluate the correctness of the generated program.

I think they should be uploaded to a dataset on the HF Hub that is then loaded like it's done for the other eval tasks

SwayamInSync · 2023-04-24T05:15:00Z

Nice! Some comments:

Remove .pyc .DS_Store and other files from the PR that are not important

Can't we upload all those .txt files to the hub? / Do we even need them? This PR adds way too many files imo

Can you share any results you got?

@Muennighoff

.pyc , DS_Store and other irrelevant files are removed
test cases are pushed to hub and integrated into the code without creating any variance during runtime evaluation

Author evaluated on Python and C++, but for now, we are only evaluating on Python, since C++ data was not available. I am creating the dataset for C++ too, as its done will push into the hub

Muennighoff · 2023-04-24T06:15:42Z

lm_eval/tasks/pie_perf.py

+        cmd = "git clone https://huggingface.co/datasets/rootacess/pie-perf-testcases lm_eval/tasks/custom_metrics/pie_perf_metric/public_test_cases"
+        process = subprocess.Popen(cmd.split(), stdout=subprocess.PIPE)
+        output, error = process.communicate()
+        logging.error(f'An error occurred: {error}')
+
+        # running evaluations
+        res = compute(generations, references, dataset=self.get_dataset()[:limit])
+
+        # cleaning up
+        cmd = "rm -rf lm_eval/tasks/custom_metrics/pie_perf_metric/public_test_cases"
+        process = subprocess.Popen(cmd.split(), stdout=subprocess.PIPE)
+        output, error = process.communicate()
+        logging.error(error)


Can't we load the test cases with HF datasets?
If not at least should check that the path doesn't already exist I think

Due to the invariable numbers of test cases per type of problem, it is not suitable to convert it into Dataset format. I can add the condition to check if path exists

SwayamInSync added 2 commits April 23, 2023 23:35

adding pie-perf

f824010

pie-perf: fixed dataset loading

36843cb

SwayamInSync added 2 commits April 24, 2023 10:40

added --limit to the task and pushed testcases to hub

026998c

Remove .DS_Store and .pyc files

a509a37

Muennighoff reviewed Apr 24, 2023

View reviewed changes

Muennighoff requested a review from loubnabnl April 24, 2023 06:17

check if cloning repo path already exists

bc7c27e

arjunguha changed the title ~~Add: LEARNING PERFORMANCE-IMPROVING CODE EDITS 🥧~~ Add: learning performance-improving code edits 🥧 May 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add: learning performance-improving code edits 🥧 #65

Add: learning performance-improving code edits 🥧 #65

SwayamInSync commented Apr 23, 2023

Muennighoff commented Apr 23, 2023

SwayamInSync commented Apr 23, 2023

Muennighoff commented Apr 23, 2023

SwayamInSync commented Apr 24, 2023 •

edited

Loading

Muennighoff Apr 24, 2023

SwayamInSync Apr 24, 2023

Add: learning performance-improving code edits 🥧 #65

Are you sure you want to change the base?

Add: learning performance-improving code edits 🥧 #65

Conversation

SwayamInSync commented Apr 23, 2023

Muennighoff commented Apr 23, 2023

SwayamInSync commented Apr 23, 2023

Muennighoff commented Apr 23, 2023

SwayamInSync commented Apr 24, 2023 • edited Loading

Muennighoff Apr 24, 2023

Choose a reason for hiding this comment

SwayamInSync Apr 24, 2023

Choose a reason for hiding this comment

SwayamInSync commented Apr 24, 2023 •

edited

Loading