# Find Comment IDs from Submission IDs
In this notebook, I will show you how to use the `search_submission_comment_ids` method from `PMAW` to retrieve all the Reddit comment IDs for an array of submission IDs. You can view details about this endpoint in the Pushshift [documentation](https://github.com/pushshift/api#get-all-comment-ids-for-a-particular-submission).

In [1]:
import pandas as pd
from pmaw import PushshiftAPI

In [2]:
# instantiate
api = PushshiftAPI()

## Data Preparation

In [3]:
# import test data into a dataframe
posts_df = pd.read_csv(f'./test_data.csv', delimiter=';', header=0)
posts_df.head(5)

Unnamed: 0,all_awardings,allow_live_comments,author,author_flair_css_class,author_flair_richtext,author_flair_text,author_flair_type,author_fullname,author_patreon_flair,author_premium,...,author_cakeday,distinguished,suggested_sort,crosspost_parent,crosspost_parent_list,category,top_awarded_type,poll_data,steward_reports,comment_ids
0,[],False,nf_hades,,[],,text,t2_hriq1b,False,False,...,,,,,,,,,,"gjacwx5,gjad2l6,gjadatw,gjadc7w,gjadcwh,gjadgd..."
1,[],False,MyLittleDeku,,[],,text,t2_7dj62vj2,False,False,...,,,,,,,,,,gjacn1r
2,[],False,lilirucaarde12,,[],,text,t2_6i04uaxw,False,False,...,,,,,,,,,,"gjac5fb,gjacdy5,gjaco45,gjasj4f,gjbxfeg"
3,[],False,[deleted],,,,,,,,...,,,,,,,,,,gjac9d6
4,[],False,sirdimpleton,,[],,text,t2_bznmn4i,False,False,...,,,,,,,,,,"gjaocmg,gjb2jsj,gjbisrw,gjbjbk8"


In [4]:
len(posts_df)

2500

The data in `posts_df`, contains 2500 submissions and their respective metadata extracted from a subreddit submission search, the `comment_ids` was added post-search with additional requests.

In [5]:
# create submission ID list
post_ids = list(posts_df.loc[:, 'id'])
post_ids[:3]

['kxi2w8', 'kxi2g1', 'kxhzrl']

## Comment IDs for a Single Submission

In [6]:
comment = api.search_submission_comment_ids(ids=post_ids[0])

Total:: Success Rate: 100.00% - Requests: 1 - Batches: 1 - Items Remaining: 0


## Comment IDs for Multiple Submissions

In [7]:
%%time
comment_ids = api.search_submission_comment_ids(ids=post_ids)

Checkpoint:: Success Rate: 89.00% - Requests: 100 - Batches: 10 - Items Remaining: 1405
Total:: Success Rate: 83.57% - Requests: 140 - Batches: 14 - Items Remaining: 0
Wall time: 2min 17s


### Save Comment IDs

Since the `search_submission_comment_ids` method returns a `Response` object which is a generator we need to store the comment ids in a list before we start working with them.

In [8]:
comment_id_list = [_id for _id in comment_ids]

In [10]:
len(comment_id_list)

2500