<h1>Arxiv</h1>
<h3>Document: https://lukasschwab.me/arxiv.py/arxiv.html``</h3>

In [14]:
import arxiv
import pandas as pd

# Construct the default API client.
client = arxiv.Client()

# Search for the 100 most recent articles matching the keyword "quantum."
search = arxiv.Search(
  query = "thailand",
  max_results = 100,
  sort_by = arxiv.SortCriterion.SubmittedDate
)

results = client.results(search)
author=[]
abstract=[]
title=[]
# `results` is a generator; you can iterate over its elements one by one...
for r in client.results(search):
  title.append(r.title)
  abstract.append(r.summary)
  author.append(r.authors)

df=pd.DataFrame({'title':title,'abstract':abstract,'author':author})
df.head()


INFO:arxiv:Requesting page (first: True, try: 0): https://export.arxiv.org/api/query?search_query=thailand&id_list=&sortBy=submittedDate&sortOrder=descending&start=0&max_results=100
DEBUG:urllib3.connectionpool:Starting new HTTPS connection (1): export.arxiv.org:443
DEBUG:urllib3.connectionpool:https://export.arxiv.org:443 "GET /api/query?search_query=thailand&id_list=&sortBy=submittedDate&sortOrder=descending&start=0&max_results=100 HTTP/11" 200 None
INFO:arxiv:Got first page: 100 of 273 total results


Unnamed: 0,title,abstract,author
0,Thai Financial Domain Adaptation of THaLLE -- ...,Large Language Models (LLMs) excel in general ...,"[KBTG Labs, Atthakorn Petchsod, Pornchanan Bal..."
1,TomoATT: An open-source package for Eikonal eq...,"TomoATT is an open-source software package, ai...","[Jing Chen, Masaru Nagaso, Mijian Xu, Ping Tong]"
2,One-Stage-TFS: Thai One-Stage Fingerspelling D...,The Thai One-Stage Fingerspelling (One-Stage-T...,"[Siriwiwat Lata, Sirawan Phiphitphatphaisit, E..."
3,EcoCropsAID: Economic Crops Aerial Image Datas...,The EcoCropsAID dataset is a comprehensive col...,"[Sangdaow Noppitak, Emmanuel Okafor, Olarik Su..."
4,AIWR: Aerial Image Water Resource Dataset for ...,Effective water resource management is crucial...,"[Sangdaow Noppitak, Emmanuel Okafor, Olarik Su..."


In [2]:
# ...or exhaust it into a list. Careful: this is slow for large results sets.
all_results = list(results)
print([r.title for r in all_results])

['Thai Financial Domain Adaptation of THaLLE -- Technical Report', 'TomoATT: An open-source package for Eikonal equation-based adjoint-state traveltime tomography for seismic velocity and azimuthal anisotropy', 'One-Stage-TFS: Thai One-Stage Fingerspelling Dataset for Fingerspelling Recognition Frameworks', 'EcoCropsAID: Economic Crops Aerial Image Dataset for Land Use Classification', 'AIWR: Aerial Image Water Resource Dataset for Segmentation Analysis', 'IoT-Based Water Quality Monitoring System in Philippine Off-Grid Communities', 'Design and Feasibility of a Community Motorcycle Ambulance System in the Philippines', 'Exploring 5G Network Performance: Comparison of Inner and Outer City Areas in Phetchaburi Province', 'Integrating PETs into Software Applications: A Game-Based Learning Approach', 'Study of the $μ$ Cet Binary with Speckle Interferometric, Photometric and Spectroscopic Techniques']


In [3]:
# For advanced query syntax documentation, see the arXiv API User Manual:
# https://arxiv.org/help/api/user-manual#query_details
search = arxiv.Search(query = "au:del_maestro AND ti:checkerboard")
first_result = next(client.results(search))
print(first_result)

http://arxiv.org/abs/cond-mat/0603029v1


In [4]:
# Search for the paper with ID "1605.08386v1"
search_by_id = arxiv.Search(id_list=["1605.08386v1"])
# Reuse client to fetch the paper, then print its title.
first_result = next(client.results(search))
print(first_result.title)

From stripe to checkerboard order on the square lattice in the presence of quenched disorder


<h2>Fetching results with a custom client</h2>

In [None]:
big_slow_client = arxiv.Client(
  page_size = 1000,
  delay_seconds = 10.0,
  num_retries = 5
)

# Prints 1000 titles before needing to make another request.
for result in big_slow_client.results(arxiv.Search(query="quantum")):
  print(result.title)

<h2>Logging</h2>

In [6]:
import logging
logging.basicConfig(level=logging.DEBUG)
client = arxiv.Client()
paper = next(client.results(arxiv.Search(id_list=["1605.08386v1"])))

INFO:arxiv:Requesting page (first: True, try: 0): https://export.arxiv.org/api/query?search_query=&id_list=1605.08386v1&sortBy=relevance&sortOrder=descending&start=0&max_results=100
DEBUG:urllib3.connectionpool:Starting new HTTPS connection (1): export.arxiv.org:443
DEBUG:urllib3.connectionpool:https://export.arxiv.org:443 "GET /api/query?search_query=&id_list=1605.08386v1&sortBy=relevance&sortOrder=descending&start=0&max_results=100 HTTP/11" 200 978
INFO:arxiv:Got first page: 1 of 1 total results
