## Conhecendo a requests

### Primeira requisição

In [130]:
import requests
import os
from dotenv import load_dotenv

In [131]:
r = requests.get('https://api.github.com/events')

In [132]:
r

<Response [200]>

### Explorando a biblioteca

In [133]:
r.status_code

200

In [134]:
r.url

'https://api.github.com/events'

In [135]:
r.text



In [136]:
r.json()

[{'id': '46467489089',
  'type': 'PushEvent',
  'actor': {'id': 77738923,
   'login': 'atyrell3',
   'display_login': 'atyrell3',
   'gravatar_id': '',
   'url': 'https://api.github.com/users/atyrell3',
   'avatar_url': 'https://avatars.githubusercontent.com/u/77738923?'},
  'repo': {'id': 708950142,
   'name': 'ScottLarge-NOAA/bsb',
   'url': 'https://api.github.com/repos/ScottLarge-NOAA/bsb'},
  'payload': {'repository_id': 708950142,
   'push_id': 22602684104,
   'size': 2,
   'distinct_size': 2,
   'ref': 'refs/heads/main',
   'head': '1d3c3cdce939936ce2d96b0317545f1c001c9d23',
   'before': 'b415832f5161de5e514d4c5bbd3416ced3c1d8c7',
   'commits': [{'sha': 'b82f115909428ee3a0ae37ab3ef0ba19f5e7da9e',
     'author': {'email': 'abigail.tyrell@noaa.gov', 'name': 'atyrell3'},
     'message': 'figure updates',
     'distinct': True,
     'url': 'https://api.github.com/repos/ScottLarge-NOAA/bsb/commits/b82f115909428ee3a0ae37ab3ef0ba19f5e7da9e'},
    {'sha': '1d3c3cdce939936ce2d96b0317545f

Utilizando outro endpoint

In [137]:
r = requests.get('https://api.github.com/versions')
r.status_code

200

In [138]:
r.json()

['2022-11-28']

## Extraindo dados

### Obtendo dados dos repositórios

In [139]:
# especificando a versão da API

headers = {'X-GitHub-Api-Version': '2022-11-28'}

In [140]:
api_base_url = 'https://api.github.com'
owner = 'amzn' #username de onde vai ser coletado os dados
url = f'{api_base_url}/users/{owner}/repos'

In [141]:
url

'https://api.github.com/users/amzn/repos'

In [142]:
response = requests.get(url, headers=headers)
response.status_code

200

In [143]:
response.json()

[{'id': 171339259,
  'node_id': 'MDEwOlJlcG9zaXRvcnkxNzEzMzkyNTk=',
  'name': '.github',
  'full_name': 'amzn/.github',
  'private': False,
  'owner': {'login': 'amzn',
   'id': 8594673,
   'node_id': 'MDEyOk9yZ2FuaXphdGlvbjg1OTQ2NzM=',
   'avatar_url': 'https://avatars.githubusercontent.com/u/8594673?v=4',
   'gravatar_id': '',
   'url': 'https://api.github.com/users/amzn',
   'html_url': 'https://github.com/amzn',
   'followers_url': 'https://api.github.com/users/amzn/followers',
   'following_url': 'https://api.github.com/users/amzn/following{/other_user}',
   'gists_url': 'https://api.github.com/users/amzn/gists{/gist_id}',
   'starred_url': 'https://api.github.com/users/amzn/starred{/owner}{/repo}',
   'subscriptions_url': 'https://api.github.com/users/amzn/subscriptions',
   'organizations_url': 'https://api.github.com/users/amzn/orgs',
   'repos_url': 'https://api.github.com/users/amzn/repos',
   'events_url': 'https://api.github.com/users/amzn/events{/privacy}',
   'received_ev

In [144]:
len(response.json())

30

### Autenticação

Solicitações autenticadas têm um limite de taxa mais alto. Quando um usuário faz uma solicitação autenticada, ele fornece credenciais que comprovam sua identidade, o que permite que a API confie nele e lhe conceda acesso a recursos e funcionalidades adicionais.

Além disso, a maioria das APIs estabelece limites para o número de solicitações que um usuário pode fazer em um determinado período de tempo, conhecido como 'limite de taxa'. Quando um usuário faz solicitações autenticadas, a API geralmente permite que ele faça mais solicitações em um determinado período de tempo, devido à maior confiança e credibilidade que a autenticação fornece.

In [145]:
load_dotenv()
access_token = os.getenv("API_TOKEN")
headers = {'Authorization': 'Bearer ' + access_token,
           'X-GitHub-Api-Version': '2022-11-28'}

### Paginando os repositórios

In [146]:
api_base_url = 'https://api.github.com'
owner = 'amzn' #username de onde vai ser coletado os dados
url = f'{api_base_url}/users/{owner}/repos'

url

'https://api.github.com/users/amzn/repos'

In [147]:
repos_list = []
for number_page in range(1,7):
    try:
        url_page = f'{url}?page={number_page}'
        response = requests.get(url_page, headers=headers)
        repos_list.append(response.json())
    except:
        repos_list.append(None)

In [148]:
repos_list

[[{'id': 171339259,
   'node_id': 'MDEwOlJlcG9zaXRvcnkxNzEzMzkyNTk=',
   'name': '.github',
   'full_name': 'amzn/.github',
   'private': False,
   'owner': {'login': 'amzn',
    'id': 8594673,
    'node_id': 'MDEyOk9yZ2FuaXphdGlvbjg1OTQ2NzM=',
    'avatar_url': 'https://avatars.githubusercontent.com/u/8594673?v=4',
    'gravatar_id': '',
    'url': 'https://api.github.com/users/amzn',
    'html_url': 'https://github.com/amzn',
    'followers_url': 'https://api.github.com/users/amzn/followers',
    'following_url': 'https://api.github.com/users/amzn/following{/other_user}',
    'gists_url': 'https://api.github.com/users/amzn/gists{/gist_id}',
    'starred_url': 'https://api.github.com/users/amzn/starred{/owner}{/repo}',
    'subscriptions_url': 'https://api.github.com/users/amzn/subscriptions',
    'organizations_url': 'https://api.github.com/users/amzn/orgs',
    'repos_url': 'https://api.github.com/users/amzn/repos',
    'events_url': 'https://api.github.com/users/amzn/events{/privac

In [149]:
len(repos_list) #Quantas páginas de repositórios

6

In [150]:
len(repos_list[0]) #Repositórios em uma única página

30

## Transformando os dados

### Nomes dos repositórios

In [151]:
repos_list

[[{'id': 171339259,
   'node_id': 'MDEwOlJlcG9zaXRvcnkxNzEzMzkyNTk=',
   'name': '.github',
   'full_name': 'amzn/.github',
   'private': False,
   'owner': {'login': 'amzn',
    'id': 8594673,
    'node_id': 'MDEyOk9yZ2FuaXphdGlvbjg1OTQ2NzM=',
    'avatar_url': 'https://avatars.githubusercontent.com/u/8594673?v=4',
    'gravatar_id': '',
    'url': 'https://api.github.com/users/amzn',
    'html_url': 'https://github.com/amzn',
    'followers_url': 'https://api.github.com/users/amzn/followers',
    'following_url': 'https://api.github.com/users/amzn/following{/other_user}',
    'gists_url': 'https://api.github.com/users/amzn/gists{/gist_id}',
    'starred_url': 'https://api.github.com/users/amzn/starred{/owner}{/repo}',
    'subscriptions_url': 'https://api.github.com/users/amzn/subscriptions',
    'organizations_url': 'https://api.github.com/users/amzn/orgs',
    'repos_url': 'https://api.github.com/users/amzn/repos',
    'events_url': 'https://api.github.com/users/amzn/events{/privac

In [152]:
repos_list[0][3]['name']

'alexa-coho'

In [153]:
repos_name =[]
for page in repos_list:
    for repo in page:
        repos_name.append(repo['name'])

In [154]:
repos_name[10]

'amazon-hub-counter-api-samples'

In [155]:
len(repos_name)

157

### Linguagens dos repositórios

In [156]:
repos_language = []
for page in repos_list:
    for repo in page:
        repos_language.append(repo['language'])

In [157]:
repos_language

[None,
 'Jupyter Notebook',
 'Smarty',
 'JavaScript',
 None,
 'Python',
 'PHP',
 'Java',
 'Python',
 'CSS',
 'Java',
 'Java',
 'PowerShell',
 'Java',
 'C#',
 'PHP',
 'Ruby',
 'JavaScript',
 'Python',
 'PHP',
 'Python',
 'Jupyter Notebook',
 'C#',
 'Java',
 'JavaScript',
 'PHP',
 'Ruby',
 'C#',
 'Java',
 'PHP',
 'Python',
 'Ruby',
 'PHP',
 'Kotlin',
 'PHP',
 'Python',
 'C',
 None,
 'Swift',
 'Python',
 'C++',
 'Python',
 'Go',
 'C',
 'Python',
 'Python',
 'Jupyter Notebook',
 'Python',
 'Python',
 None,
 'Java',
 'Kotlin',
 'Python',
 'Python',
 'TypeScript',
 'TypeScript',
 'Python',
 None,
 'Jupyter Notebook',
 'Python',
 'Python',
 'Python',
 'Java',
 'Jupyter Notebook',
 'Python',
 'Python',
 'Java',
 'Objective-C',
 'JavaScript',
 'TypeScript',
 'Java',
 None,
 'Python',
 'Python',
 'Python',
 'Java',
 'Java',
 'Kotlin',
 'Java',
 'C#',
 'C#',
 'JavaScript',
 'JavaScript',
 'Go',
 'Java',
 'TypeScript',
 'Python',
 'C++',
 None,
 'Python',
 'Python',
 'Java',
 'C#',
 'HTML',
 'Kotl

In [158]:
len(repos_language)

157

### Criando um DataFrame

In [159]:
import pandas as pd

In [160]:
dados_amz = pd.DataFrame()
dados_amz['repository_name'] = repos_name
dados_amz['language'] = repos_language

In [None]:
dados_amz

Unnamed: 0,repository_name,language
0,.github,
1,ads-advanced-tools-docs,Jupyter Notebook
2,ads-pao-amznjs-gtm-template,Smarty
3,alexa-coho,JavaScript
4,alexa-skills-kit-js,
...,...,...
152,zeek-plugin-enip,Zeek
153,zeek-plugin-profinet,Zeek
154,zeek-plugin-s7comm,Zeek
155,zeek-plugin-tds,Zeek


Salvando o DataFrame

In [162]:
dados_amz.to_csv('amazon.csv')

## Armazenando os dados

### Criando repositório com POST

### Formato do arquivo

### Upload de arquivo com PUT