Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sourcery Starbot ⭐ refactored mikeasilva/cgr-work #1

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

sourcery-ai-bot
Copy link

Thanks for starring sourcery-ai/sourcery ✨ 🌟 ✨

Here's your pull request refactoring your most popular Python repo.

If you want Sourcery to refactor all your Python repos and incoming pull requests install our bot.

Review changes via command line

To manually merge these changes, make sure you're on the master branch, then run:

git fetch https://github.com/sourcery-ai-bot/cgr-work master
git merge --ff-only FETCH_HEAD
git reset HEAD^

holes = list()
holes = []
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Lines 96-96 refactored with the following changes:

  • Replace list() with []

if n%2 == 0 or (n+1)%2 == 0:
return True
return False
return n%2 == 0 or (n+1)%2 == 0
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Function validMetaData refactored with the following changes:

  • Simplify conditional into return statement

if n%2 == 0 or (n+1)%2 == 0:
return True
return False
return n%2 == 0 or (n+1)%2 == 0
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Function validMetaData refactored with the following changes:

  • Simplify conditional into return statement

if n%2 == 0 or (n+1)%2 == 0:
return True
return False
return n%2 == 0 or (n+1)%2 == 0
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Function validMetaData refactored with the following changes:

  • Simplify conditional into return statement

file_names = list()
file_names = []
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Lines 41-91 refactored with the following changes:

  • Replace assignment with augmented assignment
  • Replace list() with []
  • Simplify logical expression

Comment on lines -16 to +18
return sum([i*j for i,j in zip(a, b)])/(math.sqrt(sum([i*i for i in a]))* math.sqrt(sum([i*i for i in b])))
return sum(i * j for i, j in zip(a, b)) / (
math.sqrt(sum(i * i for i in a)) * math.sqrt(sum(i * i for i in b))
)
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Function cosine_similarity refactored with the following changes:

  • Replace unneeded comprehension with generator

@@ -5,6 +5,7 @@
@author: Michael
"""

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Lines 25-74 refactored with the following changes:

  • Use any() instead of for loop
  • Simplify inverted any() and all() calls

Comment on lines -46 to +55
temp_dict = dict()
temp_dict = {}

# Open the file
f = open(path, 'r')

for line in f:
# Loop through the items
line_number = line_number + 1
line_number += 1
temp_dict = dict()
for i in range(0,len(head)):
for i in range(len(head)):
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Function fbi_ucr refactored with the following changes:

  • Replace assignment with augmented assignment
  • Replace range(0, x) with range(x)
  • Replace dict() with {}

for i, l in enumerate(f):
pass
pass
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Function file_len refactored with the following changes:

  • Hoist statements out of for/while loops

Comment on lines +8 to +20

import requests
import pandas as pd
from sqlalchemy import create_engine

# This will hold the data scrapped from the API
scrapped_data = list()
scrapped_data = []

for year in range(2013, 2016):
url = "http://map.feedingamerica.org/api/v1/map/county."+str(year)+".overall.null.null.null?sideload[]=county&sideload[]=national&&sideload[]=states"
response = requests.get(url)
j = response.json()
states = dict()
states = {}
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Lines 13-34 refactored with the following changes:

  • Replace list() with []
  • Replace dict() with {}

@@ -4,6 +4,7 @@

@author: Michael
"""

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Lines 15-36 refactored with the following changes:

  • Replace if statement with if expression
  • Replace list() with []

data = dict()
data = {}
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Function parse_lehd_html refactored with the following changes:

  • Replace dict() with {}

Comment on lines -42 to +48

sql = 'SELECT CGR_GEO_ID, NAME, LEHD_URL FROM CGR_GeographyIndex WHERE CI_GEO=1 AND LEHD_URL IS NOT NULL'

query = conn.execute(sql)
browser = webdriver.Chrome()

data = list()
data = []
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Lines 42-68 refactored with the following changes:

  • Replace list() with []

@@ -5,6 +5,7 @@
@author: Michael Silva
"""

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Lines 18-61 refactored with the following changes:

  • Replace list() with []

Comment on lines +7 to +14

from bs4 import BeautifulSoup
import requests as re
import pandas as pd

base_url = 'http://www.artsindexusa.org/where-i-live?c4='
data_url = 'http://www.artsindexusa.org/fetchCounty.php?selectedCounty='
data = list()
data = []
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Lines 13-43 refactored with the following changes:

  • Replace assignment with augmented assignment
  • Replace list() with []
  • Replace dict() with {}

Comment on lines -71 to +72
# the enb extension.
file_name_list = list()
# the enb extension.
file_name_list = []
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Function get_file_name_list refactored with the following changes:

  • Replace list() with []

@@ -5,6 +5,7 @@
@author: Michael Silva
"""

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Lines 35-78 refactored with the following changes:

  • Replace if statement with if expression
  • Replace list() with []

Comment on lines +7 to +18

import requests
import pandas as pd

opportunity_index_data = list()
opportunity_index_data = []

# Loop through the list of fips codes
for line in open('fips.csv', 'r', encoding='latin-1'):
csv_row = line.split(',')
csv_row = line.split(',')
# Pull the name
pos = line.find('"')
if pos > 0:
name = line[pos:].strip().replace('"','')
else:
name = csv_row[2].strip()
name = line[pos:].strip().replace('"','') if pos > 0 else csv_row[2].strip()
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Lines 10-35 refactored with the following changes:

  • Replace if statement with if expression
  • Replace list() with []

Comment on lines +7 to +14

from bs4 import BeautifulSoup
import requests as re
import pandas as pd

base_url = 'http://www.artsindexusa.org/where-i-live?c4='
data_url = 'http://www.artsindexusa.org/fetchCounty.php?selectedCounty='
data = list()
data = []
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Lines 13-43 refactored with the following changes:

  • Replace assignment with augmented assignment
  • Replace list() with []
  • Replace dict() with {}

@@ -5,6 +5,7 @@
@author: Michael Silva
"""

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Lines 19-19 refactored with the following changes:

  • Replace list() with []

file_name_list = list()
file_name_list = []
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Function get_file_name_list refactored with the following changes:

  • Replace list() with []

Comment on lines +8 to +14

import requests
from sqlalchemy import create_engine
import pandas as pd

# This will hold the data
data = list()
data = []
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Lines 13-71 refactored with the following changes:

  • Replace list() with []
  • Replace dict() with {}

Comment on lines +35 to +139
"RAC1P",
"SCH",
"SCHL",
"PWGTP",
"PWGTP1",
"PWGTP2",
"PWGTP3",
"PWGTP4",
"PWGTP5",
"PWGTP6",
"PWGTP7",
"PWGTP8",
"PWGTP9",
"PWGTP10",
"PWGTP11",
"PWGTP12",
"PWGTP13",
"PWGTP14",
"PWGTP15",
"PWGTP16",
"PWGTP17",
"PWGTP18",
"PWGTP19",
"PWGTP20",
"PWGTP21",
"PWGTP22",
"PWGTP23",
"PWGTP24",
"PWGTP25",
"PWGTP26",
"PWGTP27",
"PWGTP28",
"PWGTP29",
"PWGTP30",
"PWGTP31",
"PWGTP32",
"PWGTP33",
"PWGTP34",
"PWGTP35",
"PWGTP36",
"PWGTP37",
"PWGTP38",
"PWGTP39",
"PWGTP40",
"PWGTP41",
"PWGTP42",
"PWGTP43",
"PWGTP44",
"PWGTP45",
"PWGTP46",
"PWGTP47",
"PWGTP48",
"PWGTP49",
"PWGTP50",
"PWGTP51",
"PWGTP52",
"PWGTP53",
"PWGTP54",
"PWGTP55",
"PWGTP56",
"PWGTP57",
"PWGTP58",
"PWGTP59",
"PWGTP60",
"PWGTP61",
"PWGTP62",
"PWGTP63",
"PWGTP64",
"PWGTP65",
"PWGTP66",
"PWGTP67",
"PWGTP68",
"PWGTP69",
"PWGTP70",
"PWGTP71",
"PWGTP72",
"PWGTP73",
"PWGTP74",
"PWGTP75",
"PWGTP76",
"PWGTP77",
"PWGTP78",
"PWGTP79",
"PWGTP80",
]

for chunk in pd.read_csv("psam_p06.csv", low_memory = False, chunksize = chunksize):
chunk = chunk[p_vars]
chunk = chunk[chunk["PUMA"].isin(keep_me)]
data.append(chunk)

df = pd.concat(data)
df.to_csv("psam_p06_small.csv", index=False)

with zipfile.ZipFile("csv_pca_small.zip", "w") as z:
z.write("psam_p06_small.csv")
z.write("ACS2013_2017_PUMS_README.pdf")

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Lines 36-139 refactored with the following changes:

  • Hoist statements out of for/while loops

if val == 1:
return False
else:
return True
return val != 1
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Function recode_hisp refactored with the following changes:

  • Simplify conditional into return statement

Comment on lines -59 to +56
elif val == 3 or val == 4 or val == 5:
elif val in [3, 4, 5]:
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Function recode_rac1p refactored with the following changes:

  • Replace multiple comparisons of same variable with in operator

Comment on lines -29 to +30
est = list()
moe = list()
est = []
moe = []
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Function aggregate_acs_data refactored with the following changes:

  • Replace list() with []

inflator = base_cpi / cpi
return inflator
return base_cpi / cpi
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Function get_cpi_inflator refactored with the following changes:

  • Inline variable that is only used once

Comment on lines -90 to +122
aggregate_data = list()
for a in aggregate_me:
if a[1] not in geos:
aggregate_data.append({'geo_id': a[0], 'aggregate_id': a[1]})
aggregate_data = [
{'geo_id': a[0], 'aggregate_id': a[1]}
for a in aggregate_me
if a[1] not in geos
]

aggregate_data = pd.DataFrame(aggregate_data)

# Merge it with our data (inner join)
df = pd.merge(df, aggregate_data, on='geo_id')

# Set some common values
df['NAME'] = df['geo_id'] = df['aggregate_id']

# Drop the extra column
df = df.drop(['aggregate_id'], axis=1)

# Determine the columns
cols_to_keep = list(df.columns.values)
cols_to_keep = list(df.columns.values)
cols_to_aggregate = [e for e in cols_to_keep if e not in ('NAME', 'geo_id')]

# Deal with the margin of error
if 'count_moe' in cols_to_aggregate:
# Square the Margin of Error
df['count_moe'] = df['count_moe'] ** 2

# Aggregate (Sum)
df = df.groupby(['NAME', 'geo_id'])[cols_to_aggregate].sum().reset_index()

# Deal with margin of error
if 'count_moe' in cols_to_aggregate:
# Square root the sum of squares
df['count_moe'] = df['count_moe'] ** .5

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Function get_sum_aggregates refactored with the following changes:

  • Replace list() with []
  • Convert for loop into list comprehension

Comment on lines -125 to +131
aggregate_data = list()
for a in aggregate_me:
aggregate_data.append({'geo_id': a[0], 'aggregate_id': a[1]})
aggregate_data = [{'geo_id': a[0], 'aggregate_id': a[1]} for a in aggregate_me]
aggregate_data = pd.DataFrame(aggregate_data)

# Merge it with our data (inner join)
df = pd.merge(df, aggregate_data, on='geo_id')

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Function get_weighted_average_aggregate refactored with the following changes:

  • Replace list() with []
  • Convert for loop into list comprehension

model = dict()
model = {}
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Lines 12-12 refactored with the following changes:

  • Replace dict() with {}

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant