Skip to content

These Python scripts retrieve and parse the "Risk Factors" section in Apple's annual SEC filings from 2007 through 2015. The output is an .xlsx file with a dictionary of words used in each filing.

Notifications You must be signed in to change notification settings

pjswords/Apple-SEC-filings

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 

Repository files navigation

Apple-SEC-filings

These Python scripts retrieve and parse Apple's annual SEC filings from 2007 through 2015. Apple-SEC-filings-py2.py works with Python 2, and Apple-SEC-filings-py3.py works with Python 3. Both scripts perform the same function.

After retrieving a report from Apple's website, the script searches for the section titled "Item 1A. Risk Factors" and builds a dictionary from the words in that section. The script then places the contents of the dictionary into an OpenPyXL object that corresponds to a Microsoft Excel worksheet. While building the worksheets, the script also provides console output showing which words have the highest frequency. Once a worksheet is built for each report, the script saves the worksheets in a single XLSX workbook that can be used with Tableau or other analytics software.

Note: These scripts require access to the BeautifulSoup, OpenPyXL, and nltk libraries.

About

These Python scripts retrieve and parse the "Risk Factors" section in Apple's annual SEC filings from 2007 through 2015. The output is an .xlsx file with a dictionary of words used in each filing.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages