text-mining-for-annual-reports

#逻辑 #先利用16年年报数据的一部分数据训练数据 """ 1.选取特定股票 2.统计这些股票年报所有动词与名词的词频 3.取前100的关键词 4.统计选定的年报中这些关键词的次数，按顺序构成一个X数据集 (可以加一些交叉项来提高拟合准确度) 5.利用16年的每股收益率作为y数据集拟合上面的x数据集，得到系数k和截距b """ #然后利用模型分析16年全部数据 """ 1.统计16年所有年报的上面选定的关键词的词频，构成x数据集 2.利用上面的拟合结果k和b,来预测y 3.将y和平均每股收益率做统计分析(z检验)，如果效果好，则进行下一步操作 """ #然后利用16年全部数据训练数据集 """ 1.统计这些股票年报中上面选定的关键词的词频构成x数据集(和上面一样) 2.利用16年的平均每股收益作为y拟合x，得到k和b (可以加一些交叉项来提高拟合准确度) """ #然后分析17年全部数据 """ 1.统计17年所有年报的已选定关键词的频次，构成x数据集 2.按照所得的k和b来预测y 3.将结果和真实情况做统计分析(z检验)

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
download the annual reports.py		download the annual reports.py
get the key word list.py		get the key word list.py
notes about the code.txt		notes about the code.txt
predition.py		predition.py
train the model.py		train the model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

text-mining-for-annual-reports

About

Releases

Packages

Languages

Jessicawwww/text-mining-for-annual-reports

Folders and files

Latest commit

History

Repository files navigation

text-mining-for-annual-reports

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages