GitHub

Company2Vec

The following repository includes the code and pre-trained cross-lingual word embeddings from the paper Learning Company Embeddings from Annual Reports for Fine-grained Industry Characterization (FinNLP 2020).

Code

In the following you can find how to easily get your relation embeddings given a corpus.

Requirements:

Python 3.8.2

Dataset and Models

You can download the dataset and models from the followlig link

Run Code

A number of optional parameters can be specified to your needs:

*-training (English) *: English/learning.ipynb.

*-training (English) *: English/evaluation.ipynb.

*-training (Japanese) *: Japanese/learning.ipynb.

*-training (Japanese) *: Japanese/evaluation.ipynb.

Reference paper

If you use any of these resources, please cite the following paper:

@InProceedings{tomokicompany2vec,
  author = 	"Tomoki Ito, Jose Camacho Collados, Hiroki Sakaji and Steven Schockaert",
  title = 	"Learning Company Embeddings from Annual Reports for Fine-grained Industry Characterization",
  booktitle = 	"Proceedings of The  2nd Workshop on Financial Technology and Natural Language Processing",
  year = 	"2020"
}

License

Code and data in this repository are released open-source.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
English		English
Japanese		Japanese
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Company2Vec

Code

Dataset and Models

Run Code

Reference paper

License

About

Releases

Packages

Languages

itomoki430/Company2Vec

Folders and files

Latest commit

History

Repository files navigation

Company2Vec

Code

Dataset and Models

Run Code

Reference paper

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages