Selection of Related Stocks using Financial Text Mining
We propose a method to select and rank stocks related to a given theme. The proposed method has two flows; obtaining related words, and selecting related stocks based on obtained related words. First, on the basis of the given theme word, the proposed method selects words with high similarity using...
Saved in:
Published in | 2018 IEEE International Conference on Data Mining Workshops (ICDMW) pp. 191 - 198 |
---|---|
Main Authors | , , , , , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
01.11.2018
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | We propose a method to select and rank stocks related to a given theme. The proposed method has two flows; obtaining related words, and selecting related stocks based on obtained related words. First, on the basis of the given theme word, the proposed method selects words with high similarity using an ensemble of word2vec models. Then, we modify the similarity based on the results of the word matches in information from companies including investor relations documents and homepages. Second, the top-10 similar words are matched to the company data, and we extract sentences related to the given theme from the data of each company. We then calculate company similarity by summing the modified similarity of related words in the extracted sentences as a final similarity measure of each company. Finally, we select the top-n related stocks based on the obtained final similarity. Targeting the Japanese documents, companies, and stocks, we achieved 0.49 accuracy (precision, recall, and F1-value), which is better than the result of randomly selecting. In addition, by comparing the results obtained using a completely different theme, we verified that the proposed method works correctly and can filter related stocks effectively. |
---|---|
ISSN: | 2375-9259 |
DOI: | 10.1109/ICDMW.2018.00036 |