This webpage provides a
- Crowdsourced reannotation of WordSim-353 word pairs:
Finkelstein, Lev, et al. "Placing search in context: The concept revisited." Proceedings of the 10th international conference on World Wide Web. ACM, 2001. - The original WordSim-353 guidelines elicit word relatedness. To elicit word similarity, we reannotated the word pairs according to guidelines explicitly listing similarity relations and the implicit similarity as word interchangeability (WIN) guidelines . For details see:
Kliegr, Tomáš, and Ondřej Zamazal. Antonyms are similar: Towards paradigmatic association approach to rating similarity in SimLex-999 and WordSim-353. Data & Knowledge Engineering 115 (2018): 174-193. - A Czech version of WordSim353 reannotated using similarity as word interchangeability (WIN) guidelines. The translated pairs were adopted from:
Cinková, Silvie. "WordSim353 for Czech." In International Conference on Text, Speech, and Dialogue, pp. 190-197. Springer, Cham, 2016.
Agirre, Eneko, et al. "A study on similarity and relatedness using distributional and wordnet-based approaches." Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics. Association for Computational Linguistics, 2009.
Dataset | File |
---|---|
WordSim353crowd WordSim353 word pairs reannotated according to the original WordSim353 guidelines using crowdsourcing. |
zip |
WIN-353 WordSim353 word pairs reannotated according to the word interchangeability guidelines. |
zip |
ExplictSim353 WordSim353 word pairs reannotated dataset according to explicit similarity guidelines. |
zip |
WIN-353cs WordSim353 word pairs reannotated according to the word interchangeability guidelines - CZECH version . |
zip |
Dataset | File |
---|---|
Automatic mappings - WordSim353 | csv |
Crowdsourced mappings - WordSim353 | csv |