Distributed Sentence Similarity Base on Word Mover's Distance

Algorithm:ci

Refrence from one ICML15 paper: Word Mover's Distance.get

1. First use Google's word2vec tool to get distributed word representing aka. word vectors.it

2. Then use earth mover's distance as similarity measure metric.io

3. Solve the EMD problem as transportation problem by Hungarian Algorithm.im


 

Outcome:margin

Result looks not bad, but still have ways to improve the precision.word

For example: use n-gram to keep a little bit sentence structure.di

相關文章
相關標籤/搜索