如何比對字串資料庫相似度?
當有一個句子,例如 This is a book. 希望可以在資料庫中找出相似的語句時,
有哪些工具可以利用?
資料庫範例
- This is my book.
- These are books.
- This is a blue book.
- This is a car
字串相似度比對工具
https://pypi.python.org/pypi/gensim
https://radimrehurek.com/gensim/
http://www.codeproject.com/Articles/13525/Fast-memory-efficient-Levenshtein-algorithm