Mining Quality Phrases from Massive Text Corpora (2015)
Tags:
framework that extracts quality phrases from text corpora integrated with phrasal segmentation. > The framework requires only limited training but the quality of phrases so generated is close to human judgment. Moreover, the method is scalable: both computation time and required space grow linearly as corpus size increases [Related blog post](https://medium.com/@SherlockHumus/mining-quality-phrases-from-not-so-massive-text-corpora-part-i-b20b8336520a) Used in [this Entity Linking method](/doc/?uri=https%3A%2F%2Farxiv.org%2Fabs%2F1807.06036)
About This Document
File info