http://www.semanlink.net/tag/pdf_extract ; OCR AND Google Cloud Platform
Common descendants