http://www.semanlink.net/tag/pdf_extract AND OCR
Common descendants