词条 | Pdftotext |
释义 |
pdftotext is an open source command-line utility for converting PDF files to plain text files—i.e. extracting text data from PDF-encapsulated files. It is freely available and included by default with many Linux distributions, and is also available for Windows as part of the Xpdf Windows port. Such text extraction is complicated as PDF files are internally built on page drawing primitives, meaning the boundaries between words and paragraphs often must be inferred based on their position on the page. pdftotext is part of the Xpdf software suite. Poppler, which is derived from Xpdf, also includes an implementation of pdftotext. On most Linux distributions, pdftotext is included as part of the poppler-utils package.[1] See also
References1. ^{{Cite web|url=http://linuxappfinder.com/package/poppler-utils|title=poppler-utils|last=|first=|date=|website=linuxappfinder.com|language=en|archive-url=|archive-date=|dead-url=|access-date=2018-09-14}} External links
2 : Linux text-related software|Free PDF software |
随便看 |
|
开放百科全书收录14589846条英语、德语、日语等多语种百科知识,基本涵盖了大多数领域的百科知识,是一部内容自由、开放的电子版国际百科全书。