How to extract the title of a PDF document from within a script for renaming?

前端未结

关注

 6  1768

长情又很酷 2021-02-01 21:34

I have thousands of PDF files in my computers which names are from a0001.pdf to a3621.pdf, and inside of each there is a title; e.g. \"aluminum carbona

6条回答

忘掉有多难 (楼主)

2021-02-01 21:44
What you need is a library that can actually read PDF files. For example pdfrw:
```
In [8]: from pdfrw import PdfReader

In [9]: reader = PdfReader('example.pdf')

In [10]: reader.Info.Title
Out[10]: 'Example PDF document'
```
0 讨论(0)

查看其它6个回答
发布评论:

提交评论
- 加载中...