Bookmarks
Tag cloud
Picture wall
Daily
RSS Feed
  • RSS Feed
  • Daily Feed
  • Weekly Feed
  • Monthly Feed
Filters

Links per page

  • 20 links
  • 50 links
  • 100 links

Filters

Untagged links
jorisschellekens/ptext-release https://github.com/jorisschellekens/ptext-release
Thu 21 Jan 2021 01:20:14 PM PST archive.org

pText is a pure python library to read, write and manipulate PDF documents. It represents a PDF document as a JSON-like datastructure of nested lists, dictionaries and primitives. Extract and edit metadata, extract and edit text and images, add annotations.

Seems like it would be useful for a large-scale indexing effort.

python modules pdf data metadata editor export import converter
4995 links, including 380 private
Shaarli - The personal, minimalist, super-fast, database free, bookmarking service by the Shaarli community - Theme by kalvn