Public Notes on
View Public Collections
Loading...

The PDF library used by the Chromium project. Contribute to chromium/pdfium development by creating an account on GitHub.

#pdf #library #chrome

Show More
Loading...

iDev - @zongmumask666 - 我是一个独立开发者,最近上线了一款 macOS 上的 PDF 阅读器,最初是因为自己处理 PDF 时总觉得不够顺手,就想着自己做一个,功能更聚焦、体验更清爽一点。开发过程中我选择了 PDFiu




via: https://www.v2ex.com/member/zongmumask666/topics

#pdf #macos #app #reader

Show More
Loading...

Edit PDFs with ease—add text, images, signatures, merge files, create fillable forms, and password protect documents. Try our all-in-one PDF editor now!

via: https://hckrnews.com/

#web #app #pdf #editor

Show More
Loading...

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。 - opendatalab/MinerU

via: https://www.google.com/

#pdf #markdown #convert #llm

Show More
Loading...
Pulse www.runpulse.com
Pulse understands your complex data

#llm #ai #api #document #convert #markdown #ocr #extraction #etl #pdf

Show More
Loading...
Unstructured helps you get your data ready for AI by transforming it into a format that large language models can understand. Easily connect your data to LLMs.

#llm #ai #api #document #convert #markdown #ocr #extraction #etl #pdf

Show More
Loading...
Python tool for converting files and office documents to Markdown. - microsoft/markitdown

#python #html #pdf #convert #markdown

Show More
Loading...
Papermark is the open-source DocSend alternative with built-in analytics and custom domains. - mfts/papermark

#document #pdf #collaboration #tracking #docsend #alternative #sharing

Show More
Loading...
An in-browser, local-first Markdown resume builder. - Renovamen/oh-my-cv

#markdown #resume #pdf

Show More
Loading...
Polytype – Home polytype.dev

A Rosetta stone for typesetting engines.


This project's goal is to provide a chrestomathy for typesetting similar to what Rosetta Code does for programming languages. The samples here are designed to compare and/or contrast the approaches taken to various typesetting situations by different typesetting engines.


The emphasis is less on document markup languages, programming languages, or actual content and more on the way layout and orthographic features are achieved. Sometimes similar input wi...

#typography #pdf #latex #pattern #best-practice

Show More
Loading...
Paged.js — pagedjs.org

Paged.js is a free and open source JavaScript library that paginates content in the browser to create PDF output from any HTML content. This means you can design works for print (eg. books) using HTML and CSS!


Paged.js follows the Paged Media standards published by the W3C (ie the Paged Media Module, and the Generated Content for Paged Media Module). In effect Paged.js acts as a polyfill for the CSS modules to print content using features that are not yet natively supported by browsers.

#javascript #library #html #pdf #typography #printing

Show More
Loading...
📄 Create PDF files using React. Contribute to diegomura/react-pdf development by creating an account on GitHub.

#react #pdf #library #component

Show More
Loading...

#PDF #mmarkdownhtml #docx #convert #python

Show More
Loading...
iLovePDF is an online service to work with PDF files completely free and easy to use. Merge PDF, split PDF, compress PDF, office to PDF, PDF to JPG and more!

#pdf #convert #tool #online

Show More
Loading...
Improved file parsing for LLM’s. Contribute to Filimoa/open-parse development by creating an account on GitHub.

#llm #parser #text #convert #markdown #split #extraction #content #python #library #pdf #ocr

Show More
Loading...
A Unified Toolkit for Deep Learning Based Document Image Analysis - Layout-Parser/layout-parser

#pdf #layout #parser #llm #python #image #ocr

Show More
Loading...
Amazon Textract is a machine learning (ML) service that uses optical character recognition (OCR) to automatically extract text, handwriting, and data from scanned PDF documents, forms, and tables.

#content #extraction #ocr #pdf #parser #api

Show More
Loading...
Community maintained fork of pdfminer - we fathom PDF - pdfminer/pdfminer.six

#python #pdf #content #extraction #parser #library

Show More
Loading...
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents. - pymupdf/PyMuPDF

#python #pdf #content #extraction #parser #library

Show More
Loading...
We’re on a journey to advance and democratize artificial intelligence through open source and open science.

#llm #model #table #pdf #content #extraction

Show More
Loading...
UniTable: Towards a Unified Table Foundation Model - poloclub/unitable

#pdf #table #content #extraction #llm #machine-learning

Show More
Loading...
Xournal++ is a handwriting notetaking software with PDF annotation support. Written in C++ with GTK3, supporting Linux (e.g. Ubuntu, Debian, Arch, SUSE), macOS and Windows 10. Supports pen input from devices such as Wacom Tablets. - xournalpp/xournalpp

#drawing #notebook #note-taking #app #crossplatform #handwriting #pdf

Show More