Public Notes on
View Public Collections
Python tool for converting files and office documents to Markdown. - microsoft/markitdown

#python #html #pdf #convert #markdown

Show More
Papermark is the open-source DocSend alternative with built-in analytics and custom domains. - mfts/papermark

#document #pdf #collaboration #tracking #docsend #alternative #sharing

Show More
An in-browser, local-first Markdown resume builder. - Renovamen/oh-my-cv

#markdown #resume #pdf

Show More
Polytype – Home polytype.dev

A Rosetta stone for typesetting engines.


This project's goal is to provide a chrestomathy for typesetting similar to what Rosetta Code does for programming languages. The samples here are designed to compare and/or contrast the approaches taken to various typesetting situations by different typesetting engines.


The emphasis is less on document markup languages, programming languages, or actual content and more on the way layout and orthographic features are achieved. Sometimes similar input wi...

#typography #pdf #latex #pattern #best-practice

Show More
Paged.js — pagedjs.org

Paged.js is a free and open source JavaScript library that paginates content in the browser to create PDF output from any HTML content. This means you can design works for print (eg. books) using HTML and CSS!


Paged.js follows the Paged Media standards published by the W3C (ie the Paged Media Module, and the Generated Content for Paged Media Module). In effect Paged.js acts as a polyfill for the CSS modules to print content using features that are not yet natively supported by browsers.

#javascript #library #html #pdf #typography #printing

Show More
📄 Create PDF files using React. Contribute to diegomura/react-pdf development by creating an account on GitHub.

#react #pdf #library #component

Show More

#PDF #mmarkdownhtml #docx #convert #python

Show More
iLovePDF is an online service to work with PDF files completely free and easy to use. Merge PDF, split PDF, compress PDF, office to PDF, PDF to JPG and more!

#pdf #convert #tool #online

Show More
Improved file parsing for LLM’s. Contribute to Filimoa/open-parse development by creating an account on GitHub.

#llm #parser #text #convert #markdown #split #extraction #content #python #library #pdf #ocr

Show More
A Unified Toolkit for Deep Learning Based Document Image Analysis - Layout-Parser/layout-parser

#pdf #layout #parser #llm #python #image #ocr

Show More
Amazon Textract is a machine learning (ML) service that uses optical character recognition (OCR) to automatically extract text, handwriting, and data from scanned PDF documents, forms, and tables.

#content #extraction #ocr #pdf #parser #api

Show More
Community maintained fork of pdfminer - we fathom PDF - pdfminer/pdfminer.six

#python #pdf #content #extraction #parser #library

Show More
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents. - pymupdf/PyMuPDF

#python #pdf #content #extraction #parser #library

Show More
We’re on a journey to advance and democratize artificial intelligence through open source and open science.

#llm #model #table #pdf #content #extraction

Show More
UniTable: Towards a Unified Table Foundation Model - poloclub/unitable

#pdf #table #content #extraction #llm #machine-learning

Show More
Xournal++ is a handwriting notetaking software with PDF annotation support. Written in C++ with GTK3, supporting Linux (e.g. Ubuntu, Debian, Arch, SUSE), macOS and Windows 10. Supports pen input from devices such as Wacom Tablets. - xournalpp/xournalpp

#drawing #notebook #note-taking #app #crossplatform #handwriting #pdf

Show More
网易有道速读,主要针对快速从文档提取,定位,汇总信息,为您提供ai阅读论文,论文阅读软件等一站式论文、文档速读方面的问题。

#ai #pdf

Show More
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines. - Unstructured-IO/unstructured: Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

#rag #pdf #document #processing

Show More
Poppler poppler.freedesktop.org
Poppler is a PDF rendering library based on the xpdf-3.0 code base.


#pdf #renderer #engine

Show More
Macro | Home macro.com
Macro Document Workspace

#pdf #editor #document #app

Show More
QPDF: A content-preserving PDF document transformer - qpdf/qpdf

#pdf #cli #transform #convert

Show More
Build and generate PDF using React 📄 UI kit for PDFs and print documents. Simple, reusable components and templates to create great invoices, docs, brochures. Use your favorite front-end framework React to build your next PDF. - OnedocLabs/react-print-pdf

#react #component #html #css #pdf #generator

Show More