A data extraction tool to convert PDF to Markdown and JSON
Label data with an open‑source annotation tool