OCXLY
Dev Tools · Convert

Anything to Markdown_

Turn PDFs, Word documents, plain text, and even images with text into clean Markdown — entirely in your browser. Nothing is uploaded. Images are read with on-device OCR; Word files keep their headings, lists, and links.

Drop a file or click to choose

PDF · DOCX · DOC · TXT · JPG / PNG / WebP (with text)

PDF → MD DOCX → MD TXT → MD Image → MD (OCR)

How each format is handled

Plain text (.txt) is already close to Markdown, so it passes through directly. PDF text is extracted page-by-page with PDF.js and pages are separated with a horizontal rule. Word (.docx) is the richest conversion: Mammoth reads the document structure and Turndown turns it into Markdown, preserving headings, bold and italic, lists, links, and tables.

Images with text (OCR)

JPG, PNG, and WebP files are read with Tesseract.js, an optical-character-recognition engine that runs entirely on your device. The first image takes a little longer because the recognition model downloads once, then it’s cached. OCR works best on clear, high-contrast text; handwriting and low-resolution scans are less reliable.

A note on legacy .doc

The old binary .doc format (Word 97–2003) has no reliable in-browser parser, so this tool does a best-effort text extraction and flags the result. For a faithful conversion, open the file in Word or Google Docs and save it as .docx first, then convert that.

Private by design. Built with AI assistance. Your files never leave your device — parsing, OCR, and conversion all run locally in the browser.