Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
PythonNanoNets/docstrange

docstrange

Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple formats (Markdown, JSON, CSV, HTML) with intelligent structured data extraction and advanced OCR.

51.8/100
1.4KForks: 125
View on GitHubHomepage →
Loading report...

Similar Projects

local_ai_ocr

71

An local, offline (after initial setup), portable OCR software that can process images and PDF files, using DeepSeek-OCR AI (running directly on your machine).

Python721

ZerolanLiveRobot

72

AI VTuber with LLM, ASR, TTS, OCR, CV and more technologies to live stream or play Minecraft with you.

Python612

langchain

94

The agent engineering platform

Python129.8K

open-webui

94

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Python127.5K
Back to List