
Surya : OCR and line detection in 90+ languages
Date : 2024-01-10
Description
This summary was drafted with mixtral-8x7b-instruct-v0.1.Q5_K_M.gguf
Surya is an open-source document OCR toolkit developed by Vik Paruchuri that offers accurate OCR in 90+ languages and line-level text detection in any language. It supports a range of documents, including images, PDFs, and folders of images/PDFs, and is capable of detecting tables and charts (coming soon). The toolkit includes a streamlit app for interactive use, making it accessible to users who want to try Surya on their images or PDF files. Surya's name comes from the Hindu sun god, who has universal vision.
GitHub repo here
Recently on :
Artificial Intelligence
Information Processing | Computing
PITTI - 2026-03-05
Scaling Trust : a Missing Piece in Multi-Agent Worlds
Humanity’s ability to build complex civilizations relies on an "invisible infrastructure" - the shared culture, institutions, a...
PITTI - 2026-01-14
Cultural, Ideological and Political Bias in LLMs
Transcription of a talk given during the work sessions organized by Technoréalisme on December 9, 2025, in Paris. The talk pres...
WEB - 2025-11-13
Measuring political bias in Claude
Anthropic gives insights into their evaluation methods to measure political bias in models.
WEB - 2025-10-09
Defining and evaluating political bias in LLMs
OpenAI created a political bias evaluation that mirrors real-world usage to stress-test their models’ ability to remain objecti...
WEB - 2025-07-23
Preventing Woke AI In Federal Government
Citing concerns that ideological agendas like Diversity, Equity, and Inclusion (DEI) are compromising accuracy, this executive ...