Guillem Hernandez Sola f753a78ba4 Split
2026-04-21 17:12:32 +02:00
2026-04-15 21:12:41 +02:00
2026-04-15 16:22:35 +02:00
2026-04-21 17:12:32 +02:00
2026-04-21 17:12:32 +02:00
2026-04-21 17:12:32 +02:00
2026-04-21 17:12:32 +02:00
2026-04-21 17:12:32 +02:00
2026-04-21 17:12:32 +02:00
2026-04-15 21:41:01 +02:00
2026-04-21 17:12:32 +02:00
2026-04-15 21:12:41 +02:00
2026-04-20 19:02:24 +02:00
2026-04-15 21:12:41 +02:00
2026-04-15 21:12:41 +02:00
2026-04-20 19:02:24 +02:00
2026-04-20 19:02:24 +02:00
2026-04-21 17:12:32 +02:00

Manga Translator OCR Pipeline

A robust manga/comic OCR + translation pipeline with:

  • EasyOCR (default, reliable on macOS M1)
  • Optional PaddleOCR (auto-fallback if unavailable)
  • Bubble clustering and line-level boxes
  • Robust reread pass (multi-preprocessing + slight rotation)
  • Translation export + debug overlays

Features

  • OCR from raw manga pages
  • Noise filtering (BOX debug artifacts, tiny garbage tokens, symbols)
  • Speech bubble grouping
  • Reading order estimation (ltr / rtl)
  • Translation output (output.txt)
  • Structured bubble metadata (bubbles.json)
  • Visual debug output (debug_clusters.png)

🧰 Requirements

  • macOS (Apple Silicon supported)
  • Python 3.11 recommended
  • Homebrew (for Python install)

🚀 Setup (Python 3.11 venv)

cd /path/to/manga-translator

# 1) Create venv with 3.11
/opt/homebrew/bin/python3.11 -m venv venv

# 2) Activate
source venv/bin/activate

# 3) Verify interpreter
python -V
# expected: Python 3.11.x

# 4) Install dependencies
python -m pip install --upgrade pip setuptools wheel
python -m pip install -r requirements.txt

# Optional Paddle runtime
python -m pip install paddlepaddle || true
Description
No description provided
Readme 1.4 MiB
Languages
Python 100%