StillHammer/couple-repo

StillHammer 7425f4af2e Reorganize Projects structure by status + update tracking files

## Projects Organization
- Create status-based folders: WIP/PAUSE/CONSTANT/CONCEPT/ARCHIVE
- Move 17 projects to appropriate status folders
- Delete obsolete README.md

### WIP (4 projects)
- GroveEngine, SEO_Article_Generator, AISSIA, SecondVoice

### PAUSE (6 projects)
- Warfactory, chinese_audio_tts_pipeline, MCP_Game_Asset_Pipeline
- ocr_pdf_service, Essay_Writing_Tingting, shipping_strategy/

### CONSTANT (3 projects)
- ClassGen (Analysis + 2.0), Database_Cours_Chinois, civjdr

### CONCEPT (5 projects)
- pokrovsk_last_day, pokrovsk_drone_command (NEW full design doc)
- social_network_manager, vps_tunnel_china, Claude_Workflow_Optimization

### ARCHIVE (3 items)
- MCP_Creative_Amplification, Backlog_9-10_Octobre_2025, LeBonCoup/

## Tracking Files Updated
- Status_Projets.md: Complete rewrite with current state (Nov 2025)
- planning/TODO_data.md: Updated with new structure and all projects by status
- CLAUDE.md: Updated relation status, Projects section, daily check stats

## Daily Check System
- Add card ACTION-008: essay_writing_tingting
- Update card_database.md: 21 total cards (15 Tingting, 3 Personal, 1 Family, 1 Tech, 1 Comm)

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>

2025-11-20 11:25:53 +08:00

5.7 KiB

Raw Blame History

OCR PDF Service - Service OCR en Ligne

Created: 19/11/2025 Status: Conception Stack: À définir (probablement Node.js + PaddleOCR Python backend)

Concept

Service en ligne d'OCR pour PDFs avec deux modes de sortie :

Extraction texte brut - PDF → Texte extrait
PDF avec texte intégré - PDF scanné → PDF searchable (texte OCR intégré dans le PDF)

Use Cases

Mode 1: Extraction Texte

Upload PDF scanné
Service fait OCR sur toutes les pages
Retourne fichier texte structuré

Mode 2: PDF Searchable

Upload PDF scanné (images uniquement)
Service fait OCR + intègre texte invisible dans le PDF
Retourne PDF avec couche texte (Ctrl+F fonctionne, sélection texte possible)

Stack Potentielle

Backend OCR:

PaddleOCR (déjà validé sur ClassGen - 99.97% précision chinois)
Support multilingue (CN/EN/FR/etc.)
API Python

Service Web:

Node.js + Express (API REST)
Upload handling (multipart/form-data)
Queue system pour jobs OCR (Redis + Bull ?)

PDF Processing:

PDF.js ou pdf-lib (manipulation PDF côté Node)
PyPDF2 ou reportlab (Python - intégration texte dans PDF)

Frontend (optionnel):

Simple upload form
Progress tracking
Download résultat

Architecture Proposée

┌─────────────┐
│   Client    │
│  (Browser)  │
└──────┬──────┘
       │ Upload PDF
       ▼
┌─────────────────┐
│   Node.js API   │
│   (Express)     │
└────────┬────────┘
         │ Enqueue job
         ▼
┌─────────────────┐
│   Job Queue     │
│   (Redis/Bull)  │
└────────┬────────┘
         │ Process
         ▼
┌─────────────────┐
│  Python Worker  │
│  (PaddleOCR)    │
└────────┬────────┘
         │ OCR Result
         ▼
┌─────────────────┐
│  PDF Generator  │
│  (PyPDF2/etc)   │
└────────┬────────┘
         │ Output PDF
         ▼
┌─────────────────┐
│  Storage/CDN    │
│  (Download)     │
└─────────────────┘

Features MVP

Core

Upload PDF (max size ?)
Détection langue automatique
OCR via PaddleOCR
Export texte brut (.txt)
Export PDF searchable

Nice-to-Have

Batch processing (multiple PDFs)
Support images (JPG, PNG) en plus des PDFs
Choix manuel langue OCR
Preview avant download
API key pour usage programmatique
Webhook pour notification fin de job

Différenciation vs Concurrence

Concurrents:

Adobe Acrobat (payant, lourd)
Online OCR services (limites, confidentialité ?)
Google Drive OCR (limites format)

Notre avantage:

Gratuit (ou freemium)
Open source (si tu veux)
Privacy-focused : Upload → Process → Delete (pas de stockage permanent)
Multi-langue optimisé : Chinois excellemment supporté (PaddleOCR)
Deux modes : Texte brut OU PDF searchable
API publique : Intégration dans workflows

Monétisation Potentielle

Freemium Model:

Free tier: 10 PDFs/mois, max 5MB, watermark optionnel
Pro tier: 100 PDFs/mois, max 50MB, pas de watermark, API access
Enterprise: Unlimited, self-hosted option, support

Alternative:

Pur gratuit + donations
Ou pur gratuit comme portfolio piece

Timeline Estimée

Phase 1 - MVP (1-2 semaines):

Setup backend Python (PaddleOCR déjà validé)
API Node.js upload/download
Mode extraction texte brut
Interface web minimaliste

Phase 2 - PDF Searchable (1 semaine):

Intégration texte dans PDF original
Tests qualité (alignement texte/image)

Phase 3 - Polish (1 semaine):

UI/UX améliorée
Error handling robuste
Rate limiting
Documentation API

Total: 3-4 semaines pour version production-ready

Risques & Challenges

Technique:

Alignement texte OCR avec position dans PDF (complexe)
Performance pour gros PDFs (100+ pages)
Gestion mémoire (PaddleOCR peut être gourmand)

Business:

Coût serveur (OCR = CPU-intensive)
Scaling si succès
Légal : Respecter copyright des PDFs uploadés

Produit:

Beaucoup de concurrence
Besoin USP clair (pourquoi utiliser le nôtre ?)

Lien avec ClassGen

Synergie:

Pipeline OCR déjà validé (99.97% précision)
Code réutilisable (PaddleOCR setup, correction IA)
Même stack backend

Différence:

ClassGen : OCR → JSON structuré → Gamification (usage perso)
OCR Service : OCR → PDF/Texte → Download (usage général)

Décision à Prendre

Questions:

Priorité : Avant ou après ClassGen stable ?
Scope : MVP simple ou service complet ?
Monétisation : Gratuit, freemium, ou portfolio piece ?
Hébergement : VPS, serverless, ou local d'abord ?

Recommandation:

Attendre ClassGen livré + utilisé 1-2 semaines
Valider pipeline OCR en usage réel
Puis décider si ce service a du sens commercialement

Notes

Pattern observé : Encore un projet de conception brillante. Attention à ne pas tomber dans le piège "design mais jamais livré".

Solution :

Time-box strict (4h sessions max)
MVP ultra-minimaliste d'abord
Livrer même si "pas parfait"
Améliorer selon feedback réel

Question socratique : Pourquoi ce projet maintenant ? Quel problème concret ça résout pour toi ou pour d'autres ? Ou c'est juste "ça serait cool" ?