init: scripts diversos (crawlers, conversores, scrapers)

This commit is contained in:
2026-03-05 20:38:36 +00:00
commit 6ac6f4be2a
925 changed files with 850330 additions and 0 deletions

25
pdf-extractor/README.md Normal file
View File

@@ -0,0 +1,25 @@
# pdf-extractor - PDF to Markdown with AI
Extrai texto de PDFs e converte para Markdown usando AI (OpenRouter/Mistral).
## Setup
```bash
python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt
cp .env.example .env
# Editar .env com API key OpenRouter
```
## Uso
```bash
# Colocar PDFs na pasta input/
mkdir -p input output
cp meu-ficheiro.pdf input/
python pdfmd.py
# Output em output/
```
## Configuracao
`.env`:
- `OPENROUTER_API_KEY` - Chave API OpenRouter (obrigatorio)