Python Developer Needed to Build Local Document Redaction Microservice (PDF/DOCX)
Description:
We’re looking for a skilled developer to create a lightweight Python-based microservice that automatically redacts personal information from uploaded documents (PDF and DOCX). This service will run locally on our private IONOS Linux (Plesk) dedicated server and integrate with a WordPress site via API.
Project Goals:
Build a local Python microservice (Flask-based) to:
Accept uploaded PDF and Word documents via API
Automatically identify and redact key text patterns (e.g. names, phone numbers, email addresses, dates, etc.)
Return a redacted version with text replaced or blacked out
WordPress Integration:
PHP hook to intercept document uploads
Send file to Flask API and replace with redacted version
Start small (couple hundred documents/month)
Ensure solution is scalable for high volume (millions/month in future)
Tech Requirements:
Python 3.10+
Flask (or FastAPI)
spaCy or similar NLP library
python-docx for Word
PyMuPDF (fitz) for PDFs
WordPress PHP basics (to hook into upload filter)
Experience deploying Python apps on Linux servers
Environment:
WordPress (PHP)
Linux with Plesk (IONOS Dedicated Server)
All processing must stay local (no third-party APIs)
Ideal Developer:
Experienced in document processing (PDF/Word)
Comfortable working with both Python and PHP
Familiar with WordPress hooks
Can deliver a simple working MVP with future scaling in mind
Apply Job!
Apply to this Job