Bulk File Loader v1.0.1: German Patent Office Downloads

Bulk File Loader v1.0.1: German Patent Office Downloads

Bulk File Loader reached version v1.0.1 with a new data source: the German Patent and Trademark Office (DPMA). The application now downloads bulk patent data from DPMA alongside existing sources, built on top of the dpma-connect-plus Go client library I released earlier.

What It Does

Bulk File Loader is a self-hosted application with a web UI for managing automated patent data downloads. You configure which data products you need, and it handles scheduling, downloading, and organizing the files. The web interface shows what's available, what's been downloaded, and what's in progress.

Adding DPMA means you can now pull German patent publications, utility models, designs, and trademark data from the same tool that handles your other patent office downloads.

DPMA Adapter

DPMA organizes bulk data by calendar week. The adapter understands this structure and maps it to the same interface used by other data sources. All DPMA product types are covered: disclosure documents, patent specifications, utility models, European patent specifications published in Germany, designs, and trademarks - in both XML and PDF where available.

Downloads use streaming to handle DPMA's multi-gigabyte weekly publication packages without loading everything into memory.

New in 1.0

Beyond the DPMA adapter, this release adds real-time progress tracking. Server-Sent Events stream download status to the browser - you see which files are downloading, their progress, and estimated completion. The previous version only showed completed downloads.

The download progress UI includes active download indicators, progress bars, and a pending files view for queued downloads.

Cross-Platform

Pre-built binaries are available for macOS (Intel and Apple Silicon), Linux (amd64 and arm64), and Windows. Docker images are published as well.

Repository: github.com/patent-dev/bulk-file-loader
License: MIT


For help setting up patent data pipelines or integrating bulk downloads into your infrastructure, reach out through patent.dev.