GLAMpipe is an extensible open source web-app for transforming datasets and uploading data and media to online repositories.
GROBID (requires separate install)
The basic idea of GLAMpipe is to read data and files from different sources, process the metadata and export the data and files to different destinations. The workflow is based on documents and nodes instead of rows and cells.
A building block of the data flow is called a node. Each node can do one thing. For example, a node can import data from a source. Another node can modify the data in different ways: split text, combine strings, format Wikimedia templates etc. Finally, a node can export data as files or as data in different formats and to different services like Wikimedia Commons or Wikidata.