Monitor harvesting
Track harvesting health at both system and job levels.
In this guide
Monitor background processes
When to check: If harvests aren't running as expected.
Three background processes must run continuously for harvesting to work:
ckan_gather_consumer: Manages the gathering of data sources to be harvestedckan_fetch_consumer: Responsible for fetching the data from the sources identified by the gather processrun: Responsible for triggering the harvester at the end of each specified time interval
To check process status:
-
Access the CKAN container:
docker compose exec -it ckan-dev bash -
Check all processes:
supervisorctl status -
Verify output:
ckan_fetch_consumer RUNNING
ckan_gather_consumer RUNNING
run RUNNING
All three processes should show RUNNING status. If any process is not running correctly, see Manage harvest sources.
Monitor harvest jobs
When to check: If you want to review harvest history and results.
- Go to Harvest Sources and select your source.
- Select the Jobs tab to view:
- Harvest history with timestamps
- Job status
- Number of datasets added, updated, deleted, and not modified
- Error messages and logs
- Select a specific job to view detailed logs and troubleshooting information.