Is anyone using recipeops, "job history" as a builtin way of queueing and resubmitting jobs that failed.
Seems like this would work really well for scenarios where you want to resubmit failed jobs, or jobs that couldnt complete due to end point being offline.