Project

General

Profile

Feature #14303

Implement a procedure to avoid that jobs exceed resource requirements

Added by Vito Di Benedetto almost 4 years ago. Updated over 3 years ago.

Status:
Closed
Priority:
Normal
Category:
-
Target version:
Start date:
Due date:
% Done:

100%

Estimated time:
Spent time:
Experiment:
Stakeholders:
Duration:

Description

This issue is to have a procedure that monitors the resource usage of the experiment script.
The resource to monitor are: execution time, memory usage, disk usage.
If the job reach 99% of the required resources, the process need to be killed to avoid the get the jobs held.
The experiment script will return different exit codes depending on the resource that triggered the process to be killed.

History

#1 Updated by Vito Di Benedetto almost 4 years ago

  • Tracker changed from Task to Feature

#2 Updated by Vito Di Benedetto almost 4 years ago

  • Assignee set to Michele Fattoruso

#3 Updated by Vito Di Benedetto over 3 years ago

  • Assignee changed from Michele Fattoruso to Vito Di Benedetto
  • % Done changed from 0 to 40

The check about the expected_lifetime is complete

#4 Updated by Vito Di Benedetto over 3 years ago

  • Status changed from New to Work in progress

#5 Updated by Vito Di Benedetto over 3 years ago

  • Status changed from Work in progress to Resolved
  • % Done changed from 40 to 100

The DAG structure used doesn't require all jobs survive.

#6 Updated by Vito Di Benedetto over 3 years ago

  • Status changed from Resolved to Closed

#7 Updated by Vito Di Benedetto over 3 years ago

  • Start date deleted (10/27/2016)

#8 Updated by Vito Di Benedetto over 3 years ago

  • Parent task deleted (#13748)


Also available in: Atom PDF