Monitor
Monitors are a way to collect data from the simulation. They are automatically collected for resources, queues and states. On top of that the user can define its own monitors.
Monitors can be used to get statistics and as a feed for graphical tools, like matplotlib.
There are two types of monitors:
level monitors Level monitors are useful to collect data about a variable that keeps its value over a certain length of time, such as the length of a queue or the colour of a traffic light.
non level monitors Non level monitors are useful to collect data about a values that occur just once. Examples, are the length of stay in a queue and the number of processing steps of a part.
For both types, the time is always collected, along with the value.
Non level monitors can be weighted, if required.
Non level monitor
Non level monitors collects values which do not reflect a level, e.g. the processing time of a part.
We define the monitor with processingtime = sim.Monitor('processingtime')
and then
collect values by processingtime.tally(this_duration)
By default, the collected values are stored in a list. Alternatively, it is possible to store the values in an array of one of the following types:
type |
stored as |
lowerbound |
upperbound |
number of bytes |
---|---|---|---|---|
‘any’ |
list |
N/A |
N/A |
depends on data |
‘bool’ |
integer |
False |
True |
1 |
‘int8’ |
integer |
-128 |
127 |
1 |
‘uint8’ |
integer |
0 |
255 |
1 |
‘int16’ |
integer |
-32768 |
32767 |
2 |
‘uint16’ |
integer |
0 |
65535 |
2 |
‘int32’ |
integer |
2147483648 |
2147483647 |
4 |
‘uint32’ |
integer |
0 |
4294967295 |
4 |
‘int64’ |
integer |
-9223372036854775808 |
9223372036854775807 |
8 |
‘uint64’ |
integer |
0 |
18446744073709551615 |
8 |
‘float’ |
float |
-inf |
inf |
8 |
Monitoring with arrays takes up less space. Particularly when tallying a large number of values, this is strongly advised.
Note that if non numeric values are stored (only possible with the default setting (‘any’)), a tallied value is converted, if required, to a numeric value if possible, or 0 if not.
It is possible to use monitors with weighted data. In that case, just add a second parameter to tally, which defaults to 1. All statistics will take the weights into account.
There is set of statistical data available, which will be all weighed according to the tallied weights (1 by default):
number_of_entries
number_of_entries_zero
weight
weight_zero
mean
std
minimum
median
maximum
percentile
bin_number_of_entries (number of entries between two given values)
bin_weight (total weight of entries between two given values)
value_number_of_entries (number of entries equal to a given value or set of values)
value_weight (total weight of entries equal to a given value or set of values)
For all these statistics, it is possible to exclude zero entries,
e.g. m.mean(ex0=True)
returns the mean, excluding zero entries.
Besides, it is possible to get all collected values as an array with x(). In the case of ‘any’ monitors,
the values might have to be converted. By specifying force_numeric=False
the collected values will be returned as stored.
With the monitor method, the monitor can be enabled or disabled. Note that a tally is just ignored when the monitor is disabled.
Also, the current monitor status (enabled/disabled) can be retrieved.
proctime.monitor(False) # disable monitoring
proctime.monitor(True) # enable monitoring
if proctime.monitor():
print('proctime is enabled')
Calling m.reset() will clear all tallied values.
The statistics of a monitor can be printed with print_statistics()
.
E.g: waitingline.length_of_stay.print_statistics()
:
Statistics of Length of stay in waitingline at 50000
all excl.zero zero
-------------- ------------ ------------ ------------
entries 4995 4933 62
mean 84.345 85.405
std.deviation 48.309 47.672
minimum 0 0.006
median 94.843 95.411
90% percentile 142.751 142.975
95% percentile 157.467 157.611
maximum 202.153 202.153
And, a histogram can be printed with print_histogram()
. E.g.
waitingline.length_of_stay.print_histogram(30, 0, 10)
:
Histogram of Length of stay in waitingline
all excl.zero zero
-------------- ------------ ------------ ------------
entries 4995 4933 62
mean 84.345 85.405
std.deviation 48.309 47.672
minimum 0 0.006
median 94.843 95.411
90% percentile 142.751 142.975
95% percentile 157.467 157.611
maximum 202.153 202.153
<= entries % cum%
0 62 1.2 1.2 |
10 169 3.4 4.6 ** |
20 284 5.7 10.3 **** |
30 424 8.5 18.8 ****** |
40 372 7.4 26.2 ***** |
50 296 5.9 32.2 **** |
60 231 4.6 36.8 *** |
70 192 3.8 40.6 *** |
80 188 3.8 44.4 *** |
90 136 2.7 47.1 ** |
100 352 7.0 54.2 ***** |
110 491 9.8 64.0 ******* |
120 414 8.3 72.3 ****** |
130 467 9.3 81.6 ******* |
140 351 7.0 88.7 ***** |
150 224 4.5 93.2 *** |
160 127 2.5 95.7 ** |
170 67 1.3 97.0 * |
180 59 1.2 98.2 |
190 61 1.2 99.4 |
200 24 0.5 99.9 |
210 4 0.1 100 |
220 0 0 100 |
230 0 0 100 |
240 0 0 100 |
250 0 0 100 |
260 0 0 100 |
270 0 0 100 |
280 0 0 100 |
290 0 0 100 |
300 0 0 100 |
inf 0 0 100
If neither number_of_bins, nor lowerbound nor bin_width are specified, the histogram will be autoscaled.
Histograms can be printed with their values, instead of bins. This is particularly useful for non numeric tallied values, such as names:
import salabim as sim
sim.yieldless(False)
env = sim.Environment()
monitor_names= sim.Monitor(name='names')
for _ in range(10000):
name = sim.Pdf(('John', 30, 'Peter', 20, 'Mike', 20, 'Andrew', 20, 'Ruud', 5, 'Jan', 5)).sample()
monitor_names.tally(name)
monitor_names.print_histogram(values=True)
The ouput of this:
Histogram of names
entries 10000
value entries
Andrew 2031( 20.3%) ****************
Jan 495( 5.0%) ***
John 2961( 29.6%) ***********************
Mike 1989( 19.9%) ***************
Peter 2048( 20.5%) ****************
Ruud 476( 4.8%) ***
It is also possible to specify the values to be shown
import salabim as sim
sim.yieldless(False)
env = sim.Environment()
monitor_names= sim.Monitor(name='names')
for _ in range(10000):
name = sim.Pdf(('John', 30, 'Peter', 20, 'Mike', 20, 'Andrew', 20, 'Ruud', 5, 'Jan', 5)).sample()
monitor_names.tally(name)
monitor_names.print_histogram(values=('John', 'Andrew', 'Ruud', 'Fred'))
The output of this:
Histogram of names
entries 10000
value entries %
John 2961 29.6 ***********************
Andrew 2031 20.3 ****************
Ruud 476 4.8 ***
Fred 0 0
<rest> 4532 45.3 ************************************
It is also possible to sort the histogram on the weight (or number of entries) of the value
import salabim as sim
sim.yieldless(False)
env = sim.Environment()
monitor_names= sim.Monitor(name='names')
for _ in range(10000):
name = sim.Pdf(('John', 30, 'Peter', 20, 'Mike', 20, 'Andrew', 20, 'Ruud', 5, 'Jan', 5)).sample()
monitor_names.tally(name)
monitor_names.print_histogram(values=True, sort_on_weight=True)
The output of this:
Histogram of names
entries 10000
value entries %
John 2961 29.6 ***********************
Peter 2048 20.5 ****************
Andrew 2031 20.3 ****************
Mike 1989 19.9 ***************
Jan 495 5.0 ***
Ruud 476 4.8 ***
Level monitor
Level monitors tally levels along with the current (simulation) time. e.g. the number of parts a machine is working on.
A level monitor is defined by specifying level=True
in the initialization of Monitor, e.g.
working_on_parts = sim.Monitor(name='working_on_parts', level=True, initial_tally=0)
By default, the collected x-values are stored in a list. Alternatively, it is possible to store the x-values in an array of one of the following types:
type |
stored as |
lowerbound |
upperbound |
number of bytes |
do not tally (=off) |
---|---|---|---|---|---|
‘any’ |
list |
N/A |
N/A |
depends on data |
N/A` |
‘bool’ |
integer |
False |
True |
1 |
255 |
‘int8’ |
integer |
-127 |
127 |
1 |
-128 |
‘uint8’ |
integer |
0 |
254 |
1 |
255 |
‘int16’ |
integer |
-32767 |
32767 |
2 |
-32768 |
‘uint16’ |
integer |
0 |
65534 |
2 |
65535 |
‘int32’ |
integer |
2147483647 |
2147483647 |
4 |
2147483648 |
‘uint32’ |
integer |
0 |
4294967294 |
4 |
4294967295 |
‘int64’ |
integer |
-9223372036854775807 |
9223372036854775807 |
8 |
-9223372036854775808 |
‘uint64’ |
integer |
0 |
18446744073709551614 |
8 |
18446744073709551615 |
‘float’ |
float |
-inf |
inf |
8 |
-inf |
Monitoring with arrays takes up less space. Particularly when tallying a large number of values, this is strongly advised.
Note that if non numeric x-values are stored (only possible with the default setting (‘any’)), the tallied values are converted, if required, to a numeric value if possible, or 0 if not.
During the simulation run, it is possible to retrieve the last tallied value (which represents the ‘current’ value)
by calling Monitor.get().
It’s also possible to directly call the level monitor to get the current value, e.g.
mylevel = sim.Monitor('level', level=True, initial_tally=0)
...
mylevel.tally(10)
yield self.hold(1)
print(mylevel()) # will print 10
For the same reason, the standard length monitor of a queue can be used to get the current length of a queue: q.length()
although
the more Pythonic len(q)
is prefered.
When Monitor.get() is called with a time parameter or a direct call with a time parameter, the value at that time will be returned
print (mylevel.get(4)) # will print the value at time 4
print (mylevel(4)) # will print the value at time 4
There is set of statistical data available, which are all weighted with their duration:
duration
duration_zero (time that the value was zero)
mean
std
minimum
median
maximum
percentile
bin_duration (total duration of entries between two given values)
value_duration (total duration of entries equal to a given value or set of values)
For all these statistics, it is possible to exclude zero entries, e.g. m.mean(ex0=True)
returns the mean, excluding zero entries.
The individual x-values and their duration can be retrieved xduration(). By default, the x-values will be returned as an array, even if
the type is ‘any’. In case the type is ‘any’ (stored as a list), the tallied x-values will be converted to a numeric value or 0 if
that’s not possible. By specifying force_numeric=False
the collected x-values will be returned as stored.
The individual x-values and the associated timestamps can be retrieved with xt() or tx(). By default, the x-values will be returned as an array, even if
the type is ‘any’. In case the type is ‘any’ (stored as a list), the tallied x-values will be converted to a numeric value or 0 if
that’s not possible. By specifying force_numeric=False
the collected x-values will be returned as stored.
When monitoring is disabled, an off value (see table above) will be tallied. All statistics will ignore the periods from this
off to a non-off value. This also holds for the xduration() method, but NOT for xt() and tx(). Thus,
the x-arrays of xduration() are not necessarily the same as the x-arrays in xt() and tx(). This is
the reason why there’s no x() or t() method.
It is easy to get just the x-array with xduration()[0] or xt()[0].
It is important that a user never tallies an off value! Instead use Monitor.monitor(False)
With the monitor method, a level monitor can be enabled or disabled.
Also, the current monitor status (enabled/disabled) can be retrieved.
mylevel.monitor(False) # disable monitoring
mylevel.monitor(True) # enable monitoring
if mylevel.monitor():
print('level is enabled')
It is strongly advised to keep tallying even when monitoring is off, in order to be able to access the current value at any time. The values tallied when monitoring is off are not stored.
Calling m.reset() will clear all tallied values and timestamps.
The statistics of a level monitor can be printed with print_statistics()
.
E.g: waitingline.length.print_statistics()
:
Statistics of Length of waitingline at 50000
all excl.zero zero
-------------- ------------ ------------ ------------
duration 50000 48499.381 1500.619
mean 8.427 8.687
std.deviation 4.852 4.691
minimum 0 1
median 9 10
90% percentile 14 14
95% percentile 16 16
maximum 21 21
And, a histogram can be printed with print_histogram()
. E.g.
waitingline.length.print_histogram(30, 0, 1)
Histogram of Length of waitingline
all excl.zero zero
-------------- ------------ ------------ ------------
duration 50000 48499.381 1500.619
mean 8.427 8.687
std.deviation 4.852 4.691
minimum 0 1
median 9 10
90% percentile 14 14
95% percentile 16 16
maximum 21 21
<= duration % cum%
0 1500.619 3.0 3.0 **|
1 2111.284 4.2 7.2 *** |
2 3528.851 7.1 14.3 ***** |
3 4319.406 8.6 22.9 ****** |
4 3354.732 6.7 29.6 ***** |
5 2445.603 4.9 34.5 *** |
6 2090.759 4.2 38.7 *** |
7 2046.126 4.1 42.8 *** |
8 1486.956 3.0 45.8 ** |
9 2328.863 4.7 50.4 *** |
10 4337.502 8.7 59.1 ****** |
11 4546.145 9.1 68.2 ******* |
12 4484.405 9.0 77.2 ******* |
13 4134.094 8.3 85.4 ****** |
14 2813.860 5.6 91.1 **** |
15 1714.894 3.4 94.5 ** |
16 992.690 2.0 96.5 * |
17 541.546 1.1 97.6 |
18 625.048 1.3 98.8 * |
19 502.291 1.0 99.8 |
20 86.168 0.2 100.0 |
21 8.162 0.0 100 |
22 0 0 100 |
23 0 0 100 |
24 0 0 100 |
25 0 0 100 |
26 0 0 100 |
27 0 0 100 |
28 0 0 100 |
29 0 0 100 |
30 0 0 100 |
inf 0 0 100
If neither number_of_bins, nor lowerbound nor bin_width are specified, the histogram will be autoscaled.
Histograms can be printed with their values, instead of bins. This is particularly useful for non numeric tallied values, like names of production stages. For example
Histogram of Status
duration 300
value duration %
idle 70 23.3 ******************
package 42 14.0 ***********
prepare 48 16 ************
stage A 12 4 ***
stage B 50 16.7 *************
stage C 54 18 **************
stage D 24 8 ******
Merging of monitors
Monitors can be merged, to create a new monitor, nearly always to collect aggregated data.
The method Monitor.merge() is used for that, like
mc = m0.merge(m1, m2)
Then we can just get the mean of the monitors m0, m1 and m2 combined by
mc.mean()
,but also directly with :
m0.merge(m1, m2).mean()
Alternatively, monitors can be merged with the + operator, like
mc = m0 + m1 + m2
And then get the mean of the aggregated monitors with
mc.mean()
, but also with
(m0 + m1 + m2).mean()
It is also possible to use the sum function to merge a number of monitors. So
print(sum((m0, m1, m2)).mean())
Finally, if ms = (m0, m1, m2), it is also possible to use
print(sum(ms).mean())
A practical example of this is the case where the list waitinglines contains a number of queues.
Then to get the aggregated statistics of the length of all these queues, use
sum(waitingline.length for waitingline in waitinglines).print_statistics()
For non level monitors, all of the tallied x-values are copied from the to be merged monitors. For level monitors, the x-values are summed, for all the periods where all the monitors were on. Periods where one or more monitors were disabled, are excluded. Note that the merge only takes place at creation of the (timestamped) monitor and not dynamically later.
Sample usage:
Suppose we have three types of products (a, b, c) and that each have a queue for processing, so a.processing, b.processing, c.processing. If we want to print the histogram of the combined (=summed) length of these queues
a.processing.length.merge(b.processing.length, c.processing.length, name='combined processing length')).print_histogram()
and to get the minimum of the length_of_stay for all queues
(a.processing.length_of_stay + b.processing.length_of_stay + c.processing.length_of_stay).minimum()
Note that it is possible to rename a merged monitor (particularly those created with + or sum) with the rename() method:
sum(waitingline.length for waitingline in waitinglines).rename('aggregated length of waitinglines').print_statistics()
Merged monitors are disabled and cannot be enabled again.
Slicing of monitors
It is possible to slice a monitor with Monitor.slice(), which has two applications:
to get statistics on a monitor with respect to a given time period, most likely a subrun
to get statistics on a monitor with respect to a recurring time period, like hour 0-1, hour 0-2, etc.
Examples
for i in range(10):
start = i * 1000
stop = (i+1) * 1000
print(f'mean length of q in [{start},{stop})={q.length.slice(start,stop).mean()}'
print(f'mean length of stay in [{start},{stop})={q.length_of_stay.slice(start,stop).mean()}'
for i in range(24):
print(f'mean length of q in hour {i}={q.length.slice(i, i+1, 24).mean()}'
print(f'mean length of stay of q in hour {i}={q.length_of_stay.slice(i, i+1, 24).mean()}'
Instead of slice(), a monitor can be sliced as well with the standard slice operator [], like
q.length[1000:2000].print_histogram()
q.length[2:3:24].print_histogram()
print(q.length[1000].mean())
Note that it is possible to rename a sliced monitor (particularly those created []) with the rename() method:
waitingline.length[1000:2000].rename('length of waitingline between t=1000 and t-2000').print_statistics()
Sliced monitors are disabled and cannot be enabled again.
Using monitored values in pandas
It is possible to save the t- and x- values of a monitor into a pandas dataframe, like
non_level_monitor.as_dataframe()
This results, for instance, in this dataframe:
t non level monitor.x
0 1.0 11.0
1 2.0 12.0
2 3.0 13.0
3 4.5 14.5
4 5.0 15.0
5 10.0 20.0
The same can be done for a level monitor. But it might be difficult to use this dataframe, as in most analyses, a uniform (equidistant) t-scale is required and that is not how salabim collects timeseries data.
That’s where Monitor.as_resampled_dataframe()
is useful. Here, you specify the frequency (delta_t)
of the resample. The smaller delta_t, the more accurate the dataframe will follow the collected
data.
As an example, suppose we have a monitor like this:
When we now resample with delta_t=1, we will see:
, whereas delta_t=0.1, is much more accurate (at the expense of a bigger dataframe):
The Monitor.as_resampled_dataframe
method can be used to make a dataframe containing the x-values of several monitors.
If datetime0 (see also the Miscellaneous chapter) is specified, both Monitor.as_dataframe
and Monitor.as_resampled_dataframe
can emit the time as datetime.datatime values.
See the reference section for details.
Using monitored values in other packages, like matplotlib
For high quality, reproduction ready, graphs, it can be useful to use additional packages, most notably matplotlib.
The sampled values from a non level monitor can be retrieved with Monitor.x(). If the moment of the sample is required as well, either Monitor.xt() or Monitor.tx() can be used.
For level monitors, there is choice of :
Monitor.xt()
Monitor.tx()
Monitor.xduration()
To get a proper display of a level monitor, we advise something like
import matplotlib.pyplot as plt
...
plt.plot(*waitingline.length.tx(), drawstyle="steps-post")
plt.show()
, which might produces something like:
Pickling a monitor
Monitor.freeze() returns a ‘frozen’ monitor that can be used to store the results not depending on the current environment.
This is particularly useful for pickling a monitor.
E.g. use
with open("mon.pickle", "wb") as f:
pickle.dump(f, mon.freeze())
to save the monitor mon, and
with open("mon.pickle", "rb") as f:
mon_retrieved = pickle.load(f)
to retrieve the monitor, later.
Both level and non level monitors are supported. Frozen monitors get the name of the original monitor padded with ‘.frozen’ unless specified differently.
Stats_only monitors
Monitors can optionally collect statistics only.
This minimizes memory usage as individual tallies will not be stored in that case. But beware that some important functionality is not available in stats_only monitors (see below).
You can define the stats_only status at creation time of the monitor, like
m = sim.Monitor('m', stats_only=True)
But, it is also possible to reset a monitor with a different stats_only value. This can be useful if you want a system monitor, like Component.mode or Component.status to not keep individual tallied values
class Car(sim.Component):
def setup(self):
self.mode.reset(stats_only=True)
When stats_only is active, values are always forced to numeric, with a fallback value of 0.
Monitor with stats_only=True support
__call__
/get
(without arguments), base_name
, deregister
, duration
, duration_zero
, maximum
, mean
,
minimum
, monitor
, name
, number_of_entries
, number_of_entries_zero
, print_histogram
,
print_histograms
, print_statistics
, register
, rename
, reset
, reset_monitors
, sequence_number
,
setup
, stats_only
, std
, t
, tally
, weight
, weight_zero
Monitors with stats_only=True do NOT support (these will raise a NotImplementedError)
__call__
/get
(with an argument), arithmeric operations, animate
, bin_duration
,
bin_number_of_entries
, bin_weight
, freeze
, histogram_autoscale
, median
, merge
, multiply
,
percentile
, slice
, slicing
, to_days
, to_hours
, to_microseconds
, to_milliseconds
, to_minutes
,
to_seconds
, to_time_unit
, to_weeks
, to_years
, tx
, value_duration
, value_number_of_entries
,
value_weight
, values
, x
, xduration
, xt
, xweight
In line with the above, Queue.reset_monitors
, Resource.reset_monitors
, State.reset_monitors
have a stats_only parameter, with which all monitors can set/reset the stats_only status.
So, if you want all eight monitors belonging to a resource, but the requesters’ to be stats_only, you can write
r.reset_monitors(stats_only=True)
r.requesters.monitors(stats_only=False)
The current stats_only mode can be queried with Monitor.stats_only()
.