We now present one possibility to reduce the size of an IP-table. The idea is to collapse a certain number of IP-table entries, say a , into one. We call this a condensation of an IP-table by a and refer to it as I'(R,a) if the original IP-table is I(R). The parameter a is called the condensation factor . Condensation means that N' new timepoints are created with
(27) |
(28) |
The collapse of also implies that - logically - all intervals that started at one of these points are now considered to start at t'j. We use a new function which describes this fact:
(29) |
Theorem 29365
Let be a partition for R with pk-1 < pk for , and let and pm=t'N'. Then the following holds
(30) |
Now we can show the following
Corollary 29381
Let be a partition for R with pk-1 < pk for , and let and pm=t'N'. Then the following holds
(31) |
In the case that all definitions remain the same as shown above for . For j=N' we define
for the same reasons as above. However, it pays attention to the fact that t'N' does not comprise the same number a of the original timepoints but only . The proof of theorem 5 is not changed by this situation. Therefore theorem 5 and corollary 1 hold for this case too.
The condensation process divides the size of the IP-table by a. As a consequence, the information in the IP-table becomes coarser and less precise. This might decrease the quality of the resulting partitions but, for example, can make the process of deriving the partition more efficient. Just consider the optimal partitioning algorithm IP-opt whose runtime is a function of N. In chapter 10, we will perform experiments with different values of a and look at the impact it has on the quality of the partitions that are derived.
Figure 7.7 shows the condensed IP-table
I'(R,2) for the example of figure 7.2. The notion
of condensation can be applied to create a function
with providing the number of
intervals that ended within the condensed timepoint's range, i.e. within (t'j-1,t'j]. Similarly, we can define a function
with providing the number of
intervals that intersect with the range of t'j, i.e. (t'j-1,t'j]. Then the formulas (5.1),
(5.2) and (5.3) can be used by replacing
,
and
through , and respectively.
This is straightforward for the same reasons that applied in the case
of
,
and
(see section 5.2), just that
condensed timepoints are used. Please note that condensation assumes
that