Systemic
Learning in the Context of Observation.
By
Michael
P. Byron, Ph.D.
Abstract.
The hypothesis that the real world political
system constitutes a complex adaptive learning system is evaluated. Relevant parameters of this system are
abstracted into a difference equation driven BASIC language computer model of
the world system. These difference
equations impose global, “top-down” transformation rules upon the system. This transformative process is highly
modified by “bottom-up” interactions among the model system’s constituent
actor-elements. This interplay between
order and chaos allows for the emergence, and subsequent evolution of a complex
adaptive learning system.
This
model is utilized to generate empirical data for a Singular Predictive Instance
(SPI) of systemic learning: warfare frequencies. These data are compared with corresponding
real world empirical data for this SPI.
A finding that the two sets of data are closely correlated allows for
extrapolation of model systemic findings to the real world system.
The real
world system’s trajectory through learning space is found to resemble that of
typical model systems—with the exception that its attractor-bound trajectory is
as yet incomplete. The potential effects
of conscious observation of the world system upon its evolutionary learning
trajectory are evaluated. A conclusion
is reached that the effects of this observation process may, potentially, be
significant, with respect to the world system’s subsequent trajectory through
learning space.
Keywords: complexity, complex adaptive systems,
systemic learning, warfare frequencies, empirical corroboration, computer
modeling, attractors.
Section I:
Conceptual Foundations of the Learning Model.
This study presents and then evaluates the
hypothesis that the modern international political system is a complex adaptive
learning system. Subsequently it
investigates possible effects of conscious observation upon the world system’s trajectory
through learning space.
To evaluate the hypothesis that the
world system is a complex adaptive learning system I utilize a singular
predictive instance (“SPI”) of systemic behavior: warfare frequencies. Warfare frequencies are selected because
empirical data for the 1495-1975 time period (Levy, 1983) are available. These data allow for falsifiable empirical
testing of my computer model (Byron, 1996) for crisis-driven systemic
learning.
Systemic learning, with respect to
warfare frequencies, is hypothesized to have occurred in response to increasingly
rapid increases in total systemic power.
These power increases cause the adverse consequences of warfare, in
general, and great-power warfare, specifically, to evince a corresponding
increase. This creates ‘selection
pressure’ upon the system to ‘learn’ to
avoid this ever-increasing danger to its existence. Conscious intervention arising from research
into the nature of this learning process, if possible, is thus desirable, in
the context of systemic survival via learning optimization.
My investigative strategy was to utilize
MS QUICKBASIC to create a model international system program. The results obtained from running this
program are employed to generate data for this SPI. These data are then evaluated in the context
of Levy’s data. This allows for falsifiable testing of the study’s complex
adaptive learning system hypothesis while validating extrapolation from model
system data to real world data. Building
upon my findings, I then evaluated the possible effects of conscious
observation upon the subsequent evolution of the world system.
The model system consists of ten mutually
interacting actor-elements which represent the nation-state constituents of the
world political system. This number of
actors is sufficient to permit the hypothesized systemic learning processes to
be detected, if existent, while still remaining a small enough value to allow
for maximal analytical clarity.
In conceptualizing the structure of
the world system I envisioned that its several
levels of analysis (i.e. individual/decision-maker, nation-state,
systemic) were, although analytically distinguishable, integrated together into
a coherent whole. Pursuant to this, I
incorporate two fundamental individual level cognitive precepts into the
computer model: 1) Awareness is
limited. It is a zero-sum variable (Simon, 1985, 302, Simon & Kaplan, 1991,
9). 2) Cognitive change occurs
discretely. This assumption follows logically from the realization that awareness
is limited. Given this attentional constraint,
information is stored associatively, probably in a schematic arrangement
(Schater, 1991, 692). This allows for
rapid associative retrieval of stored information. The human brain is limited to
performing no more than about 100 associational steps per second (Rumelhart, 1991, 135). Aggregate schemas for any given subject
comprise ‘memes.’ This form of
information storage allows for rapid information retrieval, and utilization,
within the limited attention, ‘100 step’ constraints imposed by the brain’s
architecture. Decisional outputs
of state level actors are assumed to conform to these individual level
cognitive constraints due to the concentration of inter-state decisional power
among relatively few leaders.
Possession of a common meta-meme
among individual level actors within a nation-state is represented in the
context of the model as comprising that actor’s culture. Culture thus acts as a constraint upon state
behavior towards other states. In particular
it determines a state’s propensity for reciprocal behavior. Reciprocity is the model’s indicator for
democracy. Given the theoretical (Doyle,
1983, 1986) as well as empirical (Levy, 1983) observation that democracies do
not wage war upon one another, level of democratization, is hypothesized to be
a key variable defining nation-state behavior.
My computer model of crisis-driven
learning is predicated upon the hypothesis that systemic level learning occurs
principally in response to crisis.
Crisis, in this context, is defined as any situation in which the
application of previously encoded knowledge fails to resolve some situation in a
manner in which harm, or the potential for harm, is averted. Because culture represents a common meta-meme
between citizens of a given nation-state, it follows that any change in its
configuration is likely to be resisted as it will entail significant changes
among a given cultural meme’s associative linkages. Thus, it follows that cultural change will
likely occur discontinuously, by discrete increments, which represent
replacement of entire memes within the greater cultural meta-meme. This change will most likely occur when
‘forced’, that is, in a crisis situation.
Models can be either ‘top-down’ or
‘bottom-up.’ That is, the model’s global properties can be pre-defined
algorithmically, or they can arise from unpredictable interactions between actors
executing instructions locally. I have
chosen a mixed approach in which certain global properties of the model are
algorithmically pre-specified, while local interactions among the system’s
actors determine its evolution. Thus,
one interesting feature of this model is its interplay between deterministic,
top-down, global rules, and non-predictable, bottom-up, actor interaction
effects. This effect is consciously
designed to mirror the interplay of deterministic and non-deterministic
phenomena which underlay actual reality.
The model consists of four global,
top-down, difference-equation-driven variables along with another variable
which sets an upper limit upon one of these variables. These variables are modified in value via
local, bottom-up, interactions among system actors, by operation of a WAR
subroutine, and a CRISIS procedure.
Figure 1, below presents this model schematically.
*EDITORS NOTE: PLACE
FIGURE 1 ABOUT HERE. PLACE CAPTION
BENEATH IT.*
*EDITORS NOTE:
CAPTION FOR FIGURE 1 APPEARS BELOW*
Figure 1
Information
Flows Between Model Variables and Procedures. Note:
The ECON module is utilized only in the later version of the model
discussed below, and then only for the “economically enabled model system”
runs.
Relation, R: This variable has a range of between +1.0 and
-1.0, for each of nation i’s relations with other actors comprising the model
international system. Equation 1 below
defines this variable in the context of the model for any two system actors i
and j.
Rij(t + 1) = Rij(t)
+ b;
where b = ((.25 * (Rci *
Rcj)/2)) * (Rij(t) - (Ri+j))
(N-1))
The “b” term is subtractive as the model
assumes that state actors possess limited quantities of attentional
capacity. It incorporates the assumption
that change occurs gradually under non-crisis circumstances. By including reciprocity, it assumes that
more reciprocal states possess greater adaptability.
Reciprocity.
Reciprocity, Rc: This variable represents level of
democratization, as articulated above.
It is defined by equation 2 below:
Rc(t + 1)i =
2 * ( å
+Ri(t)
)
å½Ri(t)
½)
Equation 2b) Equation for Reciprocity, “Rc” for “i” and
“j”.
Between any two states i and j:
Rcij = Rci * Rcj.
This equation assumes that positive values
of relation contribute to increasing power, for a given nation “i”, with
respect to negative relational values.
Power.
Power, P: This variable represents total power which a
given state, “i”, possesses at any time
t. Equation 3, below defines this
variable:
Pi’(t + 1) = Pi(t)
+ a,
where a = ½{(å+Ri(t))}½ + ½[.01
* {(å
+R(t))}]½
½{(å Ri(t))}
½ ½ {(å
R(t))} ½
P (t + 1)
The power equation
assumes that propensity towards reciprocal behavior is conducive to increasing
power.
Culture is represented by a variable,
“K”. K has a potential range of 0.00 to
1.00. Initial values of K are
arbitrarily assigned. (Five nations are placed into a “high bound” culture with
values of .75, while the remaining five are assigned to a “low bound” culture
with K values of .25) Subsequently, these values may be modified by application
of the CRISIS procedure, as detailed below.
K simply sets an upper limit for the potential range of state i’s Rc
values. Rc values are free to vary
within the upper bound set by this limit.
K varies by discrete increments of .05 because cognitive, or behavioral,
change is assumed to occur discretely.
Coupling.
Finally, the model incorporates a
coupling parameter, CF. CF provides a
measure of how closely coupled any two nations i and j are at any time t. In effect, it determines the extent to which
any nation i may interact with, and thus algorithmically influence, the
behavior of a given other nation, j.
Evaluated for the system as a whole (by computing its mean value) it
provides a measure of the degree of “openness” of the system. Equation 4 assumes that perfect coupling can
be closely approached, but never actually reached.
CF is calculated in one of three
ways. These are given below in equation
4:
If Rci
< MRc: CFi = MRc - ((Rci
- MRc)/2)
If Rci
> MRc: CFi = MRc + ((Rci
- MRc)/2)
If Rci =
MRc: CFi = MRc Where MRc = Rctotal /( N-1)
The model’s WAR subroutine is triggered
when the following criteria are met:
IF Rcij(t) * ((CFi + CFj) /
2) £ .25 AND Rij(t) £
0 THEN GOSUB WAR.
Each of the ten model system nations compares
itself with each of the other nine, sequentially to determine whether the WAR
criteria are met. Thus wars occur
dyadically in the model.
When
the model’s WAR subroutine is triggered, the power renormalization process is
reversed, or ‘denormalized’, for each of the participating nations. This means that if two nations i and j
trigger the WAR subroutine, their total, raw, power is calculated. This allows for the effects of increasing
power upon warfare to be modeled.
ECON Subroutine.
Like
the WAR subroutine, the ECON subroutine checks sequentially for the presence of
dyads which meet the ECON selection criteria.
The models ECON subroutine is triggered when the following conditions
are met:
IF Rcij(t) * ((CFi
+ CFj) / 2) £ .35 THEN GOSUB ECON.
Economic interactions are enabled only beginning
at moderately high values of Rc because the model assumes that systemically
meaningful levels of trade require the absence of outright systemic anarchy to
take hold and flourish. As the system
becomes more reciprocal, conditions for trade improve. The model does allow for asymmetric trading
gains when power is highly unequal and reciprocity is relatively low between
members of a given dyad. However under
high reciprocity conditions, it is possible for “smaller” (less powerful)
nations to actually benefit disproportionately from a given trade
interaction. Unlike the WAR subroutine,
nations which attempt and fail to establish a trade agreement, BOTH undergo the
CRISIS procedure. The reasoning behind
this is that both are aware of having forfeited potential gains. This allows crisis‑driven learning to
continue even at levels of reciprocity to great to permit war. However, such occurrences (trade failures)
are substantially less frequent than war, ceteris paribus.
Crisis Procedure.
Nations which lose at WAR, or fail
at trade, (ECON module) experience two outcomes, the first, a reduction in
their power, was mentioned above. The
second is that they undergo the CRISIS procedure. Here, their K values are randomly raised, or
lowered, by an increment of .05. The
probabilities for each outcome are equal.
This discrete value is selected in conformity with the models assumption
that cultural change occurs adaptively, by discrete intervals, corresponding to
memetic replacement, in response to crisis.
In essence, via operation of the
model’s WAR subroutine, and its CRISIS procedure, each nation compares itself
dyadically, with all other nations. This
comparison determines whether the two nations will engage in war, and if so,
which one will initiate it. The effects
of this process include substantial adjustments in power and, for losers at
war, culture, and hence reciprocity. It
is this local, interactive process, which allow for the system’s adaptive
learning behavior to emerge locally within the context of its difference
equation specified global structure.
Analysis.
The model system program was run 200
times to produce a sufficient N of cases for analysis of key analytical
parameters. Figure 2, below displays
this information.
*EDITORS NOTE: PLACE
FIGURE 2 ABOUT HERE. PLACE CAPTION
BENEATH IT.*
*EDITORS NOTE:
CAPTION FOR FIGURE 2 APPEARS BELOW*
Figure 2
Number of Program Iterations Required
for Attainment of Systemic No‑War Equilibrium.
Figure 2 represents the distribution of
learning times required for a model system to find a no-war equilibrium
solution. At this point two additional
systemic variables need to be introduced:
The first of these is termed “C”.
C varies uniquely for each system.
It has a maximum value of approximately 1.0, and a minimum value of
0.0. It provides an indicator of the
model system’s trajectory within the learning space matrix. The second is designated as “PI.” PI
represents the value of the ratio of instant systemic power to initial systemic
power. It provides an indicator of the
model system’s depth within the learning space matrix.
Given an initial mean Rc value of .5,
and a mean systemic no-war threshold Rc value of .775, as determined by
observation during the 200 model runs, each nation must learn through, on
average, 6 .05 K increments to reach the no‑war threshold. As the probability of a positive increment
learning result is equal to the corresponding probability of a negative
increment learning outcome, this yields a mean of 12 .05 Rc increments. As there are 10 nations involved, each of
which may influence the Rc values of the others, the total mean volume of
learning space must comprise 1210 discreet K points, assuming normal
distribution. Each of these points is
located .05 K from all other points.
These points are configured as a 10‑dimensional array. Each time the program is initiated a unique
model system is created de novo and positioned in an arbitrary location in this
space, consistent with a moderately low initial level of learning.
Direction in the learning space
matrix is provided by recalling that each of the 10 dimensions is, in
actuality, a range of discrete values.
Motion along this range towards a no-war equilibrium solution (that is
towards an attractor which represents a discrete solution to the problem of
attaining this condition) corresponds to motion along the C‑axis towards
its end-point, 0. Thus, lower C values,
and correspondingly, higher amounts of learning, (a closer approach to a no-war
equilibrium problem attractor solution) equate to motion “deeper” into the
learning array.
This visualization also allows for
the geometry of the learning array to be comprehended: As the end-point, or vertex, for C is common
to all 10 nations, and C contours the learning space of these nations, then C’s
common vertex is the common vertex of learning space. All 10 dimensions therefore posses a common
vertex. At this unique point, and only
at this point, they coincide. As I
visualize it, the learning space manifold, when mapped to conventional 3‑D
space, looks something like an inverted 10-sided step polygon, which steps
‘downwards’ towards this singularity in increments of .05 K. These 10 sides, or dimensions, correspond to
the 10 nations which comprise the model system.
Topographically, the model’s learning space, which lies within the
interior volume of this 10‑sided inverted step polygon, is contorted away
from exactly resembling a rectilinear grid by the presence of attractors, which
represent unique solutions to a system’s no-war equilibrium search. PI may be visualized as providing a measure
of the ‘steepness’ of a system’s trajectory through this learning space,
measured with respect to its vertex.
Thus, the real world system appears
to be learning how to deal with the crisis, caused by warfare, in the context
of rapidly increasing systemic power, which threatens its continued existence,
by increasingly seeking out non-warfare solutions to inter-state
conflicts. In effect, it appears to
evince a trajectory through its learning space manifold towards a no-war
equilibrium attractor, driven by selection pressure arising from this
endogenously generated crisis.
Overall, Levy’s war frequencies data
were found to correlate very closely with equivalent model system generated
data. Figures 3 and 4, below, illustrate
this correspondence. Figure 3 depicts
Levy’s data for all great-power wars, measured at 20 year increments. Figure 4 represents data generated by one of
the model system runs, (termed “model system 1”) for all wars, per program
iteration.
*EDITORS NOTE: PLACE
FIGURE 3 ABOUT HERE. PLACE CAPTION
BENEATH IT.*
*EDITORS NOTE:
CAPTION FOR FIGURE 3 APPEARS BELOW*
Figure 3.
Real World: Aggregate Warfare at 20 Year
Resolution.
Model System 1: All Wars.
Figure 5.
Model System 1: C.
Model system 1 evinced fairly typical
systemic parameters. It reached its
no-war equilibrium after 116 cycles.
This is less than 1 standard deviation from the mean value calculated
above in figure 2. Model system 1 has a
PI of 1,260. A glance at figure 5 which
graphs C, depicts model system 1’s trajectory through learning space. At cycle 13, the system ‘identifies’ a deep attractor,
representing a stable solution to its war-end boundary conditions. Its subsequent trajectory is towards this
attractor. Other, less deep attractors
are also encountered en route to this deep attractor‑solution at cycle 31, cycle 43, cycle 62, and cycle
65.
The topology of model system 1’s
learning space matrix is better evinced by figure 6. Figure 7 clearly portrays its
trajectory. One caveat is that as the
learning space matrix decreases in area towards 0 in increments of .05, while
the diagrams portray it as possessing equal area for all values of C, deeper
attractors are depicted multiply. Thus
in figure 7 what appears to be two deep attractors are in actuality, a single
point attractor.

Figure 6
Topology of Model System 1.
Figure 7
Trajectory of Model System 1 in
Learning Space.
Extrapolation from figures 5, 6, and 7,
above, to the real world system, implies that we are presently “locked on” to
and approaching, although along a non‑deterministic trajectory, a
corresponding point attractor somewhere in our own endogenously generated
learning space. The deep issue is
whether, or not, the length of our future systemic trajectory is sufficiently
short as to allow us to arrive at our no-war equilibrium attractor‑solution
prior to the crash of our world system.
More specifically, the issue is what effect(s), if any, our apprehension
of our position, and movement through the topography of learning space, can
have on this process.
It is important to note that since the
world system’s topology is endogenously generated, it follows that any
substantive restructuring of that system will likely cause a concomitant shift
in its learning space topology. Thus,
if for some reason, perhaps a life-or-death systemic crisis, a global process
of memetic replacement, leading to a fundamental replacement of one global
meta-meme by another were to occur, then, the world system would, necessarily,
find itself following some other trajectory, through a learning space imbued
with some other metric. Presumably it
would be seeking some other deep attractor‑solution.
It follows from this that conscious
intervention into the learning process of the system, arising from research
into this process, does possess the potential to substantially affect the
trajectory of the world system. If this
intervention is accepted by political decision‑makers, then this
potential can be actualized.
Given that the effects of fundamental
systemic change upon the system’s learning space matrix topology are
unknowable, with respect to its resultant, changed, configuration, in advance,
due to the stochastic nature of the systems bottom‑up interactions
discussed above, it follows that such intervention would only be reasonable if
it could be shown to be beneficial in all, or nearly all, possible cases. Is this possible?
Section
II: Extension of the Model
To answer this question, I next set out to
evaluate the possible effects of economic interactions, that is of trade, upon
the development of my model world systems.
This entailed the addition of a trade subroutine to the model’s
underlying BASIC program. As discussed
in greater detail above, this subroutine operates very similarly to that of the
WAR subroutine. Recall that the
principle differences are that it is triggered by a higher threshold value (Rcij
* ((CFi + CFj) / 2) £
.35 as opposed to the WAR subroutines threshold criteria of (Rcij *
((CFi + CFj) / 2) £
.25). Also, if failure to reach a trade
agreement occurs BOTH actors undergo the CRISIS procedure, as opposed to only
the loser doing so in the in WAR subroutine.
My reasoning is that if crisis is modeled
to be the primary driver of systemic learning, then any class of conditions,
the failure of which would likely be perceived by system actors as creating
harm to their well-being, ought to be a generator of crisis. As trade is unlikely to flourish in a
war-prone, anarchic systemic environment, I have a-priori set a higher
triggering threshold level for it than I have for inter-state war.
The effects of this change are
hypothesized to be an increase in the overall rate of learning for the model
system. My assumption is that such an
increase is desirable. As I envision it
such a process is equivalent, in the simplified context of my model, to Ernst
Hass’ (1990) concept of epistemic communities.
What are the consequences of my economic
assumptions upon systemic evolution, evaluated with respect to a “standard”
power-dominated model system (e.g. Gilpin, 1981) which does not incorporate
these economic effects? What is the
relationship between international trade and international warfare, in the
context of the model? Does this
difference, if existent, provide an answer to the question of the desirability
of consciously altering the trajectory of the world system?
To find out, I ran each type of model
system 80 times so as to establish a sufficiently large N for meaningful
determination of mean values of systemic variables. To determine the potential effects upon
war-instigated systemic collapse, I also incorporated a ‘collapse module’ into
the program. Basically, for each
iteration of system time, there is a very small possibility of collapse. This possibility increases as net systemic
power increases. As the collapse
probabilities are identical for each type of model system they provide a
standard metric against which the performance of the two types of systems can
be evaluated. Due to the continuing
presence of small, but persistent differences in these two resultant values, I
used a higher N of 240 to allow for statistically significant comparison
of these apparent differences, as we shall see below.
These two later variants of my basic
model, being developed following much testing and analysis of the initial model
incorporate several refinements over it.
Principally these are: 1) A
systemic collapse module is added. This
provides a small probability of collapse for any war prone model system. This probability increases as systemic power
increases. 2) The initial model’s power
difference equation utilized a fixed value (.01) in determining the quantity of
power by which the following program cycle of the model would be incremented. This represents an unrealistic
simplification. The later version
permits the model system to self‑regulate this value utilizing an
incremental value derived algorithmically from the mean systemic value of Rc
for that program iteration . Thus, the
below values cannot be directly compared to those obtained above. The above analysis represents proof of
concept, while the below findings build upon their foundation.
Table 1
Economic Interaction Enabled Model
System Runs.
Variable N Missing Mean Std.
Deviation*
Cycles 80 0 60.400 34.230
PI 80 0 103400.238 449154.836
C 80 0 0.244 0.130
MeanCF 80 0 0.871 0.353
MeanRc 80 0 0.826 0.330
Collapse 240 0 0.287 0.454
Table 2
Power-Dominated Model System Runs.
Variable N Missing Mean Std.
Deviation*
Cycles 80 0 93.412 49.863
PI 80 0 90449.663 433517.532
C 80 0 0.237 0.109
MeanCF 80 0 0.813 0.249
MeanRc 80 0 0.771