(PDF) Empirical characterisation of agent behaviours in socio-ecological systems

The integrated—environmental, economic and social—analysis of climate change calls for a paradigm shift as it is fundamentally a problem of complex, bottom-up and multi-agent human behaviour. There is a growing awareness that global environmental change dynamics and the related socio-economic implications involve a degree of complexity that requires an innovative modelling of combined social and ecological systems. Climate change policy can no longer be addressed separately from a broader context of adaptation and sustainability strategies. Past research on artificial intelligence and social simulation has developed a promising methodology. Literature on agent-based modelling (ABM) shows it’s potential to couple social and environmental models and incorporate the influence of micro-level decision making in the system dynamics and to study the emergence of collective responses to policies. However, there are few studies that concretely apply this methodology to the study of climate c...

The potential of Agent-Based Modeling is now well-established to address natural resource management and policy issues in social-ecological systems where a number of biophysical and anthropic processes interact. The selection of anthropic processes to be considered in a model according to the research question is not easy, due to modelers' difficulty to objectify and identify these processes. The paper proposes some criteria to identify anthropic processes, decide which ones to discard or retain and how to implement the latter. It is illustrated by the analysis of the governance sub-model of the MAELIA platform dedicated to the integrated assessment and simulation of water use and water management policy, in a context of land use / land cover and climate changes.

Interconnected social and environmental systems are the domain of ecological economics, and models can be used to explore feedbacks and adaptations inherent in these systems. Agent-based modeling (ABM) represents autonomous entities, each with dynamic behavior and heterogeneous characteristics. Agents interact with each other and their environment, resulting in emergent outcomes at the macroscale that can be used to quantitatively analyze complex systems. ABM is contributing to research questions in ecological economics in the areas of natural resource management and land-use change, urban systems modeling, market dynamics, changes in consumer attitudes, innovation, and diffusion of technology and management practices, commons dilemmas and self-governance, and psychological aspects to human decision making and behavior change. Frontiers for ABM research in ecological economics involve advancing the empirical calibration and validation of models through mixed methods, including surveys, interviews, participatory modeling, and, notably, experimental economics to test specific decision-making hypotheses. Linking ABM with other modeling techniques at the level of emergent properties will further advance efforts to understand dynamics of social-environmental systems.

Environmental Modelling & Software 26 (2011) 837e844 Contents lists available at ScienceDirect Environmental Modelling & Software journal homepage: www.elsevier.com/locate/envsoft Empirical characterisation of agent behaviours in socio-ecological systems Alex Smajgl a, *, Daniel G. Brown b, Diego Valbuena c, d, Marco G.A. Huigen e a CSIRO Ecosystems Sciences and Climate Adaptation, University Drive, Townsville 4810, Australia University of Michigan, School of Natural Resources and Environment, Ann Arbor, MI, USA c Wageningen University, Department of Environmental Sciences, P.O. Box 47, 6700 AA Wageningen, The Netherlands d International Livestock Research Institute, System-wide Livestock Programme, P.O. Box 5689, Addis Ababa, Ethiopia e Universität Hohenheim, Germany b a r t i c l e i n f o a b s t r a c t Article history: Received 29 July 2010 Received in revised form 14 February 2011 Accepted 17 February 2011 Agent-based modelling has become an important tool to investigate socio-ecological processes. Its use is partially driven by increasing demand from decision makers to provide support for understanding the potential implications of decisions in complex situations. While one of the advantages of agent-based modelling is the ability to simulate the implications of human decision-making processes explicitly, methods for providing empirical support for the representation of the behaviour of human agents have not been structured systematically. This paper develops a framework for the parameterisation of human behaviour in agent-based models and develops twelve distinct sequences for the characterisation and parameterisation of human behaviours. Examples are provided to illustrate the most important sequences. This framework is a ﬁrst step towards a guide for parameterisation of human behaviour in ABM. A structured discussion within the agent-based community is needed to achieve a more deﬁnitive guideline. Ó 2011 Elsevier Ltd. All rights reserved. Keyword: Agent-based modelling 1. Introduction Environmental policy and management increasingly demand the integration of cross-disciplinary knowledge of socio-ecological systems. Understanding human responses to environmental and/or policy changes is critical for understanding socio-ecological systems and the outcomes they produce. As agent-based modelling (ABM) allows for simulating actual decision-making processes of individuals or groups of individuals this modelling technique is gaining importance (Matthews et al., 2007; Parker et al., 2003). This paper is focused on ABM, in which (a) human agents are represented at the scale of households or individuals and (b) the main concern is the understanding of socio-ecological interactions. In empirical contexts, this class of ABM is mostly chosen for its ability to simulate explicit human decision-making processes (mostly described in discontinuous functions) in the context of changes in the bio-physical environment. However, the main strength of ABM can be a key weakness. As the implementation of human decision-making processes is the main strength of ABM, the agent attributes and behavioural response functions that represent these processes require knowledge support from qualitative and/or * Corresponding author. E-mail address: alex.smajgl@csiro.au (A. Smajgl). 1364-8152/$ e see front matter Ó 2011 Elsevier Ltd. All rights reserved. doi:10.1016/j.envsoft.2011.02.011 quantitative empirical sources. Unfortunately, there are no standard approaches to documenting and communicating the empirical support that underlies modelling and design decisions in agentbased models (Berger and Schreinemachers, 2006; Bohensky et al., 2007; Smajgl et al., 2007). Empirical characterisation and parameterisation of human decision-making processes can involve a range of methods that include: expert knowledge, surveys, interviews, and participant observation. We describe a set of speciﬁc methods here that, while it does not include all possible options, includes those that are most commonly used in practice and help deﬁne what we believe to be a robust set of sequences within which alternative speciﬁc methods can be applied. In many cases it is necessary to iteratively combine several empirical methods in a sequence to fully parameterise behavioural responses in ABM. The development of an ABM that aims to simulate behavioural responses of humans requires two fundamental steps in which empirical data are required: the development of behavioural categories and the scaling to the whole population of agents. The effectiveness of different methodological sequences depends on the modelling context (Janssen and Ostrom, 2006; Robinson et al., 2007). Most signiﬁcantly, the size of the human population in the system to be modelled helps to determine appropriateness of various methods, and can be anything from 20 individuals or households to many million households. Carrying out in-depth 838 A. Smajgl et al. / Environmental Modelling & Software 26 (2011) 837e844 interviews with 20 households is realistic. With larger populations and more complex socio-ecological interactions, it becomes necessary to work with a sample of the population. Sampling often requires characterisation of the existing heterogeneity of agent attributes and behavioural responses in order to provide a simpliﬁed description of behavioural realities that can guide sampling and the realistic parameterisation of model agents. The ﬁrst challenge for the parameterisation of ABMs that represent larger populations is the identiﬁcation of suitable empirical methods for eliciting behavioural data. Then, the sample-based data need to be translated into behavioural representations for the whole population (i.e. up-scaling). Up-scaling can be performed proportionally or disproportionally. Proportional up-scaling assumes that the sample is representative for the whole population. This can result in under-representation of behavioural minorities. Disproportional up-scaling is needed if these behavioural minorities are likely to determine system dynamics, such as free-riders in common pool situations. Disproportional up-scaling is based on the development of agent typologies. The inverse problem (i.e., down-scaling) occurs when data for a large collection of agents are acquired for aggregated sources (e.g., over census enumeration districts), and populations of agents are generated through some simulation technique that honours the aggregate data (Baker et al., 2010; Brown and Robinson, 2006; Wu et al., 2008). This paper aims to provide an initial structure that can be used within rigorous discussions of parameterisation methods for human behaviour in ABM. This is only a ﬁrst step towards a guide for parameterisation of human behaviour in ABM as a structured discussion within the agent-based community is needed to achieve a more deﬁnite guideline. Thus, we developed a framework for this process followed by a discussion of methods available for what we call the parameterisation sequence. Discussing sequences instead of individual methods allows for consideration of approaches that mix different kinds of methods, a systematic understanding of existing options and evaluation of suitability across a range of modelling contexts. Then, we give examples demonstrating the application of the framework. Most published examples of empirical agent-based models do not disclose methodological details for the parameterisation process that was used. Therefore, we believe that an initial outline of the dimensions of a useful framework along these lines is needed. The systematic application of the framework to a few different types of empirical agent-based models should guide readers in applying the framework and considering its applicability in a wider variety of cases. The paper concludes with a discussion of the need to further tests of the generality of the framework. 2. Deﬁnitions of empirical methods Several methods can be used to parameterise behavioural responses of humans empirically. This set of the most commonly used is not all inclusive, but we posit that it is complete enough to build a fairly comprehensive framework: Expert knowledge: This constitutes a variety of formal and informal methods for capturing the understanding of experts in a ﬁeld or region and representing that knowledge in ways that it can be used in models. Approaches range from conceptual mapping by experts themselves to informal conversations or focus groups from which expert knowledge can be elicited for subsequent model building. Expert knowledge can also be employed to quantify uncertainties associated with expert judgement (Cooke and Goossens, 2008). Participant observation: Becker (1958) deﬁnes participant observation as the process in which the scientist participates in and documents the daily life of communities. Social surveys: Survey instruments consist of a list of questions, each with pre-deﬁned sets of possible answers (Nichols, 1991). Responses are elicited via mail, email, in person, or via phone. Interviews: While a survey comprises of mostly closed questions, interviews are normally less structured (Jupp, 2006) and range from a list of open questions to unstructured dialogues. Census data: While surveys and interviews are conducted with a sample of a population census data is elicited for 100% of a population, normally within national boundaries (Rees et al., 2002). Aggregated census data summarise responses for groups of households within enumeration districts, while disaggregated census data show household-level responses. Field or lab experiments: Experiments are designed to observe how the change in an independent variable affects a dependent variable. Traditionally, this happened in laboratories, which allows for high degrees of control and hence clear causal links in observed outcomes. However, the high levels of control create artiﬁcial situations, which can lead to poor applicability of results to more realistic situations (Patzer, 1996). Field experiments aim for less artiﬁcialness by placing the experiment in the natural environment (Harrison and List, 2004; Smajgl et al., 2009e). While ﬁeld experiments allow for more realistic behaviour to be observed the levels of control decrease. Role-playing games: Barreteau (2003) deﬁnes Role-Playing Games as “group settings that determine the roles or behavioural patterns of players as well as an imaginary context.” Similar to experimental designs, people are given pre-deﬁned roles and tasks, which they have to perform in interaction with other role players in a pre-deﬁned setting. Cluster analysis: Clustering describes the grouping of subjects based on the similarity of attributes. Based on categorical data, algorithms calculate either (1) the Euclidean distance between the median and each value, or (2) between each pair of values (Rousseeuw, 1987). Depending on proximity, subjects are mapped into clusters. Dasymetric mapping: This method involves a combination of detailed spatial data with aggregated census data, usually about population, to create disaggregated representations of the spatial distribution of population characteristics (Mennis and Hultgren, 2006). Often, for example, land-cover maps based on satellite imagery are used to distribute populations within an area such that (a) population totals within census enumeration districts are preserved and (b) their spatial locations are estimated at a much ﬁner resolution, based on the locations of land-cover types with different population densities. Monte Carlo method: This approach refers to “experiments with algebraic models which involve a stochastic structure” (Martin, 1977). Monte Carlo runs with stochastic agent-based models allow developing uncertainty distributions of output variables. 3. Parameterisation framework ABM requires the systematic representation of three main phenomena: agents, their social networks and the agent environment (Fig. 1). This paper discusses only the parameterisation of human agents; we leave development of empirical frameworks for social networks and the agent environment to others. We acknowledge that any model builder will not be able to separate human agent attributes and behaviours from social networks and the (bio-physical) agent environment, but for the purpose of clarity in this paper we focused on human agents. To achieve agent parameterisation, six different methodological steps were identiﬁed (Fig. 1), each with multiple empirical methods available (Table 1). The ﬁrst step of the framework is to identify different classes of agents (Method 1, Fig. 1). We assume that agents that share the same 839 A. Smajgl et al. / Environmental Modelling & Software 26 (2011) 837e844 Fig. 1. Framework for parameterisation of ABM. sequence of actions are deﬁned in an agent class. The sequences of actions represent the modelled behaviour of the agents and constitute the totality of their actions within the model. The identiﬁcation of distinct agent classes and their sequences of actions can be based on expert knowledge (EK), participant observation (PO), laboratory experiments (LE), interviews and role-playing games (RPG). We group these methods as Methods 1 (M1) as listed in Table 1. Within each agent class agents can be grouped into agent types according to their behavioural similarity. For instance, farmers would be a different class of agents from urban developers. Among farmers, different behavioural responses might be distinguished based on attributes like wealth or age, and therefore separate farmer types can be created. Types can be developed in three further steps employing methods 2e4. If no agent types are required, attributes and behavioural components can be assigned for the whole agent class employing methods 2 and 3. The second step of the framework can be either to specify the values for agent attributes (M2) or the parameters for the behavioural rules agents follow (M3). While agents in each class share the same sequence of actions, the parameters governing the magnitudes of the actions and the degree to which various sources of information inﬂuence those actions can vary. Agents with the same behaviours and similar or identical parameters of behaviour belong to the same agent type (i.e., types are the equivalent of sub-classes). Agent types can be developed from agent attributes or from data on agent behaviours. Thus, data have to be elicited to measure realworld attributes and behavioural responses. Behavioural responses can include observed relationships between qualitative or quantitative contextual variables and the actions or probabilities of actions carried out by agents. We group methods for obtaining agent attributes as Methods 2 (M2) and those for obtaining behavioural responses as Methods 3 (M3). Table 1 lists surveys and census data as M2 methods. Options for step M3 include surveys, interviews, ﬁeld experiments, participant observation, RPG, timeseries data, and expert knowledge. The sequence of steps within the proposed framework can vary (Fig. 1) because agent types can be developed from agent attributes or from observed behavioural responses. If types are developed from agent attributes, M4a methods can be employed, through the combination of (a) clustering and regression, (b) a mix of correlation and expert knowledge, (c) expert knowledge individually, or (d) dasymetric mapping. The ﬁrst three options can also be used if developing agent types from behavioural data. If methods M4a are not used, participant observation is an alternative (method group M4b). Once agent types are developed, and their attributes and behavioural response functions characterised, agents in the whole population need to be assigned to the appropriate agent type (M5). If data were elicited from the whole population, this step does not require any up- or down-scaling, as the sub-population (sample) and population are identical (Fig. 1). If there are some assurances that the sample is representative of the population, the scaling can be conducted proportionally. Under such assumptions each sample is assumed to represent a number of other agents, determined by the proportion of the population sampled, and is then cloned, accordingly. Cloning means that each respondent is replicated x times, with x ¼ (population size/sample size). Cloning does not mean that all agents of one clone have to be identical, as ranges of values or measures of variability can be used to introduce some level of heterogeneity. If representativeness of the sample cannot be assumed, agent types can be up- or down-scaled by means of census data or a Monte Carlo approach. These methods are grouped as M5 methods and listed in Table 1. 4. Parameterisation sequences What combinations of M1eM5 methods make sense will depend on the empirical context, including its purpose. We chose to simplify a description of the empirical context using three variables: the size of the population (N), high/low; the behavioural diversity (BD), high/ low; and the pragmatic possibility of creating representative samples. Behavioural diversity describes the relative number of different types of actions that agents can take in a given context. If the options are relatively limited, e.g., to grow rice or not, then behavioural diversity is low. If the agents choose among a large set of options, behavioural diversity is said to be high. Whether or not it is possible to create a representative sample of the population depends on, for instance, availability of funding for the modelling project and ease of accessing the people. We chose not to include model purpose as a dimension in the descriptions of context, though it may have some effect on the empirical choices, because the framework is predicated on the assumption that the interest of the modeller is in creating an empirically realistic model. That assumption necessarily includes some model purposes and excludes others. We have identiﬁed twelve different cases or sequences that combine the methods listed in Table 1. These cases can be followed to parameterise the behavioural responses of human agents in ABMs (Table 2). Depending on the actual context this combination of methods might require an iterative approach instead of a strict sequence. There may be other combinations, but we believe this set covers most empirical ABM contexts. Despite being a ﬁrst step, this is a reasonably comprehensive list of cases that should allow for a structured testing of methods to identify which sequences prove to be effective under various modelling conditions. Such testing could aim for quantifying uncertainty and guide the ABM community towards more robust empirical model development. The Table 1 Overview of methodologies relevant for the parameterisation of behavioural traits of human agents. M1 M2 M3 M4a ATB M4b BT M5 Expert knowledge ParticObser Lab experiment Interviews RPG Survey Census (incl. GIS data) Survey Interviews Field experiment ParticObser RPG Time-series data Expert knowledge Clustering and regression Correlation and expert knowledge Expert knowledge Dasymetric mapping Clustering and regression Correlation þ expert knowledge Expert knowledge ParticObser Proportional Census/GIS based assignment Monte Carlo 840 No up-Scaling CD/GIS based CD/GIS based EK Clustering & Regression Clustering & Regression Census Aggregated Census Disaggregated Census Census EK or PO or RPG EK or PO EK or PO Yes Less relevant Less relevant No Less relevant Less relevant 10 11 12 Less relevant Less relevant Less relevant No No 9 Less relevant Yes 8 Notes: N: agent population; BD: Behavioural diversity; EK: Expert Knowledge; RPG: Role-playing games; PO: Participant observation; CD: Census data; GIS: geographic information systems. EK EK Dasymetric mapping e No up-Scaling EK CD/GIS based No No 7 Less relevant Less relevant Yes 100% Yes No No Less relevant 4 5 6 Yes No access No Yes Yes 3 No Yes Yes 2 No Yes 1 Yes Less relevant EK or PO or Interviews (optional: Experiments EK or PO or Interviews (optional: Experiments EK or PO or Interviews (optional: Experiments EK or PO or RPG EK or PO EK or PO or Interviews (optional: Experiments EK or PO or Interviews (optional: Experiments EK or PO or Interviews (optional: Experiments EK or PO or RPG or RPG) or RPG) or RPG) Survey Census & Surveys (or PO) Disaggregated Census Survey or RPG) Survey OR Census Interviews (PO) or RPG Time-series data AND EK Interviews or Field Experiments or RPG Interviews or Field Experiments or RPG EK or Interviews or Field Experiments or RPG EK or Field Experiments or RPG EK or RPG Time-series data Time-series data e Statistics and EK e PO EK (EK or Clustering) and Regression e Clustering or (Statistics and EK) PO e e Interviews or RPG Survey Survey or RPG) or RPG) No up-scaling CD/GIS based CD/GIS based CD/GIS based CD/GIS based (EK or Clustering) and Regression e Interviews or Field Experiments or RPG or EK Interviews or RPG Survey M3 M2 M1 BD high? Sample representative? N high? When Case Table 2 Overview of techniques for the parameterisation of human agent behavioural responses. e e M4a e M4b M5 (Cloning or Monte Carlo) and Spatial references CD/GIS based A. Smajgl et al. / Environmental Modelling & Software 26 (2011) 837e844 characterisation of sequences should facilitate description and documentation of ABM applications. While model description is clearly important (Grimm et al., 2006, 2010), we argue that good description of the empirical support for models in speciﬁc contexts is equally important in allowing scientiﬁc peers to make good judgements about the value of the model results. We describe the modelling contexts within which the different cases make sense and emphasise different routes through the parameterisation processes (Fig. 1) depending on these contextual variables. The ﬁrst case represents a modelling context with a large population and the possibility to develop representative samples (Case 1). An understanding of agents and their actions (M1) can be developed using expert knowledge (EK), participant observation (PO), or interviews. Detailed agent attribute data (M2) can be elicited by conducting sample surveys, while behavioural data (M3) can be obtained from interviews, ﬁeld experiments, role-playing games (RPG), or expert knowledge (EK). Assuming that the sample used to generate attributes and behavioural parameters is representative, proportional up-scaling can be carried out, in which each data point is cloned to generate the whole population (M5). This step needs to include spatial referencing if agent locations are an element of the model design. This sequence was implemented in an example that is explained in the following section. In cases where the behavioural diversity is high and assumptions on the representativeness of samples do not hold, variations to the ﬁrst sequence have to be introduced. While steps M1 and M2 can remain unchanged, the behavioural complexity makes ﬁeld experiments or expert knowledge less suitable in step M3. Instead, two alternative sequences that require the development of agent types (for disproportional up-scaling) can be used. Firstly, interviews can be conducted to generate behavioural data for development of agent types (Case 2). For each type, distributions of agent attribute values can be identiﬁed by surveys. This means that the actual sequence of this approach (following Fig. 1) is M1 / M3 / M4b / M2 / M5. Instead of developing agent types from behavioural data, types could be constructed from agent attributes (Case 3). This means that a survey could be conducted to elicit attributes and clustering techniques applied to develop agent types. Then, interviews can be targeted to people who match the proﬁle of agent types. The behavioural data would then be disproportionally up-scaled to the whole population based on a process that assigns each agent to a type by mapping type-speciﬁc characteristics against census data. This approach sequences methods as M1 / M2 / M4a / M3 / M5. Cases 2 and 3 require census data to allow for up-scaling (M5). This type of up-scaling requires the identiﬁcation of the agent types using census data, in order to determine the proportions of the population in each type. While both of these parameterisation sequences reduce uncertainty resulting from sample non-representativeness, both approaches introduce uncertainty from the development of agent types by clustering (i.e. disproportional upscaling). Below examples are given for these two approaches, the Dutch model and the SimPaSI model for East Kalimantan. In situations where representative samples can be developed in spite of high behavioural diversity (Case 4), expert knowledge or participant observation can provide broad systems understanding (M1). Participant observation allows for developing agent types based on attribute data (census or surveys) and behavioural data (interviews or role-playing games). The ﬁnal up-scaling could be guided by census data. The MamaLuke model is described below to give an example for this particular sequence. Case 5 describes a situation where the actors are not accessible for interviews or surveys (e.g., because they are not responsive to surveys or due to funding limitations). In such cases agent classes can be identiﬁed based on expert knowledge or participant observation. This sequence requires the availability of disaggregated data A. Smajgl et al. / Environmental Modelling & Software 26 (2011) 837e844 to provide agent attributes. Assumptions on behavioural responses can be developed from statistical analysis of time-series data or expert knowledge. Experts could then guide the mapping of behavioural assumptions into the agent population. In the next section, the DEED model gives an example for such an approach. Case 6 is another variation of the ﬁrst sequence. This approach assumes that instead of a proportional sample, the development of agent types is based on behavioural data (M4b). Disproportional up-scaling could be performed with a census-based mapping, as described for Case 2. Case 7 involves relatively small populations. It is very similar to Case 1 but does not require any up-scaling (M5) because the whole population can be described, see for instance (Dray et al., 2006). Case 8 is another variation of Case 1 (and similar to Case 3). This case represents a large population with low behavioural diversity. It assumes that disaggregated census data are sufﬁciently available for providing agent attribute information. In such a case agent types could be statistically developed from census data or surveys (e.g., for disproportional up-scaling). Then, interviews could be conducted to obtain behavioural response data (M3) before ﬁnally behavioural data are mapped via census data into the agent population. The sequence would be M1 / M2 / M4a / M3 / M5. In cases with small populations and available census data, two different sequences can be used, depending on the behavioural diversity. In case of low behavioural diversity (Case 9) agent classes can be identiﬁed using expert knowledge, participant observation or role-playing games. Disaggregated census data would provide agent attributes. Agent attributes could further be characterised by adding expert knowledge (M4a). Then, behavioural data can be developed for each agent based on expert knowledge, ﬁeld experiments, or roleplaying games. Experts could guide the process of reﬁning behavioural assumptions for each agent. While publications on agent-based models mostly lack sufﬁcient detail on parameterisation methods it seems as if Case 9 is quite common (Barnaud et al., 2010; Dung et al., 2009; Mathevet et al., 2003; Smajgl et al., 2009e). The case of high behavioural diversity (Case 10) only differs from Case 9 in that ﬁeld experiments are likely to become unrealistic for M3 due to complications in designing experiments (Smajgl et al., 2008). The ﬁnal two cases start with expert knowledge or participant observation to design system properties. Behavioural parameters are then developed through analysis of time-series data (e.g., An et al., 2011), while attribute data are available in the form of aggregated or disaggregated census data. Statistical techniques such as clustering or regression methods can be employed to process the time-series data into agent types (M4b). Aggregated census data has to be disaggregated through dasymetric mapping (Case 11) before behavioural data can be assigned for the whole agent population (i.e., down-scaling). If disaggregated census data are available they can be directly used for proportional up-scaling assumptions on agent behaviour (Case 12). The following section provides empirical examples for the ﬁrst ﬁve cases listed in Table 2 5. Case examples 5.1. SimPaSI model for Central Java Purpose of the model: The SimPaSI model (Simulating Pathways to Sustainability in Indonesia) was implemented for the districts Demak, Jepara and Pati in Central Java. The goal was to provide capacity to analyse and foster a cross-scale dialogue across multiple levels of resource governance about the implications of plans to change fuel subsidies. Main model features: The SimPaSI model (Smajgl et al., 2009a,b) simulates in daily time steps livelihood related activities of about 841 3.2 million people. Within the model, these decisions take place in an environment that is created from best available (polygon-based) GIS data (elevation, land use) and with representations of hydrological dynamics (surface water, groundwater, soil moisture) as well as crop growth (trees, fruit, rice, tapioca, bamboo) and ecological elements (mangroves, corals). The model design document (including pseudo code) can be downloaded from http:// www.cse.csiro.au/downloads/DesignDoc_JaTeng.pdf. The user can vary the price for petrol, kerosene, and electricity, and the level of quarterly cash payments to people below the poverty line. Results are presented (in maps and charts) for poverty, unemployment, high water level risk, rice production, forest cover, coral area, and mangrove area. Sequence applied with details on approach: Empirical application of this model followed the sequence outlined in Case 1. Artiﬁcial households and their members were parameterised based on survey data. In total, 3500 households were surveyed to elicit demographic data, perceptions of importance of natural and social resources, and behavioural responses to a total of eight scenarios (i.e., fuel-price changes). The underlying assumption is that the stratiﬁed sample is representative and that behavioural responses can be proportionally up-scaled to initialise the total population. This means that each surveyed household is cloned about 350 times to create attributes for model agents. Technically, survey data is read during the initialisation process and instead of creating one agent from each entry line attributes of 350 agents are created and characterised. To assign values for many of these attributes (i.e. wages), value ranges (based on the minimum and maximum from the survey) were used instead of point values. As a result, households and their members are heterogeneous, even within the clusters of similar clones. 5.2. Dutch agriculture model Purpose of the model: the objective of this ABM was to analyse and explore how the response of farmers to changes in policy, expressed through socio-economic scenarios, affect the changes in the use, cover and structure of the land in a Dutch rural region (Valbuena et al., 2010a,b). In this region, farmers’ decisions included processes of farm cessation, farm expansion and protection of hedgerows and tree lines. Main features: In order to represent farm(er)s, 2700 agents were created on a grid that represented 1 ha. Each agent owned one or several ﬁelds and each ﬁeld was formed by one or several pixels. Agents’ decisions were parameterised based on a probabilistic rather than a deterministic approach, in which a probability for a certain choice was assigned to each agent. A run of the model consisted of 15 time steps each representing the actions taken in one year. Agents could stop farming, sell or buy land and protect or clear hedgerows (Valbuena et al., 2010a,b). The outputs include spatially explicit changes in the landscape structure of the rural region. Sequence applied with details on approach: Empirical application of this model followed that sequence outline for Case 2. Only one agent class was identiﬁed (i.e., farmers) based on expert knowledge (M1). A sample survey of 300 farmers was used to elicit agent attributes (e.g., farm size, agribusiness type and age) and behavioural components (i.e., agents’ willingness to make certain choices). Both attributes and components were combined to build an agent typology, to describe these differences (M4a and M4b). Although the typology was built based on expert knowledge, it was supported by quantitative analyses (e.g., cluster analysis and classiﬁcation regression trees). Based on this typology and on the census data, the attributes and the behavioural parameters were assigned to each agent of the whole population (Valbuena et al., 2008). 842 A. Smajgl et al. / Environmental Modelling & Software 26 (2011) 837e844 5.3. SimPaSI model for East Kalimantan Purpose of the model: In this case study, the SimPaSI model was implemented for six districts of the Indonesian province of East Kalimantan. The goal was to provide a simulation tool that facilitates a dialogue across governmental levels in the context of potential fuel subsidy reductions. The underlying idea was to simulate the impacts that central government interventions could have on poverty in different parts of Indonesia. The two selected regions were Central Java and East Kalimantan. Main model features: This implementation of the SimPaSI model (Smajgl et al., 2009c,d) is similar to the Central Java implementation described above. However, land uses (trees, rubber, rattan, fruit) and ecological variables (ﬁsh, honey, deer, dolphins, hornbill) differ. Sequence applied with details on approach: Empirical application of this model followed that sequence outline for Case 3. The development of a dataset for parameterising households and their members was based on the assumption that representative sample could not be gathered. While about 53% of the 1.8 million people to be included in the model live in (peri-)urban areas, the rest live in 99% of the area, which is mostly quite remote. Additionally, the complexity of the context required a larger set of data than for the ﬁrst case example, which translated into interviews of unrealistic length (up to 4 h based on pilot tests). Therefore, ﬁeld work was structured to identify all possible types in a ﬁrst step and elicit behavioural response functions for each type separately (Bohensky et al., 2007; Smajgl et al., 2007; Trébuil et al., 1997). In a ﬁrst step, a survey of 3000 people elicited data on demography, values, and livelihood strategies. In a second step, a K-means cluster analysis was performed based on regression trees, given that most of the variables were categorical. This enabled the development of a decision tree (with a limited set of variables), through which a type could be assigned to each household in the area. This step was followed by validation workshops to test the plausibility of clusters. Then, interviews with 530 households were carried out to elicit behavioural responses to eight scenarios (i.e. fuel prices). These households were identiﬁed based on the variables with highest discriminatory power for each cluster. The behavioural responses were scaled up disproportionally and conducted based on (disaggregated) household-level census data, with behavioural response data (deﬁned for types) assigned to each household (in the census). 5.4. MameLuke settlement model Purpose of the model: The MameLuke Settlement model, which has been constructed using the MameLuke (multi-actor modelling of land use and land-cover changes) framework for the development of agent-based models (Huigen, 2004), was implemented for the Isabela province in the Philippines. The aim of the model is to understand the settlement processes in the area. Understanding the settlement process and the spatial effects of population growth in the area are of pivotal importance for the future land use and land-cover change (Huigen, 2004; Huigen et al., 2006). Main model features: The settlement model for the Isabela province simulates all demographic processes relevant to population growth, including migration. The ethnicity is a major driver for the location of settlement of a household. Throughout the simulated years households settle following different rules deﬁned by their ethnic preference structure. Furthermore, the model simulates spatially explicit household land-use behaviour (including deforestation) over a period of 100 years (starting in 1900) and measures changes in land use and cover. Key outcomes of the model are measured using an ‘ethnic spread’ ﬁtness function. Sequence applied with details on approach: Empirical application of this model followed that sequence outline for Case 4. Participant observation was the dominant technique used for the parameterisation of human agents. Participant observation was used to create a principle systems understanding (M1), which includes the identiﬁcation of ethnic groups as a major driver for the creation of settlements. In total 44 unstructured qualitative interviews in combination with 7 focus group sessions were used to obtain empirical data. The interviews were stratiﬁed on age, gender and ethnicity. Attributes and behavioural traits of agents were deﬁned iteratively. While Fig. 1 could be interpreted as linear sequences of methods, the MameLuke implementation was conducted through iteration among steps M2, M3 and M4a/b, using participant observation. The iterative approach allowed for reﬁning the consistency of assumptions on agent behaviour and attributes. Unstructured interviews were conducted to elicit the base structure of an agent typology. This structure was then used as a reference point in focus group discussions. Up-scaling to the total 3000 households that live in the area has been done using the aforementioned agent-type identiﬁcation technique. Based on the attribute ethnicity in combination with the age of the household (at the speciﬁed simulated year) a rule-set for settlement decisions and for land use decisions were generated using the empirical datasets. 5.5. DEED model of ex-urban developments in South-eastern Michigan Purpose of the model: The Dynamic Ex-urban Ecological Development (DEED) was developed to understand the evolution of exurban landscapes in South-eastern Michigan, USA, and identify opportunities to introduce landscape designs that provide enhanced ecosystem services in these heterogeneous humandominated landscapes. Main model features: A key feature of the DEED model is that it represents the interacting behaviours of land developers and residential home buyers (Brown et al., 2008; Robinson and Brown, 2009; Zellner et al., 2009). Land developers are included because of their importance in structuring the supply of residential landscapes to residential land buyers. Each of three different developers builds one of three different development types. The development types are distinguished by their density, where they locate relative to roads and aesthetic features, and their effects on the amount of tree cover on the landscape, thereby providing both aesthetic value to residents and ecological services. Each time a developer creates a development, lots within the development become available to residents, who are created by the model at a user-speciﬁed rate per time step. A given developer cannot create a new development until a speciﬁed threshold percentage of lots in his existing developments are sold to residents. Sequence applied with details on approach: Empirical support for the developers in this model followed that sequence outline for Case 5. Empirical support for residents was described elsewhere (Brown and Robinson, 2006). The development types were created as a working hypothesis about differentiation among development types within South-eastern Michigan based on expert knowledge. Empirical information was needed to test the differentiation among the development types and to represent the developer behaviours related to selection of sites for developments. The ﬁrst type of empirical information was largely deﬁnitional (i.e., do these types exist?) and was observed by identifying multiple developments in parcel data, geographically overlaying the subdivisions on digital aerial imagery to observe differences in landscape characteristics, and accessing tax records for selected subdivisions to observe differences in house and price characteristics (Brown et al., 2008). A. Smajgl et al. / Environmental Modelling & Software 26 (2011) 837e844 Once the differences among development types were conﬁrmed, important behavioural information about the development types, and by inference the developers of those different types, was observed through analysis of time-series information on a sample of developments created between 1950 and 2000 (An and Brown, 2008). Subdivisions were identiﬁed in GIS-based parcel data and labelled for type based on parcel sizes and layouts as well as landscape characteristics observed in scoincident aerial photography. The dates of each development were determined using historical aerial photography, collected approximately decadally over the study period. Survival analysis was used to determine the locational characteristics that determined the likelihood of given farm being converted to each of the three different development types. The coefﬁcients estimated by survival analysis were used as parameters in the developer models (i.e., one for each developer type). 6. Concluding remarks This paper developed a framework for the parameterisation of attributes and behavioural response functions of human agents in agent-based models for a socio-ecological context. This is a ﬁrst step to allow for systematic testing, documentation and communication of empirical parameterisation methods that can be compared across various modelling contexts. We identiﬁed multiple methods that can be used at each of ﬁve iterative or sequential steps in the process of creating an empirically informed agent-based model. Until now, methodological choices have often been made in an ad hoc manner, depending on the researcher’s conceptual understanding, methodological training, and access to data. We classiﬁed the various modelling contexts according to the size of the agent population, their behavioural diversity, and the ability of researchers to acquire a representative sample of agent attributes or behaviours. This classiﬁcation provides a structure that can be used to evaluate the appropriateness of various combinations of methods in different contexts, and can formalize these combinations in a way that can guide choices and facilitate communication of those choices among modellers and to practitioners who use model results. Such an effort to systematically understand the effectiveness of methodologies employed in ABM for the parameterisation of behavioural aspects is critical because the (technical) strength of ABM, which is its ﬂexibility and capacity in representing human decision making, has to be based on a robust empirical support. Without translating this advantage into empirical contexts, the applied value of ABM remains limited and the focus remains on hypothetical and theoretical analyses. The framework for and describing different parameterisation sequences should also be tested in various modelling (Janssen and Ostrom, 2006; Robinson et al., 2007) and other application contexts (Matthews et al., 2007) such as the growing domain of Social Epidemiology in public health research (Kaplan, 2004) or agent-based economics (Luna and Perrone, 2002). Further reﬁnement and testing of this framework would allow for reducing model uncertainty, for provision of systematic guidelines for building and applying empirical ABMs, and for facilitation of the model comparisons (Parker et al., 2008a,b). Particular sequences could be assessed by comparing their performance, which links directly into the related topic of model validation (Grimm et al., 2005; Moss, 2008; Smajgl et al., 2011). The framework described in this paper is a ﬁrst step in this process. The community of ABM researchers needs to assess the framework and whether it is sufﬁcient to cover all existing or potential approaches for empirical agent-based models. That evaluation should include an assessment of situations in which the most appropriate sequence of methods can be determined at the beginning of a project, based on the characteristics of the problem at hand, and when appropriate sequencing might not 843 be clear at the beginning, but needs to emerge through iteration across the various steps. Opportunities for extension of the framework might appear as methods for the parameterisation for the social networks of agents and the agent environment are reﬁned (Fig.1). For example, the requirement for real-world data on social network might change the approach to obtaining behavioural data for individual agents, as individual behaviours might be dependent on how these individuals interact with each other. References An, L., Brown, D.G., 2008. Survival analysis in land-change science: integrating with GIS and remote sensing in Southeastern Michigan. Annals of the Association of American Geographers 98 (2), 323e344. An, L., Brown, D.G., Nassauer, J.I., Low, B., 2011. Variations in development of exurban residential landscapes: timing, location, and driving forces. Journal of Land Use Science 6 (1), 13e32. Baker, T.J., Miller, S.N., Prager, S.D., Legg, D.E., 2010. Disaggregating human population for improved land use management in Kenya. Journal of Land Use Science 5 (4), 237e257. Barnaud, C., Van Paassen, A., Trébuil, G., Promburom, T., Bousquet, F., 2010. Dealing with power games in a companion modelling process: lessons from community water management in Thailand Highlands. The Journal of Agricultural Education and Extension 16 (1), 55e74. Barreteau, O., 2003. The joint use of role-playing games and models regarding negotiation processes: characterization of associations. JASSS e The Journal of Artiﬁcial Societies and Social Simulation 6 (2). http://jasss.soc.surrey.ac.uk/6/2/ 3.html. Becker, H.S., 1958. Problems with inference and proof in participant observation. American Sociological Review 23 (6), 652e660. Berger, T., Schreinemachers, P., 2006. Creating agents and landscapes for multiagent systems from random samples. Ecology and Society 11 (2). Bohensky, E., Smajgl, A., Herr, A., 2007. Calibrating behavioural variables in agent-based models: insights from a case study in East Kalimantan, Indonesia. Paper Presented at the MODSIM 2007 International Congress on Modelling and Simulation. Brown, D.G., Robinson, D.T., 2006. Effects of heterogeneity in residential preferences on an agent-based model of Urban Sprawl. Ecology and Society 11 (2). Brown, D.G., Robinson, D.T., An, L., Nassauer, J.I., Zellner, M., Rand, W., et al., 2008. Exurbia from the bottom-up: confronting empirical challenges to characterizing complex systems. GeoForum 39 (2), 805e818. Cooke, R.M., Goossens, L.L.H.J., 2008. TU Delft expert judgment data base. Reliability Engineering & System Safety 93 (5), 657e674. Dray, A., Perez, P., Jones, N., Le Page, C., D’Aquino, P., White, I., et al., 2006. The AtollGame experience: from knowledge engineering to a computer-assisted role playing game. JASSS e The Journal of Artiﬁcial Societies and Social Simulation 9 (1), 6. http://jasss.soc.surrey.ac.uk/9/1/6.html. Dung, L.C., Hoanh, C.T., Le Page, C., Bousquet, F., Gajaseni, N., 2009. Facilitating dialogue between aquaculture and agriculture: lessons from role-playing games with farmers in the Mekong Delta. Vietnam Water Policy 11 (S1), 80e93. Grimm, V., Berger, U., Bastiansen, F., Eliassen, S., Ginot, V., Giske, J., et al., 2006. A standard protocol for describing individual-based and agent-based models. Ecological Modelling 198 (1e2), 115e126. Grimm, V., Berger, U., DeAngelis, D.L., Polhill, J.G., Giske, J., Railsback, S.F., 2010. The ODD protocol: a review and ﬁrst update. Ecological Modelling 221 (23), 2760e2768. Grimm, V., Revilla, E., Berger, U., Jeltsch, F., Mooij, W.M., Railsback, S.F., et al., 2005. Pattern-oriented modeling of agent-based complex systems: lessons from ecology. Science 310 (5750), 987e991. Harrison, G., List, J., 2004. Field experiments. Journal of Economic Literature 42 (4), 1009e1055. Huigen, M.G.A., 2004. First principles of the MameLuke multi-actor modelling framework for land use change, illustrated with a Philippine case study. Journal of Environmental Management 72 (1e2), 5e21. Huigen, M.G.A., Overmars, K.P., de Groot, W.T., 2006. Multiactor modeling of settling decisions and behavior in the San Mariano Watershed, the Philippines: a ﬁrst application with the MameLuke framework. Ecology and Society 11 (2). Janssen, M., Ostrom, E., 2006. Empirically based, agent-based models. Ecology and Society 11 (2). http://www.ecologyandsociety.org/vol11/iss32/art37/ Art. 37. Jupp, V., 2006. The Sage Dictionary of Social Research Methods. Sage Publications, London. Kaplan, G.A., 2004. What’s wrong with social epidemiology, and how can we make it better? Epidemiologic Reviews 26, 124e135. Luna, F., Perrone, A., 2002. Agent-based Modelling in Economics and Finance: Simulations in SWARM. Kluwer, Massachusetts, Dordrecht. Martin, L.R., 1977. Survey of Agriculture Economics Literature: Quantitative Methods in Agricultural Economics, 1940’se1970’s, vol. 4. University of Minnesota Press, Minneapolis. Mathevet, R., Bousquet, F., Le Page, C., Antona, M., 2003. Agent-based simulations of interactions between duck population, farming decisions and leasing of hunting rights in the Camargue (Southern France). Ecological Modelling 165 (2e3),107e126. Matthews, R., Gilbert, N., Roach, A., Polhill, J., Gotts, N., 2007. Agent-based land-use models: a review of applications. Landscape Ecology 22 (10), 1447e1459. 844 A. Smajgl et al. / Environmental Modelling & Software 26 (2011) 837e844 Mennis, J., Hultgren, T., 2006. Intelligent dasymetric mapping and its application to areal interpolation. Cartography and Geographic Information Science 33 (3), 179e194. Moss, S., 2008. Alternative approaches to the empirical validation of agent-based models. Journal of Artiﬁcial Societies and Social Simulation 11 (1), 5. http:// jasss.soc.surrey.ac.uk/11/1/5.html. Nichols, P., 1991. Social Survey Methods: A Fieldguide for Development Workers. Oxfam, Oxford. Parker, D.C., Brown, D., Polhill, J., Deadman, P.J., Manson, S.M., 2008a. Illustrating a new ‘conceptual design pattern’ for agent-based models of land use via ﬁve case studies e the Mr. Potatohead framework. In: López Paredes, A., Hernández Iglesias, C. (Eds.), Agent-Based Modelling in Natural Resource Management. Pearson Education, Upper Saddle River, New Jersey, pp. 23e51. Parker, D.C., Entwisle, B., Rindfuss, R.R., Vanwey, L.K., Manson, S.M., Moran, E., et al., 2008b. Case studies, cross-site comparisons, and the challenge of generalization: comparing agent-based models of land-use change in frontier regions. Journal of Land Use Science 3 (1), 41e72. Parker, D.C., Manson, M.S., Janssen, M.A., Hoffmann, M.J., Deadman, P., 2003. Multiagent system models for the simulation of land-use and land-cover change: a review. Annals of the Association of American Geographers 93 (2). Patzer, G.L., 1996. Experiment-Research Methodology in Marketing: Types and Applications. Quorum Books, Westport, CT. Rees, P., Martin, D., Williamson, P., 2002. The Census Data System. John Wiley and Sons, New York. Robinson, D.T., Brown, D.G., 2009. Evaluating the effects of land-use development policies on ex-urban forest cover: an integrated agent-based GIS approach. International Journal of Geographical Information Science 23 (9), 1211e1232. Robinson, D.T., Brown, D.G., Schreinemachers, P., Janssen, M.A., Huigen, M., Wittmer, H., et al., 2007. Comparison of empirical methods for building agentbased models in land use science. Journal of Land Use Science 2 (1), 31e55. Rousseeuw, P.J., 1987. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. Journal of Computational and Applied Mathematics 20, 53e65. Smajgl, A., Bohensky, E., Bohnet, I., 2007. Capturing heterogeneity in empirical agent-based models: a guideline. Paper Presented at the Fifth ESSA Conference, Toulouse. Smajgl, A., Carlin, G., Bohensky, E., House, A., Kurnia, A.S., Butler, J., et al., 2009a. Assessing Impacts of Fuel Subsidy Decisions on Poverty and Fish Catch in Central Java, Indonesia: An Agent-based Analysis. CSIRO Sustainable Ecosystems, Townsville. Smajgl, A., Carlin, G., House, A., Kurnia, A.S., Butler, J., Sugiyanto, C., et al., 2009b. Design Document for Agent-based Model SimPaSI Jawa Tengah. CSIRO Sustainable Ecosystems, Townsville. http://www.cse.csiro.au/downloads/ DesignDoc_JaTeng.pdf. Smajgl, A., Carlin, G., House, A., Pambudhi, F., Butler, J., Bohensky, E., et al., 2009c. Simulating Pathways to Sustainability in Indonesia: Design Document for Agent-based Model SimPaSI East Kalimantan. CSIRO Sustainable Ecosystems, Townsville. http://www.cse.csiro.au/downloads/DesignDoc_JaTeng.pdf. Smajgl, A., Carlin, G., Pambudhi, F., Bohensky, E., House, A., Butler, J., et al., 2009d. Assessing Impacts of Fuel Subsidy Decisions on Poverty and Forest Depletion in East Kalimantan, Indonesia: An Agent-based Analysis, vol. 2. CSIRO. Smajgl, A., Heckbert, S., Ward, J., Straton, A., 2009e. Simulating impacts of water trading in an institutional perspective. Environmental Modelling and Software 24, 191e201. Smajgl, A., Heckbert, S., Bohnet, I., Carlin, G., Hartcher, M., McIvor, J., 2008. Simulating the Grazing System in the Bowen Broken Catchment: An Agent-based Modelling Approach. CSIRO, Townsville. Smajgl, A., House, A., Butler, J., 2011. Implications of ecological data constraints for integrated policy and livelihoods modelling: an example from East Kalimantan, Indonesia. Ecological Modelling 222, 888e896. Trébuil, G., Kam, S.P., TurkelBoom, F., Shinawatra, B., Teng, P.S., Kropff, M.J., et al., 1997. Systems diagnoses at ﬁeld, farm and watershed levels in diversifying upland agroecosystems: towards comprehensive solutions to farmers’ problems. In: Applications of Systems Approaches at the Farm and Regional Levels. Kluwer Academic, The Hague. 99e114. Valbuena, D., Verburg, P.H., Bregt, A.K., 2008. A method to deﬁne a typology for agent-based analysis in regional land-use research. Agriculture, Ecosystems & Environment 128 (1e2), 27e36. Valbuena, D., Verburg, P.H., Bregt, A.K., Ligtenberg, A., 2010a. An agent-based approach to model land-use change at a regional scale. Landscape Ecology 25, 185e199. Valbuena, D., Verburg, P.H., Veldkamp, A., Bregt, A.K., Ligtenberg, A., 2010b. Effects of farmers’ decisions on landscape structure of a Dutch rural region: an agentbased approach. Landscape Urban Planning 97, 98e110. Wu, B.M., Birkin, M.H., Rees, P.H., 2008. A spatial microsimulation model with student agents. Computers, Environment & Urban Systems 32 (6), 415e488. Zellner, M.L., Page, S.E., Rand, W., Brown, D.G., Robinson, D.T., Nassauer, J., et al., 2009. The emergence of zoning policy games in exurban jurisdictions: informing collective action theory. Land Use Policy 26 (2), 256e367.

RELATED PAPERS

RELATED TOPICS

Log In

Empirical characterisation of agent behaviours in socio-ecological systems

Empirical characterisation of agent behaviours in socio-ecological systems

Related Papers

RELATED PAPERS

RELATED TOPICS