www.fgks.org   »   [go: up one dir, main page]

Academia.eduAcademia.edu
Environmental Modelling & Software 26 (2011) 837e844 Contents lists available at ScienceDirect Environmental Modelling & Software journal homepage: www.elsevier.com/locate/envsoft Empirical characterisation of agent behaviours in socio-ecological systems Alex Smajgl a, *, Daniel G. Brown b, Diego Valbuena c, d, Marco G.A. Huigen e a CSIRO Ecosystems Sciences and Climate Adaptation, University Drive, Townsville 4810, Australia University of Michigan, School of Natural Resources and Environment, Ann Arbor, MI, USA c Wageningen University, Department of Environmental Sciences, P.O. Box 47, 6700 AA Wageningen, The Netherlands d International Livestock Research Institute, System-wide Livestock Programme, P.O. Box 5689, Addis Ababa, Ethiopia e Universität Hohenheim, Germany b a r t i c l e i n f o a b s t r a c t Article history: Received 29 July 2010 Received in revised form 14 February 2011 Accepted 17 February 2011 Agent-based modelling has become an important tool to investigate socio-ecological processes. Its use is partially driven by increasing demand from decision makers to provide support for understanding the potential implications of decisions in complex situations. While one of the advantages of agent-based modelling is the ability to simulate the implications of human decision-making processes explicitly, methods for providing empirical support for the representation of the behaviour of human agents have not been structured systematically. This paper develops a framework for the parameterisation of human behaviour in agent-based models and develops twelve distinct sequences for the characterisation and parameterisation of human behaviours. Examples are provided to illustrate the most important sequences. This framework is a first step towards a guide for parameterisation of human behaviour in ABM. A structured discussion within the agent-based community is needed to achieve a more definitive guideline. Ó 2011 Elsevier Ltd. All rights reserved. Keyword: Agent-based modelling 1. Introduction Environmental policy and management increasingly demand the integration of cross-disciplinary knowledge of socio-ecological systems. Understanding human responses to environmental and/or policy changes is critical for understanding socio-ecological systems and the outcomes they produce. As agent-based modelling (ABM) allows for simulating actual decision-making processes of individuals or groups of individuals this modelling technique is gaining importance (Matthews et al., 2007; Parker et al., 2003). This paper is focused on ABM, in which (a) human agents are represented at the scale of households or individuals and (b) the main concern is the understanding of socio-ecological interactions. In empirical contexts, this class of ABM is mostly chosen for its ability to simulate explicit human decision-making processes (mostly described in discontinuous functions) in the context of changes in the bio-physical environment. However, the main strength of ABM can be a key weakness. As the implementation of human decision-making processes is the main strength of ABM, the agent attributes and behavioural response functions that represent these processes require knowledge support from qualitative and/or * Corresponding author. E-mail address: alex.smajgl@csiro.au (A. Smajgl). 1364-8152/$ e see front matter Ó 2011 Elsevier Ltd. All rights reserved. doi:10.1016/j.envsoft.2011.02.011 quantitative empirical sources. Unfortunately, there are no standard approaches to documenting and communicating the empirical support that underlies modelling and design decisions in agentbased models (Berger and Schreinemachers, 2006; Bohensky et al., 2007; Smajgl et al., 2007). Empirical characterisation and parameterisation of human decision-making processes can involve a range of methods that include: expert knowledge, surveys, interviews, and participant observation. We describe a set of specific methods here that, while it does not include all possible options, includes those that are most commonly used in practice and help define what we believe to be a robust set of sequences within which alternative specific methods can be applied. In many cases it is necessary to iteratively combine several empirical methods in a sequence to fully parameterise behavioural responses in ABM. The development of an ABM that aims to simulate behavioural responses of humans requires two fundamental steps in which empirical data are required: the development of behavioural categories and the scaling to the whole population of agents. The effectiveness of different methodological sequences depends on the modelling context (Janssen and Ostrom, 2006; Robinson et al., 2007). Most significantly, the size of the human population in the system to be modelled helps to determine appropriateness of various methods, and can be anything from 20 individuals or households to many million households. Carrying out in-depth 838 A. Smajgl et al. / Environmental Modelling & Software 26 (2011) 837e844 interviews with 20 households is realistic. With larger populations and more complex socio-ecological interactions, it becomes necessary to work with a sample of the population. Sampling often requires characterisation of the existing heterogeneity of agent attributes and behavioural responses in order to provide a simplified description of behavioural realities that can guide sampling and the realistic parameterisation of model agents. The first challenge for the parameterisation of ABMs that represent larger populations is the identification of suitable empirical methods for eliciting behavioural data. Then, the sample-based data need to be translated into behavioural representations for the whole population (i.e. up-scaling). Up-scaling can be performed proportionally or disproportionally. Proportional up-scaling assumes that the sample is representative for the whole population. This can result in under-representation of behavioural minorities. Disproportional up-scaling is needed if these behavioural minorities are likely to determine system dynamics, such as free-riders in common pool situations. Disproportional up-scaling is based on the development of agent typologies. The inverse problem (i.e., down-scaling) occurs when data for a large collection of agents are acquired for aggregated sources (e.g., over census enumeration districts), and populations of agents are generated through some simulation technique that honours the aggregate data (Baker et al., 2010; Brown and Robinson, 2006; Wu et al., 2008). This paper aims to provide an initial structure that can be used within rigorous discussions of parameterisation methods for human behaviour in ABM. This is only a first step towards a guide for parameterisation of human behaviour in ABM as a structured discussion within the agent-based community is needed to achieve a more definite guideline. Thus, we developed a framework for this process followed by a discussion of methods available for what we call the parameterisation sequence. Discussing sequences instead of individual methods allows for consideration of approaches that mix different kinds of methods, a systematic understanding of existing options and evaluation of suitability across a range of modelling contexts. Then, we give examples demonstrating the application of the framework. Most published examples of empirical agent-based models do not disclose methodological details for the parameterisation process that was used. Therefore, we believe that an initial outline of the dimensions of a useful framework along these lines is needed. The systematic application of the framework to a few different types of empirical agent-based models should guide readers in applying the framework and considering its applicability in a wider variety of cases. The paper concludes with a discussion of the need to further tests of the generality of the framework. 2. Definitions of empirical methods Several methods can be used to parameterise behavioural responses of humans empirically. This set of the most commonly used is not all inclusive, but we posit that it is complete enough to build a fairly comprehensive framework: Expert knowledge: This constitutes a variety of formal and informal methods for capturing the understanding of experts in a field or region and representing that knowledge in ways that it can be used in models. Approaches range from conceptual mapping by experts themselves to informal conversations or focus groups from which expert knowledge can be elicited for subsequent model building. Expert knowledge can also be employed to quantify uncertainties associated with expert judgement (Cooke and Goossens, 2008). Participant observation: Becker (1958) defines participant observation as the process in which the scientist participates in and documents the daily life of communities. Social surveys: Survey instruments consist of a list of questions, each with pre-defined sets of possible answers (Nichols, 1991). Responses are elicited via mail, email, in person, or via phone. Interviews: While a survey comprises of mostly closed questions, interviews are normally less structured (Jupp, 2006) and range from a list of open questions to unstructured dialogues. Census data: While surveys and interviews are conducted with a sample of a population census data is elicited for 100% of a population, normally within national boundaries (Rees et al., 2002). Aggregated census data summarise responses for groups of households within enumeration districts, while disaggregated census data show household-level responses. Field or lab experiments: Experiments are designed to observe how the change in an independent variable affects a dependent variable. Traditionally, this happened in laboratories, which allows for high degrees of control and hence clear causal links in observed outcomes. However, the high levels of control create artificial situations, which can lead to poor applicability of results to more realistic situations (Patzer, 1996). Field experiments aim for less artificialness by placing the experiment in the natural environment (Harrison and List, 2004; Smajgl et al., 2009e). While field experiments allow for more realistic behaviour to be observed the levels of control decrease. Role-playing games: Barreteau (2003) defines Role-Playing Games as “group settings that determine the roles or behavioural patterns of players as well as an imaginary context.” Similar to experimental designs, people are given pre-defined roles and tasks, which they have to perform in interaction with other role players in a pre-defined setting. Cluster analysis: Clustering describes the grouping of subjects based on the similarity of attributes. Based on categorical data, algorithms calculate either (1) the Euclidean distance between the median and each value, or (2) between each pair of values (Rousseeuw, 1987). Depending on proximity, subjects are mapped into clusters. Dasymetric mapping: This method involves a combination of detailed spatial data with aggregated census data, usually about population, to create disaggregated representations of the spatial distribution of population characteristics (Mennis and Hultgren, 2006). Often, for example, land-cover maps based on satellite imagery are used to distribute populations within an area such that (a) population totals within census enumeration districts are preserved and (b) their spatial locations are estimated at a much finer resolution, based on the locations of land-cover types with different population densities. Monte Carlo method: This approach refers to “experiments with algebraic models which involve a stochastic structure” (Martin, 1977). Monte Carlo runs with stochastic agent-based models allow developing uncertainty distributions of output variables. 3. Parameterisation framework ABM requires the systematic representation of three main phenomena: agents, their social networks and the agent environment (Fig. 1). This paper discusses only the parameterisation of human agents; we leave development of empirical frameworks for social networks and the agent environment to others. We acknowledge that any model builder will not be able to separate human agent attributes and behaviours from social networks and the (bio-physical) agent environment, but for the purpose of clarity in this paper we focused on human agents. To achieve agent parameterisation, six different methodological steps were identified (Fig. 1), each with multiple empirical methods available (Table 1). The first step of the framework is to identify different classes of agents (Method 1, Fig. 1). We assume that agents that share the same 839 A. Smajgl et al. / Environmental Modelling & Software 26 (2011) 837e844 Fig. 1. Framework for parameterisation of ABM. sequence of actions are defined in an agent class. The sequences of actions represent the modelled behaviour of the agents and constitute the totality of their actions within the model. The identification of distinct agent classes and their sequences of actions can be based on expert knowledge (EK), participant observation (PO), laboratory experiments (LE), interviews and role-playing games (RPG). We group these methods as Methods 1 (M1) as listed in Table 1. Within each agent class agents can be grouped into agent types according to their behavioural similarity. For instance, farmers would be a different class of agents from urban developers. Among farmers, different behavioural responses might be distinguished based on attributes like wealth or age, and therefore separate farmer types can be created. Types can be developed in three further steps employing methods 2e4. If no agent types are required, attributes and behavioural components can be assigned for the whole agent class employing methods 2 and 3. The second step of the framework can be either to specify the values for agent attributes (M2) or the parameters for the behavioural rules agents follow (M3). While agents in each class share the same sequence of actions, the parameters governing the magnitudes of the actions and the degree to which various sources of information influence those actions can vary. Agents with the same behaviours and similar or identical parameters of behaviour belong to the same agent type (i.e., types are the equivalent of sub-classes). Agent types can be developed from agent attributes or from data on agent behaviours. Thus, data have to be elicited to measure realworld attributes and behavioural responses. Behavioural responses can include observed relationships between qualitative or quantitative contextual variables and the actions or probabilities of actions carried out by agents. We group methods for obtaining agent attributes as Methods 2 (M2) and those for obtaining behavioural responses as Methods 3 (M3). Table 1 lists surveys and census data as M2 methods. Options for step M3 include surveys, interviews, field experiments, participant observation, RPG, timeseries data, and expert knowledge. The sequence of steps within the proposed framework can vary (Fig. 1) because agent types can be developed from agent attributes or from observed behavioural responses. If types are developed from agent attributes, M4a methods can be employed, through the combination of (a) clustering and regression, (b) a mix of correlation and expert knowledge, (c) expert knowledge individually, or (d) dasymetric mapping. The first three options can also be used if developing agent types from behavioural data. If methods M4a are not used, participant observation is an alternative (method group M4b). Once agent types are developed, and their attributes and behavioural response functions characterised, agents in the whole population need to be assigned to the appropriate agent type (M5). If data were elicited from the whole population, this step does not require any up- or down-scaling, as the sub-population (sample) and population are identical (Fig. 1). If there are some assurances that the sample is representative of the population, the scaling can be conducted proportionally. Under such assumptions each sample is assumed to represent a number of other agents, determined by the proportion of the population sampled, and is then cloned, accordingly. Cloning means that each respondent is replicated x times, with x ¼ (population size/sample size). Cloning does not mean that all agents of one clone have to be identical, as ranges of values or measures of variability can be used to introduce some level of heterogeneity. If representativeness of the sample cannot be assumed, agent types can be up- or down-scaled by means of census data or a Monte Carlo approach. These methods are grouped as M5 methods and listed in Table 1. 4. Parameterisation sequences What combinations of M1eM5 methods make sense will depend on the empirical context, including its purpose. We chose to simplify a description of the empirical context using three variables: the size of the population (N), high/low; the behavioural diversity (BD), high/ low; and the pragmatic possibility of creating representative samples. Behavioural diversity describes the relative number of different types of actions that agents can take in a given context. If the options are relatively limited, e.g., to grow rice or not, then behavioural diversity is low. If the agents choose among a large set of options, behavioural diversity is said to be high. Whether or not it is possible to create a representative sample of the population depends on, for instance, availability of funding for the modelling project and ease of accessing the people. We chose not to include model purpose as a dimension in the descriptions of context, though it may have some effect on the empirical choices, because the framework is predicated on the assumption that the interest of the modeller is in creating an empirically realistic model. That assumption necessarily includes some model purposes and excludes others. We have identified twelve different cases or sequences that combine the methods listed in Table 1. These cases can be followed to parameterise the behavioural responses of human agents in ABMs (Table 2). Depending on the actual context this combination of methods might require an iterative approach instead of a strict sequence. There may be other combinations, but we believe this set covers most empirical ABM contexts. Despite being a first step, this is a reasonably comprehensive list of cases that should allow for a structured testing of methods to identify which sequences prove to be effective under various modelling conditions. Such testing could aim for quantifying uncertainty and guide the ABM community towards more robust empirical model development. The Table 1 Overview of methodologies relevant for the parameterisation of behavioural traits of human agents. M1 M2 M3 M4a ATB M4b BT M5 Expert knowledge ParticObser Lab experiment Interviews RPG Survey Census (incl. GIS data) Survey Interviews Field experiment ParticObser RPG Time-series data Expert knowledge Clustering and regression Correlation and expert knowledge Expert knowledge Dasymetric mapping Clustering and regression Correlation þ expert knowledge Expert knowledge ParticObser Proportional Census/GIS based assignment Monte Carlo 840 No up-Scaling CD/GIS based CD/GIS based EK Clustering & Regression Clustering & Regression Census Aggregated Census Disaggregated Census Census EK or PO or RPG EK or PO EK or PO Yes Less relevant Less relevant No Less relevant Less relevant 10 11 12 Less relevant Less relevant Less relevant No No 9 Less relevant Yes 8 Notes: N: agent population; BD: Behavioural diversity; EK: Expert Knowledge; RPG: Role-playing games; PO: Participant observation; CD: Census data; GIS: geographic information systems. EK EK Dasymetric mapping e No up-Scaling EK CD/GIS based No No 7 Less relevant Less relevant Yes 100% Yes No No Less relevant 4 5 6 Yes No access No Yes Yes 3 No Yes Yes 2 No Yes 1 Yes Less relevant EK or PO or Interviews (optional: Experiments EK or PO or Interviews (optional: Experiments EK or PO or Interviews (optional: Experiments EK or PO or RPG EK or PO EK or PO or Interviews (optional: Experiments EK or PO or Interviews (optional: Experiments EK or PO or Interviews (optional: Experiments EK or PO or RPG or RPG) or RPG) or RPG) Survey Census & Surveys (or PO) Disaggregated Census Survey or RPG) Survey OR Census Interviews (PO) or RPG Time-series data AND EK Interviews or Field Experiments or RPG Interviews or Field Experiments or RPG EK or Interviews or Field Experiments or RPG EK or Field Experiments or RPG EK or RPG Time-series data Time-series data e Statistics and EK e PO EK (EK or Clustering) and Regression e Clustering or (Statistics and EK) PO e e Interviews or RPG Survey Survey or RPG) or RPG) No up-scaling CD/GIS based CD/GIS based CD/GIS based CD/GIS based (EK or Clustering) and Regression e Interviews or Field Experiments or RPG or EK Interviews or RPG Survey M3 M2 M1 BD high? Sample representative? N high? When Case Table 2 Overview of techniques for the parameterisation of human agent behavioural responses. e e M4a e M4b M5 (Cloning or Monte Carlo) and Spatial references CD/GIS based A. Smajgl et al. / Environmental Modelling & Software 26 (2011) 837e844 characterisation of sequences should facilitate description and documentation of ABM applications. While model description is clearly important (Grimm et al., 2006, 2010), we argue that good description of the empirical support for models in specific contexts is equally important in allowing scientific peers to make good judgements about the value of the model results. We describe the modelling contexts within which the different cases make sense and emphasise different routes through the parameterisation processes (Fig. 1) depending on these contextual variables. The first case represents a modelling context with a large population and the possibility to develop representative samples (Case 1). An understanding of agents and their actions (M1) can be developed using expert knowledge (EK), participant observation (PO), or interviews. Detailed agent attribute data (M2) can be elicited by conducting sample surveys, while behavioural data (M3) can be obtained from interviews, field experiments, role-playing games (RPG), or expert knowledge (EK). Assuming that the sample used to generate attributes and behavioural parameters is representative, proportional up-scaling can be carried out, in which each data point is cloned to generate the whole population (M5). This step needs to include spatial referencing if agent locations are an element of the model design. This sequence was implemented in an example that is explained in the following section. In cases where the behavioural diversity is high and assumptions on the representativeness of samples do not hold, variations to the first sequence have to be introduced. While steps M1 and M2 can remain unchanged, the behavioural complexity makes field experiments or expert knowledge less suitable in step M3. Instead, two alternative sequences that require the development of agent types (for disproportional up-scaling) can be used. Firstly, interviews can be conducted to generate behavioural data for development of agent types (Case 2). For each type, distributions of agent attribute values can be identified by surveys. This means that the actual sequence of this approach (following Fig. 1) is M1 / M3 / M4b / M2 / M5. Instead of developing agent types from behavioural data, types could be constructed from agent attributes (Case 3). This means that a survey could be conducted to elicit attributes and clustering techniques applied to develop agent types. Then, interviews can be targeted to people who match the profile of agent types. The behavioural data would then be disproportionally up-scaled to the whole population based on a process that assigns each agent to a type by mapping type-specific characteristics against census data. This approach sequences methods as M1 / M2 / M4a / M3 / M5. Cases 2 and 3 require census data to allow for up-scaling (M5). This type of up-scaling requires the identification of the agent types using census data, in order to determine the proportions of the population in each type. While both of these parameterisation sequences reduce uncertainty resulting from sample non-representativeness, both approaches introduce uncertainty from the development of agent types by clustering (i.e. disproportional upscaling). Below examples are given for these two approaches, the Dutch model and the SimPaSI model for East Kalimantan. In situations where representative samples can be developed in spite of high behavioural diversity (Case 4), expert knowledge or participant observation can provide broad systems understanding (M1). Participant observation allows for developing agent types based on attribute data (census or surveys) and behavioural data (interviews or role-playing games). The final up-scaling could be guided by census data. The MamaLuke model is described below to give an example for this particular sequence. Case 5 describes a situation where the actors are not accessible for interviews or surveys (e.g., because they are not responsive to surveys or due to funding limitations). In such cases agent classes can be identified based on expert knowledge or participant observation. This sequence requires the availability of disaggregated data A. Smajgl et al. / Environmental Modelling & Software 26 (2011) 837e844 to provide agent attributes. Assumptions on behavioural responses can be developed from statistical analysis of time-series data or expert knowledge. Experts could then guide the mapping of behavioural assumptions into the agent population. In the next section, the DEED model gives an example for such an approach. Case 6 is another variation of the first sequence. This approach assumes that instead of a proportional sample, the development of agent types is based on behavioural data (M4b). Disproportional up-scaling could be performed with a census-based mapping, as described for Case 2. Case 7 involves relatively small populations. It is very similar to Case 1 but does not require any up-scaling (M5) because the whole population can be described, see for instance (Dray et al., 2006). Case 8 is another variation of Case 1 (and similar to Case 3). This case represents a large population with low behavioural diversity. It assumes that disaggregated census data are sufficiently available for providing agent attribute information. In such a case agent types could be statistically developed from census data or surveys (e.g., for disproportional up-scaling). Then, interviews could be conducted to obtain behavioural response data (M3) before finally behavioural data are mapped via census data into the agent population. The sequence would be M1 / M2 / M4a / M3 / M5. In cases with small populations and available census data, two different sequences can be used, depending on the behavioural diversity. In case of low behavioural diversity (Case 9) agent classes can be identified using expert knowledge, participant observation or role-playing games. Disaggregated census data would provide agent attributes. Agent attributes could further be characterised by adding expert knowledge (M4a). Then, behavioural data can be developed for each agent based on expert knowledge, field experiments, or roleplaying games. Experts could guide the process of refining behavioural assumptions for each agent. While publications on agent-based models mostly lack sufficient detail on parameterisation methods it seems as if Case 9 is quite common (Barnaud et al., 2010; Dung et al., 2009; Mathevet et al., 2003; Smajgl et al., 2009e). The case of high behavioural diversity (Case 10) only differs from Case 9 in that field experiments are likely to become unrealistic for M3 due to complications in designing experiments (Smajgl et al., 2008). The final two cases start with expert knowledge or participant observation to design system properties. Behavioural parameters are then developed through analysis of time-series data (e.g., An et al., 2011), while attribute data are available in the form of aggregated or disaggregated census data. Statistical techniques such as clustering or regression methods can be employed to process the time-series data into agent types (M4b). Aggregated census data has to be disaggregated through dasymetric mapping (Case 11) before behavioural data can be assigned for the whole agent population (i.e., down-scaling). If disaggregated census data are available they can be directly used for proportional up-scaling assumptions on agent behaviour (Case 12). The following section provides empirical examples for the first five cases listed in Table 2 5. Case examples 5.1. SimPaSI model for Central Java Purpose of the model: The SimPaSI model (Simulating Pathways to Sustainability in Indonesia) was implemented for the districts Demak, Jepara and Pati in Central Java. The goal was to provide capacity to analyse and foster a cross-scale dialogue across multiple levels of resource governance about the implications of plans to change fuel subsidies. Main model features: The SimPaSI model (Smajgl et al., 2009a,b) simulates in daily time steps livelihood related activities of about 841 3.2 million people. Within the model, these decisions take place in an environment that is created from best available (polygon-based) GIS data (elevation, land use) and with representations of hydrological dynamics (surface water, groundwater, soil moisture) as well as crop growth (trees, fruit, rice, tapioca, bamboo) and ecological elements (mangroves, corals). The model design document (including pseudo code) can be downloaded from http:// www.cse.csiro.au/downloads/DesignDoc_JaTeng.pdf. The user can vary the price for petrol, kerosene, and electricity, and the level of quarterly cash payments to people below the poverty line. Results are presented (in maps and charts) for poverty, unemployment, high water level risk, rice production, forest cover, coral area, and mangrove area. Sequence applied with details on approach: Empirical application of this model followed the sequence outlined in Case 1. Artificial households and their members were parameterised based on survey data. In total, 3500 households were surveyed to elicit demographic data, perceptions of importance of natural and social resources, and behavioural responses to a total of eight scenarios (i.e., fuel-price changes). The underlying assumption is that the stratified sample is representative and that behavioural responses can be proportionally up-scaled to initialise the total population. This means that each surveyed household is cloned about 350 times to create attributes for model agents. Technically, survey data is read during the initialisation process and instead of creating one agent from each entry line attributes of 350 agents are created and characterised. To assign values for many of these attributes (i.e. wages), value ranges (based on the minimum and maximum from the survey) were used instead of point values. As a result, households and their members are heterogeneous, even within the clusters of similar clones. 5.2. Dutch agriculture model Purpose of the model: the objective of this ABM was to analyse and explore how the response of farmers to changes in policy, expressed through socio-economic scenarios, affect the changes in the use, cover and structure of the land in a Dutch rural region (Valbuena et al., 2010a,b). In this region, farmers’ decisions included processes of farm cessation, farm expansion and protection of hedgerows and tree lines. Main features: In order to represent farm(er)s, 2700 agents were created on a grid that represented 1 ha. Each agent owned one or several fields and each field was formed by one or several pixels. Agents’ decisions were parameterised based on a probabilistic rather than a deterministic approach, in which a probability for a certain choice was assigned to each agent. A run of the model consisted of 15 time steps each representing the actions taken in one year. Agents could stop farming, sell or buy land and protect or clear hedgerows (Valbuena et al., 2010a,b). The outputs include spatially explicit changes in the landscape structure of the rural region. Sequence applied with details on approach: Empirical application of this model followed that sequence outline for Case 2. Only one agent class was identified (i.e., farmers) based on expert knowledge (M1). A sample survey of 300 farmers was used to elicit agent attributes (e.g., farm size, agribusiness type and age) and behavioural components (i.e., agents’ willingness to make certain choices). Both attributes and components were combined to build an agent typology, to describe these differences (M4a and M4b). Although the typology was built based on expert knowledge, it was supported by quantitative analyses (e.g., cluster analysis and classification regression trees). Based on this typology and on the census data, the attributes and the behavioural parameters were assigned to each agent of the whole population (Valbuena et al., 2008). 842 A. Smajgl et al. / Environmental Modelling & Software 26 (2011) 837e844 5.3. SimPaSI model for East Kalimantan Purpose of the model: In this case study, the SimPaSI model was implemented for six districts of the Indonesian province of East Kalimantan. The goal was to provide a simulation tool that facilitates a dialogue across governmental levels in the context of potential fuel subsidy reductions. The underlying idea was to simulate the impacts that central government interventions could have on poverty in different parts of Indonesia. The two selected regions were Central Java and East Kalimantan. Main model features: This implementation of the SimPaSI model (Smajgl et al., 2009c,d) is similar to the Central Java implementation described above. However, land uses (trees, rubber, rattan, fruit) and ecological variables (fish, honey, deer, dolphins, hornbill) differ. Sequence applied with details on approach: Empirical application of this model followed that sequence outline for Case 3. The development of a dataset for parameterising households and their members was based on the assumption that representative sample could not be gathered. While about 53% of the 1.8 million people to be included in the model live in (peri-)urban areas, the rest live in 99% of the area, which is mostly quite remote. Additionally, the complexity of the context required a larger set of data than for the first case example, which translated into interviews of unrealistic length (up to 4 h based on pilot tests). Therefore, field work was structured to identify all possible types in a first step and elicit behavioural response functions for each type separately (Bohensky et al., 2007; Smajgl et al., 2007; Trébuil et al., 1997). In a first step, a survey of 3000 people elicited data on demography, values, and livelihood strategies. In a second step, a K-means cluster analysis was performed based on regression trees, given that most of the variables were categorical. This enabled the development of a decision tree (with a limited set of variables), through which a type could be assigned to each household in the area. This step was followed by validation workshops to test the plausibility of clusters. Then, interviews with 530 households were carried out to elicit behavioural responses to eight scenarios (i.e. fuel prices). These households were identified based on the variables with highest discriminatory power for each cluster. The behavioural responses were scaled up disproportionally and conducted based on (disaggregated) household-level census data, with behavioural response data (defined for types) assigned to each household (in the census). 5.4. MameLuke settlement model Purpose of the model: The MameLuke Settlement model, which has been constructed using the MameLuke (multi-actor modelling of land use and land-cover changes) framework for the development of agent-based models (Huigen, 2004), was implemented for the Isabela province in the Philippines. The aim of the model is to understand the settlement processes in the area. Understanding the settlement process and the spatial effects of population growth in the area are of pivotal importance for the future land use and land-cover change (Huigen, 2004; Huigen et al., 2006). Main model features: The settlement model for the Isabela province simulates all demographic processes relevant to population growth, including migration. The ethnicity is a major driver for the location of settlement of a household. Throughout the simulated years households settle following different rules defined by their ethnic preference structure. Furthermore, the model simulates spatially explicit household land-use behaviour (including deforestation) over a period of 100 years (starting in 1900) and measures changes in land use and cover. Key outcomes of the model are measured using an ‘ethnic spread’ fitness function. Sequence applied with details on approach: Empirical application of this model followed that sequence outline for Case 4. Participant observation was the dominant technique used for the parameterisation of human agents. Participant observation was used to create a principle systems understanding (M1), which includes the identification of ethnic groups as a major driver for the creation of settlements. In total 44 unstructured qualitative interviews in combination with 7 focus group sessions were used to obtain empirical data. The interviews were stratified on age, gender and ethnicity. Attributes and behavioural traits of agents were defined iteratively. While Fig. 1 could be interpreted as linear sequences of methods, the MameLuke implementation was conducted through iteration among steps M2, M3 and M4a/b, using participant observation. The iterative approach allowed for refining the consistency of assumptions on agent behaviour and attributes. Unstructured interviews were conducted to elicit the base structure of an agent typology. This structure was then used as a reference point in focus group discussions. Up-scaling to the total 3000 households that live in the area has been done using the aforementioned agent-type identification technique. Based on the attribute ethnicity in combination with the age of the household (at the specified simulated year) a rule-set for settlement decisions and for land use decisions were generated using the empirical datasets. 5.5. DEED model of ex-urban developments in South-eastern Michigan Purpose of the model: The Dynamic Ex-urban Ecological Development (DEED) was developed to understand the evolution of exurban landscapes in South-eastern Michigan, USA, and identify opportunities to introduce landscape designs that provide enhanced ecosystem services in these heterogeneous humandominated landscapes. Main model features: A key feature of the DEED model is that it represents the interacting behaviours of land developers and residential home buyers (Brown et al., 2008; Robinson and Brown, 2009; Zellner et al., 2009). Land developers are included because of their importance in structuring the supply of residential landscapes to residential land buyers. Each of three different developers builds one of three different development types. The development types are distinguished by their density, where they locate relative to roads and aesthetic features, and their effects on the amount of tree cover on the landscape, thereby providing both aesthetic value to residents and ecological services. Each time a developer creates a development, lots within the development become available to residents, who are created by the model at a user-specified rate per time step. A given developer cannot create a new development until a specified threshold percentage of lots in his existing developments are sold to residents. Sequence applied with details on approach: Empirical support for the developers in this model followed that sequence outline for Case 5. Empirical support for residents was described elsewhere (Brown and Robinson, 2006). The development types were created as a working hypothesis about differentiation among development types within South-eastern Michigan based on expert knowledge. Empirical information was needed to test the differentiation among the development types and to represent the developer behaviours related to selection of sites for developments. The first type of empirical information was largely definitional (i.e., do these types exist?) and was observed by identifying multiple developments in parcel data, geographically overlaying the subdivisions on digital aerial imagery to observe differences in landscape characteristics, and accessing tax records for selected subdivisions to observe differences in house and price characteristics (Brown et al., 2008). A. Smajgl et al. / Environmental Modelling & Software 26 (2011) 837e844 Once the differences among development types were confirmed, important behavioural information about the development types, and by inference the developers of those different types, was observed through analysis of time-series information on a sample of developments created between 1950 and 2000 (An and Brown, 2008). Subdivisions were identified in GIS-based parcel data and labelled for type based on parcel sizes and layouts as well as landscape characteristics observed in scoincident aerial photography. The dates of each development were determined using historical aerial photography, collected approximately decadally over the study period. Survival analysis was used to determine the locational characteristics that determined the likelihood of given farm being converted to each of the three different development types. The coefficients estimated by survival analysis were used as parameters in the developer models (i.e., one for each developer type). 6. Concluding remarks This paper developed a framework for the parameterisation of attributes and behavioural response functions of human agents in agent-based models for a socio-ecological context. This is a first step to allow for systematic testing, documentation and communication of empirical parameterisation methods that can be compared across various modelling contexts. We identified multiple methods that can be used at each of five iterative or sequential steps in the process of creating an empirically informed agent-based model. Until now, methodological choices have often been made in an ad hoc manner, depending on the researcher’s conceptual understanding, methodological training, and access to data. We classified the various modelling contexts according to the size of the agent population, their behavioural diversity, and the ability of researchers to acquire a representative sample of agent attributes or behaviours. This classification provides a structure that can be used to evaluate the appropriateness of various combinations of methods in different contexts, and can formalize these combinations in a way that can guide choices and facilitate communication of those choices among modellers and to practitioners who use model results. Such an effort to systematically understand the effectiveness of methodologies employed in ABM for the parameterisation of behavioural aspects is critical because the (technical) strength of ABM, which is its flexibility and capacity in representing human decision making, has to be based on a robust empirical support. Without translating this advantage into empirical contexts, the applied value of ABM remains limited and the focus remains on hypothetical and theoretical analyses. The framework for and describing different parameterisation sequences should also be tested in various modelling (Janssen and Ostrom, 2006; Robinson et al., 2007) and other application contexts (Matthews et al., 2007) such as the growing domain of Social Epidemiology in public health research (Kaplan, 2004) or agent-based economics (Luna and Perrone, 2002). Further refinement and testing of this framework would allow for reducing model uncertainty, for provision of systematic guidelines for building and applying empirical ABMs, and for facilitation of the model comparisons (Parker et al., 2008a,b). Particular sequences could be assessed by comparing their performance, which links directly into the related topic of model validation (Grimm et al., 2005; Moss, 2008; Smajgl et al., 2011). The framework described in this paper is a first step in this process. The community of ABM researchers needs to assess the framework and whether it is sufficient to cover all existing or potential approaches for empirical agent-based models. That evaluation should include an assessment of situations in which the most appropriate sequence of methods can be determined at the beginning of a project, based on the characteristics of the problem at hand, and when appropriate sequencing might not 843 be clear at the beginning, but needs to emerge through iteration across the various steps. Opportunities for extension of the framework might appear as methods for the parameterisation for the social networks of agents and the agent environment are refined (Fig.1). For example, the requirement for real-world data on social network might change the approach to obtaining behavioural data for individual agents, as individual behaviours might be dependent on how these individuals interact with each other. References An, L., Brown, D.G., 2008. Survival analysis in land-change science: integrating with GIS and remote sensing in Southeastern Michigan. Annals of the Association of American Geographers 98 (2), 323e344. An, L., Brown, D.G., Nassauer, J.I., Low, B., 2011. Variations in development of exurban residential landscapes: timing, location, and driving forces. Journal of Land Use Science 6 (1), 13e32. Baker, T.J., Miller, S.N., Prager, S.D., Legg, D.E., 2010. Disaggregating human population for improved land use management in Kenya. Journal of Land Use Science 5 (4), 237e257. Barnaud, C., Van Paassen, A., Trébuil, G., Promburom, T., Bousquet, F., 2010. Dealing with power games in a companion modelling process: lessons from community water management in Thailand Highlands. The Journal of Agricultural Education and Extension 16 (1), 55e74. Barreteau, O., 2003. The joint use of role-playing games and models regarding negotiation processes: characterization of associations. JASSS e The Journal of Artificial Societies and Social Simulation 6 (2). http://jasss.soc.surrey.ac.uk/6/2/ 3.html. Becker, H.S., 1958. Problems with inference and proof in participant observation. American Sociological Review 23 (6), 652e660. Berger, T., Schreinemachers, P., 2006. Creating agents and landscapes for multiagent systems from random samples. Ecology and Society 11 (2). Bohensky, E., Smajgl, A., Herr, A., 2007. Calibrating behavioural variables in agent-based models: insights from a case study in East Kalimantan, Indonesia. Paper Presented at the MODSIM 2007 International Congress on Modelling and Simulation. Brown, D.G., Robinson, D.T., 2006. Effects of heterogeneity in residential preferences on an agent-based model of Urban Sprawl. Ecology and Society 11 (2). Brown, D.G., Robinson, D.T., An, L., Nassauer, J.I., Zellner, M., Rand, W., et al., 2008. Exurbia from the bottom-up: confronting empirical challenges to characterizing complex systems. GeoForum 39 (2), 805e818. Cooke, R.M., Goossens, L.L.H.J., 2008. TU Delft expert judgment data base. Reliability Engineering & System Safety 93 (5), 657e674. Dray, A., Perez, P., Jones, N., Le Page, C., D’Aquino, P., White, I., et al., 2006. The AtollGame experience: from knowledge engineering to a computer-assisted role playing game. JASSS e The Journal of Artificial Societies and Social Simulation 9 (1), 6. http://jasss.soc.surrey.ac.uk/9/1/6.html. Dung, L.C., Hoanh, C.T., Le Page, C., Bousquet, F., Gajaseni, N., 2009. Facilitating dialogue between aquaculture and agriculture: lessons from role-playing games with farmers in the Mekong Delta. Vietnam Water Policy 11 (S1), 80e93. Grimm, V., Berger, U., Bastiansen, F., Eliassen, S., Ginot, V., Giske, J., et al., 2006. A standard protocol for describing individual-based and agent-based models. Ecological Modelling 198 (1e2), 115e126. Grimm, V., Berger, U., DeAngelis, D.L., Polhill, J.G., Giske, J., Railsback, S.F., 2010. The ODD protocol: a review and first update. Ecological Modelling 221 (23), 2760e2768. Grimm, V., Revilla, E., Berger, U., Jeltsch, F., Mooij, W.M., Railsback, S.F., et al., 2005. Pattern-oriented modeling of agent-based complex systems: lessons from ecology. Science 310 (5750), 987e991. Harrison, G., List, J., 2004. Field experiments. Journal of Economic Literature 42 (4), 1009e1055. Huigen, M.G.A., 2004. First principles of the MameLuke multi-actor modelling framework for land use change, illustrated with a Philippine case study. Journal of Environmental Management 72 (1e2), 5e21. Huigen, M.G.A., Overmars, K.P., de Groot, W.T., 2006. Multiactor modeling of settling decisions and behavior in the San Mariano Watershed, the Philippines: a first application with the MameLuke framework. Ecology and Society 11 (2). Janssen, M., Ostrom, E., 2006. Empirically based, agent-based models. Ecology and Society 11 (2). http://www.ecologyandsociety.org/vol11/iss32/art37/ Art. 37. Jupp, V., 2006. The Sage Dictionary of Social Research Methods. Sage Publications, London. Kaplan, G.A., 2004. What’s wrong with social epidemiology, and how can we make it better? Epidemiologic Reviews 26, 124e135. Luna, F., Perrone, A., 2002. Agent-based Modelling in Economics and Finance: Simulations in SWARM. Kluwer, Massachusetts, Dordrecht. Martin, L.R., 1977. Survey of Agriculture Economics Literature: Quantitative Methods in Agricultural Economics, 1940’se1970’s, vol. 4. University of Minnesota Press, Minneapolis. Mathevet, R., Bousquet, F., Le Page, C., Antona, M., 2003. Agent-based simulations of interactions between duck population, farming decisions and leasing of hunting rights in the Camargue (Southern France). Ecological Modelling 165 (2e3),107e126. Matthews, R., Gilbert, N., Roach, A., Polhill, J., Gotts, N., 2007. Agent-based land-use models: a review of applications. Landscape Ecology 22 (10), 1447e1459. 844 A. Smajgl et al. / Environmental Modelling & Software 26 (2011) 837e844 Mennis, J., Hultgren, T., 2006. Intelligent dasymetric mapping and its application to areal interpolation. Cartography and Geographic Information Science 33 (3), 179e194. Moss, S., 2008. Alternative approaches to the empirical validation of agent-based models. Journal of Artificial Societies and Social Simulation 11 (1), 5. http:// jasss.soc.surrey.ac.uk/11/1/5.html. Nichols, P., 1991. Social Survey Methods: A Fieldguide for Development Workers. Oxfam, Oxford. Parker, D.C., Brown, D., Polhill, J., Deadman, P.J., Manson, S.M., 2008a. Illustrating a new ‘conceptual design pattern’ for agent-based models of land use via five case studies e the Mr. Potatohead framework. In: López Paredes, A., Hernández Iglesias, C. (Eds.), Agent-Based Modelling in Natural Resource Management. Pearson Education, Upper Saddle River, New Jersey, pp. 23e51. Parker, D.C., Entwisle, B., Rindfuss, R.R., Vanwey, L.K., Manson, S.M., Moran, E., et al., 2008b. Case studies, cross-site comparisons, and the challenge of generalization: comparing agent-based models of land-use change in frontier regions. Journal of Land Use Science 3 (1), 41e72. Parker, D.C., Manson, M.S., Janssen, M.A., Hoffmann, M.J., Deadman, P., 2003. Multiagent system models for the simulation of land-use and land-cover change: a review. Annals of the Association of American Geographers 93 (2). Patzer, G.L., 1996. Experiment-Research Methodology in Marketing: Types and Applications. Quorum Books, Westport, CT. Rees, P., Martin, D., Williamson, P., 2002. The Census Data System. John Wiley and Sons, New York. Robinson, D.T., Brown, D.G., 2009. Evaluating the effects of land-use development policies on ex-urban forest cover: an integrated agent-based GIS approach. International Journal of Geographical Information Science 23 (9), 1211e1232. Robinson, D.T., Brown, D.G., Schreinemachers, P., Janssen, M.A., Huigen, M., Wittmer, H., et al., 2007. Comparison of empirical methods for building agentbased models in land use science. Journal of Land Use Science 2 (1), 31e55. Rousseeuw, P.J., 1987. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. Journal of Computational and Applied Mathematics 20, 53e65. Smajgl, A., Bohensky, E., Bohnet, I., 2007. Capturing heterogeneity in empirical agent-based models: a guideline. Paper Presented at the Fifth ESSA Conference, Toulouse. Smajgl, A., Carlin, G., Bohensky, E., House, A., Kurnia, A.S., Butler, J., et al., 2009a. Assessing Impacts of Fuel Subsidy Decisions on Poverty and Fish Catch in Central Java, Indonesia: An Agent-based Analysis. CSIRO Sustainable Ecosystems, Townsville. Smajgl, A., Carlin, G., House, A., Kurnia, A.S., Butler, J., Sugiyanto, C., et al., 2009b. Design Document for Agent-based Model SimPaSI Jawa Tengah. CSIRO Sustainable Ecosystems, Townsville. http://www.cse.csiro.au/downloads/ DesignDoc_JaTeng.pdf. Smajgl, A., Carlin, G., House, A., Pambudhi, F., Butler, J., Bohensky, E., et al., 2009c. Simulating Pathways to Sustainability in Indonesia: Design Document for Agent-based Model SimPaSI East Kalimantan. CSIRO Sustainable Ecosystems, Townsville. http://www.cse.csiro.au/downloads/DesignDoc_JaTeng.pdf. Smajgl, A., Carlin, G., Pambudhi, F., Bohensky, E., House, A., Butler, J., et al., 2009d. Assessing Impacts of Fuel Subsidy Decisions on Poverty and Forest Depletion in East Kalimantan, Indonesia: An Agent-based Analysis, vol. 2. CSIRO. Smajgl, A., Heckbert, S., Ward, J., Straton, A., 2009e. Simulating impacts of water trading in an institutional perspective. Environmental Modelling and Software 24, 191e201. Smajgl, A., Heckbert, S., Bohnet, I., Carlin, G., Hartcher, M., McIvor, J., 2008. Simulating the Grazing System in the Bowen Broken Catchment: An Agent-based Modelling Approach. CSIRO, Townsville. Smajgl, A., House, A., Butler, J., 2011. Implications of ecological data constraints for integrated policy and livelihoods modelling: an example from East Kalimantan, Indonesia. Ecological Modelling 222, 888e896. Trébuil, G., Kam, S.P., TurkelBoom, F., Shinawatra, B., Teng, P.S., Kropff, M.J., et al., 1997. Systems diagnoses at field, farm and watershed levels in diversifying upland agroecosystems: towards comprehensive solutions to farmers’ problems. In: Applications of Systems Approaches at the Farm and Regional Levels. Kluwer Academic, The Hague. 99e114. Valbuena, D., Verburg, P.H., Bregt, A.K., 2008. A method to define a typology for agent-based analysis in regional land-use research. Agriculture, Ecosystems & Environment 128 (1e2), 27e36. Valbuena, D., Verburg, P.H., Bregt, A.K., Ligtenberg, A., 2010a. An agent-based approach to model land-use change at a regional scale. Landscape Ecology 25, 185e199. Valbuena, D., Verburg, P.H., Veldkamp, A., Bregt, A.K., Ligtenberg, A., 2010b. Effects of farmers’ decisions on landscape structure of a Dutch rural region: an agentbased approach. Landscape Urban Planning 97, 98e110. Wu, B.M., Birkin, M.H., Rees, P.H., 2008. A spatial microsimulation model with student agents. Computers, Environment & Urban Systems 32 (6), 415e488. Zellner, M.L., Page, S.E., Rand, W., Brown, D.G., Robinson, D.T., Nassauer, J., et al., 2009. The emergence of zoning policy games in exurban jurisdictions: informing collective action theory. Land Use Policy 26 (2), 256e367.