The cause-effect associations between geographical phenomena are an important focus in ecological research. Recent studies in structural equation modeling (SEM) demonstrated the potential for analyzing such associations. We applied the variance-based partial least squares SEM (PLS-SEM) and geographically-weighted regression (GWR) modeling to assess the human-climate impact on grassland productivity represented by above-ground biomass (AGB). The human and climate factors and their interaction were taken to explain the AGB variance by a PLS-SEM developed for the grassland ecosystem in Inner Mongolia, China. Results indicated that 65.5% of the AGB variance could be explained by the human and climate factors and their interaction. The case study showed that the human and climate factors imposed a significant and negative impact on the AGB and that their interaction alleviated to some extent the threat from the intensified human-climate pressure. The alleviation may be attributable to vegetation adaptation to high human-climate stresses, to human adaptation to climate conditions or/and to recent vegetation restoration programs in the highly degraded areas. Furthermore, the AGB response to the human and climate factors modeled by GWR exhibited significant spatial variations. This study demonstrated that the combination of PLS-SEM and GWR model is feasible to investigate the cause-effect relation in socio-ecological systems.
Natural grasslands and savannas occupy nearly half of the terrestrial area of the globe (Moore, 1966; Chapin et al., 2001) and provide important services to human societies. Grassland vegetation not only provides food for livestock but also contributes considerably to the global carbon recycle. Grasslands are sensitive to human activities and weather variations (Lauenroth et al., 1999; Gang et al., 2014). Climate change and human interference are commonly recognized as the two broad underlying drivers that lead to grassland degradation (Gang et al., 2014; Zhang et al., 2016). Understanding the factors affecting grassland productivity is increasingly critical to the conservation and, in some cases, to the restoration of these fragile ecosystems (De Luis et al., 2006; Cutler et al., 2008). The grassland ecosystem is essentially a paired socio-ecological system (SES) that involves complicated interactions related to both socio-economic activities and natural factors. Plenty of studies trying to examine the impact of human activities or climate changes on vegetation growth have been reported (e.g., Li et al., 2013). There are three levels of models in studying SES, namely, loose coupling, tight coupling, and integrated modeling (Antle et al., 2001). While much attention has been paid to studying systemic changes of ecological and economic systems independently, the attempt for taking a coupled SES approach is still rare (Stannard and Aspinall, 2011; Filatova et al., 2013; Polhill et al., 2016). Two strategies, i.e., bottom-up and top-down, are usually taken in the integrated SES modeling. The bottom-up strategy starts from decomposed elements/components and models their relationships based on some selected conceptual frameworks. On the contrary, the top-down modeling starts from the overall performance of a socio-ecological system and attributes the performance to individual elements and/or to their interaction. A systematic study frame for better discovering casual relationship in SES is still required.
Over the last decade, structural equation modeling (SEM) has been applied in many scientific disciplines (Astrachan et al., 2014; Sarstedt et al., 2014; Surridge et al., 2014). As a second-generation multivariate statistical approach, SEM has played a prominent role in the SES (Liu et al., 2014; Lowry and Gaskin, 2014). SEM possesses substantial advantages over the first generation techniques (Chin, 1998; Banerjee and Hine, 2016). SEM model, which is usually composed of a measurement model and a structural model, can reveal causal relationships through introducing latent variables that relate observable variables (indicators). The latent variables, as opposed to the observable variables, are not directly observable but inferred from the observed ones. In the measurement model (or outer model), a set of latent variables are aggregated from the observable data to form an underlying concept, which greatly reduces the dimensionality of the original variables and makes the data easier to understand (Haenlein and Kaplan, 2004; Hair et al., 2014). In the structural model (or inner model), the latent variables are connected with each other according to a substantive theory in the form of simple connected digraphs (Monecke and Leisch, 2012). The latent variables in the SEM model are divided into two classes, exogenous (without any predecessors) and endogenous (having at least a predecessor). Designed for working with multiple equations simultaneously, SEM offers a number of advantages over the traditional statistics such as regression analysis or factor analysis (Sarstedt et al., 2014). SEM has been popular in many disciplines and allows great flexibility on equation specification (Hair et al., 2009; Monecke and Leisch, 2012). Overall, SEM model provides an effective tool to assess latent variables at the measurement model and test the relationships between the latent variables on the structural model (Bollen, 1989; Hair et al., 2012).
Two most popular types of SEM modeling were identified, i.e., the covariance-based technique or CB-SEM (Jö reskog, 1993) and the variance-based partial least square or PLS-SEM (Wold, 1982, 1985). Although both modeling types share the same roots (Jö reskog and Wold, 1982), earlier studies have focused primarily on CB-SEM. In recent years, PLS-SEM has expanded rapidly with recognition of being a distinctive and innovative multivariate analysis method (Henseler et al., 2009). PLS-SEM is a soft-modeling-technique with minimum demands regarding the measurement scales, sample sizes, and residual distributions (Tenenhaus et al., 2005). It is capable of fully revealing the inner characteristics of the observed data (Bagozzi and Yi, 2012). While CB-SEM estimates model parameters so that the discrepancy between the estimated and the sample covariance matrices is minimized, PLS-SEM maximizes the explained variance of the endogenous latent variables by estimating partial model relationships in an iterative sequence of ordinary least squares (OLS) regressions (Astrachan et al., 2014). PLS-SEM is essentially a regression-based modeling approach, which uses a component-based technique in analyzing path models (Hair et al., 2012). In addition, Bayesian SEM (or simply BSEM) and hierarchical SEM (HSEM) have also been proposed to empower multiple variance analysis over the last few years. BSEM has received escalating attention in SEM applications due to its flexibility in use; HSEM, also known as multilevel SEM, analyzes hierarchically clustered data by specifying the direct and indirect causal effect between clusters (Eisenhauer et al., 2015; Fan et al., 2016). Further reading on the definitions and applications of the SEMs can be found in the literatures (e.g., Chin, 2010; Hair et al., 2011; Astrachan et al., 2014). Detailed descriptions of the development and use of PLS-SEM models are given by Hair et al. (2011, 2014) and Henseler et al. (2016).
In this paper, a PLS-SEM model was developed to assess the influence from human and climate stresses on grassland vegetation productivity in Inner Mongolia, China. As the local economy depends heavily on livestock industry (through forage supply), it is thus necessary to understand the associations between the vegetation productivity and the influencing factors. Instead of using CB-SEM, the selection of PLS-SEM is based on the following considerations. First, unlike CB-SEM that requires the multivariate normal distribution for the input data, PLS-SEM relaxes the assumption of normal distribution and can obtain explicitly the estimated latent variable scores in the process of parameter estimation (Shen et al., 2016). For most socio-ecological systems including the grassland ecosystem examined in the study, this normality assumption cannot be guaranteed. Second, PLS-SEM has the advantage of the flexibility for the use of a single measurement construct (Hair et al., 2014). In this study, we were only interested in the vegetation productivity using above-ground biomass (AGB) as the proxy, AGB being the only indicator for the endogenous variable. Third, PLS-SEM is suitable for predictive research, which is usually not the case for CB-SEM (Surridge et al., 2014; Henseler et al., 2016). We expected that the result would be helpful in predicting forage quantity for livestock industry.
The main objectives of the study were to (1) develop a PLS-SEM model for exploring the cause-effect relationships in SESs; (2) quantitatively analyze the impact of the major human and climate factors (human-climate factors hereafter) and their interactions on the grassland productivity based on the PLS-SEM model; and (3) unravel the sensitivity and spatial variability of AGB response to the human-climate factors using geographically weighted regression (GWR) modeling.
The Inner Mongolia Autonomous Region (IMAR) of China was selected as the study area (Fig. 1). As a typical grassland region of the Eurasian continent, it has an area of 1.2× 106 km2, of which nearly 60% is covered with grassland vegetation. The annual average temperature in the growing season (April-September) ranges from 12.1° C in the north to 25.2° C in the south. The annual precipitation varies spatially from < 200 mm in the west to nearly 600 mm in the east, most of which is concentrated in July and August. The western section of the study area is an arid region (annual precipitation < 200 mm), the middle and the east is a semi-arid region (annual precipitation in the ranges of 200-400 and 400-600 mm, respectively) (Sha et al., 2016). Previous studies revealed that grassland productivity was affected by human activities and climate fluctuations (Mu et al., 2013; Sun et al., 2015). Specifically, the increasing human population and intensified grazing coupled with arid climate condition were identified as the primary threats to the sustainable development of the grassland ecology (Christensen et al., 2004). To the best of our knowledge, no quantitative assessment on the coupled impacts from the human activities and climate variations was conducted for the grassland system so far.
To evaluate the impact on vegetation productivity from human activities, we derived three indices, i.e., grazing intensity, road density, and urban density. A systematic field sampling of grassland was conducted in the summers of 2011-2013 and 676 samples were collected for AGB and the corresponding grazing intensities. The 676 sampling locations are shown in Figure 1. The exact sample locations were recorded using a handheld global position system (GPS) unit with an accuracy of ± 15 m. Within each sample site, five evenly distributed 1 m× 1 m quadrats were harvested for AGB. All plants in the quadrats were clipped at the ground surface and were oven-dried at 80° C for 24-48 h, or until a constant dry biomass was recorded. The grazing intensity was categorized into 6 levels, namely, extreme heavy (5), heavy (4), medium (3), moderate grazing (2), light grazing (1), and no grazing (0). Two vector layers about the road network and urbanized areas (including cities and villages) were extracted from the publicly available OpenStreetMap dataset (Zhang et al., 2015). Maps of road density and urban density were processed by the population weighted “ Point Density” and “ Line Density” analysis, respectively, and rasterized as grid layers with a spatial resolution at 1 km× 1 km to denote the accessibility to transportation and urbanized areas. The resultant pixel values indicated the influential levels from the human activities due to the accessibility to road network and urbanized areas.
To evaluate the climate impact on vegetation productivity, we regarded temperature and climate dryness as the most important indices. Annually, averaged temperature and total precipitation variables were collected from 680 meteorological stations across the entire China (http://cdc.cma.gov.cn). Those climate variables were spatially interpolated as the grid maps at 1 km× 1 km using an ANUSPLIN approach (Hutchinsom, 1999). This interpolation approach calculates and optimizes thin plate smoothing splines fitted to data sets distributed across an unlimited number of climate station locations. A general model for a thin plate spline function f(x) fitted to n data values zi at positions xi is given by,
zi=f(xi)+ε i, (1)
where, i=1, 2, ..., n, and xi represent the longitude, latitude and scaled elevation. The values of ε i are zero mean random errors and have a covariance matrix Vσ 2, where V is a known positive definite n× n matrix, usually diagonal, while σ 2 is usually unknown. The function f is estimated by minimizing:
((Z-F)TV-1Z-F)+ρ Jm(F), (2)
where, Z=(z1, …, zn)T, F=(f1, …, fn)T, and T denotes the matrix transpose, with fi=f(xi) and Jm(F) being a measure of the roughness of the spline function fdefined in terms of mth order partial derivatives of f. Digital elevation model (SRTM30 from http://srtm.usgs.gov) was resampled to the same spatial resolution as that of the climate variables and used during the interpolation process. Error analysis of the interpolated result was conducted by a comparison with the actual measurements using 53 meteorological stations located within IMAR. The climate maps were cropped to the boundary of IMAR.
From the above steps, an integrated GIS dataset was created to develop the PLS-SEM model. Climate dryness (CD_A) index was defined as the negative value of the total precipitation (i.e., precipitation), meaning that higher precipitation corresponded to lower climate dryness and vice versa. The descriptive statistics of the dataset were summarized in Table 1. As the original AGB variables were found to be highly skewed, it was square-rooted to derive a new variable AGBsqrt (square root of AGB), which was close to a normal distribution.
![]() | Table 1 Parameters of partial least-square structural equation modelling (PLS-SEM) in this study |
The objective of the study was to examine the effects of the human-climate factors on AGB. PLS-SEM assumes a minimum of data requirement and is an excellent tool for studying the causal relationships between multiple variables in ecological systems (Hair et al., 2009). It is worth pointing out that recent studies have suggested that climate changes and socio-economic activities could interact to create a combined or coupled effect on the grassland ecology, and thus their interactions should be taken into consideration when studying the coupled impacts of the human-climate factors for most socio-ecological systems. Accordingly, human activity (Human_A) and climate factor (Climate) along with their interaction (Interaction) were taken as the exogenous latent variables and vegetation productivity was set as the endogenous latent variable in the model specification (Fig. 2). The three variables from the previously processed dataset, including grazing intensity (Grazing hereafter), the density of urbanized area (Urban_D) and the road density (Road_D), were taken as indicators for Human_A, and the two indicators of the annual average temperature (Temp_A) and the climate dryness (CD_A) were used for Climate.
A few approaches have been proposed to analyze the interaction effect between multiple latent variables for PLS-SEM (Henseler and Chin, 2010). For instance, the product indicator approach (PIA) has been developed to quantitatively reflect the magnitude of such a coupled effect (Chin et al., 2003). In PIA, a new latent variable, or the product indicator, namely Interij, through all pairwise products of the indicators from the human-climate factors, was introduced as the interaction term,
Interij = Human_Ai× Climatej, (3)
whereiis one of the climate variables (CD_A or Temp_A), and j is one of the human activity variables (Grazing, Urban_D or Road_D). Interij is then taken as the interaction of human activities (Human_A) and climate conditions (Climate). While the endogenous score had a single indicator (i.e., AGBsqrt), the exogenous variable scores were estimated as the exact linear combinations of their associated indicators (observed). Therefore, they were treated as error free substitutes for the indicators (Tenenhaus et al., 2005; Li et al., 2014). In this PLS-SEM model specification, the reflective type was specified because the constructs were reflectively measured latent variables that did not share a common cause but rather formed a general concept that mediated the influence on subsequent endogenous variables (Chin, 1998). This model specification was analyzed in SmartPLS 2.0, a stand-alone software specialized in PLS path models (http://www.smartpls.de).
The process entailed two stages: the establishment of a measurement model and the building of a structural model (Fig. 2). The measurement model establishment involved the analysis of the loadings of indicators for the human-climate factors. The structural model building included computing the hypothesized relationships between the exogenous variables and endogenous variable (vegetation productivity in this study). After the process was implemented, the loadings of observable indicators, the path coefficients between exogenous variables and endogenous variable and the coefficient of determination would be available for evaluation.
Regression analysis, which can be largely classified as global model (e.g., based on ordinary least square or OLS) and local model, is a good tool to identify the relationships between multiple variables. Multicollinearity between input variables was proved to be a problem in regression analysis and needs to be checked through variance inflation factor (VIF) and condition number, where VIF values greater than 10 indicated presence of multicollinearity (Wabiri et al., 2016). The adjusted coefficient of determination (Adjusted R2) and corrected Akaike information criterion (AICc) usually serve as a measure of goodness of fitting to determine whether global or local model should be applied. The concept for applying AICc here is to determine which model (global or local) could interpret data better. GWR modeling is an exploratory technique mainly intended to identify where non-stationary takes place on the map, namely exploring spatial heterogeneity (Brunsdon et al., 1998). In this study, the local spatial regression model of AGB against the human-climate factors was developed in the form of GWR, whose regression coefficients were expected to reveal local spatial variation and whose standard errors of coefficients were expected to indicate the reliability of the estimated coefficients. The GWR model was specified as,
AGB=a× Human_A+b× Climate+c× Interaction+e, (4)
where a, b and c are the regression coefficients of the factor scores and e is the error term. The spatial distribution of the regression coefficients could indicate how sensitively the AGB responded to any change in the corresponding variables (Brunsdon et al., 1998). ArcGIS (ArcGIS 10.3; ESRI Inc., Redlands, CA, USA) was used to implement the GWR model and create the maps. The Natural Breaks (Jenks) method was used to classify the regression coefficients of the independent variables and their standard errors.
A three-stage model assessment, namely the measurement model testing, the structural model testing, and the significant test, was performed. In the first stage, the reliability and validity tests were performed for the outer model (Hulland, 1999; Gefen et al., 2000). In the second stage, the coefficient of determination (R2) for the structural model was computed to evaluate the explanatory power of the model about the relationship between the exogenous and the endogenous latent variables in respect of the target variable (i.e., AGBsqrt). In addition, the standardized path coefficients (β ) were also examined because they indicated the strengths of relationships between the exogenous and the endogenous variables (Chin, 1998). Lastly, the significance of the outer model loadings for the indicators and the inner path coefficients indicating the relationship between the latent variables were tested by using the bootstrapping function (Hair et al., 2011). Detail information regarding the model assessment result is delineated as follows.
2.4.1 Measurement model evaluation
Evaluation of the measurement model is to examine the reliability and validity of the latent variables (constructs) in the model. It determines how well the indicators (input variables) load on the theoretically defined constructs. This can be carried out in the following four stages, i.e., reliability analysis of each individual indicator; internal consistency reliability; convergent validity; and discriminant validity of each construct (Table 2).
Individual indicator reliability is the extent to which measurements of the constructs measured with multiple indicators reflects the true score of the constructs relative to the error. It is assessed by calculating square of the standardized loading (SSL) of each indicator and that value should be over 0.4, in a reflective model, to show that the construct is able to explain the indicator (Hulland, 1999).
Internal consistency reliability is used to check how well a construct is measured by its assigned indicators. Internal consistency reliability can be evaluated using Joreskog’ s composite (construct) reliability (CR) (Jö reskog, 1971). It is suggested that 0.7 is used as a benchmark for modest CR (Chin, 1998; Hair et al., 2011).
Convergent validity is the measure of the internal consistency. It is estimated to ensure that the items assumed to measure the construct are correct while do not actually measure other constructs (Hulland, 1999). Convergent validity of the construct is determined by average variance extracted (AVE) (Gefen et al., 2000). AVE higher than 0.5 is usually viewed as acceptable (Hair et al., 2011).
Discriminant validity indicates the extent to which a given construct is different from other constructs (Hulland, 1999). It is tested through analysis of average variance extracted by using the criteria that a construct should share more variance with its measures than that with other constructs in the model (Fornell and Larcker, 1981). This is examined by comparing the AVE of construct shared on itself and other constructs. For valid discriminant of construct, AVE shared on itself should be higher than that with other constructs (Chin, 1998).
![]() | Table 2 Reliability and validity test for the measurement model |
2.4.2 Structural model evaluation
Provided that the measurement model assessment indicates that the measurement model meets all requirements, the next step is the assessment of the structural model. PLS-SEM does not have a standard goodness-of-fitting statistic (Henseler and Sarstedt, 2013). Instead, the structural model evaluation is carried out to assess the relationship between constructs and endogenous latent variables in respect of variance accounted (Hulland, 1999). Evaluating squared multiple correlations (R2) and path coefficient (β ) values can also determine the explanatory power of the model, where R2 indicates the percentage of a construct’ s variance and the β values indicate the strengths of relationships between constructs (Chin, 1998). Since the estimation of the β is based on OLS regression, the β might be biased if the estimation involves multicollinearity between the studied constructs. Therefore, potential collinearity between predictor constructs in the structural model should be tested (Hair et al., 2014). R² of endogenous variable can be categorized as substantial=0.75, moderate=0.50 and weak=0.25 (Hair et al., 2014). For path co-efficient assessment, β value of all structural paths is compared. Highest β value indicates that effect of the construct is most significant and lowest value shows the lowest effect of construct on endogenous variable. Besides evaluating the magnitude of the R² , the Stone-Geisser’ s Q² value could also be calculated as a criterion of predictive relevance (Geisser, 1974; Stone, 1974).
SmartPLS can generate T-statistics for significance testing of both the inner and outer models, using a bootstrapping procedure. In this procedure, a large number of subsamples (e.g., 5000) are taken from the original samples with replacement to give bootstrap standard errors, which in turn gave approximate T-values for significance testing of the structural path. Next, the strength and significance of the β values are evaluated for the relationships (structural paths) hypothesized between the constructs. The significance assessment builds on bootstrapping standard errors as a basis for calculating t-values for the β . In terms of relevance, β values are standardized on a range from -1 to 1, with coefficients closer to 1 representing strong positive relationships and coefficients closer to -1 indicating strong negative relationships.
2.4.3 Model results
First, the measurement model is evaluated for their reliability and validity (Table 3). Individual indicator reliability is acceptable since all the SSL showed to be over 0.4. The values of the composite reliability for both human activity and climate variability are shown to be larger than 0.6. All of the AVE values are greater than the acceptable threshold of 0.5, so convergent validity is confirmed. The square root of AVE of each latent variable is larger than their correlation (0.458), meaning that discriminant validity is well established. Low VIFs mean that multicollinearity is not an issue.
After the reliability and validity of the construct measures have been confirmed, the next step is to assess the structural model results. The VIF values for the exogenous variables, Climate, Human_A and Interaction, are 1.001, 1.061 and 1.074, respectively, suggesting that the structural model results are not negatively affected by multicollinearity. R² of endogenous equals to 0.655, indicating moderate explanatory power of the model. The β value of all structural paths to AGB is -0.540 for Human_A, -0.453 for Climate, and 0.183 for Interaction, respectively. Further, the Stone-Geisser’ s Q² value using the blindfolding technique (default omission distance 7) is 0.236 (Table 4), proving predictive relevance for the model.
![]() | Table 3 Analysis of model reliability and validity |
![]() | Table 4 Prediction relevance test |
Table 5 shows the bootstrapping test results for outer loading of the observed variables and the path coefficients of the latent variables (125 cases, 5000 samples, no sign changes option). All of the T-statistics of the outer loadings are larger than 1.96 so the outer model loadings are highly significant (P< 0.001). The β values of human activities, climate factors, and their interaction in the inner model are also statistically significant. The test proves that the developed model is effective and valid.
![]() | Table 5 Significant test of structural paths based on bootstrapping |
The coefficients from Pearson’ s correlation analysis revealed that AGB and the square root of AGB (AGBsqrt) were negatively and significantly correlated with all the 5 human and climate variables (P=0.01; Table 6), suggesting that intensive human activities (high grazing intensity and high density of road network and urbanized area) and high climate value (high temperature and high climate dryness) were likely to impose negative effect on the grassland productivity. As compared with AGB, AGBsqrt values were even more significantly and more negatively correlated to the human and climate variables. Furthermore, the indicators (Grazing, Urban_D and Road_D) reflecting the human activities or the indicators (CD_A and Temp_A) reflecting climate factors also presented significant and positive correlations with AGB and AGBsqrt.
![]() | Table 6 Coefficient of Pearson’ s correlation between AGB (AGBsqrt) and the human-climate variables |
To assess the main effects and the interaction effect, we used the indicators representing human activities (Human_A) and climate factors (Climate) in the measurement model, while the latent variables, including three exogenous variables (Human_A, Climate, and Interaction) and the target endogenous variable (AGBsqrt-M), were included in the structure model. The PLS-SEM modeling results provided several key findings regarding the causal relationships between the AGB (AGBsqrt) and the human-climate factors (Fig. 3).
The indicators reflected by either human activities or climate conditions showed positive loadings, ranging from the lowest 0.634 for Grazing to the highest 0.853 for Road_D. The main effects from the human activities and climate factors had negative path coefficients to the endogenous variable (i.e., the modeled AGBsqrt-M) with β = -0.540 and -0.453, respectively. On the contrary, their interaction path coefficient (i.e., Interaction) showed a positive value (β =0.183). This statistic showed that the vegetation was adapting to the human and climate stresses although either higher human activities (Human_A) or climate factors (Climate) alone could result in poor AGB. The interaction term confirmed the complex relationships between the response of AGB and the adaptation of vegetation to human pressure or/and climate pressure. The overall coefficient of determination (R2) for the model reached 0.655, explaining 65.5% of the variance in AGBsqrt from the combination of the main, and interaction of human-climate factors.
![]() | Fig. 3 PLS-SEM results with loadings and path coefficients. AGBsqrt-M means the modeled endogenous variable for AGBsqrt and the symbol [+] represents the pairwise product term between the 2 indicators of Climate and the 3 indicators of Human_A (Henseler and Chin, 2010). |
Model estimation began with an examination of the linear relationship between AGB and the human-climate factors. The VIF values (less than 7.5) indicated that the OLS estimates were not biased from global multicollinearity. The condition number reported from GWR (with “ adaptive” Kernel) suggested a lack of local multicollinearity. The OLS and GWR models were further compared by two evaluation parameters (Adjusted R2 and AICc). The comparison indicated that R2 in GWR had been larger than that in OLS (0.681 vs 0.655), and that AICc values in GWR were smaller than that in OLS (590 vs 595), suggesting that the localized impact from the human-climate factors could be identified by GWR modeling. Figure 4 summarizes the distributions (histograms) of the coefficients and the statistical indices of the GWR model between AGBsqrt and the three latent variables (i.e., Climate, Human_A, and Interaction). The distribution histograms of the coefficients indicated that there existed extensive spatial variations in the coefficients estimated at different locations. The coefficient parameter from Human_A had the highest variation (with standard deviation (SD)=0.56) while that from Climate and Interaction varied less (SD=0.15 and 0.21, respectively). From the overall statistical indices in the GWR model, R2 reached 0.718, indicating that the model accounted for 71.8% of the variance for AGBsqrt. Moran’ s I for residual equaled to 0.052 and showed no significance at the 0.05 level (Z-score=0.093 and P=0.925), suggesting that the residuals were spatially random.
The results in Figure 5 show the contour maps of the factor scores (Figs. 5a-c) and the corresponding regression coefficients by the GWR modeling (Figs. 5d-f). The climate factors (Climate) increased along the direction from the northeast to the southwest, displaying the extremely dry condition combined with relatively high temperature in the southwestern part of IMAR. The human activities (Human_A) showed higher intensities in the middle part where more cities and more populations were located and decreased both eastward and westward. The interaction score decreased along the gradient from the northwest to the southeast. The main effects from the human-climate factors and their interaction effect driving grassland degradation varied greatly across the study area as evidenced from the spatial distributions of GWR coefficients. In the western and northeastern parts, grassland productivity had the most sensitive and negative response to Climate; the southeastern part of the study area observed less sensitive response to Climate. The most sensitive region of vegetation productivity to Human_A was in the western and southeastern areas (with most negative coefficient value) while the least sensitive area was located in the northeast where both Human_A and Climate had relatively low values. Moreover, the interaction coefficient showed spatial variation with the most prominent interaction effect observed in the western and southeastern parts where relatively fewer human settlements were found. This finding may suggest that the vegetation might have adapted to human and climate conditions to some extent (see Section 4.2 for detailed discussion).
![]() | Fig. 4 Coefficient histograms and statistical indices from geographically-weighted regression (GWR) modeling |
A basic framework for applying PLS-SEM model in assessing the cause-effect relationships between the human-climate factors and the grassland productivity was presented. PLS-SEM combines features of multivariate statistical methods such as principal component analysis with multiple regression using iterative regression processes and bootstrapping to maximize the explained variance of the endogenous latent factors. It is thus able to overcome limitations of the first generation analysis tools (Haenlein and Kaplan, 2004; Abdi, 2010). When building such models, the general steps involve conceptualization of potential contribution factors (latent variables), creation and testing of the measurement model (the selection of indicators and the calculation of indicators’ loadings), and fitting and testing of the structure model (path coefficients between the exogenous and endogenous variables). Depending on actual scenarios, the selected factors and the structure of the models could be different. Assisted by the available software package such as SmartPLS, those steps can be readily achieved.
Methodologically, this study has demonstrated the usefulness of using PLS-SEM model to analyze the cause-effect relationship for the grassland ecosystem. At least two advantages were verified in this study. First, PLS-SEM model is useful for analyzing non-normal, multidimensional, and multi-collinear dataset (Henseler et al., 2016; Sarstedt et al., 2016). When a dataset with variables shows high skewness and kurtosis (e.g., the road density in Table 1), PLS-SEM is a more appropriate tool than CB-SEM to test the hypothesis for the multivariate relationships (Hair et al., 2014). Normalization of input indicators is unnecessary as shown by prior researches (e.g., Cassel et al., 1999; Reinartz et al., 2009). However, normalized input could increase the explanation power of the established models and thus researchers should nevertheless consider data distribution since highly skewed data inflate errors and reduce statistical power (Chernick, 2008). In the current study, we used AGBsqrt to get better explanation for its variance from the human-climate factors and their interaction. Second, PLS-SEM allows for a flexible handling of more advanced model elements such as moderator variables or hierarchical component models (Henseler and Chin, 2010; Becker et al., 2012), and it is thus useful to study the interaction effect between multiple factors. The influence of the human-climate factors on AGB was revealed from both the main effects and the interaction effect.
Particular attention should be paid to modeling the interaction effect of multiple factors in PLS-SEM. Henseler and Chin (2010) summarized the four common ways, including the product indicator approach (PIA), 2-stage approach, hybrid approach, and orthogonalizing approach, and they also compared their pros and cons. The PIA, which was first introduced by Busemeyer and Jones (1983) and Kenny and Judd (1984), was adopted in this study through building the product terms with the indicators of the latent independent variable (Climate) and the latent moderator variable (Human_A). Compared to other methods, the PIA and the orthogonalizing approach can provide more accurate predictions (Henseler and Chin, 2010). However, readers should refer to the advantages and limitations of each approach and select the suitable one to better reveal the interaction effect under different contexts. Furthermore, the sample size is also a critical factor. It was recommended that if the sample size is medium to large (which is the case in the current study), PIA should be used (Henseler and Chin, 2010).
Previous studies have confirmed that grassland vegetation productivity could be modified by climate variations and human activities (Li et al., 2013; Gang et al., 2014). The relationship between vegetation productivity and the influencing factors may be complex and nonlinear (Tanţ ă u et al., 2014; Liu et al., 2015). Some studies have been conducted to investigate the impacts of climate variations or human activities on vegetation productivity (Wang et al., 2012; Li et al., 2013), but the quantitative assessment of the interaction effect (between human activities and climate factors) is not yet convincingly reported. PLS-SEM provides a promising tool to evaluate both the main effects (from human and climate stresses) and the interaction effect (between human activities and climate factors) on grassland productivity. In the case study, the indicators selected for human activities included the grazing intensity, urbanization intensity and road network intensity while the annually average temperature and climate dryness were selected for the climate factors. The model testing statistics showed that all the indicators were reliable and valid and that the path coefficients were significant. The main effects contributed significantly but negatively to AGBsqrt, suggesting that intensive grazing, dense road network and urban development, or/and high temperature and dry climate in arid regions could lead to poor vegetation productivity, being consistent with the previous studies (Li et al., 2013). More importantly, the interaction effect imposed a significant and positive effect on AGB, revealing that the impact from the climate factors could be mediated by the human activities or vice versa. There are a few possible explanations for such an interaction phenomenon, though further investigation may be required to confirm the explanations. First, the negative effect on vegetation growth could be partially moderated due to vegetation adaptation to intensified human and climate stresses, as vegetation adaptation to human and natural causes has long been reported (Ringrose et al., 2002). Second, recent vegetation restoration programs, e.g., the fragile area protection (protecting vegetation by fencing) introduced mostly in vegetation degraded regions (co-located with intensive human activities) by the Chinese government helped vegetation recovery (Mu et al., 2013). Third, vegetation protection is an important way for human adaptation to climate conditions (O’ Brien et al., 2004) and might have also help to alleviate the negative impact on AGB from high human and climate pressure.
Although PLS-SEM modeling results show that the main effects (from human and climate stresses) imposed a negative impact on grassland productivity whereas the interaction effect (between human activities and climate factors) showed a positive effect, the heterogeneous spatial distribution of the regression coefficients by the GWR modeling suggested that the sensitivity of AGB response to those factors was location-specific. With the recent development of the economy, urban expansion, transportation development, and grazing might have resulted in a significant impact on grassland vegetation (Li et al., 2016). This significant impact was reflected by the factor score of human activities that was higher in the middle part where most key cities (including the capital city Hohhot) were located. Furthermore, the spatial distribution of the GWR coefficients suggest that human activities had substantial but local impact on vegetation productivity, as the coefficient of Human_A showed highest deviation (Figs. 4b and 5e). For example, in the western IMAR, the vegetation showed the most fragile (highest regression coefficient) in response to the pressure from the human activities. Conversely, the climate factors modulated the vegetation productivity at a more uniform or global scale, as indicated by the lower variation of the regression coefficient (Figs. 4a and 5d). It deserves to point out that, from the comparison of the distribution of the climate factors and human activities (Figs. 5a and b), it is likely that humans intended to reduce their activities in the climatically stressed areas. That is, human may adjust its behaviors under unfavorable climate and this adjustment is known as human adaptation (Palmer and Smith, 2014). The GWR modeling also unveiled a spatially varying interaction effect between the human activities and climate factors.
This work demonstrated the usefulness of applying PLS-SEM and GWR to evaluating the impact of the human-climate factors on the grassland vegetation productivity in IMAR. The combination of the PLS-SEM and GWR modeling allowed us to explore both the main and interaction effects from the human-climate factors and to understand the spatial variance in the sensitivity of the AGB response to those factors. On-site field data along with other ancillary dataset collected for the typical grassland provided an excellent case study. The framework for developing, testing and evaluating the PLS-SEM model for IMAR was presented. Though only limited variables were taken in the case study, the results from the PLS-SEM did indicate that intensive human activities (including intense grazing, dense road network, and vicinity to urbanized areas) or/and unfavorable climate conditions (i.e., high annual temperature and less water availability) were contributing factors to low AGB. More importantly, the result revealed that the impact on vegetation productivity could be partially alleviated through the interaction effect (between the human activities and climate factors). In addition, the spatially varied sensitivity of the AGB response to the human-climate factors identified by the GWR modeling provided further evidence of the complicated cause-effective relationships existing in the ecosystem. Those findings can serve as critical reference for grassland practitioners to make informed decisions on grassland management.
Acknowledgements
This study was supported by the National Natural Science Foundation of China (41371371) and the Strategic Priority Research Program of the Chinese Academy of Sciences (XDA05050402). We also thank the anonymous reviewers and editors for their constructive comments and suggestions.
The authors have declared that no competing interests exist.
1 |
|
2 |
|
3 |
|
4 |
|
5 |
|
6 |
|
7 |
|
8 |
|
9 |
|
10 |
|
11 |
|
12 |
|
13 |
|
14 |
|
15 |
|
16 |
|
17 |
|
18 |
|
19 |
|
20 |
|
21 |
|
22 |
|
23 |
|
24 |
|
25 |
|
26 |
|
27 |
|
28 |
|
29 |
|
30 |
|
31 |
|
32 |
|
33 |
|
34 |
|
35 |
|
36 |
|
37 |
|
38 |
|
39 |
|
40 |
|
41 |
|
42 |
|
43 |
|
44 |
|
45 |
|
46 |
|
47 |
|
48 |
|
49 |
|
50 |
|
51 |
|
52 |
|
53 |
|
54 |
|
55 |
|
56 |
|
57 |
|
58 |
|
59 |
|
60 |
|
61 |
|
62 |
|
63 |
|
64 |
|
65 |
|
66 |
|
67 |
|
68 |
|
69 |
|
70 |
|
71 |
|
72 |
|