Skip to main content

Using fuzzy logic to compare species distribution models developed on the basis of expert knowledge and sampling records

Expert knowledge versus sampling in species distribution modelling

Abstract

Background

Experts use knowledge to infer the distribution of species based on fuzzy logical assumptions about the relationship between species and the environment. Thus, expert knowledge is amenable to fuzzy logic modelling, which give to propositions a continuous truth value between 0 and 1. In species distribution modelling, fuzzy logic may also be used to model, from a number of records, the degree to which conditions are favourable to the occurrence of a species. Therefore, fuzzy logic operations can be used to compare and combine models based on expert knowledge and species records. Here, we applied fuzzy logic modelling to the distribution of amphibians in Uruguay as inferred from expert knowledge and from observed records to infer favourable locations, with favourability being the commensurable unit for both kinds of data sources. We compared the results for threatened species, species considered by experts to be ubiquitous, and non-threatened, non-ubiquitous species. We calculated the fuzzy intersection of models based on both knowledge sources to obtain a unified prediction of favourable locations.

Results

Models based on expert knowledge involved a larger number of variables and were less affected by sampling bias. Models based on experts had the same overprediction rate for the three types of species, whereas models based on species records had a lower prediction rate for ubiquitous species. Models based on expert knowledge performed equally as well or better than corresponding models based on species records for threatened species, even when they had to discriminate and classify the same set of records used to build the models based on species records. For threatened species, expert models predicted more restrictive favourable territories than those predicted based on records. Observed records generated the best-fitted models for non-threatened non-ubiquitous species, and ubiquitous species.

Conclusions

Fuzzy modelling permitted the objective comparison of the potential of expert knowledge and incomplete distribution records to infer the territories favourable for different species. Distribution of threatened species was able to be better explained by subjective expert knowledge, while for generalist species models based on observed data were more accurate. These results have implications for the correct use of expert knowledge in conservation planning.

Background

The status of species presence in a territory is usually subject to doubt [1]. This is mainly due to poor conducted sampling and to the dynamic nature of species distributions [2]. Imprecise knowledge of ranges occupied by species can lead to erroneous inferences about the species-environment relationship [3], which may have a long-term impact on biodiversity conservation [4]. However, at least part of the response of species to environmental conditions can be analysed based on the locations where they have been recorded; such results may then be used to infer the possible distribution of species in other locations with similar conditions [5, 6].

Species distribution modelling has been used to objectively evaluate the potential of areas to be occupied by species [5, 7]. This potential is typically established according to the species-environment relationship, which can be inferred from known species occurrence data [2, 5, 8]. Experts on a species in a particular territory also tend to use their expertise to subjectively infer the wider distribution of species in a territory, typically at the country or regional level. Experts’ experience and knowledge of the biology of species are used to define and review the distribution of a species, thus improving the available information [9]. Expert-derived distribution maps may be compared and combined with distribution models based on recorded data to improve model outputs, although procedures through which to do this have only recently begun to be explored [9,10,11].

In Uruguay, recorded species distribution data, although generally incomplete, is mainly available in scientific collections [12, 13], scientific publications [14,15,16,17,18,19], in the Global Biodiversity Information Facility [20], or in the Biodiversidata database [13, 21]. Amphibians are one of the taxa that remain poorly characterized in Uruguay [18] and are one of the most threatened groups on the planet [22, 23]. It is challenging to perform effective sampling of amphibians because of their two-phase life cycle, both phases of which are affected to varying degrees by local environmental fluctuations. Many of these species also have a crepuscular lifestyle, have cryptic colour patterns, live underground, or can only be found during specific seasons or under particular weather conditions [24]. Maneyro & Carreira [17] published a field guide to Uruguayan amphibians in which they presented the areas they considered to be occupied by each species according to their expert knowledge. These expert-defined areas are implicitly derived from fuzzy hypotheses about the relationship between species and the environment inferred from direct observation of species in their habitats. Thus, fuzzy logic may be appropriate for formally analysing these inferences.

Fuzzy logic was introduced by Zadeh [25] as a many-valued logic in which propositions have a truth value that varies in degree from 0 (completely false) to 1 (completely true). This logic is particularly useful for formally analysing the kinds of messages involved in human language and thinking [26]. Consequently, fuzzy logic may be used to make experts' assumptions more formally explicit, which could help to more objectively define environmental conditions that are considered as favourable for a species according to expert knowledge. Fuzzy logic may also be used to objectively model on the basis of an incomplete number of records, the degree to which conditions are favourable for the presence of a species in a territory in which the species has not yet been found [5, 27]. Specifically, favourability is a mathematical function that estimates, based on the probability of occurrence and species prevalence, the degree to which environmental conditions are favourable for a species’ occurrence in an area. With these fuzzy favourability values, it is possible to perform fuzzy logic operations that can be used account for the uncertainty inherent in nature, in which conditions are only ever partially favourable for the occurrence of a species, into species distribution models [6, 8, 28].

The application of fuzzy logic modelling to experts’ inferences and to occurrence records places both kinds of data within the same conceptual framework. It also yields commensurable degrees of affiliation with a fuzzy set of locations with favourable conditions for a species according to the two datasets. In addition, fuzzy logic operations may be used to compare and combine both kinds of models once they are expressed in commensurable favourability units.

The aim of this study was to test the capability of fuzzy logic tools to compare and combine the distribution of species inferred from expert criteria and from incomplete records of species distribution. This may have implications for the better use of expert knowledge in conservation planning and for identifying the most favourable conditions and territories for species.

Materials and methods

Study area and species distribution data

This study was carried out in Uruguay, which is located between 30° to 35° south latitude and 53° to 55° west longitude, lies entirely within the temperate zone and, according to Olson et al. [29], is included in the Uruguayan savanna ecoregion. It has low orography, with average altitudes not exceeding 514 m above sea level, and a wide coastline on the Atlantic Ocean (more than 150 km; see average altitude map in Fig. 1). The proximity to the sea regulates the average annual temperature (approximately 17.5 °C) and precipitation varies annually between 1000 and 1700 mm [30] (see annual precipitation map in Fig. 1). Uruguay is located at a biogeographic crossroads created by the influence of various biotas [31].

Fig. 1
figure 1

10-km × 10-km grid cells of Uruguay and situation of Uruguay within South America. As examples, the display of the values of two explanatory variables on this grid is shown

The country has high amphibian species richness, with 48 native species (one gymnophion, 47 anurans), and an invasive exotic anuran (Lithobates catesbeianus). As in other regions of the world, amphibians are in decline [22, 23], with 12 species under some category of threat [18] (Table S1 in Additional file 1) and seven considered as almost threatened, not evaluated, or with insufficient data [18, 32]. Recently, amphibian species nomenclature was revised and updated (see [33] for details).

To compare species records and maps derived from expert knowledge, an operational scale of resolution capable of reconciling the different precisions of species records and expert distribution maps must be used. The spatial precision of expert maps is usually unknown [34, 35], but generally considered to be broad, in the range of 100–200 km [36]. Species distribution records have a higher precision, but a common intermediate precision scale to assess the two types of data in a commensurate had to be defined. We used a grid divided Uruguay country into 10 km × 10 km cells (total: 1887 grid cells; Fig. 1). This is in line with methods used in biogeography to identify the best grid for biogeographical studies [37]. We verified the recorded or expert-inferred presence of each amphibian species in each grid cell. To do this, we reviewed two types of information sources: distribution data recorded in the databases of scientific collections, and expert criteria, understood as the distribution maps derived from expert knowledge of the species' natural history. Regarding the recorded distribution, information on the geo-referenced occurrences of the species was extracted from the amphibian collection database of the Faculty of Sciences of the University of the Republic of Uruguay [21] and from the work of Núñez et al. [12]. Thus, grid cells with at least one record were considered as indicating species presence and the rest as absence. Distribution according to expert criteria were taken from the guide to amphibians in Uruguay [17]. In this manner, we considered the grid cells totally or partially contained in the range established by the experts for each species as grid cells with presence of the species. The remaining grid cells were treated as absences. We divided species into three categories to compare how models based on species records or expert knowledge performed in each category: threatened species (hereafter T), species considered by experts to be present throughout the study area (ubiquitous species, hereafter U), and non-threatened, non-ubiquitous species (hereafter NtNu).

Environmental predictor variables

For predictors, we used a set of 44 variables linked to spatial location (combination of longitude and latitude) and to the following eight environmental factors: topography (four variables), climate (22 variables), vegetation (one variable), geography (one variable), hydrology (two variables), land cover (five variables), lithology (four variables) and human activity (four variables). These are the factors that, according to theory, more likely affect the species analysed at the spatial resolution of our study [38]. Table 1 shows the details of variables and their sources. Presence/absence of each amphibian species and the values of the predictors variables were extracted for each 10-km × 10-km grid cell and stored in a geo-referenced database implemented in a Geographical Information System using QGIS software [39].

Table 1 Explanatory variables analysed during the modelling process and their source. Variable names (second and forth columns), their codes (first and third columns) and factors grouping them (inserted sections). The resolution of the variables when coming from a raster are indicated in brackets next to the factor in which are grouped, in italics. The codes of the variables selected after the multicollinearity evaluation are shown in bold

Regarding the spatial structure of the distribution data, a polynomial trend-surface analysis [40, 41] was applied using the longitude (Lo), latitude (La), the quadratic and cubic effect of both, and a combination of the above (Lo; La; Lo2; Lo3; La2; La3; LaLo; La2Lo; LaLo2). The spatial variable included in the models for each species was a linear multifactorial combination (hereafter spatial logit) resulting from a linear logistic regression of the species presence/absence on the combination of these nine spatial terms of latitude and longitude. This spatial analysis determined whether the distribution of a species responded to spatial geographic trends related to historical events or more recent population dynamics, rather than to the environmental conditions of the habitat they occupy [41, 42]. For the other variables, we avoided the use of Principal Component Analysis or quadratic terms to keep the models simple and facilitate comparison with models based on expert knowledge, as such models are usually based on a simpler relationship between the species distribution and environmental conditions.

Favourability models

To avoid excessive multicollinearity in the models, we tested pair-wise correlations between non-spatial variables using the Spearman coefficient r. When two variables had a correlation value of r > 0.75, we only retained the variable that the expert considered more relevant regarding amphibians, as this option facilitated the comparison of the explanatory capacity of models based on expert knowledge and on species records. This procedure reduced the number of potential predictor variables to 30 (Table 1). For each species, we then performed bivariate logistic regression models of the species distribution with each of these variables to obtain a yet smaller subset of significant predictors, according to the Rao’s score test, for each species. If no predictor had a significant relationship with the distribution of the species, then no model was produced for the species and source of data.

To minimize the risk of Type I errors in the modelling process due to the number of predictors, we also controlled for the false discovery rate (FDR) [43] for each species, retaining only those predictor variables that were significant under an FDR of q < 0.05.

Then, we performed an ensemble multivariate forward–backward stepwise logistic regression of presences/absences of each species on their corresponding spatial variable and reduced set of environmental variables to obtain probability values (P) of the species’ presence in every grid cell according to the formula:

$${\text{P}} = {\text{e}}^{{\text{y}}} /\left( {{1} + {\text{ e}}^{{\text{y}}} } \right),$$

with e being the base of the natural logarithms and y the logit function

$${\text{y}} = \alpha + \beta_{{1}} {\text{x}}_{{1}} + \beta_{{2}} {\text{x}}_{{2}} + \beta_{{3}} {\text{x}}_{{3}} + \cdots + \beta_{{\text{n}}} {\text{x}}_{{\text{n}}}$$

where, α is a constant and β1, β2,…, βn are the coefficients of the n environmental predictor variables x1, x2,…, xn (see details in Acevedo & Real [5] and Real et al. [27]). Variables were included according to the importance of their relationship with species distribution, and redundancy was avoided by verifying at each step that each new variable added significant new information.

Subsequently, favourability values were calculated from the probability values obtained by logistic regression according to the formula:

$${\text{F}} = \left[ {{\text{P}}/\left( {{1} - {\text{P}}} \right)} \right]/\left[ {\left( {{\text{n1}}/{\text{n}}0} \right) + ({\text{P}}/} \right[{1} - {\text{P}}\left] ) \right]$$

where F is the environmental favourability (between 0 and 1), P is the probability of occurrence given by the multivariate logistic regression, n1 is the number of presences, and n0 is the number of absences [27]. Thus, when the number of presences and absences is the same, F = P, whereas when the number of absences is greater than the number of presences, F > P, and when the number of absences is lower than presences F < P (which is rarely the case). The favourability value F = 0.5 corresponds to a probability value equal to the prevalence of the species in the sample, i.e., the value expected under neutral conditions over the entire area of study. Thus, the favourability values represent the contribution of the response of the species to the local environmental conditions and of its particular history and population dynamics to the probability of occurrence. It should be noted that local probability depends on local favourability and on the overall prevalence of species in the dataset. Therefore, favourability is a commensurate unit that can be used to compare and combine models of species that differ in their prevalence [44, 45].

A favourability function can be considered to be a membership function of each locality in the fuzzy set of localities favourable for the occurrence of species. This matches the definition of the membership function typical of fuzzy sets, which is the basis of fuzzy logic [25]. This logic asserts that the membership of any element in a set is neither completely false nor completely true; instead, each element is assigned a real number from 0 to 1 representing the degree of membership in the fuzzy set. After applying the favourability function, each grid cell corresponds to a degree of membership, ranging from 0 to 1, to the set of favourable localities for the presence of each amphibian species. For each grid cell, the favourability function can identify dark biodiversity or the degree of favourability for non-observed species [6]. It can also combine favourability models using fuzzy logic operations [44,45,46,47].

The above-mentioned procedure was applied to the presence/absence datasets derived from the species ranges proposed by experts and to those derived from recorded distributions. In this manner, favourability values were obtained based on expert knowledge (FE) and recorded distributions (FR).

All statistical analyses were conducted using the IBM SPSS Statistics V25 software. Finally, a cartographic favourability model was generated for each amphibian species according to both sources of data using ArcMap 10.8 software.

Model evaluation

We used the area under the curve (AUC) of the receiving operating characteristic (ROC) [48, 49] to evaluate the ability of each favourability model, either based on species records or on expert knowledge, to discriminate grid cells with at least one record from those with no record of the species. We evaluated the ability of each model to classify this dataset of recorded presences and absences based on a favourability threshold of 0.5 using sensitivity, specificity, under-prediction rate (UPR), over-prediction rate (OPR) [50] and correct classification rate (CCR) [51]. We also compared the discrimination and classification ability of the models for T species (N = 12), NtNu (N = 22), and U (N = 14), using the Kruskal–Wallis Test to separately test models based on species records and on expert knowledge. We also compared the number of variables and factors involved in the models, as well as the discrimination and classification ability of models based on species records and expert criteria using the Wilcoxon signed-rank test for matched samples for the three groups of species separately.

Favourability values (F) represent the degree of membership in the fuzzy set of grid cells favourable for the occurrence of the species. Consequently, we obtained for each species a fuzzy set of grid cells favourable for the occurrence of the species according to expert criteria (E) with favourability values FE, and another set according to the direct modelling of the recorded distribution data (R) with favourability values FR. For each of these fuzzy sets, we computed cardinal, card (E) and card (R), which is the sum of all favourability values representing the size of the fuzzy set. We then computed the fuzzy entropy of each fuzzy set [26] as follows:

$$\begin{gathered} {\text{S}}\left( {\text{E}} \right) \, = {\text{ card }}({\text{E}} \cap {\text{E}}^\prime /{\text{card }}({\text{E}} \cup {\text{E}}^\prime ) \hfill \\ {\text{S}}\left( {\text{R}} \right) \, = {\text{ card }}\left( {{\text{R}} \cap {\text{R}}^\prime } \right)/{\text{card }}({\text{R}} \cup {\text{R}}^\prime ) \hfill \\ \end{gathered}$$

where S(E) is the entropy of E, S(R) is the entropy of R, and E′ and R′ are the complementary values of E and R, respectively, whose membership values in each grid cell are 1-FE and 1-FR, respectively. Fuzzy entropy is a value between 0 (minimum entropy) and 1 (maximum entropy) that represents the uncertainty associated with the species distribution. Using the Wilcoxon signed-rank test for matched samples, we compared the fuzzy entropy of models based on species records and expert criteria for the three groups of species separately.

Combining fuzzy sets derived from records and expert criteria using fuzzy logic

These two fuzzy sets were combined using fuzzy logic tools [25]. Then, for each species, we calculated in each grid cell the degree of membership in the fuzzy union E  R (maximum of FE or FR) and the fuzzy intersection E ∩ R (minimum of FE and FR) of the two fuzzy sets as the consensus values of both models. E  R represents the degree to which a location is considered favourable according to either expert criteria or the direct modelling of the records, whereas E ∩ R represents the degree to which a location is considered favourable using both expert criteria and direct modelling of the records. E  R and E ∩ R were also fuzzy sets and we represented them cartographically and as computed card (E  R) and card (E ∩ R).

We also obtained the fuzzy overlap values between E and R, which were values between 0 and 1 computed as:

$${\text{O }}\left( {{\text{E}},{\text{R}}} \right) \, = {\text{ card }}\left( {{\text{E}} \cap {\text{R}}} \right)/{\text{card }}({\text{E}} \cup {\text{R}})$$

To assess how much information R held about E, we used the fuzzy inclusion of E into R according to the equation provided by [52]:

$${\text{I }}\left( {{\text{E}},{\text{ R}}} \right) \, = {\text{ card }}\left( {{\text{E}} \cap {\text{R}}} \right)|/{\text{card }}\left( {\text{E}} \right)$$

which is a value between 1 and 0 that indicates the extent to which the set E is included in the set R. In this way, we computed I (R, E).

In all methodological procedures, we followed the recommendations of Sillero et al. [53].

Results

After accounting for multicollinearity among the original variables and controlling for FDR, a total of thirty predictor variables belonging to the nine explanatory factors remained available for building the models. We did not obtain significant favourability models for species with ubiquitous distributions using expert inference, as the experts considered that these distributions covered all of Uruguay. We obtained significant favourability models for the rest of the included species using both expert criteria and recorded distributions (see the geographical representation of the models in maps of Figure S3 in Additional file 3).

Explanatory factor and variables in the models

The thirty variables available to build the models, and the nine consequent factors, were included in at least one significant distribution model. Relevant variables differed according to the two sources of information analysed. The expert-criteria-based models were significantly more complex, as they included between one and 16 variables, with a mean of 5.1 and 8.8 variables per model for threatened and NtNu species, respectively, whereas the models based on recorded distributions included between one and eight variables, with a mean of 1.9 and 2.7 variables per model for T and NtNu species, respectively (Wilcoxon signed-rank test significance was p < 0.05 for T species and p < 0.001 for NtNu species), and 5.8 variables per model for U species. Consequently, expert-criteria-based models included more explanatory factors than record-based models.

According to the expert criteria, the three most representative factors in the models were climatic (85% of models), topographic (74% of models) and spatial (62% of models), whereas according to the recorded distribution, the most representative factors were spatial (54% of models), land use (54% of models) and climatic (52% of models).

In models based on the recorded distributions, distances to roads and urban centres were included 24 times, and only once with a positive value, whereas in models based on expert criteria, these variables were included 20 times, 10 which had positive values (χ2 = 9.147, p < 0.01).

Discrimination capacity of the models

In general, the models were able to discriminate presences from absences (see mean values of AUC in Table 2). Using both sources of data, all models of the 12 threatened species showed outstanding discrimination according to Hosmer & Lemeshow [48] (AUC > 0.9). Wilcoxon signed-rank test results showed no significant differences between them (Table 3), as both sources of data performed equally well. However, some outlier models had lower discrimination in models based on species records, meaning that models based on expert knowledge discriminated equally as well as or better than corresponding models based on species records for T species (Figure S1 in Additional file 2). In fact, the Kruskal–Wallis test showed that three significantly different levels of discrimination ability could be distinguished for models based on expert knowledge, while only two levels could be distinguished from models based on species records (one for ubiquitous species and one for the rest of species) (see Table 4, and Figure S1 in Additional file 2).

Table 2 Number of species (N), average discrimination (AUC or Area Under the Curve), and classification values (Sensitivity, Specificity, CCR or Correct Classification Rate, Und or Under Prediction Rate and Ove or Over Prediction Rate) for all species, and separately for Threatened species (T), non-threatened non-ubiquitous species (NtNu), and ubiquitous species (U) of the models based on expert criteria and on species records
Table 3 Results of the Wilcoxon signed-rank test for matched samples comparing the discrimination (AUC: Area Under the Curve) and classification (Sensitivity, Specificity, CCR or Correct Classification Rate, Und or Under Prediction Rate and Ove or Over Prediction Rate) performance of models based on expert knowledge and models based on species records, for the three groups of species: threatened (T), non-endangered and non-ubiquitous (NtNu), and ubiquitous (U) species separately. St. = Standardized Wilcoxon signed-rank statistic. Sig. = Significance. Negative values of the statistics mean that models based on species records had higher values than those based on expert knowledge
Table 4 Results of the Kruskal–Wallis test comparing the discrimination and classification abilities for threatened species (T), non-threatened and non-ubiquitous species (NtNu), and ubiquitous species (U), as well as for all groups together, of the models based on species records and on expert knowledge. AUC: Area Under the ROC Curve. Sens: Sensitivity. Spe: Specificity. CCR: Correct Classification Rate. Und: Underprediction Rate. Ove: Overprediction Rate

For the 22 NtNu species, 14 models based on species records showed outstanding discrimination capacity, with another five models showing excellent discrimination (0.8 < AUC < 0.9), while for models based on expert knowledge, only eight models demonstrated outstanding discrimination and six others excellent discrimination. This indicates that models based on species records outperformed those based on expert knowledge (Wilcoxon signed-rank test significance p < 0.001).

Experts did not discriminate between the 14 ubiquitous species (AUC = 0.5), precisely because these species were considered by them to be present throughout Uruguay. However, 10 models based on species records for these species showed acceptable discrimination according to Hosmer & Lemeshow [48] (0.7 < AUC < 0.8) and one showed excellent discrimination (Wilcoxon signed-rank test significance p < 0.001, i.e.); this indicates that models based on species records outperformed those based on expert knowledge).

Classification capacity of the models

Models, whether based on species records or expert knowledge, had three significantly different levels of classification ability, as measured with the CCR, and performed best for threatened species and worst for ubiquitous species (Table 4, and Figure S1 in Additional file 2). The CCRs of models based on species records were significantly higher than those of models based on expert knowledge for all but T species (Table 3). In fact, for T species, the mean CCR of models based on expert knowledge had a higher value, although not significantly higher, than that of models based on species records, even when there was a low outlier for a species (Tables 2 and 3, and Figure S1 in Additional file 2).

Sensitivity values were generally high for all models (Table 2), but were highest for expert knowledge-based models in all three groups of species, with these differences being significant only for U species (Table 3). However, while this higher sensitivity was obtained at the expense of a higher overprediction rate and a lower specificity for NtNu species, for T species, the higher sensitivity of expert knowledge-based models was obtained with (non-significantly) lower overprediction and higher specificity. Models based on species records had the same sensitivity level for T and NtNu species, with a lower level of sensitivity for U species (Table 4, and Figure S1 in Additional file 2). However, models based on expert inferences showed the same level of sensitivity for T and U species, which is notable given that sensitivity for U species was always 1, with a lower level of sensitivity for the other species (Table 4, and Figure S1 in Additional file 2).

Mean specificity values were always lower than those of corresponding mean sensitivity (Table 2). Additionally, specificity values were significantly higher in record-based models than in expert-based models, except for threatened species (Table 3). Both types of models had three significantly different levels of specificity, being best for threatened species and worst for ubiquitous species (Table 4, and Figure S1 in Additional file 2).

Record-based models underpredicted equally for T and NtNu species, but significantly worse for U species, although always with low underprediction values. However, these models overpredicted less for U species than for the other two groups, which performed equally in this regard. Expert-criteria-based models demonstrated no underprediction rate for ubiquitous species (experts did not predict absence in any location) and underpredicted significantly better for T species than for NtNu species. Models based on expert criteria displayed the same overprediction rate for the three groups of species (Table 4, and Figure S1 in Additional file 2).

Entropy of the models

The results of the Wilcoxon signed-rank test for matched samples showed that the mean entropy values were significantly lower in models based on expert criteria for threatened species (0.009 vs 0.091, p < 0.01) and for NtNu species (0.035 vs 0.19, p < 0.001). The distribution models of the NtNu species had higher entropy values than those of T species with either the species records or the expert criteria. The models produced using species records for U species had the highest entropy values (0.518); individual entropy values can be seen in the maps of Figure S3 in Additional file 3.

Combination of the fuzzy sets derived from the models based on records and expert criteria

The results of combining the fuzzy set E and R derived from the models based on expert criteria and on recorded distribution, respectively, using their fuzzy union and intersection appear in maps by species of Figure S3 in Additional file 3. Figure 2 shows these results using three species as an example: Dendrosophus nanus as a T species, Scinax uruguayus as an NtNu species and Scinax squalirostris as a U species.

Fig. 2
figure 2

General scheme of the procedure used in three species as examples: Dendropsophus nanus (upper part) as a threatened species, Scinax uruguayus (middle part) as a non-threatened non-ubiquitous species, and Scinax squalirostris (lower part) as a ubiquitous species. From left to right: grid cells with presence according to both sources of information (upper maps: according to expert-criteria; lower maps: according to species records); the favourability models based on expert knowledge (E); the favourability models based on species records (R); their fuzzy union (E  R) and their fuzzy intersection (E ∩ R)

Mean overlap values between fuzzy sets E and R were similar for the three groups of species: 0.34 for T species, 0.37 for NtNu species and 0.42 for U species. For 17% of species, the spatial overlap between both alternative models was higher than 50%. Individual overlap value can be seen in the maps of Figure S3 in Additional file 3.

For threatened species, the mean inclusion of the fuzzy set E in the fuzzy set R was 0.66, whereas the mean inclusion of R in E was 0.40. This suggests that expected distribution according to recorded data was larger than that expected according to expert knowledge. However, for NtNu species, the mean inclusion of E in R was 0.51, whereas the mean inclusion of R into E was 0.67, meaning that, for these species, the distribution expected based on expert knowledge was larger. For U species the mean inclusion of E into R was 0.42, whereas the mean inclusion of R into E was 1, which was expected given that the distribution expected according to expert criteria included all of Uruguay, according to the experts.

Discussion

When studying the distribution of species, researchers attempt not only to describe, but also to understand and explain the distribution patterns of species. Classical mathematics can be used to this end through the application of statistical methods and precise mathematical models in the study of species distributions [53]. Methods such as regression analysis, cluster analysis and probability theory, among others, can be used to analyse species presence and absence data, assess distribution patterns, and predict future distribution based on environmental and geographic variables. The inclusion of fuzzy logic, which is derived from fuzzy set theory, provides species distribution models with a malleability particularly useful for dealing with the intrinsic uncertainty and vagueness of species distribution data [25, 26]. This is the case because information on the presence or absence of species can often be ambiguous due to the lack of precise data or natural variability in the distribution of the species. [3, 6]. Fuzzy logic allows this kind of ambiguity to be addressed by working with partial truth values (about the species being present at a locality) or degrees of membership in a category (in a species’ range) [5, 25].

Human rationale, when used in the form of expert opinions on distribution data, for example, often yields uncertainty and incomplete information. As a result, it is common to work using degrees of confidence with the information provided. In this manner, although expert knowledge is grounded in empirical observations of species distribution, including both presences and absences recorded in field samples, experts also implicitly use informal fuzzy logic when constructing or completing their mental model of a species’ occupied territory [25, 26]. Fuzzy logic allows formal handling of this uncertainty in human thinking, thus reflecting and making more explicit the lack of certainty in our statements and beliefs. Consequently, fuzzy logic allows work with commensurable categories of the uncertainty and vagueness inherent in the incomplete nature of species distribution data (species records) and in human thinking and beliefs (expert knowledge). Our work suggests that the subjective representation of species distribution by experts may reflect, perhaps unconsciously, perceived relationships between observed occurrences of species and environmental conditions, and that these relationships may be objectively assessed using the formal analysis of fuzzy logic.

Our approach compared models based on both expert knowledge and species records by evaluating their capacity to discriminate and correctly classify species records; this differs from other previous approaches, which have evaluated the correctness of expert-defined ranges on the basis of their similarity with ranges derived from species distribution modelling [34]. The latter approach precludes the possibility that a model based on expert opinion can be better than one produced by mathematical modelling, whereas this is one of the main results in this work.

General assessments of the models

Although both kinds of models performed well, discriminating and classifying the amphibian occurrences in Uruguay in an acceptable manner, models based on expert knowledge discriminated equally as well as or even better than corresponding models based on species records for threatened species [12, 16,17,18]. This finding is particularly remarkable because models based on expert knowledge had to discriminate the same set of records that were used to build the models based on species records; it is striking that the former sometimes outperformed the latter. However, experts also used the same overprediction rate for the three types of species, i.e., when experts filled in the gaps within their mental fuzzy model of species distribution [34, 36], they did so equally for all types of species. Nonetheless, this kind of generalisation is useful for discriminating the distribution range of threatened species, while it yields uninformative models for widespread species. In other words, expert judgment, although consistent, is more useful when dealing with threatened, typically specialist species and less useful when dealing with more generalist species. Conversely, objective species distribution models were more valuable when dealing with generalist species than when approaching specialist species that may be well understood by experts.

Drivers of the geographical range of amphibians

These models, built on the basis of two different sources of information, incorporated factors that have been identified in the literature as significant drivers of amphibian distributions in various regions [23, 54, 55], including Uruguay [17, 56,57,58]. Furthermore, the factors better represented in the models from both sources of information were those that exerted influence on species distribution at large spatial scales (climate, topography, land uses and spatial location), ranging from landscape (10–200 km) to regional and national scales (from 200 km to greater than 2000 km) [38]. However, models based on the two sources of information differed in the number of factors and variables invoked to explain the distribution of amphibian species, as the models based on the knowledge of the experts were more complex. This was unexpected, given that the experts produced distribution ranges apparently simpler and more homogeneous than the distribution pattern of species records. However, despite the higher complexity of expert-based models, they resulted in models with lower entropy. Consequently, these models proposed more structured and organized patterns of favourable territories for each species, in comparison to models based on recorded distributions.

It is worth noting that human activity was identified as an explanatory factor in many of the models based on both information sources. When the human variables in the models indicated a higher favourably with proximity to roads and urban centres, it mostly revealed biases in the sampled records. This was the case because researchers tend to sample more frequently in accessible and well-connected territories [12, 13, 59, 60]. Expert-based models were less affected by this kind of sampling biases [59], which may be related to the generalization capacity of the experts allowing them to dodge the effects of the sampling bias. Consequently, expert knowledge may be most valuable in countries with high biodiversity, which are often undersampled, especially for taxa that are generally underestimated, such as amphibians [61].

Comparing the extent of models based on expert criteria and field records

Fuzzy overlap between the two kinds of models was notably low, in the range 0.34–0.42. This suggests that the areas considered as favourable for the species according to the two types of information differed substantially. The inclusion of one type of model int the other, i.e., the degree to which the model based on expert knowledge was a subset of the model based on species records and vice versa, differed according to the types of species analysed. For threatened species, the models based on expert knowledge were mostly a subset of the corresponding models based on species records, while the opposite occurred for non-threatened species.

Some authors have highlighted the tendency of experts’ fuzzy thinking to incorporate absences areas as favourable territories [34], rather than excluding presence records outside the predicted favourable areas. However, our results indicate that both types of models, whether based on expert thinking or on species records, did this, and that experts were less prone to this kind of overprediction for threatened species, as specificity values were higher for expert-derived models of threatened species. The fuzzy set of favourable territories derived from expert-based models encompasses the favourable territories indicated by field records only for species with records extended over wide territories (NtNu and U). Although some authors have suggested that experts tend to think of the environmental territories occupied by these species as polygons that should include all known records [34, 36, 62,63,64], and that the set of favourability territories derived from records is more fitted to the specific environmental conditions highlighted by punctual records [34, 36], our results suggest that this rationale is only valid for the two types of non-threatened species. For threatened species (see species in Table S1 in Additional file 1), expert models predicted more restrictive fuzzy sets of favourable territories than those predicted by the records themselves. This could be attributed to the fact that threatened species generally exhibit a narrower distribution and, thus, occupy narrower environmental ranges [18, 19, 32, 33], as well as the fact that greater research effort and interest is typically dedicated to acquiring information on threatened species, which are usually better known by experts.

Bridging distribution gaps with expert knowledge and distribution modelling

For both kinds of models, the patterns of favourable zones completed the absence gaps between zones with presences [65]. Thus, the favourable zones were indicative of a distribution that is less dispersed and discontinuous than that suggested by the known records, and which fills gaps in the currently known distribution. This lack of knowledge concerning distributions could affect the conservation of these fauna groups. Therefore, although knowledge of species distributions has increased in recent decades [66], sampling efforts need to be strengthened in relation to large groups of fauna to obtain a more complete range of sampling-confirmed records and to be able to better define the territories occupied by species [65,66,67]. Even so, our results suggest that expert knowledge is particularly useful for assessing the distribution of threatened species, which are of greatest conservation concern, and should be more appreciated in areas with fewer funds devoted to sampling efforts.

On the other hand, according to expert criteria about thirty percent of Uruguay amphibian species occupy territories throughout all of the country [17]. However, favourable zones identified by models based on sampling records made it clear that, for these widely distributed amphibian species, there are zones of low or intermediate environmental favourability in which these species are probably absent or present in lower abundances [9, 68]. Although experts overestimate the distributions of species in the same way irrespective of species status as threatened or not, for widespread species, this overprediction leads to uninformative prediction (ubiquity of the distribution). In contrast, models based on observed records had a significantly lower overprediction rate for ubiquitous species than for the other groups for species, helping them to provide better information on the nuances of the distribution patterns of widespread species [5, 9, 55, 69].

Combination of models based on experts and on records

The use of fuzzy methods, such as favourability functions, to analyse species distributions provides fuzzy degree values to interpret the occurrence of species, yielding more dynamic results than those provided by classical modelling techniques [5, 6]. One new possibility is to combine models based on records and on expert knowledge. Some researchers have integrated recorded occurrences and expert data into the same modelling approach [36], or compared models resulting from both approaches [9, 34]. Fuzzy logic, rather, allows the combination of the models based on the two separate information sources [5, 44]. The application of fuzzy intersection defined areas that were consider favourable in the two types of models. This generally resulted in distribution limits mainly established by expert knowledge for threatened species, whereas they were more similar to the models based on recorded data for the rest of species (Fig. 2). Thus, this method of achieving a consensus between models based on the two kinds of information is able to give a higher weight to the best model for each species.

Conclusions

In line with the findings of other authors [9, 11, 34, 36], this study emphasizes the crucial role of incorporating expert judgment in distribution models. In the case of threatened species, the expert opinion on these species allowed for a more nuanced and accurate representation of the habitat requirements of the species and the factors that influence their distribution [57]. The use of fuzzy logic provides more dynamic tools that can be used to study the uncertain distribution of biodiversity, allowing extraction of the best information from observation records and expert knowledge. Our analysis demonstrated quantitatively that the fuzzy set of favourable areas predicted from expert knowledge was as accurate as that predicted by field records themselves, and even better in the case of threatened species. In addition, the models based on expert criteria encompassed a greater explanatory complexity to define the requirements of these species. For generalist species, models based on observed data were more accurate.

This approach provides relevant information when planning territories to be protected, and highlights territories in which sampling efforts should be improved to assess the state of conservation of the biodiversity of these areas. We propose that the fuzzy union between the models based on record data and on maps provided by experts can be used to design a national sampling strategy in Uruguay. This would validate the applicability of this new methodology for other fauna groups and regions.

Availability of data and materials

The dataset supporting the conclusions of this article will be made available through a data repository.

References

  1. Riddle BR, Ladle RJ, Lourie SA, Whittaker RJ. Basic biogeography: estimating biodiversity and mapping nature. In: Ladle RJ, Whittaker RJ, editors. Conservation Biogeography. Oxford: Willey Blackwell; 2011. p. 47–92.

    Google Scholar 

  2. Lomolino MV, Riddle BR, Brown JH. Distribution of species. In: Lomolino MV, Riddle BR, Brown JH, editors. Biogeography. 3rd ed. Sunderland: Sinauer Associates; 2006. p. 65–96.

    Google Scholar 

  3. Fei SL, Yu F. Quality of presence data determines species distribution model performance: a novel index to evaluate data quality. Landsc Ecol. 2016;31:31–42.

    Article  Google Scholar 

  4. Whittaker RJ, Araújo MB, Jepson P, Ladle RJ, Watson JEM, Willis KJ. Conservation biogeography: assessment and prospect. Divers Distrib. 2005;11:3–23.

    Article  Google Scholar 

  5. Acevedo P, Real R. Favourability: concept, distinctive characteristics and potential usefulness. Naturwissenschaften. 2012;99:515–22.

    Article  CAS  PubMed  Google Scholar 

  6. Real R, Barbosa AM, Bull JW. Species distributions, quantum theory, and the enhancement of biodiversity measures. Syst Biol. 2017;66:453–62.

    PubMed  Google Scholar 

  7. Guisan A, Zimmermann NE. Predictive habitat distribution models in ecology. Ecol Modell. 2000;135:147–86.

    Article  Google Scholar 

  8. Romero D, Olivero J, Márquez AL, Báez JC, Real R. Uncertainty in distribution forecasts caused by taxonomic ambiguity under climate change scenarios: a case study with two newt species in mainland Spain. J Biogeogr. 2013;41:111–21.

    Article  Google Scholar 

  9. Fourcade Y. Comparing species distribution modelled from occurrence data and from expert-based range maps. Implication for predicting range shifts with climate change. Ecol Inform. 2016;36:8–14.

    Article  Google Scholar 

  10. Niamir A, Skidmore AK, Toxopeus AG, Muñoz AR, Real R. Finessing atlas data for species distribution models. Divers Distrib. 2011;17:1173–85.

    Article  Google Scholar 

  11. Fourcade Y, Engler JO, Besnard AG, Rödder D, Secondi J. Confronting expert-based and modelled distributions for species with uncertain conservation status: a case study from the corncrake (Crex crex). Biol Conserv. 2013;167:161–71.

    Article  Google Scholar 

  12. Núñez D, Maneyro R, Langone J, de Sá RO. Distribución geográfica de la fauna de Anfibios del Uruguay. In: Smithsonian Herpetological Information Service. 2004.

  13. Grattarola F, Botto G, da Rosa I, Gobel N, González E, González J, Hernández D, Laufer G, Maneyro R, Martínez-Lanfranco J, Naya D, Rodales A, Ziegler L, Pincheira-Donoso D. Biodiversidata: an open-access biodiversity database for uruguay. Biodivers Data J. 2019;7:e36226.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Borteiro C, Nieto C, Kolenc F. Amphibia, Anura, Hylidae, Scinax aromothyella: Distribution extension and habitat. Check List. 2007;3:98–9.

    Article  Google Scholar 

  15. Maneyro R, Beheregaray M. First record of Physalaemus cuvieri Fitzinger, 1826 (Anura, Leptodactylidae) in Uruguay, with comments on the anuran fauna along the borderline Uruguay-Brazil. Bol Soc Zool Urug. 2007;16:36–41.

    Google Scholar 

  16. Maneyro R, Kwet A. Amphibians in the border region between Uruguay and Brazil: updated species list with comments on taxonomy and natural history (Part I: Bufonidae). Stuttgarter Beiträge zur Naturkunde. 2008;1:95–121.

    Google Scholar 

  17. Maneyro R, Carreira S. Guía de anfibios del Uruguay. In: Colección Ciencia Amiga. Uruguay; 2012.

  18. Maneyro R, Langone J, Carreira S. Anfibios. In: Carreria S, Maneyro R, editors. Libro Rojo de los Anfibios y Reptiles del Uruguay. Biología y conservación de los Anfibios y Reptiles en peligro de extinción a nivel nacional. Montevideo: DINAMA; 2019. p. 17–106.

    Google Scholar 

  19. Laufer G, Gobel N, Kacevas N, Lado I, Cortizas S, Carabio M, Arrieta D, Prigioni C, Borteiro C, Kolenc F. Updating the distributions of four Uruguayan hylids (Anura: Hylidae): recent expansions or lack of sampling effort? Amphib Reptile Conserv. 2021;15:228–37.

    Google Scholar 

  20. GBIF.org (29 February 2016). GBIF Occurrence Download; 2016. https://doi.org/10.15468/dl.ywhpmz

  21. Grattarola F, Martínez-Lanfranco JA, Botto G, Naya DE, Maneyro R, Mai P, Hernández D, Laufer G, Ziegler L, González EM, da Rosa I, Gobel N, González A, González J, Rodales AL, Pincheira-Donoso D. Multiple forms of hotspots of tetrapod Biodiversity and the challenges of open-access data scarcity. Sci Rep. 2020;10:22045.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  22. Stuart SN, Chanson JS, Cox NA, Young BE, Rodrigues ASL, Fischman DL, Waller RW. Status and trends of amphibian declines and extinctions worldwide. Science. 2004;306:1783–6.

    Article  CAS  PubMed  Google Scholar 

  23. Araújo MB, Thuiller W, Pearson G. Climate warming and the decline of amphibians and reptiles in Europe. J Biogeogr. 2006;33:1712–28.

    Article  Google Scholar 

  24. Mazerolle MJ, Bailey LL, Kendall WL, Royle JA, Converse SJ, Nichols JD. Making great leaps forward: accounting for detectability in herpetological field studies. J Herpetol. 2007;41:672–89.

    Article  Google Scholar 

  25. Zadeh LA. Fuzzy sets. Info Control. 1965;8:338–53.

    Article  Google Scholar 

  26. Kosko B. Fuzzy entropy and conditioning. Inf Sci. 1986;40:165–74.

    Article  Google Scholar 

  27. Real R, Barbosa AM, Vargas JM. Obtaining environmental favourability functions from logistic regression. Environ Ecol Stat. 2006;12:237–45.

    Article  Google Scholar 

  28. Salski A. Ecological applications of fuzzy logic. In: Recknagel F, editor. Ecological informatics: scope, techniques and applications. Berlin: Springer; 2006. p. 3–14.

    Chapter  Google Scholar 

  29. Olson DM, Dinerstein E, Wikramanayake ED, Burgess ND, Powell GVN, Underwood EC, D’Amico JA, Itoua I, Strand HE, Morrison JC, Loucks CJ, Allnutt TF, Ricketts TH, Kura Y, Lamoreux JF, Wettengel WW, Hedao P, Kassem KR. Terrestrial ecoregions of the world: a new map of life on earth: a new global map of terrestrial ecoregions provides an innovative tool for conserving biodiversity. Bio Sci. 2001;51:933–8.

    Google Scholar 

  30. Achkar M, Díaz I, Domínguez A, Pesce F. Uruguay. Naturaleza, Sociedad, Economía. In: Una visión desde la Geografía. Montevideo: Banda Oriental SRL; 2016.

  31. Simó M, Guerrero JC, Giuliani L, Castellano I, Acosta L. A predictive modelling approach to test distributional uniformity of Uruguayan harvestmen (Arachnida: Opiliones). Zool Stud. 2014;53:1–13.

    Article  Google Scholar 

  32. Carreira S, Maneyro R. Una evaluación del estado de conservación de la herpetofauna de Uruguay sobre la base de los criterios de la Unión Internacional para la Conservación de la Naturaleza. In: Lista Roja de los Anfibios y Reptiles del Uruguay. Montevideo (Uruguay): Dirección Nacional de Medio Ambiente. 2015.

  33. Frost DR. Amphibian Species of the World: an Online Reference. Version 6.0 (25 February 2020). Electronic Database. http://research.amnh.org/herpetology/amphibia/index.html. New York: American Museum of Natural History. 2020.

  34. Mainali K, Hefley T, Ries L, Fagan WF. Matching expert range maps with species distribution model predictions. Conserv Biol. 2020;34:1292–304.

    Article  PubMed  PubMed Central  Google Scholar 

  35. Hurlbert AH, Jetz W. Species richness, hotspots, and the scale dependence of range maps in ecology and conservation. Proc Natl Acad Sci. 2007;104:13384–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  36. Merow C, Wilson AM, Jetz W. Integrating occurrence data and expert maps for improved species range predictions. Glob Ecol Biogeogr. 2017;26:243–58.

    Article  Google Scholar 

  37. Phipps JB. Bestblock: optimizing grid size in biogeographic studies. Can J Bot. 1975;53:1447–52.

    Article  Google Scholar 

  38. Pearson RG, Dawson TP. Predicting the impacts of climate change on the distribution of species: are bioclimate envelope models useful? Glob Ecol Biogeogr. 2003;12:361–71.

    Article  Google Scholar 

  39. QGIS Development Team. QGIS Geographic Information System. Open Source Geospatial Foundation Project. 2016.

  40. Legendre P, Legendre L. Numerical ecology. 2nd English. Amsterdam: Elsevier Science; 1998.

    Google Scholar 

  41. Barbosa AM, Real R, Vargas JM. Use of coarse-resolution models of species’ distributions to guide local conservation inferences. Conserv Biol. 2010;24:1378–87.

    Article  PubMed  Google Scholar 

  42. Legendre P. Spatial autocorrelation: trouble or new paradigm? Ecology. 1993;74:1659–73.

    Article  Google Scholar 

  43. Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc B. 1995;57:289–300.

    Google Scholar 

  44. Romero D, Olivero J, Brito JC, Real R. Comparison of approaches to combine species distribution models based on different sets of predictors. Ecography. 2016;39:561–71.

    Article  Google Scholar 

  45. Romero D, Olivero J, Real R. Accounting for uncertainty in assessing the impact of climate change on biodiversity hotspots in Spain. Anim Biodivers Conserv. 2019;42:355–67.

    Article  Google Scholar 

  46. Olivero J, Real R, Márquez AL. Fuzzy Chorotypes as a conceptual tool to improve insight into biogeographic patterns. Syst Biol. 2011;60:645–60.

    Article  PubMed  Google Scholar 

  47. Barbosa AM. fuzzySim: applying fuzzy logic to binary similarity indices in ecology. Methods Ecol Evol. 2015;6:853–8.

    Article  Google Scholar 

  48. Hosmer DW, Lemeshow S. Applied logistic regression. 2nd ed. New York: Wiley; 2000.

    Book  Google Scholar 

  49. Hanley JA, McNeil BJ. The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology. 1982;143:29–36.

    Article  CAS  PubMed  Google Scholar 

  50. Barbosa M, Real R, Muñoz AR, Brown JA. New measures for assessing model equilibrium and prediction mismatch in species distribution models. Divers Distrib. 2013;19:1333–8.

    Article  Google Scholar 

  51. Fielding A, Bell J. A review of methods for the assessment of prediction errors in conservation presence/absence models. Environ Conserv. 1997;24:38–49.

    Article  Google Scholar 

  52. Kunchenva LI. Using measures of similarity and inclusion for multiple classifier fusion by decision templates. Fuzzy Sets Syst. 2001;122:401–7.

    Article  Google Scholar 

  53. Sillero N, Arenas-Castro S, Enriquez-Urzelai U, Vale GG, Sousa-Guedes D, Martínez-Freiría F, Real R, Barbosa AM. Want to model a species niche? A step-by-step guideline on correlative ecological niche modelling. Ecol Modell. 2021; 456.

  54. Arntzen JW. From descriptive to predictive distribution models: a working example with Iberian amphibians and reptiles. Front Zool. 2006;3:8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  55. Romero D, Real R, Olivero J. Efectos previsibles del cambio climático en la distribución de los vertebrados amenazados de España peninsular. In: Tesis doctoral. Servicio de Publicaciones y Divulgación Científica. Universidad de Málaga: RIUMA; 2014.

  56. Toranza C, Maneyro R. Potential effects of climate change on the distribution of an endangered species: Melanophryniscus montevidensis (Anura: Bufonidae). Phyllomedusa. 2013;12:97–106.

    Article  Google Scholar 

  57. Carreira S, Maneyro R. Biología y conservación de los Anfibios y Reptiles en peligro de extinción a nivel nacional. In: Carreira S, Maneyro R, editors. Libro Rojo de los Anfibios y Reptiles del Uruguay. Montevideo: DINAMA; 2019.

    Google Scholar 

  58. Toranza C, Brazeiro A, Maneyro R. Anfibios amenazados de Uruguay: efectividad de las áreas protegidas ante el cambio climático. Ecol Austral. 2016;26:138–49.

    Article  Google Scholar 

  59. Hughes AC, Orr MC, Ma K, Costello MJ, Waller J, Provoost P, Zhu C, Qiao H. Sampling biases shape our view of the natural world. Authorea. 2021.

  60. Duarte J, Romero D, Rubio PJ, Farfán MA, Fa JE. Implications for conservation and game management of the roadkill levels of the endemic Iberian hare (Lepus granatensis). Sci Rep. 2021;11:20641.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  61. Troudet J, Grandcolas P, Blin A, Vignes-Lebbe R, Legendre F. Taxonomic bias in biodiversity data and societal preferences. Sci Rep. 2017;7:9132.

    Article  PubMed  PubMed Central  Google Scholar 

  62. Domisch S, Wilson AM, Jetz W. Model-based integration of observed and expert-based information for assessing the geographic and environmental distribution of freshwater species. Ecography. 2016;39:1078–88.

    Article  Google Scholar 

  63. Fourcade Y. Comparing species distributions modelled from occurrence data and from expert-based range maps. Implication for predicting range shifts with climate change. Ecol Inform. 2016;36:8–14.

    Article  Google Scholar 

  64. Alhajeri BH, Fourcade Y. High correlation between species-level environmental data estimates extracted from IUCN expert range maps and from GBIF occurrence data. J Biogeogr. 2018;46:1329–41.

    Article  Google Scholar 

  65. Estrada A, Real R. Assessment of the National Park network of mainland Spain by the Insecurity Index of vertebrate species. PLoS ONE. 2018;13: e0197496.

    Article  PubMed  PubMed Central  Google Scholar 

  66. Soberón J, Peterson AT. Monitoring biodiversity loss with primary species-occurrence data: toward national-level indicators for the 2010 target of the convention on biological diversity. Ambio. 2008;38:29–34.

    Article  Google Scholar 

  67. Underwood JG, D’Agrosa C, Gerber LR. Identifying conservation areas on the basis of alternative distribution data sets. Conserv Biol. 2010;24:162–70.

    Article  PubMed  Google Scholar 

  68. Muñoz AR, Jiménez-Valverde A, Márquez AL, Real R. Environmental favourability as a cost-efficient tool to estimate carrying capacity. Divers Distrib. 2015;21:1388–400.

    Article  Google Scholar 

  69. Bini LM, Diniz-Filho JAF, Rangel TFLV, Bastos RP, Pinto MP. Challeging Wallacean and Linnean shortfalls: knowledge gradients and conservation planning in a biodiversity hotspot. Divers Distrib. 2006;12:475–82.

    Article  Google Scholar 

Download references

Acknowledgements

We would like to thank the Programa de Desarrollo de las Ciencias Básicas (PEDECIBA) and the Sistema Nacional de Investigadores (SNI) of the Agencia Nacional de Investigación e Innovación (ANII) of Uruguay.

Funding

This article was supported by the PD_NAC_2015_1_108393 project from the Professor Dr. Roberto Caldeyro Barcia fund (the National Research and Innovation Agency ANII (2016-2018)). It was also supported as part of the David Romero national postdoctoral support project 2018 from the Comisión Académica de Posgrado (CAP) (Universidad de la República, Uruguay). To the group of the Andalusian Plan for Research, Development and Innovation (PAIDI) RNM-262 of the Junta de Andalucía.

Author information

Authors and Affiliations

Authors

Contributions

DR processed and analysed the data, wrote the first draft of the paper and carried out the revision of the manuscript; RM conducted the fieldwork, curated the data, discuss methodology and participate in discussion of results; JCG and RR participated in the team building, and supervised the discussion section and the final version of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to David Romero.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors have no competing interests to declare.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Table S1

shows the list of amphibian species analysed in Uruguay, including the IUCN threat category.

Additional file 2: Figure S1

shows the results of the Kruskal–Wallis test comparing the discrimination and classification abilities for threatened (T), non-threatened and non-ubiquitous (NtNu) and ubiquitous species (U) of the models based on expert knowledge and species records, respectively. Figure S2 compares the average number of factors, average number of variables and entropy values of the models based on expert knowledge and species records.

Additional file 3: Figure S3

represents cartographically the occurrences and favourability model values for all amphibian species analyzed in Uruguay according to expert criteria and to species records.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Romero, D., Maneyro, R., Guerrero, J.C. et al. Using fuzzy logic to compare species distribution models developed on the basis of expert knowledge and sampling records. Front Zool 20, 38 (2023). https://doi.org/10.1186/s12983-023-00515-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12983-023-00515-x

Keywords