An objective criterion for weighting taxonomic characters

doi:10.15406/mojbm.2024.09.00218

MOJ

eISSN: 2574-9722

Biology and Medicine

Research Article Volume 9 Issue 2

An objective criterion for weighting taxonomic characters

Joaquim Carlos Sena Maia

Verify Captcha

Regret for the inconvenience: we are taking measures to prevent fraudulent form submissions by extractors and page crawlers. Please type the correct Captcha word to see email ID.

Adjunct Professor (retired) of the Department of Statistics at UFPR, Brazil

Correspondence: Joaquim Carlos Sena Maia, Adjunct Professor (retired) of the Department of Statistics at UFPR, Brazil

Received: April 12, 2024 | Published: May 22, 2024

Citation: Maia JCS. Taxonomic weighting. MOJ Biol Med. 2024;9(2):67-71. DOI: 10.15406/mojbm.2024.09.00218

Download PDF

Abstract

In this work we present the programs developed to calculate, through a weighting system, the contributions of taxonomic characters to aggregate species in higher taxa or to individualize them in identification processes. The concepts established in previous works are presented graphically in order to facilitate the understanding of the concepts of aggregative and discriminative potentials of a character. A step-by-step tutorial is presented to facilitate the use of the programs developed by the author to calculate the weighting of each character, numerically translating the ability to aggregate and discriminate taxonomic units, by means of statistical analysis.

Keywords: weighting, aggregating, taxonomic characters, potential

Introduction

The use of weights to emphasize the greater or lesser taxonomic importance of a biological character was very prominent with the emergence of Numerical Taxonomy in the 1950s. If, on the one hand, the use of weights is very appropriate for numerical methods, on the other hand, whatever method is employed, it is important to know the value of the participation of each character in the formation of the taxons studied. As the determination of weights involves calculations, including statistical analysis, we present in this work a program developed in FORTRAN 90 that contemplates all the routines of the necessary calculations.

Material and methods

We will use the same material published by Maia et al.¹ used later in the publications that described the methodologies of the calculations of the weights for discrete variables (counts, coded attributes, etc.), Maia et al.² and for continuous variables (measurements), Maia et al.³ in order to facilitate the joint analysis of these three works that complement each other in the study of the theme. In the graphic demonstrations we will use meristic data (discrete variables) to detail the structure of the groups formed, due to the ease of formatting the examples of Figures 1–4. The results and conclusions, however, also apply to continuous variables (weights and measures), whose groupings are formed from the results of analysis of variance (ANOVA).

Figure 1 Subgroups formed by pronotum color.

Figure 2 Subgroups formed by propodium punctuation.

Figure 3 Subgroups formed by labrum color.

Figure 4 Subgroups formed by scapum color.

Results and discussions

The species Plebeia juliani, Plebeia meridionalis, Plebeia droryana, Plebeia emerina, Plebeia remota and Plebeia saiqui are also identified by the letters J, M, D, E, R and S in the figures that illustrate the presentation of the methodology. We also modified the scale of weights, which initially ranged from zero to 5, to a scale from 1 to 6. This modification consisted of increasing a unit on the original scale making it more understandable in the interpretation of the relation ‘contribution of character/value of weighting. The use of weights to hierarchize the contributions of taxonomic characters is an old discussion that was potentiated with the emergence of Numerical Taxonomy in the 1950s, due to the lack of a rational and objective criterion of weighting. Michener et al.⁴ advocated the use of equal weights for all characters as an alternative to work around the problem. Other solutions were presented by Burtt et al.⁵ Farris et al.⁶ Goodman et al.⁷ Sneath et al.⁸ recognized that the controversies over the use of weighting in Numerical Taxonomy were responsible for the difficulties that slowed its progress. The importance of a taxonomic character depends on the objectives of the research and its behavior, discriminating or aggregating the members of the studied group. If the study aims to individualize species, the characters capable of discriminating are particularly important. However, if the study aims to structure taxons at higher levels, formed by the aggregation of similar individuals from lower taxons, aggregating characters play a relevant role in this task. This importance can be assessed by calculating the character weights in the formations of the taxons and in the identifications of the species. Maia et al.¹ used discrete and continuous variables in the calculation of similarity coefficients using analysis of variance, without, however, creating a weighting criterion.

Maia et al.² proposed the creation of a weighting scale for discrete variables (counts and encodings), formed by 5 categories, whose value of D, calculated through the formula D = 5(N – 2G) / (N- 2), varies from zero to 5. For the continuous variables (measurements), Maia et al.³presented a formula for the calculation of D = 5(1- 2Z/(n(n-1)), where Z is equal to the number of comparisons whose means do not differ significantly by Tukey test. In both cases the values of D vary from zero to 5, where zero represents the minimum degree of discrimination and 5 the maximum degree. Thus, two scales of weights were formatted, one to evaluate the potential for discrimination (D) and the other to evaluate the potential for aggregation (D’). The values are complementary: D’ = 5 – D. In the formatting proposed here we exclude the weight = 0, and the 5 categories have weights ranging from 1 to 6 due to the addition of a unit (+ 1) in the value of D. Thus, the weighting for discriminative characters is calculated as WD = D + 1, where WD stands for weight of the discriminative potential of the character. The weight of the aggregative potential (WA) is calculated as WA = 7 – WD. Most characters both aggregate and discriminate, that is, taxonomic characters have aggregative potential and discriminative potential, whose proportions may vary according to the case. The use of WD or WA values depends on the objectives of the study and should be done by framing the calculated value within the corresponding limits of Table 1 & 2.

1 <= WA <= 2	- Very little discriminative character
2 < WA <= 3	- Little discriminative character
3 < WA <= 4	- Moderate discriminative character
4 < WA <= 5	- discriminative character
5 < WA <= 6	- Very discriminative character

Table 1 Weight limits for weighting (WD) for discriminative taxonomic characters

1 <= WA <= 2	- Very little aggregative character
2 < WA <= 3	- Little aggregative character
3 < WA <= 4	- Moderate aggregative character
4 < WA <= 5	- Aggregative character
5 < WA <= 6	- Very aggregative character

Table 2 Weight limits for weighting (WA) for aggregative taxonomic characters

For a better understanding of the weighting criteria, we are going to analyze how the species are organized within the studied group and the respective values of WD and WA. In Figure 1 we analyze the character ‘color of the pronotum’ that presents a maximum degree of discrimination. This happens when the status number of the character is equal to the number of species that make up the analyzed group. These characters are suitable for studies aimed at the individualization of taxonomic units. In Figure 2 we will analyze the ‘propodium punctuation’. This character has the smallest grouping of species that a variable character can form: 2 subgroups of equal size brought together by 2 different statuses. Thus, we observed that the weight of the aggregation potential reaches the maximum value (WA = 6 .0) while the weight of discrimination reaches. The Figure 3 shows a situation in which the character ‘color of the labrum’ can aggregate and discriminate minimum value (WD = 1 .0). Characters with these qualities are suitable for cladistic studies, as they can form larger groups from the gathering of similar smaller groups in a similar way, forming homogeneous subgroups that differ from each other. When the WD and WA values are close, the characters are classified as intermediate (or moderate) and sometimes the WD and WA values are exactly the same as shown in Figure 3. In Figure 4 we analyze the taxonomic character ‘color of the scapum’ in which the aggregative potential (WA = 4.0) is greater than the discriminative potential (WD = 3.0), or, in other words, this character aggregates more than discriminates.⁹

Automation of calculations

D, WD and WA values can be easily calculated through the GADVDVC. F application, developed by the author, which includes the calculation routines for discrete and continuous variables. There are two versions, in Portuguese and in English.

Using the application

The GADVDVC. F application was developed to calculate the degree of aggregation/discrimination (D) of a taxonomic character and also to calculate the weights of a weighting system (on a scale from 1 to 6) that informs the contribution of the character to the formation of subgroups in a group (that brings together 3 or more species) (WA) or for individualization of these same species (WD). The term ‘group’ is being used in cases where the studied species do not represent the whole of a genus, but only part of it. Thus, the conclusions will be valid only for the group includes 16 other statistical tests of current use in biological research.

The tutorial

BIOESTAT can be obtained from the website of the Department of Statistics of UFPR, by downloading it at the address http://est.ufpr.br / ‘Recursos, Software’ / ‘Projetos Ativos, Bioestat’). It is also important to download a tutorial extensively illustrated with the prints of the screens, exemplifying in a very didactic way the routines of the calculations through examples developed for each test studied and should not be extended to the corresponding genus. The methodological foundation for these calculations can be found in the bibliography cited at the end of this chapter.

These are two works that deal with the theme. Maia et al.² presents the methodology for discrete variables (counts and codifications), based on the calculation of the geometric average. Maia et al.³ deals with the continuous variables formed by measurements using Analysis of Variance, F Test (one classification criterion), complemented by the Tukey test.⁹

Conclusion

Every variable taxonomic character has aggregative and discriminative potentials, whose values may vary according to the number of species and their respective status that make up the analyzed group.
The discriminative potential of a character can be represented numerically on a scale from zero to 5.
The discriminative and aggregative potentials are complementary, that is, a ‘very discriminative’ character is also ‘very little aggregative’ and vice versa.
The values of WD and WA represent the weights (participations) that the characters have in the formation of taxonomic groups and subgroups on a scale from 1 to 6.
WD and WA values can also be used as a criterion for prior character selection for studies employing more sophisticated or more costly methodologies.