Energy and Human Performance

### **Chapter 6**

## Methodological Procedures for Non-Linear Analyses of Physiological and Behavioural Data in Football

*José E. Teixeira, Pedro Forte, Ricardo Ferraz, Luís Branquinho, António J. Silva,Tiago M. Barbosa and António M. Monteiro*

### **Abstract**

Complex and dynamic systems are characterised by emergent behaviour, self-similarity, self-organisation and a chaotic component. In team sports as football, complexity and non-linear dynamics includes understanding the mechanisms underlying human movement and collective behaviour. Linear systems approaches in this kind of sports may limit performance understanding due to the fact that small changes in the inputs may not represent proportional and quantifiable changes in the output. Thus, non-linear approaches have been applied to assess training and match outcomes in football. The increasing access to wearable and tracking technology provides large datasets, enabling the analyses of time-series related to different performance indicators such as physiological and positional parameters. However, it is important to frame the theoretical concepts, mathematical models and procedures to determine metrics with physiological and behavioural significance. Additionally, physiological and behavioural data should be considered to determine the complexity and non-linearity of the system in football. Thus, the current chapter summarises the main methodological procedures to extract positional data using non-linear analyses such as entropy scales, relative phase transforms, non-linear indexes, cross correlation, fractals and clustering methods.

**Keywords:** complex systems, positioning, physiology, performance analysis, soccer

#### **1. Introduction**

Applying non-linear theory and approaches has been a growing research interest in sports sciences fields such as performance analysis [1–4]. It is assumed that timebased and team sports display non-linear characteristics [5–8]. Football is deemed as a complex and dynamic system where players perform intermittent movements in time-space coordination [9–11]. The prominence of this research topic is due to several factors, amongst which the shift in the paradigm from linear to non-linear frameworks that has been applied to a wide variety of fields and settings, besided the ready access to technology (e.g., tracking systems) providing large datasets, time series outputs and new time-motion approaches [12–14].

Nonlinear theory and complex sciences are disruptive of linear frameworks [15, 16]. Linear systems assume a linearity on time-varying case, an input-output statistic and a linear state feedback [17]. Considering the linear system theory, an internal and external structure developing feedback control strategies for simultaneous stabilisation of the system [16]. Based on this, theoretical models quantify the relationship between human movement (input) and performance (output), considering the athlete as a linear system [18, 19]. Desynchronization between internal (such as heart rate measure, perceived exertion and biochemical procedures) and external components (i.e., movement speed, body impacts, metabolic power, accelerations and decelerations) may affect the performance [18, 20]. Small changes in the inputs determine proportional and measurable changes in the output, reporting linearity characteristics such as controllability, observability and canonical structure [21]. These assumptions determines approaches focused on the linearity of the system, reporting an fitness-fatigue binomial with a related dependence on dose-response relationships [22, 23]. However, the accuracy of these theoretical models has been challenged for being feeble and for the lack of individualised measurement [23]. Moreover, the ecological dynamisms of informational contexts, social relations and human movement variability are not considered in linear analyses [24–26]. Human movement and collective behaviour are not characterised by the linearity of the systems (as in team sports, like football) and the linear theory could be deemed as a reductionist approach to the problem [5, 6, 26]. Thus, the individual and collective performance has been reported using a complex and dynamic perspective [26–31]. Under these assumptions, biological systems are characterised by non-linearity, interaction-dominant dynamics, emergent behaviour, self-similarity, self-organisation and a chaotic component [32]. Literature reports several nomenclatures for the topic as complex adaptive systems [8, 33, 34], complex and dynamic systems [6, 26] or non-linear and dynamical self-organisation systems [27, 34, 35].

The ready access to cutting-edge technology was another reason for this field of research to increase. Such technology eventually became more affordable and user friendly. The use of tracking data started by assessing the individual players' movement, and later integrated spatial-temporal patterns based on Cartesian and Euclidean references [13, 36, 37]. Over the last two decades, positional data has been verified in football training and match-play to assess the complexity of the systems inherent to the individual movements and collective coordination [31, 38, 39]. Positional dataset can be applied to measure both physical and tactical measures [10, 40–45]. However, analysis do not always integrate different performance indicators [10, 46, 47]. Usually, studies focused only in a single performance dimension, however, football is a multifaceted sport with the physical, tactical, and technical factors amalgamating to influence performance with each factor not mutually exclusive of another [47]. Integrating performance metrics remain rarely described in current literature, concerning football environments [36]. That creates issues in the performance analysis, leading to the fact that the integrative approaches remain understudied. This research gap may be a very important topic to enhance knowledge about the theoretical concepts, mathematical models and methodological procedures of the non-linear approaches to integrate physiological and behavioural data in football.

#### **2. Theoretical concepts of the non-linear approaches**

#### **2.1 Football as a complex and dynamic Cartesian coordinates**

Football is an invasion game characterised as a complex and dynamical systems with a goal-oriented adaptation amongst teammate and opponents [9, 48].

*Methodological Procedures for Non-Linear Analyses of Physiological and Behavioural Data… DOI: http://dx.doi.org/10.5772/intechopen.102577*

Previously, to measure and tracking player's movements, mapping tactical actions and modelling collective behaviour were time-consuming processes [2, 4, 49]. Observational and notational analysis had scarce technological and procedural means to support the occurrence of the large number of physical, technical and tactical actions of the football game [49–51]. The wearable technologies as tracking systems allowed real-time access the players' position on the field during training and competition [31, 38, 39]. Positional dataset can be captured at different frequencies by using tracking systems as global positioning systems (GPS) tracking systems [52, 53], micro-electromechanical systems (MEMS) [36, 54], local radiobased local positioning systems (LPM) [55, 56], computerised-video systems [57, 58] and tracking system [59, 60]. This is largely due to the high cost associated with its use, which restricts its use almost exclusively to professional settings in male players [20, 61]. The validity and accuracy of these time-motion methodologies is well documented with an excellent reliability (coefficient of variation, CV: 1.02–1.04%) [52, 53]. However, the integration of the different devices still needs further studies [60, 62, 63]. Using this techniques to collect data, the players' movements are possible to be framed in a Cartesian referential (football field), represented by time series of Cartesian coordinates (*xx*, *yy*) [36]. Also, this approach allows spatiotemporal patterns to be assessed with a physical and tactical significance for coaches, performance analysts and researchers [30, 36]. Capturing collective and tactical performance also requires knowledge of the tactical-strategic variables that mediate intra- and inter-team behaviour [40, 64, 65]. According to Duarte et al. [11], inter-player and team coordination reports mutual influence of each player on the behaviour of dyadic systems shaped emergent performance outcomes. Sampaio and Vitor [64] applied positional data to calculate mean position in relation to the geometric centre of the team. Thus, longitudinal and lateral directions are established by the players' dyads and geometrical centre of the team (i.e., team centroid) [40, 45]. Movement patterns and inter-player coordination as a key issue in non-linear signal processing method [6, 64, 66]. Length, width and centroid distance as measures of team's tactical performance were also applied in youth football by Folgado et al. [66]. Synchronisation and synergy are theoretical terms recurrently used to assess intra-team and inter-team coordination and assess spatial-temporal displacement for goal orientation and team success [11, 65]. Typically, this specialtemporal displacement has been assessed by the distance between player's dyads [42, 45]:

$$D\left(a\_{\mathbf{x}\_{(t)},\mathbf{y}\_{(t)}},b\_{\mathbf{x}\_{(t)},\mathbf{y}\_{(t)}}\right) = \sqrt{\left(a\_{\mathbf{x}\_{(t)},\mathbf{y}\_{(t)}}\right)^2 + \left(a\_{\mathbf{x}\_{(t)},\mathbf{y}\_{(t)}}\right)^2} \tag{1}$$

where *D* is the distance, *a* is the player, *x* and *y* are the coordinates, *t* is the time, and *b* is the teammate or opponent. Team dispersion can be measured using effective playing position (EPS), surface area or spatial exploratory indexes using Euclidean (planar) coordinates [67–69]. For their measurement dyads players are achieved to measure the magnitude of the variability, predictability, stability and/or regularity in the distance between players [11, 40]. However, the individual preponderance within the synergies and synchronicities of the team remains somehow challenging to measure [70]. Additionally, ball possession is a critical issue in this positional data modelling given the importance of distinguishing phases of play [44, 64]. Due to this fact, different levels of analysis must be considered such as micro, meso and macro [13, 71, 72]. Moreover, positional and behavioural data cannot be interpreted separately, because football performance is a multifactorial phenomenon [10, 46, 47].

#### **2.2 Current performance metrics to measure physiological demands under a linear framework**

The training process requires a systematic physiological and biomechanical stimulus to ensure optimal adaptations and an adequate performance [19]. Several theoretical frameworks have been developed to assess the quantity and quality of the training and competition demands [18, 19]. These training load-based consider the linear system theory, likewise dose-response relationship and fitness-fatigue binomial. Fitness-fatigue model approach was originally proposed by Bannister [73]:

$$p(k) = p^\* + c\_1 \sum\_{i=1}^{k-1} u(i) \exp^{\frac{-(k-1)}{\eta\_1}} - \sum\_{i=1}^{k-1} c\_2(i)u(i) \exp^{\frac{-(k-i)}{\eta\_2}} \tag{2}$$

were *p k*ð Þ is the measured fitness (or performance) with gain term (*k*) and time constant ð*τ*1) [23]. Cumulative effect of training as a key guidance to the individual athlete's performance [20, 74]. Training load concept has been developed under the assumption that the athlete is a linear system [74, 75]. It is possible to breakdown training load into external and internal load. The external load describes the work rate (i.e., physical or biomechanical output), regardless of the biochemical and psychophysiological response [20, 76]. In training environments, it is mainstream to reports the work rate [49, 77], workload [78, 79] and training load [18, 19]. When load measures are also applied to analyse match performance, quite often is noted the physical performance [47], activity profile [80, 81] or match running performance [82–84]. Wherefore, integrating load measure with other performance factors is a current research-practice gap when determining metrics with physiological and behavioural significance [10].

#### **2.3 Physiological and behaviour dataset in football environments: a integration approach using non-linear procedures**

Football performance, a multifactorial phenomenon, dependant on a variety of factors such as environmental, contextual, physical, technical, tactical and psychophysiological [46, 47, 83]. These factor are not mutually exclusive of one another, what makes relevant an integrated approach to provide holistic insights about performance analysis [47, 83]. On regular basis, each of these factors are analysed in isolation without taking the others into account, leading to 1-dimensional insights [83]. Bradley and Ade [47] proposed a theoretical model emphasising on high-intensity running efforts during match-play advocating a contextualization of these running-based actions amongst technical and tactical activities [47]. This becomes of utmost relevance considering the football game [9, 65]. This is what mediates the players' decision making throughout the game according a team strategy previously defined [13]. Several authors have tried to establish ecological approaches to evaluate training and match outcomes, including non-linear approaches [11, 65]. Non-linear analyses were fundamentally performed on competitive game [41, 85] and limited training tasks as small and large-side games [42–45, 67, 69]. It is therefore important to understand mathematical models and methodological procedures underpinning non-linear analyses to assess their significance in applied research and applied settings, identify possible research gaps to be explored and, be aware of potential limitations and criticisms (**Figure 1**).

*Methodological Procedures for Non-Linear Analyses of Physiological and Behavioural Data… DOI: http://dx.doi.org/10.5772/intechopen.102577*

**Figure 1.** *Physiological and behavioural dataset in football environments.*

#### **3. Mathematical models and methodological procedures of the non-linear approaches**

Non-linear approaches have been recurrently applied in football using complexity principles [14, 31, 36]. The informational context and spatiotemporal determinants that mediate players' perception and action are analysed by nonlinear and dynamic proprieties of the football game [6, 28]. It is assumed that environmental, task and organismic constraint influence individual and collective behaviour [5, 28]. This behaviour has a physiological cost over time that must be measured [18, 20, 76]. Informative content can be classified as different domains of variability, namely the frequency domain, the entropy domain and the scale-invariant domain [5, 86]. In biological systems, sequential time-series have become outstanding data analysis in multifaceted context [87–90]. According to Bravi et al. [86] the time-series data can be described through five different domains of variability: (1) statistical (i.e., statistical properties of the distribution in a stochastic process); (2) geometric (i.e., properties of the dataset shaped in a certain space); (3) energetic (i.e., energy or power of the time-series); (4) informational (i.e., degree of irregularity/complexity inherent to the order of the elements in a time-series); invariant (i.e., fractality or unchanging attributes over time or space). In football, time-series data application has been widely applied [14, 36]. Low et al. [36] organised the non-linear methods into measures of the regularity (or predictability) and synchronisation. Geometrical centre and periodic phase oscillators has been considered to analyse players'synchronisation and modelling the coordination of a team [14]. However, remains unclear the application of time-series data from an integrated approach perspective [10, 47, 83]. Thus, it is paramount to determine mathematical models and methodological procedures for non-linear time-series data analysis, bearing in mind an integrative approach. Therefore, the following subsections elaborate on the different non-linear mathematical models possible to apply in football.

#### **3.1 Entropy**

Entropy is a non-linear and informational parameter applied to describe variability, regularity or predictability of the movement/performance uncover the inter-player's interactions [86, 89, 91]. That, is entropy parameters describes the degree of irregularity/complexity inherent to the order of the elements in a timeseries [86]. There are several types of entropy reported in the literature and applied in football research from the integrative perspective, amongst which Approximate Entropy (ApEn) [6, 11, 42], Sample Entropy (SampEn) [44, 92], Cross-sample Entropy (Cross-SampEn) [92] and Boltzmann-Gibbs-Shannon Entropy (ShannonEn) [43]. ApEn expresses the probability that the sequence configuration in a time-series data allows the prediction of the configuration from another sequence from a distance apart [89, 91]. ApEn was derivate from Kolmogorov-Sinai entropy and ranged amongst 0�2 where lower values correspond to more predictable and higher values stands more unpredicted patterns within time-series (0 ≤ ApEn ≤2) [89].

$$\text{ApEn}(m, r, N \ ) = \phi^m(r) - \phi^{m+1}(r) \tag{3}$$

where *m* is the window length distance amongst comparting time-series points, *r* is the similarity radius, *N* is the time-series length, and *ϕ* is the probability that points *m* distance within a tolerance level (*r*) [36, 90]. *C<sup>m</sup> <sup>i</sup>* ð Þ*r* measures how similar are the regularity of the data points in the window length (*m)* having regard to the following *<sup>ϕ</sup><sup>m</sup>*ð Þ*<sup>r</sup>* [6, 36]:

$$\phi^m(r) = \left(N - m + 1\right)^{-1} \sum\_{i=1}^{N-m+1} \ln C\_i^m(r) \tag{4}$$

From a practical point of view, the imputed ApEn values should be computed with 2 to vector length (*m*) and 0.2 \* standard deviation to the tolerance (*r*) [41, 45, 93]. ApEn reliability for short time series is low, providing relative consistency during changes in input parameters (*m*, *r* and *N*) [1, 89, 91]. For this reason, Richman and Moorman [89] developed SampEn where their logarithm is simpler with shorter time-series records than ApEn that is heavily dependent on the length record causing lacks of relative consistency. SampleEn logarithm has been was developed on the basis Grassberger et al. [94] reporting a larger window length and a greater relative consistency than ApEn [89–91]. Likewise, SampEn values close to zero indicates a regular or near-periodic time-series sequence, while the higher values reports a more unpredictable pattern (0 ≤ SampEn < ∞) [88, 91, 92]. SampEn measures the negative logarithmic probability that two similar sequences of m points extracted within the tolerance limits (*r*) for a window length (*m* + 1) [36, 92].

$$\text{SampEn}(m, r, N) = -\ln \frac{\phi^{m+1}(r)}{\phi^m(r)}\tag{5}$$

Where, *m*, *r*, *N*, and *ϕ* retain the meaning from ApEn equation, whereby and *m* + 1 windows are compared for eliminating the self-matching bias in the ApEn [90]. *ϕ<sup>m</sup>*þ**<sup>1</sup>** ð Þ*r* reports the total number of time-series sequences in a window length *<sup>m</sup>* + 1 as far the *<sup>ϕ</sup><sup>m</sup>*ð Þ*<sup>r</sup>* expresses the total template in a length *<sup>m</sup>* within the aforementioned tolerance level (*r*) [36]*.*

Multiscale entropy (MSE) as Cross-ApEn and Cross-SampEn was recently introduced from the primary entropy procedures (i.e. ApEn and Cross-ApEn) [90, 95]. Therefore, cross-entropy methods quantify the degree or complexity of coupling between two cross-sequences while the primary entropy techniques evaluated the asynchronism between two time series [87, 90]. Cross-SampEn remain a *Methodological Procedures for Non-Linear Analyses of Physiological and Behavioural Data… DOI: http://dx.doi.org/10.5772/intechopen.102577*

greater relative consistent than Cross-ApEn, being defined as long as one template finds a time-series sequence [89]. Mostly, Cross-SampEn has been recurrently used in football settings to measure players'synchrony [36, 92]. Cross-SampEn, the templates are chosen from the series *u* and compared with vectors from *v*, while the negative logarithm accounts the ratio of two average between outputs [36, 89]:

$$\text{Cross} - \text{SampleEn}(m, r, N) = \left\{-\ln \frac{\phi^{m+1}(r)}{\phi^m(r)}\right\} \tag{6}$$

Boltzmann-Gibbs-Shannon entropy was applied by Ric et al. [43] to measure temporal diversity and structural flexibility of the players. This entropy-based technique was originally applied by Balescu [96], reporting the configuration's probabilities as the large *N* in a relative frequency occurring in a stationary distributions described by Shannon [97]:

$$H(\mathbf{x}) = \sum\_{i=1}^{n} p\_i \log\_b p\_i \tag{7}$$

where *pi* = *ni*/*N*, where *ni* and *N* is the frequency and total number of the configuration, respectively [1, 43]. Predictable and unpredictable patterns were also reported as lower and higher entropy values, which is presented by absolute or normalised forma (0 ≤ *H x*ð Þ ≤ 1) [36].

MSE techniques was applied in football by Canton et al. [44] to identify how positioning the goals in diagonal configurations on the pitch modifies the external training load and the tactical behaviour in youth football environments (i.e., smallsided games). The authors applied a SampleEn algorithm to compute entropy values in different timescales, calculating the area under for complexity index as reported in multiple entropy analysis for time-series [90, 98]. MSE techniques reports the point-to-point fluctuations over a time-series range [44, 90, 98] as:

$$\mathbf{y}\_j^{\mathbf{r}} = \frac{\mathbf{1}}{\mathbf{1}} \sum\_{i=(j-1)\mathbf{r}+\mathbf{1}}^{j\mathbf{r}} \mathbf{x}\_i \tag{8}$$

Where, timescales is *τ*, *yi* is the data point in the constructed time-series and *xi* is a data point in the original time-series through a window length (*N)*. Complex index (*C*1) is calculated using the area under the constructed time-series and the original time scale curve [90, 98]:

$$\mathbf{C}\_{1} = \sum\_{i=1}^{N} \mathbf{E}n(i), \mathbf{1} \le \mathbf{y}\_{j} \le \frac{N}{\tau} \tag{9}$$

Where, En is the reported entropy parameter at the time scale *i* and *N* is the total number of time scale used for the *C*<sup>1</sup> calculation [90].

#### **3.2 Relative phase (Hilbert transform)**

Relative phase was extensity reported in football within an integrative framework [40, 41, 67]. Using a Hilbert transform ½ � *ϕ*ð Þ*t* , relative phase computes the difference between two signals reporting coupled oscillators and stable patterns of synchronisation, in-phase and anti-phase, when players moved in the same or opposite directions [36, 99]. Hilbert transform was originally introduce by Gabor

[100] with aim to measure the phase and amplitude for a signal. Palut and Zanone [99] configured a phase diagram plotting the imaginary part of the Hilbert transform:

$$\begin{aligned} \phi(t) &= \phi\_1(r) - \phi\_2(r) \\ \phi(t) &= \arctan \frac{H\_1 \mathfrak{s}\_2(t) - H\_2(t)\mathfrak{s}\_1(t)}{\mathfrak{s}\_1(t)\mathfrak{s}\_2(t) + H\_1(t)H\_2(t)} \end{aligned} \tag{10}$$

Where, *H*1*s*2ð Þ*t* and *H*2ð Þ*t s*1ð Þ*t* were the Hilbert transform from the two compared signals [36, 99]. In football, longitudinal and lateral directions within the pitch were reported using near-in-phase synchronisation of each players' dyads [40, 41, 67]. The percentage of time spent in each near-in-phase mode of coordination was computed to verify in-phase synchronisation (�30° ≤ *ϕ* ≤ 30°), anti-phase synchronisation (�180° ≤*ϕ* ≤ �150° or 150° ≤ *ϕ* ≤ 180°) or without pattern synchronisation (all other *ϕ* degree) [36, 99]. Oscillation in football environment has been a recurrent non-linear approaches to process *x*- and *y*-directions and positions covered by football players in centroid and effective playing space zone [38, 101]. Sampaio and Vitor [64] displayed the relative phase post-test value to measure movement patterns and inter-player coordination. This study reported a higher regularity of the patterns with the increasing of the expertise level. There is a gap in understanding how the physiological dataset can influence the intra- and inter-team variability that needs to be further studied [47]. It remains to be understood how this varies across the different levels of the expertise.

#### **3.3 Complex index**

Non-linear parameters are often transformed into reliable complexes indices to measure complexity in football settings [43, 69, 93]. Dynamic overlap is a complex index used to compare time-series against the average cosine auto-similarity of the overlap between configurations within time lags [102]. It is an informational nonlinear parameter that expresses how timescale statured in a dynamic behaviour using the exploratory breadth at different timescales [43, 69]:

$$ = (\mathbf{1} - q\_{\rm stat})t^a + q\_{\rm stat} \tag{11}$$

Where, *< q***d**ð Þ*t >* is the dynamic overload value, *t* is the time lag, *q***stat** is the horizontal asymptote and *α* expresses the gradient. Dynamic overlap tends to infinity wherefore predictable and unpredictable reflects zero an one values, respectively (0 ≤ *q***d**ð Þ*t* ≤ 1) [36, 43, 69]. Additionally, trapping strength is the overall behavioural flexibility performed at lower and highest values of the time scale [43].

Another complex index reported in the literature is the stretch index, which can be defined as distances amongst players and the geometrical centre of the team [45, 85]. That is, this complex index measures the spatial expansion or contraction [103] as:

$$s\_{\rm ind} = \frac{\sum\_{i=1}^{N} w\_i d\_i}{\sum\_{i=1}^{N} w\_i} \tag{12}$$

Here *di* is the distance between player *i* and their geometrical team centroid [103]. Stretch index can be expressed in meters, and is also computed by CV and entropy parameters [93, 104]. Otherwise, spatial exploration index (SEI) is obtained with the width and length displacements, computing the distance from each data point in the time-series according to geometrical centre [1, 2].

*Methodological Procedures for Non-Linear Analyses of Physiological and Behavioural Data… DOI: http://dx.doi.org/10.5772/intechopen.102577*

#### **3.4 Correlation index**

Windowed and cross correlation were also applied to assess collective behaviour through positional data in football training and match environments [36, 65, 92]. Cross correlation function is well-supported in the human movement research, wherein the overlapping time windows that enclosed the time-series sequence under analysis [105, 106]. Cross-Correlation function multiplied the point-to-point amongst two time-series data series, reporting the sum of the products and the respective relationship quantification [105]:

$$r\_{\mathbf{x}\mathbf{y}} = \sum\_{i=0}^{N-1} \mathbf{x}\_i \mathbf{y}\_i \tag{13}$$

Where *rxy* is the correlation across the window length of each analysed timeseries (*N*); *xiyi* are the data point of the calculated data series. Boker et al. [106] described cross-correlation with the pairwise dataset at two different time-series signals in accordance with:

$$r\_{\mathbf{xyr}} = \frac{\mathbf{1}}{N - \pi} \sum\_{i=1}^{N-\pi} \frac{(\boldsymbol{\kappa}\_i - \overline{\boldsymbol{\kappa}})(\boldsymbol{y}\_{i+\tau} - \overline{\boldsymbol{\chi}})}{\sigma(\boldsymbol{\kappa})\sigma(\boldsymbol{y})} \tag{14}$$

Where *τ* is the observations across cross-correlation (*r*) amongst time-series data point (*xiyi* Þ; *x* and *y* are the means, *σ*ð Þ *x σ y* � � are the standard deviations in the studied window length (*N*) reporting positives and negatives correlation (�1 ≤ *r* ≤ 1) [105–107]. Pearson coefficient (*r*) was expressed as [105]:

$$r\_{xy} = \frac{N(\sum \mathbf{x}y) - (\sum \mathbf{x})(\sum y)}{\sqrt{[n\sum \mathbf{x}^2 - (\sum \mathbf{x}^2)][n\sum y^2 - (\sum y^2)]}} \tag{15}$$

where, *rxy* maintains the meaning to the previous equation. Person-moment correlation was used to estimate in-phase synchronisation (i.e., *r* values close to �1), anti-phase synchronisation (i.e., *r* values close to 1) and without pattern (i.e., *r* = 0) [36, 92]. It remains to be seen whether matrix correlation may be applied to correlate players synchronisation with physiological insight, since it was only applied to assess dynamic collective behaviours in isolation individual physiological demands [65, 92, 107]. Complementarily, a study associated the cross-correlation with the an vector coding technique to analyse coordination patterns between teams during offensive sequences that ended in shots on goal or defensive tackles [108]. The authors based their non-linear analysis the relative motion plot proposed by Sparrow et al. [109]:

$$\theta(i) = \arctan \left| \frac{\theta\_2(i+1) - \theta\_2(i)}{\theta\_1(i+1) - \theta\_1(i)} \right|, i = 1, 2, \dots, n - 1 \tag{16}$$

where *i* is the time-series data point in a right horizontal and *n* is the total frame for each timescale [108]. As well, the *θ*ð Þ*i* is the coupling angles between in the second, third and fourth adjusted quadrant (i.e., *π* � *θi*, *π* þ *θi*, 2*π* � *θi*) [36, 109]. Near-in-phase synchronisation were computed using this non-linear technique, expressing the in-phase (22.5° ≤*θ<sup>i</sup>* < 67.5° or 202.5° ≤*θ<sup>i</sup>* < 247.5°) and anti-phase (22.5° ≤*θ<sup>i</sup>* < 67.5° or 202.5° ≤*θ<sup>i</sup>* < 247.5°). Even more, attacking and defensive team phases are also reported by Moura et al. [108]. By distinguishing the phases of the play, this non-linear method could allow to understand how physiological

demands can affect the intra- and inter-synchronisation in offensive and defensive phases.

#### **3.5 Fractality**

Fractal dimension is an invariant non-linear parameter characterised by the unchanged proprieties of the system over time and/or space [86]. Multifractal time series expresses different local scaling exponents for a time-series dataset scaling different exponents at different times [86, 110]. Few studies applied fractal dimension to predict stability and predictability of the football players along specific training tasks [110, 111] and competitive matches [110–112]. Fractal calculus (FC) was reported using Shanon and Grünwald-Letnikov approaches [111, 112]. Grünwald–Letnikov fractional differential consideres the matrix containing the multi-player positions [111, 113]:

$$\mathbf{X}\_{\delta}[t] = \begin{bmatrix} \mathbf{x}\_{1}[t] \\ \vdots \\ \mathbf{x}\_{N\_{\delta}[t]} \end{bmatrix}, \left(\mathbf{x}\_{i}[t] \in \mathbf{R}^{2}\right) \tag{17}$$

where *N<sup>δ</sup>* expresses the number of players in the team *N<sup>δ</sup>* across a time-series analysis. The matrix *Xδ*½ �*t* is the planar positioning matric of player (*i*) in a target team ð Þ*δ* at the a concrete time (*t*) [111, 113]. Shannon information was expressed by [112]:

$$I[P(\mathbf{x})] = -\log P(\mathbf{x})\tag{18}$$

whereby*I Px* ½ � ð Þ is the function between the cases *<sup>D</sup>*�<sup>1</sup> *IPx* ½ �¼ ð Þ *P x*ð Þ½ � <sup>1</sup> � log *P x*ð Þ and *<sup>D</sup>*<sup>1</sup> *IPx* ½ �¼ ð Þ *P x*ð Þ <sup>1</sup> � <sup>1</sup> *P x*ð Þ h i . *<sup>I</sup>* and *<sup>D</sup>* are the integral and descriptive operations [112].

A study applied multifractal dimension in football movement behaviour using Hausdorff dimension (*D*) [110, 112, 114]:

$$D(E) = \lim\_{\varepsilon \to 0} \sup \frac{\log N\_{\mathcal{E}}(\varepsilon)}{-\log \varepsilon} \tag{19}$$

where *N*<sup>E</sup> is the minimal diameter at the most *ε* needed to cover [114]. Fractional dynamics may tracking football players trajectory, which can be useful to increase the autonomy of tracking systems [113]. Additionally, fractal and multifractal analysis can be used to analyse the regularity and synchronisation of the team, as well the players' movement dynamics [110, 111, 113]. However, the use of fractal variables within an integrative approach remains little explored whereas it is important understanding the links between collective behaviour with the fractional properties of movement and its physical and physiological repercussions [47, 110]. Fractal proprieties may also increase the autonomy of the tracking systems to collect information in tracking systems such as making decisions based on it [111].

#### **3.6 Clustering methods**

Clustering methods have become popular in data mining in several research areas, including sports sciences [70, 92, 115]. Rokach and Maimon [115] was described the clustering methods in different typologies as hierarchical, partitioning, density-based, model-based, grid-based, and soft-computing methods. *Methodological Procedures for Non-Linear Analyses of Physiological and Behavioural Data… DOI: http://dx.doi.org/10.5772/intechopen.102577*

Duarte et al. [92] pioneered applied a clustering method to measure overall and player team collective synchronisation in football. This method is derived from Hibert transform to calculate individual phase time-series and subsequently the cluster phase of these time-series by the natural exponential function [36, 92]. Originally, cluster phase analysis was proposed by Frank and Richardson [116] using Kuramoto's parameters for group synchronisation [117]. This clustering method calculates the mean and continuous group synchronisation

*ρ***group or** *ρ***group**,*<sup>i</sup>* � � whence individual's relative phase *<sup>ϕ</sup><sup>k</sup>* ð Þ was measured in relation to the group measure [116]. After the Hilbert transform calculation, the continuous degree of overall team synchronisation was clustered [92]:

$$\rho\_{\text{group}}(t\_i) = \left| \frac{1}{n} \sum\_{k=1}^{n} \exp\left\{i \left(\phi\_k(t\_i) - \overline{\phi\_k}\right)\right\} \right| \tag{20}$$

where overall team synchronisation *ρ*groupð Þ*ti* ∈½ � 0, 1 and mean degree to group synchronisation at every point-to-point of the time-series data ð Þ*ti* [36, 92]. *ρ*group,*<sup>i</sup>* was computed by the inverse of the circular variance of relative phase of the cluster amplitude, *<sup>ϕ</sup>k*ð Þ*ti* , while *<sup>i</sup>* <sup>¼</sup> ffiffiffiffiffiffi �<sup>1</sup> <sup>p</sup> as [92]:

$$\rho\_{\text{group}}(t\_i) = \frac{1}{N} \sum\_{i=1}^{n} \rho\_{\text{group},i} \tag{21}$$

Synchronisation cut-off values is zero to one representing synchronisation and unsynchronisation (**0 ≤** *ρ***group or** *ρ***group**,*<sup>i</sup>* **≤ 1**), respectively [36, 92]. Cluster phase analysis has not yet been applied to integrate physiological and behavioural data in football [38, 101].

Furthermore, average mutual information (AMI) was also applied to measure complexity of the football patterns and expresses the amount of information one random variable contains another [118]. AMI is calculated by relative entropy between probabilities distribution and the product midst two selected variables [36]. The mathematical equation described by Cover and Thomas [118] for the calculation of mutual information is:

$$I(\mathbf{x}, \mathbf{y}) = \sum\_{X, Y \in A} P(\mathbf{x}, \mathbf{y}) \log \left( \frac{P(\mathbf{x}, \mathbf{y})}{P(\mathbf{x}) P(\mathbf{y})} \right) \tag{22}$$

Whereð Þ *x* and *y* � � is the team's centroid movement coordinates and *A* is the space discretisation [119]. *P x*ð Þ and *P y*� � are the marginality of the probabilities distributions [36]. The AMI identify the relationships between time-series points that are not detected by linear correlation [118, 119]. Similar correlation cut-offs values were applied to predict uncertainty in less values and independence in higher values (0 ≤ AMI <1) [119].

#### **3.7 Frequency domain**

Non-linear techniques as entropy measures can also be expressed in the frequency domain [86]. Several studies have evaluated the variability of movement comparing informational and frequency domains [10, 41, 42]. CV expresses the magnitude of the variability in the distance amongst players', expressed as percentage (%) [10, 93]:

$$\text{CV}(\text{@}) = \frac{\sigma}{\overline{\mathfrak{x}}} \text{ 100} \tag{23}$$

Speed synchronisation has also applied into a integrative approaches in some studies [10, 41]. The near-in-phase synchronisation to players' displacements is expressed in time spent (%) of time according to speed intensity zones: 0.0– 3.5 km h�<sup>1</sup> (low intensity); 3.6–14.3 km h�<sup>1</sup> (moderate intensity); 14.4–19.7 km h�<sup>1</sup> (high intensity); and >19.8 km h�<sup>1</sup> (very high intensity) [41]. Summary diagram for mathematical models and methodological procedures of the non-linear approaches are presented in **Figure 2**.

**Table 1** displayed the corresponding equation, thresholds, advantages, disadvantages and practical application for each nonlinear variable.

#### **4. Practical considerations, criticisms and future perspectives**

Matlab® routines (Math-Works, Inc., Massachusetts, USA) were the most selected procedure to analyse positional dataset in football. Universal Transverse Mercator (UTM) coordinate system were used to transform latitude and longitude data points [40, 66]. Methodological procedures differ on the correction guidelines to be used and reduce tracking signal noise, advising the use of 3 Hz Butterworth low pass filter [64, 92]. Several authors ran non-linear logarithms using 20 windows of 3000 points per data collect (i.e. ranged 5�25 Hz) [40, 67]. Integrating notational analysis and video-based tracking systems has been a worthwhile strategy to provide contextual significance to positional data [67, 68, 93]. Applying new analysis techniques based on big data still lack an integrative approach, and it will be interesting to understand how future studies can do so with techniques such as machine learning, deep learning or network analysis [13, 14]. These techniques have been extensively used to analyse positional and physiological variables, but there are still few studies under an integrative perspective [13, 36, 46]. There is a


#### *Methodological Procedures for Non-Linear Analyses of Physiological and Behavioural Data… DOI: http://dx.doi.org/10.5772/intechopen.102577*

