Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Identification and Monitoring of Parkinson’s Disease Dysgraphia Based on Fractional-Order Derivatives of Online Handwriting

Identification and Monitoring of Parkinson’s Disease Dysgraphia Based on Fractional-Order... applied sciences Article Identification and Monitoring of Parkinson’s Disease Dysgraphia Based on Fractional-Order Derivatives of Online Handwriting 1 1 1,2 3 Jan Mucha , Jiri Mekyska , Zoltan Galaz , Marcos Faundez-Zanuy , 4 1 1 1 Karmele Lopez-de-Ipina , Vojtech Zvoncak , Tomas Kiska , Zdenek Smekal , 2 2,5, Lubos Brabenec and Irena Rektorova * Department of Telecommunications and SIX Research Centre, Brno University of Technology, Technicka 10, 61600 Brno, Czech Republic; muchajano@phd.feec.vutbr.cz (J.M.); mekyska@feec.vutbr.cz (J.M.) z.galaz@phd.feec.vutbr.cz (Z.G.); vojtech.zvoncak@phd.feec.vutbr.cz (V.Z.); kiskatomas@phd.feec.vutbr.cz (T.K.); smekal@feec.vutbr.cz (Z.S.) Applied Neuroscience Research Group, Central European Institute of Technology, Masaryk University, Kamenice 5, 62500 Brno, Czech Republic; lubos.brabenec@ceitec.muni.cz Escola Superior Politecnica, Tecnocampus Avda. Ernest Lluch 32, 08302 Mataro, Barcelona, Spain; faundez@tecnocampus.cat Department of Systems Engineering and Automation, University of the Basque Country UPV/EHU, Av de Tolosa 54, 20018 Donostia, Spain; karmele.ipina@ehu.eus First Department of Neurology, Masaryk University and St. Anne’s University Hospital, Pekarska 53, 65691 Brno, Czech Republic * Correspondence: irena.rektorova@fnusa.cz; Tel.: +420-543-182-639 † This paper is an extended version of our paper published in Mucha, J.; Zvoncak, V.; Galaz, Z.; Faundez-Zanuy, M.; Mekyska, J.; Kiska, T.; Smekal, Z.; Brabenec, L.; Rektorova, I.; Ipina, K.L. Fractional Derivatives of Online Handwriting: a New Approach of Parkinsonic Dysgraphia Analysis. In Proceedings of the 2018 41st International Conference on Telecommunications and Signal Processing (TSP), Athens, Greece, 4–6 July 2018; pp. 1–4. Received: 30 October 2018; Accepted: 6 December 2018; Published: 11 December 2018 Abstract: Parkinson’s disease dysgraphia affects the majority of Parkinson’s disease (PD) patients and is the result of handwriting abnormalities mainly caused by motor dysfunctions. Several effective approaches to quantitative PD dysgraphia analysis, such as online handwriting processing, have been utilized. In this study, we aim to deeply explore the impact of advanced online handwriting parameterization based on fractional-order derivatives (FD) on the PD dysgraphia diagnosis and its monitoring. For this purpose, we used 33 PD patients and 36 healthy controls from the PaHaW (PD handwriting database). Partial correlation analysis (Spearman’s and Pearson’s) was performed to investigate the relationship between the newly designed features and patients’ clinical data. Next, the discrimination power of the FD features was evaluated by a binary classification analysis. Finally, regression models were trained to explore the new features’ ability to assess the progress and severity of PD. These results were compared to a baseline, which is based on conventional online handwriting features. In comparison with the conventional parameters, the FD handwriting features correlated more significantly with the patients’ clinical characteristics and provided a more accurate assessment of PD severity (error around 12%). On the other hand, the highest classification accuracy (ACC = 97.14%) was obtained by the conventional parameters. The results of this study suggest that utilization of FD in combination with properly selected tasks (continuous and/or repetitive, such as the Archimedean spiral) could improve computerized PD severity assessment. Keywords: Parkinson’s disease dysgraphia; micrographia; online handwriting; kinematic analysis; fractional-order derivative; fractional calculus Appl. Sci. 2018, 8, 2566; doi:10.3390/app8122566 www.mdpi.com/journal/applsci Appl. Sci. 2018, 8, 2566 2 of 18 1. Introduction As a second most common neurodegenerative disorder, Parkinson’s disease (PD) is expected to impose an increasing social and economic burden on societies as populations age [1]. Its prevalence rate is estimated to approximately 1.5% for people aged over 65 years [2]. The risk of being affected by PD strongly increases with age, and, in the next 15 years, the incidence of PD is expected to be doubled [3,4]. The rapid degeneration of dopaminergic cells in the substantia nigra pars compacta [5] arose as the most significant biological finding associated with the disease, but the exact pathophysiological cause of PD has not yet been discovered. PD cardinal motor symptoms involve bradykinesia (slowness of movement), tremor at rest, rigidity, gait impairment, and postural instability [6–8]. A variety of non-motor symptoms may emerge as well—for instance, cognitive impairment, dementia, depression, sleep disorders, or anxiety [6,9,10]. Handwriting requires cognitive, perceptual, and fine motor abilities. In conjunction with motor dysfunctions in people suffering from PD, it has been proven that disrupted handwriting may be used as a significant biomarker for PD diagnosis [11,12]. Micrographia, which is associated with the progressive decrease in letters’ amplitude, is the most commonly observed handwriting abnormality in patients with PD [13,14]. Moreover, according to McLennan et al. [14], in approximately 5% of PD patients, micrographia may be observed even before the onset of the cardinal motor symptoms. The recent advantage of new technologies coming hand-in-hand with Health 4.0 systems enables the acquisition of online handwriting signals, where temporal information is added to the x and y position. Therefore, by using a digitizing tablet, the analysis is not limited to spatial features which mainly quantify PD micrographia. In addition, we are able to quantify temporal, kinematic, and dynamic manifestations of PD dysgraphia, such as hesitations, pauses, and slow movement [7], which cannot be studied objectively using a classical paper-and-pen method. Due to this complexity, Letanneux et al. [15] started to refer to these manifestations using the generalized term PD dysgraphia. Several research teams have explored the impact of quantitative PD dysgraphia analysis utilizing simple handwriting/drawing tasks (e.g., separate characters, a combination of two or three characters, repetitive loops, circles), as well as more complex ones (e.g., words, sentences, figures, 3D objects, and the Archimedean spiral) [8,16–20]. An overview of recent related works (2015–present) can be seen in Table 1. Most of them confirm the irreplaceability of kinematic features in PD dysgraphia analysis. Additionally, the researchers usually employ temporal, spatial, and dynamic features. Some more advanced parameters are reported too. For instance, Drotar et al. [8,16,17] demonstrated a combination of kinematic, pressure, energy, or empirical mode decomposition (EMD)-based features that resulted in a classification accuracy of up to 89% using several handwriting tasks. Kotsavasilogloua et al. [21] achieved an average prediction accuracy of 91% using simple horizontal lines and features describing the variability in the pen tip’s velocity, a deviation from the horizontal plane, and the trajectory’s entropy. Other works report even higher classification accuracies (approximately 97%), e.g., Loconsole et al. [18], who used computer vision and electromyography signal processing techniques, or Taleb et al. [22], who used a combination of features related to the correlation between kinematic and pressure characteristics (but, in this case, applied to a very small dataset). Another promising approach was published by Moetesum et al. [23], who reached an 83% classification accuracy by employing convolutional neural networks (CNN) that were used to extract discriminating visual features from handwriting data transformed into the offline mode. In 2018, Impedovo et al. reported the results of a study focused only on the early stages of PD; the best accuracy was 74.76% for a combination of three handwriting tasks. Finally, in our previous work [20], we proposed a new approach of advanced kinematic feature extraction that utilizes fractional-order derivatives (FD). This approach increased the classification accuracy by 10% (72.39%) for Archimedean spiral tasks in comparison with the baseline [20]. Appl. Sci. 2018, 8, 2566 3 of 18 Table 1. Overview of related works focused on computerized analysis of Parkinson’s disease (PD) dysgraphia. First Author Year PD/HC Handwriting Task Analysis Features Conclusions letters, words, differential kinematic, temporal, spatial, The highest classification accuracy after Drotar * [17] 2015 37/38 sentences analysis (SVM) entropy, EMD, signal energy feature selection approach was 88.13%. letters, words, differential kinematic, temporal, spatial, Classification performance was at its peak Drotar * [16] 2015 37/38 sentences analysis (SVM) entropy, EMD, pressure with on-surface features equal to AUC = 89.09%. up/down strokes at Significant difference between groups was in Heremans [24] 2015 34/10 ANOVA spatial and kinematic varying amplitudes spatial (F(2.41) = 3.97; p = 0.03). differential an. mean relative tremor The best results were obtained by NB classifier Pereira [25] 2015 37/18 Archimedean spiral (SVM, NB, OPF) and spatial parameters that provided around 79% classification accuracy. letters, words, Archimedean differential an. kinematic, temporal, spatial, Combining all exercises, SVM proved to be Drotar * [8] 2016 37/38 spiral, sentences (SVM, K-NN, ADA) entropy, EMD, pressure the best classifier with 82.5% accuracy. ANOVA, PD dysgraphia is more severe in patients with Heremans [26] 2016 30/15 repetitive cursive loops writing amplitude and velocity correlation an. freezing of gait. Archimedean spiral, differential an. The best result was obtained by CNN with Pereira [27] 2016 14/21 pen-based features meander (CNN, OPF) 87.14% classification accuracy using meander task. differential Kotsavasil [21] 2017 24/20 horizontal lines kinematic Average classification accuracy was 91%. analysis (NB) differential Highest classification accuracy (96.81%) was Loconsole [18] 2017 4/7 sentence, repetitive loops temporal, kinematic, spatial analysis (ANN) achieved using all the extracted features. differential kinematic, stroke, pressure, The highest classification accuracy was 96.88% Taleb [22] 2017 16/16 letters, waves, words analysis (SVM) entropy, energy, EMD for 12 kinematic and pressure features. Archimedean spiral, Extraction of features using CNN applied on raw differential Moetesum * [23] 2018 37/38 letters, words, CNN-based features handwriting data resulted in 83% classification analysis (SVM) sentence, loops accuracy. differential fractional derivatives based Improvement of classification accuracy Mucha * [20] 2018 30/36 Archimedean spiral analysis (RF, SVM) kinematic features by 10% (72.38%) in comparison to the baseline. differential an. Archimedean spiral kinematic, temporal, spatial, Analysis focused on PD diagnosis at earlier stages Impedovo * [28] 2018 37/38 (RF, SVM, K-NN, letters, words, sentence entropy, EMD, pressure resulted in 74.76% classification accuracy. NB, LDA, ADA) SVM—support vector machine; EMD—empirical mode decomposition; K-NN—K-nearest neighbors; ANOVA—analysis of variance; NB—naïve Bayes classifier; OPF—optimum path forest; ANN—artificial neural network; CNN—convolutional neural network; RF—random forests; LDA—Linear Discriminant Analysis; ADA—AdaBoost; AUC—area under the receiver operating characteristics (ROC) curve; articles are sorted by the year of release and then alphabetically; * analyzes performed on the same database (Parkinson’s disease handwriting database (PaHaW) [8]). Appl. Sci. 2018, 8, 2566 4 of 18 Although the authors of the previously mentioned studies reported high classification accuracies, further signal processing and machine learning pipeline improvements are expected to make the differential analysis even more accurate. One possible approach could involve an advanced feature extraction methodology based on fractional calculus (FC) [29,30], which enables the use of an arbitrary order of derivatives and/or integrals. Generally, FC has many applications in different fields of science [31–33]. For instance, it has been advantageously used during the modeling of different diseases, such as human immunodeficiency virus (HIV) [34] and malaria [35]. In addition, FC-based analytical tools have outperformed classical techniques in geology [36,37], economics and finance [38,39], etc. Moreover, in our recent paper [20], we identified a high potential for the use of FC in the kinematic analysis of PD drawings. Based on these preliminary results, we assume that FD-based handwriting features may bring improvements to PD diagnosis and assessment. In the frame of this article, we would like to go further and deeply explore the impact of FD on the PD dysgraphia diagnosis and its monitoring. More specifically, we aim to: investigate the relationship between newly designed FD handwriting features and a patient’s clinical data and compare these results with a baseline (i.e., results based on conventional parameters), evaluate the discrimination power of the FD features in terms of binary classification accuracy and compare the results to the baseline, use the newly designed features to establish regression models that will estimate the severity of PD and compare its performance to that of a baseline. The rest of this paper is organized as follows: Section 2 describes the cohort of patients and the methodology, and Section 3 includes the results. A discussion is presented in Section 4, and, finally, conclusions are drawn in Section 5. 2. Materials and Methods 2.1. Dataset For the purpose of this work, the Parkinson’s disease handwriting database (PaHaW) [8], which consists of multiple handwriting/drawing samples from 37 PD patients and 38 age- and gender-matched healthy controls (HC), was used. Since the Archimedean spiral drawing task is missing for some participants, we reduced the analyzed cohort to 33 PD patients and 36 HC. Demographic and clinical data of the participants can be found in Table 2. The participants were enrolled at the First Department of Neurology, St. Anne’s University Hospital in Brno, Czech Republic. All participants reported the Czech language as their native language and were right-handed. The patients completed their tasks approximately 1 h after their regular dopaminergic medication (L-dopa). All participants signed an informed consent form approved by the local ethics committee. Unified Parkinson’s disease rating scale, part V (UPDRS V): Modified Hoehn and Yahr staging score [40], was used to assess clinical symptoms of PD. In the frame of this work, the duration of the disease was considered as well. Descriptive visualization (histograms, regression, and residual plots) of the clinical data for the subjects participating in this study can be seen in Figure 1. Appl. Sci. 2018, 8, 2566 5 of 18 Table 2. Demographic and clinical data of the enrolled participants. Gender N Age [years] PD dur [years] UPDRS V LED [mg/day] Parkinson’s disease patients Females 17 71.76  10.93 9.88  5.27 2.18  0.86 1146.03  543.89 Males 16 66.50  13.44 7.44  4.04 2.31  0.75 1673.38  616.66 All 33 69.21  11.10 8.70  4.82 2.24  0.80 1401.72  630.71 Healthy controls Females 17 61.59  10.17 - - - Males 19 63.32  13.14 - - - All 36 62.50  11.70 - - - PD—Parkinson’s disease; N—number of subjects; PD dur—PD duration; UPDRS V—Unified Parkinson’s disease rating scale, part V: Modified Hoehn and Yahr staging score [40]; LED—L-dopa equivalent daily dose [41]. 17.5 15.0 12.5 10.0 7.5 5.0 2.5 Length of PD UPDRS V 2.0 1.5 1.0 0.5 0.0 0.5 1.0 2.5 5.0 7.5 10.0 12.5 15.0 17.5 1 2 3 4 5 Length of PD UPDRS V Figure 1. Descriptive graphs of patients’ clinical characteristics: Unified Parkinson’s disease rating scale (UPDRS V) and Parkinson’s disease (PD) duration (in years). Histograms are visualized on the diagonal. A scatterplot with a line fitted using linear regression is visualized in the top-right corner. Residuals of the trained linear model are visualized in the bottom-left corner. 2.2. Data Acquisition The PaHaW database [8] includes nine different handwriting tasks written in the Czech language. Their description and translation to English can be found in Table 3. During all handwriting tasks, the participants were rested and seated in a comfortable position with the possibility to look at the prefilled template (see Figure 2). A digitizing tablet (Wacom Intuos 4M, Wacom, Kazo, Saitama, Japan) UPDRS V Length of PD UPDRS V Length of PD Appl. Sci. 2018, 8, 2566 6 of 18 was overlaid with an empty paper template and participants were asked to perform all tasks using a special Wacom inking pen that gave the patients immediate visual feedback. Online handwriting signals were recorded with a sampling frequency of f = 150 Hz. The following time sequences were acquired: x and y coordinates (x[t], y[t]); time-stamp (t); in-air/on-surface (on-surface movement is a movement of a pen when its tip is touching the surface, e.g., paper (i.e., it provides the information about the pen writing/drawing on the paper); vice versa, in-air movement is a movement of a pen when its tip is up to 1.5 cm above the surface [42,43]) status (b[t]); pressure (p[t]); azimuth (az[t]); and altitude (al[t]). Figure 2. Filled template of the PaHaW database. Table 3. Description of the PaHaW handwriting tasks. N Task Czech (Original) English (Translation) 1 Archimedean spiral - - 2 repetitive loops - - 3 letter l l 4 syllable le le 5 word les forest 6 word lektorka lecturer 7 word porovnat compare 8 word nepopadnout not grasped 9 sentence Tramvaj dnes už nepojede. The tram will no longer go today. 2.3. Feature Extraction The main goal of this work is to compare a set of commonly used kinematic features with newly proposed FD-based features in terms of quantitative PD dysgraphia analysis. All of the handwriting features were computed using both on-surface as well as in-air movements. The two movements were quantified separately using velocity (rate at which the position of the pen changes with time [mm/s]), acceleration (rate at which the velocity of the pen changes with time [mm/s ]), jerk (rate at which the acceleration of the pen changes with time [mm/s ]), and their horizontal and vertical variants [8,44,45]. FD-based features were extracted for different values of a. In the frame of this work, a ranging from 0.1 Appl. Sci. 2018, 8, 2566 7 of 18 to 1.0 with a step of 0.1 was used. Subsequently, the statistical properties of the computed handwriting features were described using the mean, median, standard deviation (std), and maximum (max). Finally, all of the extracted features were divided into nine different feature sets according to the type of the movement (on-surface, in-air, and combined) and the calculation approach, i.e., the type of feature (FD-based, conventional, and combined). For more information, see Table 4. Table 4. Feature sets matrix. Movement FD (Count) Conventional (Count) Together (Count) on-surface 4536 618 5154 in-air 2916 404 3320 together 7452 1022 8474 Fractional-Order Derivatives Utilization of the FD as a substitution for the conventional differential derivative during calculation of the basic kinematic features provides a new advanced approach. The advantage of FDs is in their wide range of settings and many different approaches to approximation, e.g., Riemann–Liouville, Caputo, or Grünwald–Letnikov formulations [31,46,47]. For the purpose of this work, Jonathan Hadida’s FD Matlab implementation was used following the Grünwald–Letnikov approximation [31,48]. A direct definition of the FD D y(t) is based on the finite differences of an equidistant grid in [0, t], assuming that the function y(t) satisfies certain smoothness conditions in every finite interval (0, t), t  T. Choosing the grid [31], 0 = t < t < ... < t = t = (n + 1)h (1) 0 1 n+1 with t t = h (2) k+1 k and using the notation of finite differences n+1 1 1 a a D y(t) = y(t ) c y(t ) , (3) n+1 n+1v å v a h a h h v=1 where a v1 a c = (1) ( ). (4) v v The Grünwald–Letnikov implementation is defined as a a D y(t) = lim D y(t), (5) h!0 h where D y(t) denotes a derivative with order a of function y(t), and h represents a sampling lattice. 2.4. Statistical Analysis Prior to providing a description of the analytical setup, it is important to note that the effect of well-known confounding factors, also known as covariates, was controlled for in all of the analytical steps described below. In the frame of this work, we controlled for the effect of participants’ age, gender, and L-dopa [41] (dopaminergic medication). To assess the strength of the relationship between the computed handwriting features and patient’s clinical data (UPDRS V and PD duration), we computed the partial Pearson’s correlation coefficient (assessment of a linear relationship), as well as the partial Spearman’s correlation coefficient (assessment of a monotonic relationship). With this approach, we aimed to identify the handwriting features that are significantly correlated with the clinical measures under focus and also to compare Appl. Sci. 2018, 8, 2566 8 of 18 the FD features with conventional ones. A significance level of correlation (p) of 0.05 was selected for both of the correlation types. Only the results with a p-value below the significance level in both correlation coefficients were considered statistically significant. Next, to evaluate and compare the power of the handwriting features to discriminate PD patients and HC, multivariate binary classification analysis was performed. For this purpose, state-of-the-art gradient boosted trees were employed. Specifically, we used the famous XGBoost algorithm [49]. The XGBoost algorithm was chosen for its ability to achieve a good performance, even for small datasets; its inherent robustness to outliers; its ability to model complex interdependencies in the data; and also its recent successes in the field of machine learning (e.g., the winning algorithm in many www.kaggle.com competitions). To train and evaluate the models, we used the following supervised learning setup: stratified 10-fold cross-validation with 20 repetitions. The performance of the trained classification models was evaluated by Matthew’s correlation coefficient (MCC) [50], classification accuracy (ACC), sensitivity (SEN), and specificity (SPE), which are defined as follows: TP TN FP FN MCC = p , (6) (TP + FP)(TP + FN)(TN + FP)(TN + FN) TP + TN ACC =  100 [%], (7) TP + TN + FP + FN TP SEN =  100 [%], (8) TP + FN TN SPE =  100 [%], (9) TN + FP where TP is the number of true positives, TN is the number of true negatives, FP is the number of false positives, and FN is the number false negatives. Finally, to evaluate and compare the power of the handwriting features’ ability to predict the values of the selected clinical characteristics (UPDRS V and PD duration), multivariate regression analysis was performed. For this purpose, the same boosting tree algorithm (XGBoost) and the supervised learning setup were used. The performance of the trained regression models was evaluated by the mean absolute error (MAE), root mean square error (RMSE), and estimated error rate (EER), which are defined as follows: MAE = jy y ˆj, (10) å i i i=1 RMSE = (y y ˆ ) , (11) å i i i=1 EER = jy y ˆj 100 [%], (12) å i i n r i=1 where y represents the true label of the ith observation, y denotes the predicted label of the ith i i observation, n is the number of observations, and r is the range of the values of the predicted clinical characteristic (not the range that can be theoretically reached, but the actual range of the values in the dataset). Therefore, the EER describes a percentage of error predictions in regard to the statistical properties of the data. 3. Results In Table 5, the results of partial correlation analysis between the handwriting features (FD-based features, conventional features) and patients’ clinical characteristics (UPDRS V, PD duration) are summarized. The table shows the five best features according to Spearman’s correlation coefficient for each movement (on-surface, in-air). Appl. Sci. 2018, 8, 2566 9 of 18 In the case of UPDRS V (on-surface movement), the following FD-based features achieved a statistical significance of correlation: the median of jerk (a = 0.3, a = 0.4) and horizontal velocity (a = 0.1) for the repetitive letter l, the mean of vertical acceleration (a = 0.7) for repetitive loops, and the standard deviation of the vertical velocity (a = 0.3) for the sentence. The following conventional features achieved a statistical significance of correlation (p-value of only one of the coefficients was below the threshold): the maximum of horizontal jerk and velocity for the repetitive letters le, the maximum of horizontal jerk and horizontal velocity for the repetitive letter l, and the maximum of horizontal velocity for the letter l. Regarding UPDRS V (in-air movement), the following FD-based features achieved a statistical significance of correlation: the median of vertical velocity (a = 0.9, a = 0.8, a = 0.7) for the sentence and the median of horizontal velocity (a = 0.5) and vertical jerk (a = 0.3) for the repetitive letters le. The following conventional features achieved a statistical significance of correlation (p-value of only one of the coefficients was below the threshold): the mean of acceleration for the repetitive word lektorka, the maximum of horizontal jerk for the word porovnat, the median of the vertical velocity for the repetitive letter l, and the median of the horizontal velocity of the repetitive letters le. Table 5. Results of partial correlation analysis between handwriting features and clinical data. UPDRS V FD on-suface Conventional on-surface feature name a task r r r r task feature name p s s p jerk (median) 0.3 r. letters l 0.37 * 0.48 ** 0.45 * 0.24 r. letters le h. jerk (max) jerk (median) 0.4 r. letters l 0.43 * 0.46 * 0.43 * 0.2 r. letters le velocity (max) h. velocity (std) 0.1 r. letters l 0.42 * 0.41 * 0.42 * 0.25 r. letters l h. jerk (max) v. acceleration (mean) 0.7 r. loops 0.48 ** 0.40 * 0.42 * 0.16 r. letters l h. velocity (max) v. velocity (std) 0.3 sentence 0.40 * 0.40 * 0.41 * 0.15 letter l h. velocity (max) FD in-air Conventional in-air feature name a task r r r r task feature name p s s p v. velocity (median) 0.9 sentence 0.44 * 0.53 ** 0.43 * 0.28 r. word lektorka acceleration (mean) v. velocity (median) 0.8 sentence 0.40 * 0.52 ** 0.37 * 0.31 word porovnat h. jerk (max) h. velocity (median) 0.5 r. letters le 0.38 * 0.49 ** 0.36 * 0.25 r. letters l v. velocity (median) v. jerk (median) 0.3 r. letters le 0.43 * 0.49 ** 0.35 0.41 * r. letters le h. velocity (median) v. velocity (median) 0.7 sentence 0.37 * 0.48 ** 0.35 0.19 r. word lektorka acceleration (median) PD Duration FD on-surface Conventional on-surface feature name a task r r r r task feature name p s s p velocity (max) 0.1 spiral 0.54 ** 0.55 ** 0.46 * 0.40 * r. letters l h. velocity (max) acceleration (max) 0.8 spiral 0.54 ** 0.54 ** 0.40 * 0.37 * r. letters l h. jerk (max) acceleration (max) 0.6 spiral 0.54 ** 0.54 ** 0.38 * 0.37 * r. letters l velocity (max) acceleration (max) 0.2 spiral 0.54 ** 0.54 ** 0.46 ** 0.34 spiral v. velocity (mean) acceleration (max) 0.7 spiral 0.54 ** 0.53 ** 0.40 * 0.14 r. loops h. acceleration (mean) FD in-air Conventional in-air feature name a task r r r r task feature name p s s p jerk (median) 0.4 sentence 0.37 * 0.49 ** 0.44 * 0.38 * word lektorka h. jerk (median) jerk (max) 0.1 r. word les 0.57 ** 0.46 * 0.38 * 0.40 * word nepopad. velocity (max) jerk (max) 0.3 r. word les 0.57 ** 0.45 * 0.37 * 0.42 * word lektorka h. n. jerk (mean) velocity (max) 0.1 r. word les 0.57 ** 0.45 * 0.47 ** 0.13 r. word lektorka h. velocity (mean) jerk (max) 0.2 r. word les 0.57 ** 0.45 * 0.42 * 0.13 word nepopad. h. velocity (mean) a—order of FD; r —Pearson’s correlation coefficient; r —Spearman’s correlation coefficient; v.—vertical; p s h.—horizontal; r.—repetitive task; *—p < 0.05; **—p < 0.01; rows are ordered by the absolute value of Spearman’s correlation coefficient. For PD duration (on-surface movement), the following FD-based features achieved a statistical significance of correlation (of note: all of these features satisfied the stronger threshold for statistical significance of correlation p < 0.01): the maximum of the velocity (a = 0.1) and acceleration (a = 0.8, a = 0.7, a = 0.6, a = 0.2) for the Archimedean spiral. The following conventional features achieved a statistical significance of correlation (p-value of only one of the coefficients was below the threshold): Appl. Sci. 2018, 8, 2566 10 of 18 the maximum of horizontal velocity, horizontal jerk, and velocity for the repetitive letter l; the mean of the vertical velocity for the Archimedean spiral; and the mean of horizontal acceleration for repetitive loops. For PD duration (in-air movement), the following FD-based features achieved a statistical significance of correlation: the median of jerk (a = 0.4) for sentence, the maximum of jerk (a = 0.1, a = 0.2, a = 0.3) and velocity (a = 0.1) for repetitive word les. The following conventional features achieved a statistical significance of correlation (p-value of only one of the coefficients was below the threshold): the median and mean of horizontal jerk for the word lektorka, the maximum of the velocity for the word nepopadnout, and the mean of horizontal velocity for the repetitive word lektorka and the word nepopadnout. The results of the multivariate binary classification analysis are summarized in Table 6. In total, we built and evaluated nine different classification models. These models were selected according to the following criteria: movement type (on-surface, in-air, all), feature type (FD features, conventional features, all). We built models based on the combinations of these criteria as well. For more information, see Table 4. Table 6. Results of multivariate binary classification analysis (PD/HC). Feature Set MCC ACC [%] SEN [%] SPE [%] Feat conventional on-surface 0.83  0.18 91.19  9.65 93.00  15.52 70.00  0.46 1 conventional in-air 0.95  0.10 97.14  5.71 95.50  9.07 100.00  0.00 1 conventional together 0.95  0.11 97.14  5.71 95.50  9.07 100.00  0.00 1 FD on-surface 0.95  0.12 87.14  13.48 82.00  21.24 90.00  30.00 1 FD in-air 0.95  0.13 81.43  12.86 71.50  30.83 60.00  48.99 3 FD together 0.95  0.14 81.43  15.71 69.50  32.13 70.00  45.83 2 all on-surface 0.95  0.15 88.33  14.06 89.00  22.11 70.00  45.83 2 all in-air 0.95  0.16 97.14  5.71 95.50  9.07 100.00  0.00 1 all together 0.95  0.17 97.14  5.71 95.50  9.07 100.00  0.00 1 MCC—Matthew’s correlation coefficient; ACC—accuracy; SEN—sensitivity; SPE—specificity; feat.—number of features important for the trained model (i.e., feature importance of the feature > 0.0); The feature importances, as well as the exact names of these features, are summarized in the text. With respect to the classification performance, the highest MCC achieved was 0.95 was for eight out of the total nine feature sets (with the exception being the feature set composed of conventional handwriting features computed for the on-surface movements). An interesting fact to note is that for all models based on conventional handwriting features, only a single feature was capable of providing the classification models with such a high discrimination power. In terms of the specific features important for the trained models, the following feature importances were returned by the models (feature importance quantifies the relative importance of the features in the ensemble of the trained XGBoost model [49]; therefore, the higher the value of the feature importance, the more important the feature for the prediction of the dependent variable): conventional on-surface (horizontal jerk (median) of repetitive loops), conventional in-air (horizontal velocity (median) of the sentence), conventional together (horizontal velocity (median) of the sentence), FD on-surface (jerk (max) a = 0.3 of the letters le), FD in-air (vertical acceleration (mean) a = 0.6 of the word nepopadnout (FI = 0.33), horizontal jerk (mean) a = 0.9 of the word nepopadnout (FI = 0.33), horizontal jerk (mean) a = 0.2 of the repetitive word lektorka (FI = 0.33)), FD together (jerk (max) a = 0.3 of the letters le (on-surface; FI = 0.67), horizontal jerk (mean) a = 0.9 of the word nepopadnout (in-air; FI = 0.33)), all on-surface (horizontal jerk (median) of repetitive loops (FI = 0.50), jerk (max) a = 0.3 of the letters le (FI = 0.50)), all in-air (horizontal velocity (median) of the sentence), and all together (horizontal velocity (median) of the sentence (in-air)). The results of multivariate regression analysis are summarized in Table 7. For this purpose, we used UPDRS V and PD duration as our target variables. As in the case of binary classification, we built and evaluated nine different regression models according to the same criteria. For each of the Appl. Sci. 2018, 8, 2566 11 of 18 rating scales, the table shows the results achieved using the trained models and the associated feature importance values. All obtained results are discussed in the following section. Table 7. Results of regression analysis for clinical data. Feature Set MAE RMSE EER [%] Feat UPDRS V conventional on-surface 0.59  0.29 0.71  0.41 13.82  6.71 1 conventional in-air 0.60  0.30 0.72  0.42 14.01  6.98 1 conventional together 0.60  0.31 0.73  0.42 14.05  6.90 1 FD on-surface 0.60  0.32 0.65  0.45 12.51  7.55 1 FD in-air 0.60  0.33 0.68  0.43 13.49  7.29 1 FD together 0.60  0.34 0.66  0.45 13.06  7.55 2 all on-surface 0.60  0.35 0.65  0.45 12.51  7.55 1 all in-air 0.60  0.36 0.71  0.43 13.72  7.36 1 all together 0.60  0.37 0.66  0.45 13.06  7.55 2 PD duration conventional on-surface 4.29  0.94 5.03  1.09 24.52  5.39 18 conventional in-air 4.91  1.38 5.56  1.50 28.03  7.85 16 conventional together 4.14  1.32 4.85  1.52 23.64  7.55 16 FD on-surface 4.45  0.66 5.06  0.85 25.40  3.75 14 FD in-air 4.79  0.73 5.48  0.72 27.36  4.20 19 FD together 4.55  0.68 5.32  0.78 26.00  3.88 21 all on-surface 4.48  0.86 5.12  0.96 25.62  4.92 16 (12 F, 4 C) all in-air 4.95  1.18 5.59  1.17 28.30  6.75 17 (13 F, 4 C) all together 4.70  1.10 5.45  1.23 26.82  6.30 17 (12 F, 6 C) UPDRS V—Unified Parkinson’s disease rating scale, part V: Modified Hoehn and Yahr staging score [40]; MAE—mean absolute error; RMSE—root mean squared error; EER—estimation error rate; F—FD-based features; C—conventional handwriting features; feat.—number of features important for the trained model (i.e., feature importance of the feature > 0.0); The feature importances, as well as the exact names of these features for models built to assess UPDRS V, are summarized in the text. In the case of PD duration, this data can be found in Table S1 provided in the Supplementary Material. Considering EER as our performance evaluation metric, the following results are worth pointing out. In the case of UPDRS V, the lowest EER was achieved using a single FD-based feature—specifically, the standard deviation of vertical velocity (a = 0.1) computed for the on-surface movements (12.51 7.55%). The same feature was selected when both FD and conventional features were considered while building the model. In general, all models achieved an EER of around 12–13%. In comparison with the conventional features, the FD-based features performed better, with a difference of about 1%. In terms of the specific features important for the trained models, the following feature importances were returned by the models: conventional on-surface (vertical normalized jerk (mean) of the repetitive word lektorka), conventional in-air (vertical velocity (mean) of the sentence), conventional together (vertical velocity (mean) of the sentence), FD on-surface (vertical velocity (std) a = 0.1 of the sentence), FD in-air (vertical velocity (median) a = 0.3 of the sentence), FD together (vertical velocity (std) a = 0.1 of the sentence (on-surface; FI = 0.50), vertical velocity (median) a = 0.3 of the sentence (in-air; FI = 0.50)), all on-surface (vertical velocity (std) a = 0.1 of the sentence), all in-air (vertical velocity (median) a = 0.3 of the sentence), and all together (vertical velocity (std) a = 0.1 of the sentence (on-surface; FI = 0.50), vertical velocity (median) a = 0.3 of the sentence (in-air; FI = 0.50)). With respect to PD duration, the lowest EER was achieved using conventional handwriting features computed for both on-surface as well as in-air movements (23.64 7.55%). 4. Discussion To the best of our knowledge, except for our pilot work [20], there are no prior studies which integrate FD into a handwriting parameterization for quantitative PD dysgraphia analysis. Therefore, the results published in this paper are exploratory in nature. Appl. Sci. 2018, 8, 2566 12 of 18 In comparison with the conventional kinematic features, FD-based ones correlate more significantly with the clinical characteristics (UPDRS V and PD duration). We observed especially strong correlations for handwriting tasks based on the periodic repetition of specific movements (Archimedean spiral; repetitive letter l, syllable le, or word les). Although the levels of significance based on the conventional handwriting parameters are lower, similar handwriting tasks are involved in the most significant results. We hypothesize that this is due to their ability to highlight or better quantify the cardinal motor symptoms of PD. For example, the most significant relationship between handwriting performance and PD duration was identified in acceleration extracted from the Archimedean spiral. Rigidity combined with tremor and/or bradykinesia makes a PD patient’s handwriting/drawing less fluent (increased changes in velocity and higher acceleration). This is highlighted in a task such as the spiral, where the proper coordination of the fingers, wrist, and arm is required. Generally, the observed problems with coordination are in line with the work of Dounskaia et al. [51] and Teulings et al. [52]. To better illustrate these manifestations, Figure 3 plots the velocity profiles of repetitive loops for a healthy control and a PD patient. As can be seen, the patient introduced more changes in velocity, and their drawing became much more non-fluent. To summarize these findings, FD features in combination with properly selected tasks provide a stronger relationship with the severity and progress of PD. Healthly control 1.0 1.10 0.8 1.05 0.6 0.4 1.00 0.2 0.95 0.0 1.65 1.70 1.75 0 1 2 3 4 5 6 7 8 9 X [cm] time [s] PD patient 0.5 0.4 1.04 0.3 1.02 0.2 1.00 0.1 0.98 0.0 1.125 1.150 1.175 0 1 2 3 4 5 X [cm] time [s] Figure 3. Handwriting samples of the repetitive loop task for HC and PD patients are on the left, and the resulting velocity profiles are on the right. On the other hand, in terms of binary classification, the conventional parameters provided the best results. The classification performance is remarkable: ACC = 97.74%, SEN = 95.50%, and SPE = 100%. In fact, our results represent the highest classification accuracy that has ever been reported based on the PaHaW database (see Table 1). We hypothesize that the improvement was caused by the inclusion of the state-of-the-art XGBoost algorithm into our machine learning pipelines. As already mentioned, the result is based on one in-air feature: median horizontal velocity of a sentence. In comparison with the HC cohort, the PD patients exhibited much lower values of this measure, i.e., while writing the sentence, the PD patients were not able to perform horizontal transitions (movement between neighboring letters or words) as quickly as the HC could. This finding is in line with the work of Ma et al. [53], who observed that wrist extension stiffness in PD patients makes the handwriting in the horizontal direction more problematic. Therefore, scientists started to use the term horizontal Y [cm] Y [cm] velocity [cm/s] velocity [cm/s] Appl. Sci. 2018, 8, 2566 13 of 18 dysgraphia [13]. Generally, vertical or horizontal dysgraphia may be considered a presymptomatic neurobehavioral biomarker of PD with possible significance in early PD diagnosis [13]. In [20], we proved that the FD features improved the accuracy of PD dysgraphia diagnosis in the Archimedean spiral drawing task by 10%. Contrary to our pilot results, in the frame of this work, these features did not lead to any improvements. After a deeper analysis, we found that this was caused by a combined task approach. Performance of the Archimedean spiral is a quasiparticle and continuous task with some repetitive patterns. It looks as though the FD features work especially well in these specific cases. Nevertheless, when combining these tasks with a complex handwriting task (such as a sentence), the measures quantifying in-air movement tend to be more discriminative (in our case, the median in-air horizontal velocity of a sentence). This brings us to the same conclusion that was given during the correlation analysis—the FD features advance the PD dysgraphia diagnosis only in some specific cases. The best regression model, estimating the UPDRS V score with a 12.51% error, is based only on the standard deviation of on-surface vertical velocity (a = 0.1) extracted from the sentence. This FD-based parameter was selected from the feature set combining all on-surface measures; therefore, we can confirm the positive influence of FC on the regression analysis performance. In fact, the FD features outperformed the conventional ones in all scenarios. To better understand this result, we plotted vertical velocity patterns of the sentence task for different orders of FD (see Figure 4). We can observe a big difference between a = 0.1 and the rest of the orders, including the full derivative. This large distance means that we are working with completely new information that is far from that contained in the full derivative. Although it is difficult to clinically interpret this information, it is clear that FC opens new possibilities for monitoring PD severity. = 0.1 = 0.6 = 0.2 = 0.7 = 0.3 = 0.8 = 0.4 = 0.9 = 0.5 = 1.0 0 2 4 6 8 10 12 time [s] Figure 4. Vertical velocity patterns of the sentence task for different orders of fractional-order derivatives (FD). Regarding the PD duration estimation results, the most successful model (EER = 23.46%) consists of 16 conventional on-surface/in-air features (all features’ importance values can be found in Supplementary Table S1). The most frequent feature with the highest feature importance is the jerk extracted from several handwriting tasks. This probably means that as PD progresses, handwriting becomes more jerky and velocity [-] Appl. Sci. 2018, 8, 2566 14 of 18 irregular. Vertical velocity is the second most frequent feature involved in the models, which is probably linked with micrographia. Generally, in the case of PD duration estimation, the FD-based features did not yield any improvement. In conclusion, the FD-based features are better for modeling PD severity (in terms of UPDRS V score estimation), but they do not lead to an improvement in PD duration modeling. The progress of PD is nonlinear and very individual. This means that patients with the same PD duration can be in different stages of the disease. This fact supports our results: the estimation error of PD duration was generally much worse than the estimation error of the UPDRS V score. Since PD duration estimation is a difficult task with poor results, fine improvements based on FD parameters play no role. 5. Conclusions This study deals with advanced approaches to PD dysgraphia diagnosis and monitoring based on FC integrated with online handwriting/drawing parameterization. To the best of our knowledge, it is the first work that performs a complex investigation into the possibilities for FC in online handwriting processing and proposes new advances in kinematic analyses based on FD. Although the conventional features provided better and very high classification accuracy, which is at the top of the state-of-the-art analyses based on the PaHaW database (ACC = 97.74%, SEN = 95.50%, and SPE = 100%), the newly designed parameters were proven to work better for specific tasks (continuous and/or repetitive, such as the Archimedean spiral) and for specific applications, i.e., PD severity estimation (EER = 12.51%). However, our results need to be confirmed by subsequent scientific research. This study has several limitations and suggestions for further improvements. Since the dataset is small, to be able to generalize the results, bigger databases should be involved. On the other hand, it is common to have such small numbers of PD patients and HC samples in PD dysgraphia analysis, e.g., see our review in Table 1. Next, we considered only the kinematic measures. To better evaluate the discrimination power of the FD features and better evaluate their ability to estimate PD severity or progress, other feature types, such as temporal, spatial, and dynamic, should be included in future comparisons. Finally, the FD-based parameters could be further explored. For instance, we can consider other approximations (e.g., Caputo) or employ FC for other measures (e.g., entropies). Supplementary Materials: The following are available online at http://www.mdpi.com/2076-3417/8/12/2566/s1, Table S1: Feature relevance from multivariate regression (modeling PD duration). Author Contributions: Conceptualization, J.M. (Jan Mucha), J.M. (Jiri Mekyska) and M.F.-Z.; methodology, J.M. (Jan Mucha), M.F.-Z. and J.M. (Jiri Mekyska); software, J.M. (Jan Mucha) and J.M. (Jiri Mekyska); validation, J.M. (Jan Mucha); formal analysis, J.M. (Jan Mucha) and J.M. (Jiri Mekyska); investigation, J.M. (Jan Mucha), M.F.-Z., J.M. (Jiri Mekyska), I.R. and L.B.; resources, I.R. and L.B.; data curation, J.M. (Jan Mucha), J.M. (Jiri Mekyska), Z.G., V.Z., T.K., L.B. and I.R.; writing—original draft preparation, J.M. (Jan Mucha), J.M. (Jiri Mekyska), Z.G. and M.F.-Z.; visualization, J.M. (Jan Mucha) and Z.G.; supervision, J.M. (Jiri Mekyska), Z.S., I.R., M.F.-Z. and K.L.-d.-I.; project administration, J.M. (Jiri Mekyska), Z.S., I.R., M.F.-Z. and K.L.-d.-I.; funding acquisition, J.M. (Jiri Mekyska), Z.S., I.R., M.F.-Z. and K.L.-d.-I. Funding: This project has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement No. 734718 (CoBeN). In addition, this work was supported by the grant of the Czech Science Foundation 18-16835S (Research of advanced developmental dysgraphia diagnosis and rating methods based on quantitative analysis of online handwriting and drawing) and the following projects: LO1401, FEDER and MEC, and TEC2016-77791-C4-2-R from the Ministry of Economic Affairs and Competitiveness of Spain. This article is based upon work from COST Action CA15225, a network supported by COST (European Cooperation in Science and Technology), and, for the research, infrastructure of the SIX Center was used. Conflicts of Interest: The authors declare no conflict of interest. Disclaimer: The article reflects only the authors’ view, and the Research Executive Agency (REA) is not responsible for any use that may be made of the information it contains. Appl. Sci. 2018, 8, 2566 15 of 18 Abbreviations The following abbreviations are used in this manuscript: ACC accuracy ADA AdaBoost ANN artificial neural network ANOVA analysis of variance AUC area under the ROC curve CNN convolutional neural network EMD empirical mode decomposition EER estimated error rate FN false negatives FP false positives FC fractional calculus FD fractional-order derivative FI feature importance K-NN K-nearest neighbors LED L-dopa equivalent daily dose LDA linear discriminant analysis MCC Matthew’s correlation coefficient max maximum MAE mean absolute error NB naïve Bayes classifier OPF optimum path forest PD Parkinson’s disease RF random forests RMSE root mean squared error SEN sensitivity r Pearson’s correlation coefficient r Spearman’s correlation coefficient SPE specificity std standard deviation TN true negatives TP true positives SVM support vector machine UPDRS V unified Parkinson’s disease rating scale, part V: Modified Hoehn and Yahr staging score References 1. Bellou, V.; Belbasis, L.; Tzoulaki, I.; Evangelou, E.; Ioannidis, J.P. Environmental risk factors and Parkinson’s disease: An umbrella review of meta-analyses. Parkinsonism Relat. Disord. 2016, 23, 1–9. [CrossRef] [PubMed] 2. Berg, D.; Postuma, R.B.; Adler, C.H.; Bloem, B.R.; Chan, P.; Dubois, B.; Gasser, T.; Goetz, C.G.; Halliday, G.; Joseph, L.; et al. MDS research criteria for prodromal Parkinson’s disease. Mov. Disord. 2015, 30, 1600–1611. [CrossRef] [PubMed] 3. Sapir, S.; Ramig, L.; Fox, C. Speech and swallowing disorders in Parkinson disease. Curr. Opin. Otolaryngol. Head Neck Surg. 2008, 16, 205–210. [CrossRef] [PubMed] 4. Hirsch, L.; Jette, N.; Frolkis, A.; Steeves, T.; Pringsheim, T. The incidence of Parkinson’s disease: A systematic review and meta-analysis. Neuroepidemiology 2016, 46, 292–300. [CrossRef] [PubMed] 5. Hornykiewicz, O. Biochemical aspects of Parkinson’s disease. Neurology 1998, 51, S2–S9. [CrossRef] [PubMed] 6. Elbaz, A.; Carcaillon, L.; Kab, S.; Moisan, F. Epidemiology of Parkinson’s disease. Rev. Neurol. 2016, 172, 14–26, doi:10.1016/j.neurol.2015.09.012. [CrossRef] [PubMed] 7. Contreras-Vidal, J.L.; Poluha, P.; Teulings, H.L.; Stelmach, G.E. Neural dynamics of short and medium-term motor control effects of levodopa therapy in Parkinson’s disease. Artif. Intell. Med. 1998, 13, 57–79. [CrossRef] Appl. Sci. 2018, 8, 2566 16 of 18 8. Drotar, P.; Mekyska, J.; Rektorova, I.; Masarova, L.; Smekal, Z.; Faundez-Zanuy, M. Evaluation of handwriting kinematics and pressure for differential diagnosis of Parkinson’s disease. Artif. Intell. Med. 2016, 67, 39–46. [CrossRef] 9. Brabenec, L.; Mekyska, J.; Galaz, Z.; Rektorova, I. Speech disorders in Parkinson’s disease: Early diagnostics and effects of medication and brain stimulation. J. Neural Transm. 2017, 124, 303–334. [CrossRef] 10. Mucha, J.; Galaz, Z.; Mekyska, J.; Kiska, T.; Zvoncak, V.; Smekal, Z.; Eliasova, I.; Mrackova, M.; Kostalova, M.; Rektorova, I.; et al. Identification of hypokinetic dysarthria using acoustic analysis of poem recitation. In Proceedings of the 2017 40th International Conference on Telecommunications and Signal Processing (TSP), Barcelona, Spain, 5–7 July 2017; pp. 739–742. 11. De Stefano, C.; Fontanella, F.; Impedovo, D.; Pirlo, G.; di Freca, A.S. Handwriting analysis to support neurodegenerative diseases diagnosis: A review. Pattern Recognit. Lett. 2018, in press. [CrossRef] 12. Rosenblum, S.; Samuel, M.; Zlotnik, S.; Erikh, I.; Schlesinger, I. Handwriting as an objective tool for Parkinson’s disease diagnosis. J. Neurol. 2013, 260, 2357–2361. [CrossRef] 13. Thomas, M.; Lenka, A.; Kumar Pal, P. Handwriting Analysis in Parkinson’s Disease: Current Status and Future Directions. Mov. Disord. Clin. Pract. 2017, 4, 806–818. [CrossRef] [PubMed] 14. McLennan, J.; Nakano, K.; Tyler, H.; Schwab, R. Micrographia in Parkinson’s Disease. J. Neurol. Sci. 1972, 15, 141–152. [CrossRef] 15. Letanneux, A.; Danna, J.; Velay, J.L.; Viallet, F.; Pinto, S. From micrographia to Parkinson’s disease dysgraphia. Mov. Disord. 2014, 29, 1467–1475. [CrossRef] [PubMed] 16. Drotar, P.; Mekyska, J.; Smekal, Z.; Rektorova, I.; Masarova, L.; Faundez-Zanuy, M. Contribution of different handwriting modalities to differential diagnosis of Parkinson’s Disease. In Proceedings of the 2015 IEEE International Symposium Medical Measurements and Applications (MeMeA), Turin, Italy, 7–9 May 2015; pp. 1–5. 17. Drotar, P.; Mekyska, J.; Rektorova, I.; Masarova, L.; Smekal, Z.; Zanuy, M.F. Decision support framework for Parkinson’s disease based on novel handwriting markers. IEEE Trans. Neural Syst. Rehabil. Eng. 2015, 23, 508–516. [CrossRef] [PubMed] 18. Loconsole, C.; Trotta, G.F.; Brunetti, A.; Trotta, J.; Schiavone, A.; Tato, S.I.; Losavio, G.; Bevilacqua, V. Computer Vision and EMG-Based Handwriting Analysis for Classification in Parkinson’s Disease. In Intelligent Computing Theories and Application; Springer International Publishing: Cham, Switzerland, 2017; pp. 493–503, doi:10.1007/978-3-319-63312-1_43. 19. Nackaerts, E.; Broeder, S.; Pereira, M.P.; Swinnen, S.P.; Vandenberghe, W.; Nieuwboer, A.; Heremans, E. Handwriting training in Parkinson’s disease: A trade-off between size, speed and fluency. PLoS ONE 2017, 12, e0190223. [CrossRef] [PubMed] 20. Mucha, J.; Zvoncak, V.; Galaz, Z.; Faundez-Zanuy, M.; Mekyska, J.; Kiska, T.; Smekal, Z.; Brabenec, L.; Rektorova, I.; Lopez-de Ipina, K. Fractional Derivatives of Online Handwriting: A New Approach of Parkinsonic Dysgraphia Analysis. In Proceedings of the 2018 41st International Conference on Telecommunications and Signal Processing (TSP), Athens, Greece, 4–6 July 2018; pp. 214–217. 21. Kotsavasiloglou, C.; Kostikis, N.; Hristu-Varsakelis, D.; Arnaoutoglou, M. Machine learning-based classification of simple drawing movements in Parkinson’s disease. Biomed. Signal Process. Control 2017, 31, 174–180. [CrossRef] 22. Taleb, C.; Khachab, M.; Mokbel, C.; Likforman-Sulem, L. Feature selection for an improved Parkinson’s disease identification based on handwriting. In Proceedings of the 2017 1st International Workshop on Arabic Script Analysis and Recognition (ASAR), Nancy, France, 3–5 April 2017; pp. 52–56. 23. Moetesum, M.; Siddiqi, I.; Vincent, N.; Cloppet, F. Assessing visual attributes of handwriting for prediction of neurological disorders: A case study on Parkinson’s disease. Pattern Recognit. Lett. 2018. [CrossRef] 24. Heremans, E.; Nackaerts, E.; Vervoort, G.; Vercruysse, S.; Broeder, S.; Strouwen, C.; Swinnen, S.P.; Nieuwboer, A. Amplitude Manipulation Evokes Upper Limb Freezing during Handwriting in Patients with Parkinson’s Disease with Freezing of Gait. PLoS ONE 2015, 10, e0142874. [CrossRef] [PubMed] 25. Pereira, C.R.; Pereira, D.R.; da Silva, F.A.; Hook, C.; Weber, S.A.; Pereira, L.A.; Papa, J.P. A step towards the automated diagnosis of parkinson’s disease: Analyzing handwriting movements. In Proceedings of the 2015 IEEE 28th International Symposium on Computer-Based Medical Systems (CBMS), Sao Carlos, Brazil, 22–25 June 2015; pp. 171–176. Appl. Sci. 2018, 8, 2566 17 of 18 26. Heremans, E.; Nackaerts, E.; Broeder, S.; Vervoort, G.; Swinnen, S.P.; Nieuwboer, A. Handwriting Impairments in People With Parkinson’s Disease and Freezing of Gait. Neurorehabil. Neural Repair 2016, 30, 911–919. [CrossRef] 27. Pereira, C.R.; Weber, S.A.; Hook, C.; Rosa, G.H.; Papa, J.P. Deep Learning-Aided Parkinson. In Proceedings of the 2016 29th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI), Sao Paulo, Brazil, 4–7 October 2016; pp. 340–346. 28. Impedovo, D.; Pirlo, G.; Vessio, G. Dynamic Handwriting Analysis for Supporting Earlier Parkinson’s Disease Diagnosis. Information 2018, 9, 247. [CrossRef] 29. Baleanu, D.; Jajarmi, A.; Bonyah, E.; Hajipour, M. New aspects of poor nutrition in the life cycle within the fractional calculus. Adv. Differ. Equ. 2018, 2018, 230. [CrossRef] 30. Khalili Golmankhaneh, A.; Baleanu, D. New Derivatives on the Fractal Subset of Real-Line. Entropy 2016, 18, 1. [CrossRef] 31. Podlubny, I. Fractional Differential Equations an Introduction to Fractional Derivatives, Fractional Differential Equations, to Methods of Their Solution and Some of Their Applications; Academic Press: San Diego, CA, USA, 1999. 32. Uchaikin, V.V. Fractional Derivatives for Physicists and Engineers; Springer: Berlin, Germany, 2013; Volume 2. 33. Samko, S.G.; Kilbas, A.A.; Marichev, O.I. Fractional Integrals and Derivatives: Theory and Applications; Gordon and Breach: Yverdon, Switzerland, 1993; p. 44. 34. Arshad, S.; Baleanu, D.; Bu, W.; Tang, Y. Effects of HIV infection on CD4+ T-cell population based on a fractional-order model. Adv. Differ. Equ. 2017, 2017, 92. [CrossRef] 35. Pinto, C.M.; Machado, J.T. Fractional model for malaria transmission under control strategies. Comput. Math. Appl. 2013, 66, 908–916. doi:10.1016/j.camwa.2012.11.017. [CrossRef] 36. Lopes, A.M.; Machado, J.T. Integer and fractional-order entropy analysis of earthquake data series. Nonlinear Dyn. 2016, 84, 79–90. [CrossRef] 37. Lopes, A.M.; Machado, J.T. Application of fractional techniques in the analysis of forest fires. Int. J. Nonlinear Sci. Numer. Simul. 2016, 17, 381–390. [CrossRef] 38. Scalas, E.; Gorenflo, R.; Mainardi, F. Fractional calculus and continuous-time finance. Phys. A Stat. Mech. Its Appl. 2000, 284, 376–384. [CrossRef] 39. Baleanu, D. Fractional Calculus: Models and Numerical Methods; World Scientific: Singapore, 2012; Volume 3. 40. Fahn, S.; Elton, R.L. UPDRS Development Committee (1987) Unified Parkinson’s Disease Rating Scale. In Recent Developments in Parkinson’s Disease; Macmillan: Florham Park, NJ, USA, 1987. 41. Lee, J.Y.; Kim, J.W.; Lee, W.Y.; Kim, J.M.; Ahn, T.B.; Kim, H.J.; Cho, J.; Jeon, B.S. Daily dose of dopaminergic medications in Parkinson’s disease: Clinical correlates and a posteriori equation. Neurol. Asia 2010, 15, 137–143. 42. Sesa-Nogueras, E.; Faundez-Zanuy, M.; Mekyska, J. An information analysis of in-air and on-surface trajectories in online handwriting. Cogn. Comput. 2012, 4, 195–205. [CrossRef] 43. Alonso-Martinez, C.; Faundez-Zanuy, M.; Mekyska, J. A comparative study of in-air trajectories at short and long distances in online handwriting. Cogn. Comput. 2017, 9, 712–720. [CrossRef] [PubMed] 44. Drotar, P.; Mekyska, J.; Rektorova, I.; Masarova, L.; Smekal, Z.; Faundez-Zanuy, M. A new modality for quantitative evaluation of Parkinson’s disease: In-air movement. In Proceedings of the 13th IEEE International Conference on BioInformatics and BioEngineering (BIBE 2013), Chania, Greece, 10–13 November 2013; pp. 1–4. 45. Mekyska, J.; Faundez-Zanuy, M.; Mzourek, Z.; Galaz, Z.; Smekal, Z.; Rosenblum, S. Identification and rating of developmental dysgraphia by handwriting analysis. IEEE Trans. Hum.-Mach. Syst. 2017, 47, 235–248. [CrossRef] 46. Tenreiro Machado, J. Fractional coins and fractional derivatives. Abstr. Appl. Anal. 2013, 2013, 205097. [CrossRef] 47. Kilbas, A.A.; Srivastava, H.M.; Trujillo, J.J. Theory and Applications of Fractional Differential Equations; Elsevier: Amsterdam, The Netherlands, 2006. 48. Scherer, R.; Kalla, S.L.; Tang, Y.; Huang, J. The Grünwald–Letnikov method for fractional differential equations. Comput. Math. Appl. 2011, 62, 902–917. [CrossRef] Appl. Sci. 2018, 8, 2566 18 of 18 49. Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM Sigkdd International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. 50. Matthews, B.W. Comparison of the predicted and observed secondary structure of T4 phage lysozyme. Biochim. Biophys. Acta 1975, 405, 442–451. [CrossRef] 51. Dounskaia, N.; Van Gemmert, A.W.; Leis, B.C.; Stelmach, G.E. Biased wrist and finger coordination in Parkinsonian patients during performance of graphical tasks. Neuropsychologia 2009, 47, 2504–2514. [CrossRef] 52. Teulings, H.L.; Contreras-Vidal, J.L.; Stelmach, G.E.; Adler, C.H. Parkinsonism Reduces Coordination of Fingers, Wrist, and Arm in Fine Motor Control. Exp. Neurol. 1997, 146, 159–170. [CrossRef] 53. Ma, H.I.; Hwang, W.J.; Chang, S.H.; Wang, T.Y. Progressive micrographia shown in horizontal, but not vertical, writing in Parkinson’s disease. Behav. Neurol. 2013, 27, 169–174. [CrossRef] c 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.png Applied Sciences Multidisciplinary Digital Publishing Institute

Identification and Monitoring of Parkinson’s Disease Dysgraphia Based on Fractional-Order Derivatives of Online Handwriting

Loading next page...
 
/lp/multidisciplinary-digital-publishing-institute/identification-and-monitoring-of-parkinson-s-disease-dysgraphia-based-MMAJ9rNpoe

References (39)

Publisher
Multidisciplinary Digital Publishing Institute
Copyright
© 1996-2019 MDPI (Basel, Switzerland) unless otherwise stated
ISSN
2076-3417
DOI
10.3390/app8122566
Publisher site
See Article on Publisher Site

Abstract

applied sciences Article Identification and Monitoring of Parkinson’s Disease Dysgraphia Based on Fractional-Order Derivatives of Online Handwriting 1 1 1,2 3 Jan Mucha , Jiri Mekyska , Zoltan Galaz , Marcos Faundez-Zanuy , 4 1 1 1 Karmele Lopez-de-Ipina , Vojtech Zvoncak , Tomas Kiska , Zdenek Smekal , 2 2,5, Lubos Brabenec and Irena Rektorova * Department of Telecommunications and SIX Research Centre, Brno University of Technology, Technicka 10, 61600 Brno, Czech Republic; muchajano@phd.feec.vutbr.cz (J.M.); mekyska@feec.vutbr.cz (J.M.) z.galaz@phd.feec.vutbr.cz (Z.G.); vojtech.zvoncak@phd.feec.vutbr.cz (V.Z.); kiskatomas@phd.feec.vutbr.cz (T.K.); smekal@feec.vutbr.cz (Z.S.) Applied Neuroscience Research Group, Central European Institute of Technology, Masaryk University, Kamenice 5, 62500 Brno, Czech Republic; lubos.brabenec@ceitec.muni.cz Escola Superior Politecnica, Tecnocampus Avda. Ernest Lluch 32, 08302 Mataro, Barcelona, Spain; faundez@tecnocampus.cat Department of Systems Engineering and Automation, University of the Basque Country UPV/EHU, Av de Tolosa 54, 20018 Donostia, Spain; karmele.ipina@ehu.eus First Department of Neurology, Masaryk University and St. Anne’s University Hospital, Pekarska 53, 65691 Brno, Czech Republic * Correspondence: irena.rektorova@fnusa.cz; Tel.: +420-543-182-639 † This paper is an extended version of our paper published in Mucha, J.; Zvoncak, V.; Galaz, Z.; Faundez-Zanuy, M.; Mekyska, J.; Kiska, T.; Smekal, Z.; Brabenec, L.; Rektorova, I.; Ipina, K.L. Fractional Derivatives of Online Handwriting: a New Approach of Parkinsonic Dysgraphia Analysis. In Proceedings of the 2018 41st International Conference on Telecommunications and Signal Processing (TSP), Athens, Greece, 4–6 July 2018; pp. 1–4. Received: 30 October 2018; Accepted: 6 December 2018; Published: 11 December 2018 Abstract: Parkinson’s disease dysgraphia affects the majority of Parkinson’s disease (PD) patients and is the result of handwriting abnormalities mainly caused by motor dysfunctions. Several effective approaches to quantitative PD dysgraphia analysis, such as online handwriting processing, have been utilized. In this study, we aim to deeply explore the impact of advanced online handwriting parameterization based on fractional-order derivatives (FD) on the PD dysgraphia diagnosis and its monitoring. For this purpose, we used 33 PD patients and 36 healthy controls from the PaHaW (PD handwriting database). Partial correlation analysis (Spearman’s and Pearson’s) was performed to investigate the relationship between the newly designed features and patients’ clinical data. Next, the discrimination power of the FD features was evaluated by a binary classification analysis. Finally, regression models were trained to explore the new features’ ability to assess the progress and severity of PD. These results were compared to a baseline, which is based on conventional online handwriting features. In comparison with the conventional parameters, the FD handwriting features correlated more significantly with the patients’ clinical characteristics and provided a more accurate assessment of PD severity (error around 12%). On the other hand, the highest classification accuracy (ACC = 97.14%) was obtained by the conventional parameters. The results of this study suggest that utilization of FD in combination with properly selected tasks (continuous and/or repetitive, such as the Archimedean spiral) could improve computerized PD severity assessment. Keywords: Parkinson’s disease dysgraphia; micrographia; online handwriting; kinematic analysis; fractional-order derivative; fractional calculus Appl. Sci. 2018, 8, 2566; doi:10.3390/app8122566 www.mdpi.com/journal/applsci Appl. Sci. 2018, 8, 2566 2 of 18 1. Introduction As a second most common neurodegenerative disorder, Parkinson’s disease (PD) is expected to impose an increasing social and economic burden on societies as populations age [1]. Its prevalence rate is estimated to approximately 1.5% for people aged over 65 years [2]. The risk of being affected by PD strongly increases with age, and, in the next 15 years, the incidence of PD is expected to be doubled [3,4]. The rapid degeneration of dopaminergic cells in the substantia nigra pars compacta [5] arose as the most significant biological finding associated with the disease, but the exact pathophysiological cause of PD has not yet been discovered. PD cardinal motor symptoms involve bradykinesia (slowness of movement), tremor at rest, rigidity, gait impairment, and postural instability [6–8]. A variety of non-motor symptoms may emerge as well—for instance, cognitive impairment, dementia, depression, sleep disorders, or anxiety [6,9,10]. Handwriting requires cognitive, perceptual, and fine motor abilities. In conjunction with motor dysfunctions in people suffering from PD, it has been proven that disrupted handwriting may be used as a significant biomarker for PD diagnosis [11,12]. Micrographia, which is associated with the progressive decrease in letters’ amplitude, is the most commonly observed handwriting abnormality in patients with PD [13,14]. Moreover, according to McLennan et al. [14], in approximately 5% of PD patients, micrographia may be observed even before the onset of the cardinal motor symptoms. The recent advantage of new technologies coming hand-in-hand with Health 4.0 systems enables the acquisition of online handwriting signals, where temporal information is added to the x and y position. Therefore, by using a digitizing tablet, the analysis is not limited to spatial features which mainly quantify PD micrographia. In addition, we are able to quantify temporal, kinematic, and dynamic manifestations of PD dysgraphia, such as hesitations, pauses, and slow movement [7], which cannot be studied objectively using a classical paper-and-pen method. Due to this complexity, Letanneux et al. [15] started to refer to these manifestations using the generalized term PD dysgraphia. Several research teams have explored the impact of quantitative PD dysgraphia analysis utilizing simple handwriting/drawing tasks (e.g., separate characters, a combination of two or three characters, repetitive loops, circles), as well as more complex ones (e.g., words, sentences, figures, 3D objects, and the Archimedean spiral) [8,16–20]. An overview of recent related works (2015–present) can be seen in Table 1. Most of them confirm the irreplaceability of kinematic features in PD dysgraphia analysis. Additionally, the researchers usually employ temporal, spatial, and dynamic features. Some more advanced parameters are reported too. For instance, Drotar et al. [8,16,17] demonstrated a combination of kinematic, pressure, energy, or empirical mode decomposition (EMD)-based features that resulted in a classification accuracy of up to 89% using several handwriting tasks. Kotsavasilogloua et al. [21] achieved an average prediction accuracy of 91% using simple horizontal lines and features describing the variability in the pen tip’s velocity, a deviation from the horizontal plane, and the trajectory’s entropy. Other works report even higher classification accuracies (approximately 97%), e.g., Loconsole et al. [18], who used computer vision and electromyography signal processing techniques, or Taleb et al. [22], who used a combination of features related to the correlation between kinematic and pressure characteristics (but, in this case, applied to a very small dataset). Another promising approach was published by Moetesum et al. [23], who reached an 83% classification accuracy by employing convolutional neural networks (CNN) that were used to extract discriminating visual features from handwriting data transformed into the offline mode. In 2018, Impedovo et al. reported the results of a study focused only on the early stages of PD; the best accuracy was 74.76% for a combination of three handwriting tasks. Finally, in our previous work [20], we proposed a new approach of advanced kinematic feature extraction that utilizes fractional-order derivatives (FD). This approach increased the classification accuracy by 10% (72.39%) for Archimedean spiral tasks in comparison with the baseline [20]. Appl. Sci. 2018, 8, 2566 3 of 18 Table 1. Overview of related works focused on computerized analysis of Parkinson’s disease (PD) dysgraphia. First Author Year PD/HC Handwriting Task Analysis Features Conclusions letters, words, differential kinematic, temporal, spatial, The highest classification accuracy after Drotar * [17] 2015 37/38 sentences analysis (SVM) entropy, EMD, signal energy feature selection approach was 88.13%. letters, words, differential kinematic, temporal, spatial, Classification performance was at its peak Drotar * [16] 2015 37/38 sentences analysis (SVM) entropy, EMD, pressure with on-surface features equal to AUC = 89.09%. up/down strokes at Significant difference between groups was in Heremans [24] 2015 34/10 ANOVA spatial and kinematic varying amplitudes spatial (F(2.41) = 3.97; p = 0.03). differential an. mean relative tremor The best results were obtained by NB classifier Pereira [25] 2015 37/18 Archimedean spiral (SVM, NB, OPF) and spatial parameters that provided around 79% classification accuracy. letters, words, Archimedean differential an. kinematic, temporal, spatial, Combining all exercises, SVM proved to be Drotar * [8] 2016 37/38 spiral, sentences (SVM, K-NN, ADA) entropy, EMD, pressure the best classifier with 82.5% accuracy. ANOVA, PD dysgraphia is more severe in patients with Heremans [26] 2016 30/15 repetitive cursive loops writing amplitude and velocity correlation an. freezing of gait. Archimedean spiral, differential an. The best result was obtained by CNN with Pereira [27] 2016 14/21 pen-based features meander (CNN, OPF) 87.14% classification accuracy using meander task. differential Kotsavasil [21] 2017 24/20 horizontal lines kinematic Average classification accuracy was 91%. analysis (NB) differential Highest classification accuracy (96.81%) was Loconsole [18] 2017 4/7 sentence, repetitive loops temporal, kinematic, spatial analysis (ANN) achieved using all the extracted features. differential kinematic, stroke, pressure, The highest classification accuracy was 96.88% Taleb [22] 2017 16/16 letters, waves, words analysis (SVM) entropy, energy, EMD for 12 kinematic and pressure features. Archimedean spiral, Extraction of features using CNN applied on raw differential Moetesum * [23] 2018 37/38 letters, words, CNN-based features handwriting data resulted in 83% classification analysis (SVM) sentence, loops accuracy. differential fractional derivatives based Improvement of classification accuracy Mucha * [20] 2018 30/36 Archimedean spiral analysis (RF, SVM) kinematic features by 10% (72.38%) in comparison to the baseline. differential an. Archimedean spiral kinematic, temporal, spatial, Analysis focused on PD diagnosis at earlier stages Impedovo * [28] 2018 37/38 (RF, SVM, K-NN, letters, words, sentence entropy, EMD, pressure resulted in 74.76% classification accuracy. NB, LDA, ADA) SVM—support vector machine; EMD—empirical mode decomposition; K-NN—K-nearest neighbors; ANOVA—analysis of variance; NB—naïve Bayes classifier; OPF—optimum path forest; ANN—artificial neural network; CNN—convolutional neural network; RF—random forests; LDA—Linear Discriminant Analysis; ADA—AdaBoost; AUC—area under the receiver operating characteristics (ROC) curve; articles are sorted by the year of release and then alphabetically; * analyzes performed on the same database (Parkinson’s disease handwriting database (PaHaW) [8]). Appl. Sci. 2018, 8, 2566 4 of 18 Although the authors of the previously mentioned studies reported high classification accuracies, further signal processing and machine learning pipeline improvements are expected to make the differential analysis even more accurate. One possible approach could involve an advanced feature extraction methodology based on fractional calculus (FC) [29,30], which enables the use of an arbitrary order of derivatives and/or integrals. Generally, FC has many applications in different fields of science [31–33]. For instance, it has been advantageously used during the modeling of different diseases, such as human immunodeficiency virus (HIV) [34] and malaria [35]. In addition, FC-based analytical tools have outperformed classical techniques in geology [36,37], economics and finance [38,39], etc. Moreover, in our recent paper [20], we identified a high potential for the use of FC in the kinematic analysis of PD drawings. Based on these preliminary results, we assume that FD-based handwriting features may bring improvements to PD diagnosis and assessment. In the frame of this article, we would like to go further and deeply explore the impact of FD on the PD dysgraphia diagnosis and its monitoring. More specifically, we aim to: investigate the relationship between newly designed FD handwriting features and a patient’s clinical data and compare these results with a baseline (i.e., results based on conventional parameters), evaluate the discrimination power of the FD features in terms of binary classification accuracy and compare the results to the baseline, use the newly designed features to establish regression models that will estimate the severity of PD and compare its performance to that of a baseline. The rest of this paper is organized as follows: Section 2 describes the cohort of patients and the methodology, and Section 3 includes the results. A discussion is presented in Section 4, and, finally, conclusions are drawn in Section 5. 2. Materials and Methods 2.1. Dataset For the purpose of this work, the Parkinson’s disease handwriting database (PaHaW) [8], which consists of multiple handwriting/drawing samples from 37 PD patients and 38 age- and gender-matched healthy controls (HC), was used. Since the Archimedean spiral drawing task is missing for some participants, we reduced the analyzed cohort to 33 PD patients and 36 HC. Demographic and clinical data of the participants can be found in Table 2. The participants were enrolled at the First Department of Neurology, St. Anne’s University Hospital in Brno, Czech Republic. All participants reported the Czech language as their native language and were right-handed. The patients completed their tasks approximately 1 h after their regular dopaminergic medication (L-dopa). All participants signed an informed consent form approved by the local ethics committee. Unified Parkinson’s disease rating scale, part V (UPDRS V): Modified Hoehn and Yahr staging score [40], was used to assess clinical symptoms of PD. In the frame of this work, the duration of the disease was considered as well. Descriptive visualization (histograms, regression, and residual plots) of the clinical data for the subjects participating in this study can be seen in Figure 1. Appl. Sci. 2018, 8, 2566 5 of 18 Table 2. Demographic and clinical data of the enrolled participants. Gender N Age [years] PD dur [years] UPDRS V LED [mg/day] Parkinson’s disease patients Females 17 71.76  10.93 9.88  5.27 2.18  0.86 1146.03  543.89 Males 16 66.50  13.44 7.44  4.04 2.31  0.75 1673.38  616.66 All 33 69.21  11.10 8.70  4.82 2.24  0.80 1401.72  630.71 Healthy controls Females 17 61.59  10.17 - - - Males 19 63.32  13.14 - - - All 36 62.50  11.70 - - - PD—Parkinson’s disease; N—number of subjects; PD dur—PD duration; UPDRS V—Unified Parkinson’s disease rating scale, part V: Modified Hoehn and Yahr staging score [40]; LED—L-dopa equivalent daily dose [41]. 17.5 15.0 12.5 10.0 7.5 5.0 2.5 Length of PD UPDRS V 2.0 1.5 1.0 0.5 0.0 0.5 1.0 2.5 5.0 7.5 10.0 12.5 15.0 17.5 1 2 3 4 5 Length of PD UPDRS V Figure 1. Descriptive graphs of patients’ clinical characteristics: Unified Parkinson’s disease rating scale (UPDRS V) and Parkinson’s disease (PD) duration (in years). Histograms are visualized on the diagonal. A scatterplot with a line fitted using linear regression is visualized in the top-right corner. Residuals of the trained linear model are visualized in the bottom-left corner. 2.2. Data Acquisition The PaHaW database [8] includes nine different handwriting tasks written in the Czech language. Their description and translation to English can be found in Table 3. During all handwriting tasks, the participants were rested and seated in a comfortable position with the possibility to look at the prefilled template (see Figure 2). A digitizing tablet (Wacom Intuos 4M, Wacom, Kazo, Saitama, Japan) UPDRS V Length of PD UPDRS V Length of PD Appl. Sci. 2018, 8, 2566 6 of 18 was overlaid with an empty paper template and participants were asked to perform all tasks using a special Wacom inking pen that gave the patients immediate visual feedback. Online handwriting signals were recorded with a sampling frequency of f = 150 Hz. The following time sequences were acquired: x and y coordinates (x[t], y[t]); time-stamp (t); in-air/on-surface (on-surface movement is a movement of a pen when its tip is touching the surface, e.g., paper (i.e., it provides the information about the pen writing/drawing on the paper); vice versa, in-air movement is a movement of a pen when its tip is up to 1.5 cm above the surface [42,43]) status (b[t]); pressure (p[t]); azimuth (az[t]); and altitude (al[t]). Figure 2. Filled template of the PaHaW database. Table 3. Description of the PaHaW handwriting tasks. N Task Czech (Original) English (Translation) 1 Archimedean spiral - - 2 repetitive loops - - 3 letter l l 4 syllable le le 5 word les forest 6 word lektorka lecturer 7 word porovnat compare 8 word nepopadnout not grasped 9 sentence Tramvaj dnes už nepojede. The tram will no longer go today. 2.3. Feature Extraction The main goal of this work is to compare a set of commonly used kinematic features with newly proposed FD-based features in terms of quantitative PD dysgraphia analysis. All of the handwriting features were computed using both on-surface as well as in-air movements. The two movements were quantified separately using velocity (rate at which the position of the pen changes with time [mm/s]), acceleration (rate at which the velocity of the pen changes with time [mm/s ]), jerk (rate at which the acceleration of the pen changes with time [mm/s ]), and their horizontal and vertical variants [8,44,45]. FD-based features were extracted for different values of a. In the frame of this work, a ranging from 0.1 Appl. Sci. 2018, 8, 2566 7 of 18 to 1.0 with a step of 0.1 was used. Subsequently, the statistical properties of the computed handwriting features were described using the mean, median, standard deviation (std), and maximum (max). Finally, all of the extracted features were divided into nine different feature sets according to the type of the movement (on-surface, in-air, and combined) and the calculation approach, i.e., the type of feature (FD-based, conventional, and combined). For more information, see Table 4. Table 4. Feature sets matrix. Movement FD (Count) Conventional (Count) Together (Count) on-surface 4536 618 5154 in-air 2916 404 3320 together 7452 1022 8474 Fractional-Order Derivatives Utilization of the FD as a substitution for the conventional differential derivative during calculation of the basic kinematic features provides a new advanced approach. The advantage of FDs is in their wide range of settings and many different approaches to approximation, e.g., Riemann–Liouville, Caputo, or Grünwald–Letnikov formulations [31,46,47]. For the purpose of this work, Jonathan Hadida’s FD Matlab implementation was used following the Grünwald–Letnikov approximation [31,48]. A direct definition of the FD D y(t) is based on the finite differences of an equidistant grid in [0, t], assuming that the function y(t) satisfies certain smoothness conditions in every finite interval (0, t), t  T. Choosing the grid [31], 0 = t < t < ... < t = t = (n + 1)h (1) 0 1 n+1 with t t = h (2) k+1 k and using the notation of finite differences n+1 1 1 a a D y(t) = y(t ) c y(t ) , (3) n+1 n+1v å v a h a h h v=1 where a v1 a c = (1) ( ). (4) v v The Grünwald–Letnikov implementation is defined as a a D y(t) = lim D y(t), (5) h!0 h where D y(t) denotes a derivative with order a of function y(t), and h represents a sampling lattice. 2.4. Statistical Analysis Prior to providing a description of the analytical setup, it is important to note that the effect of well-known confounding factors, also known as covariates, was controlled for in all of the analytical steps described below. In the frame of this work, we controlled for the effect of participants’ age, gender, and L-dopa [41] (dopaminergic medication). To assess the strength of the relationship between the computed handwriting features and patient’s clinical data (UPDRS V and PD duration), we computed the partial Pearson’s correlation coefficient (assessment of a linear relationship), as well as the partial Spearman’s correlation coefficient (assessment of a monotonic relationship). With this approach, we aimed to identify the handwriting features that are significantly correlated with the clinical measures under focus and also to compare Appl. Sci. 2018, 8, 2566 8 of 18 the FD features with conventional ones. A significance level of correlation (p) of 0.05 was selected for both of the correlation types. Only the results with a p-value below the significance level in both correlation coefficients were considered statistically significant. Next, to evaluate and compare the power of the handwriting features to discriminate PD patients and HC, multivariate binary classification analysis was performed. For this purpose, state-of-the-art gradient boosted trees were employed. Specifically, we used the famous XGBoost algorithm [49]. The XGBoost algorithm was chosen for its ability to achieve a good performance, even for small datasets; its inherent robustness to outliers; its ability to model complex interdependencies in the data; and also its recent successes in the field of machine learning (e.g., the winning algorithm in many www.kaggle.com competitions). To train and evaluate the models, we used the following supervised learning setup: stratified 10-fold cross-validation with 20 repetitions. The performance of the trained classification models was evaluated by Matthew’s correlation coefficient (MCC) [50], classification accuracy (ACC), sensitivity (SEN), and specificity (SPE), which are defined as follows: TP TN FP FN MCC = p , (6) (TP + FP)(TP + FN)(TN + FP)(TN + FN) TP + TN ACC =  100 [%], (7) TP + TN + FP + FN TP SEN =  100 [%], (8) TP + FN TN SPE =  100 [%], (9) TN + FP where TP is the number of true positives, TN is the number of true negatives, FP is the number of false positives, and FN is the number false negatives. Finally, to evaluate and compare the power of the handwriting features’ ability to predict the values of the selected clinical characteristics (UPDRS V and PD duration), multivariate regression analysis was performed. For this purpose, the same boosting tree algorithm (XGBoost) and the supervised learning setup were used. The performance of the trained regression models was evaluated by the mean absolute error (MAE), root mean square error (RMSE), and estimated error rate (EER), which are defined as follows: MAE = jy y ˆj, (10) å i i i=1 RMSE = (y y ˆ ) , (11) å i i i=1 EER = jy y ˆj 100 [%], (12) å i i n r i=1 where y represents the true label of the ith observation, y denotes the predicted label of the ith i i observation, n is the number of observations, and r is the range of the values of the predicted clinical characteristic (not the range that can be theoretically reached, but the actual range of the values in the dataset). Therefore, the EER describes a percentage of error predictions in regard to the statistical properties of the data. 3. Results In Table 5, the results of partial correlation analysis between the handwriting features (FD-based features, conventional features) and patients’ clinical characteristics (UPDRS V, PD duration) are summarized. The table shows the five best features according to Spearman’s correlation coefficient for each movement (on-surface, in-air). Appl. Sci. 2018, 8, 2566 9 of 18 In the case of UPDRS V (on-surface movement), the following FD-based features achieved a statistical significance of correlation: the median of jerk (a = 0.3, a = 0.4) and horizontal velocity (a = 0.1) for the repetitive letter l, the mean of vertical acceleration (a = 0.7) for repetitive loops, and the standard deviation of the vertical velocity (a = 0.3) for the sentence. The following conventional features achieved a statistical significance of correlation (p-value of only one of the coefficients was below the threshold): the maximum of horizontal jerk and velocity for the repetitive letters le, the maximum of horizontal jerk and horizontal velocity for the repetitive letter l, and the maximum of horizontal velocity for the letter l. Regarding UPDRS V (in-air movement), the following FD-based features achieved a statistical significance of correlation: the median of vertical velocity (a = 0.9, a = 0.8, a = 0.7) for the sentence and the median of horizontal velocity (a = 0.5) and vertical jerk (a = 0.3) for the repetitive letters le. The following conventional features achieved a statistical significance of correlation (p-value of only one of the coefficients was below the threshold): the mean of acceleration for the repetitive word lektorka, the maximum of horizontal jerk for the word porovnat, the median of the vertical velocity for the repetitive letter l, and the median of the horizontal velocity of the repetitive letters le. Table 5. Results of partial correlation analysis between handwriting features and clinical data. UPDRS V FD on-suface Conventional on-surface feature name a task r r r r task feature name p s s p jerk (median) 0.3 r. letters l 0.37 * 0.48 ** 0.45 * 0.24 r. letters le h. jerk (max) jerk (median) 0.4 r. letters l 0.43 * 0.46 * 0.43 * 0.2 r. letters le velocity (max) h. velocity (std) 0.1 r. letters l 0.42 * 0.41 * 0.42 * 0.25 r. letters l h. jerk (max) v. acceleration (mean) 0.7 r. loops 0.48 ** 0.40 * 0.42 * 0.16 r. letters l h. velocity (max) v. velocity (std) 0.3 sentence 0.40 * 0.40 * 0.41 * 0.15 letter l h. velocity (max) FD in-air Conventional in-air feature name a task r r r r task feature name p s s p v. velocity (median) 0.9 sentence 0.44 * 0.53 ** 0.43 * 0.28 r. word lektorka acceleration (mean) v. velocity (median) 0.8 sentence 0.40 * 0.52 ** 0.37 * 0.31 word porovnat h. jerk (max) h. velocity (median) 0.5 r. letters le 0.38 * 0.49 ** 0.36 * 0.25 r. letters l v. velocity (median) v. jerk (median) 0.3 r. letters le 0.43 * 0.49 ** 0.35 0.41 * r. letters le h. velocity (median) v. velocity (median) 0.7 sentence 0.37 * 0.48 ** 0.35 0.19 r. word lektorka acceleration (median) PD Duration FD on-surface Conventional on-surface feature name a task r r r r task feature name p s s p velocity (max) 0.1 spiral 0.54 ** 0.55 ** 0.46 * 0.40 * r. letters l h. velocity (max) acceleration (max) 0.8 spiral 0.54 ** 0.54 ** 0.40 * 0.37 * r. letters l h. jerk (max) acceleration (max) 0.6 spiral 0.54 ** 0.54 ** 0.38 * 0.37 * r. letters l velocity (max) acceleration (max) 0.2 spiral 0.54 ** 0.54 ** 0.46 ** 0.34 spiral v. velocity (mean) acceleration (max) 0.7 spiral 0.54 ** 0.53 ** 0.40 * 0.14 r. loops h. acceleration (mean) FD in-air Conventional in-air feature name a task r r r r task feature name p s s p jerk (median) 0.4 sentence 0.37 * 0.49 ** 0.44 * 0.38 * word lektorka h. jerk (median) jerk (max) 0.1 r. word les 0.57 ** 0.46 * 0.38 * 0.40 * word nepopad. velocity (max) jerk (max) 0.3 r. word les 0.57 ** 0.45 * 0.37 * 0.42 * word lektorka h. n. jerk (mean) velocity (max) 0.1 r. word les 0.57 ** 0.45 * 0.47 ** 0.13 r. word lektorka h. velocity (mean) jerk (max) 0.2 r. word les 0.57 ** 0.45 * 0.42 * 0.13 word nepopad. h. velocity (mean) a—order of FD; r —Pearson’s correlation coefficient; r —Spearman’s correlation coefficient; v.—vertical; p s h.—horizontal; r.—repetitive task; *—p < 0.05; **—p < 0.01; rows are ordered by the absolute value of Spearman’s correlation coefficient. For PD duration (on-surface movement), the following FD-based features achieved a statistical significance of correlation (of note: all of these features satisfied the stronger threshold for statistical significance of correlation p < 0.01): the maximum of the velocity (a = 0.1) and acceleration (a = 0.8, a = 0.7, a = 0.6, a = 0.2) for the Archimedean spiral. The following conventional features achieved a statistical significance of correlation (p-value of only one of the coefficients was below the threshold): Appl. Sci. 2018, 8, 2566 10 of 18 the maximum of horizontal velocity, horizontal jerk, and velocity for the repetitive letter l; the mean of the vertical velocity for the Archimedean spiral; and the mean of horizontal acceleration for repetitive loops. For PD duration (in-air movement), the following FD-based features achieved a statistical significance of correlation: the median of jerk (a = 0.4) for sentence, the maximum of jerk (a = 0.1, a = 0.2, a = 0.3) and velocity (a = 0.1) for repetitive word les. The following conventional features achieved a statistical significance of correlation (p-value of only one of the coefficients was below the threshold): the median and mean of horizontal jerk for the word lektorka, the maximum of the velocity for the word nepopadnout, and the mean of horizontal velocity for the repetitive word lektorka and the word nepopadnout. The results of the multivariate binary classification analysis are summarized in Table 6. In total, we built and evaluated nine different classification models. These models were selected according to the following criteria: movement type (on-surface, in-air, all), feature type (FD features, conventional features, all). We built models based on the combinations of these criteria as well. For more information, see Table 4. Table 6. Results of multivariate binary classification analysis (PD/HC). Feature Set MCC ACC [%] SEN [%] SPE [%] Feat conventional on-surface 0.83  0.18 91.19  9.65 93.00  15.52 70.00  0.46 1 conventional in-air 0.95  0.10 97.14  5.71 95.50  9.07 100.00  0.00 1 conventional together 0.95  0.11 97.14  5.71 95.50  9.07 100.00  0.00 1 FD on-surface 0.95  0.12 87.14  13.48 82.00  21.24 90.00  30.00 1 FD in-air 0.95  0.13 81.43  12.86 71.50  30.83 60.00  48.99 3 FD together 0.95  0.14 81.43  15.71 69.50  32.13 70.00  45.83 2 all on-surface 0.95  0.15 88.33  14.06 89.00  22.11 70.00  45.83 2 all in-air 0.95  0.16 97.14  5.71 95.50  9.07 100.00  0.00 1 all together 0.95  0.17 97.14  5.71 95.50  9.07 100.00  0.00 1 MCC—Matthew’s correlation coefficient; ACC—accuracy; SEN—sensitivity; SPE—specificity; feat.—number of features important for the trained model (i.e., feature importance of the feature > 0.0); The feature importances, as well as the exact names of these features, are summarized in the text. With respect to the classification performance, the highest MCC achieved was 0.95 was for eight out of the total nine feature sets (with the exception being the feature set composed of conventional handwriting features computed for the on-surface movements). An interesting fact to note is that for all models based on conventional handwriting features, only a single feature was capable of providing the classification models with such a high discrimination power. In terms of the specific features important for the trained models, the following feature importances were returned by the models (feature importance quantifies the relative importance of the features in the ensemble of the trained XGBoost model [49]; therefore, the higher the value of the feature importance, the more important the feature for the prediction of the dependent variable): conventional on-surface (horizontal jerk (median) of repetitive loops), conventional in-air (horizontal velocity (median) of the sentence), conventional together (horizontal velocity (median) of the sentence), FD on-surface (jerk (max) a = 0.3 of the letters le), FD in-air (vertical acceleration (mean) a = 0.6 of the word nepopadnout (FI = 0.33), horizontal jerk (mean) a = 0.9 of the word nepopadnout (FI = 0.33), horizontal jerk (mean) a = 0.2 of the repetitive word lektorka (FI = 0.33)), FD together (jerk (max) a = 0.3 of the letters le (on-surface; FI = 0.67), horizontal jerk (mean) a = 0.9 of the word nepopadnout (in-air; FI = 0.33)), all on-surface (horizontal jerk (median) of repetitive loops (FI = 0.50), jerk (max) a = 0.3 of the letters le (FI = 0.50)), all in-air (horizontal velocity (median) of the sentence), and all together (horizontal velocity (median) of the sentence (in-air)). The results of multivariate regression analysis are summarized in Table 7. For this purpose, we used UPDRS V and PD duration as our target variables. As in the case of binary classification, we built and evaluated nine different regression models according to the same criteria. For each of the Appl. Sci. 2018, 8, 2566 11 of 18 rating scales, the table shows the results achieved using the trained models and the associated feature importance values. All obtained results are discussed in the following section. Table 7. Results of regression analysis for clinical data. Feature Set MAE RMSE EER [%] Feat UPDRS V conventional on-surface 0.59  0.29 0.71  0.41 13.82  6.71 1 conventional in-air 0.60  0.30 0.72  0.42 14.01  6.98 1 conventional together 0.60  0.31 0.73  0.42 14.05  6.90 1 FD on-surface 0.60  0.32 0.65  0.45 12.51  7.55 1 FD in-air 0.60  0.33 0.68  0.43 13.49  7.29 1 FD together 0.60  0.34 0.66  0.45 13.06  7.55 2 all on-surface 0.60  0.35 0.65  0.45 12.51  7.55 1 all in-air 0.60  0.36 0.71  0.43 13.72  7.36 1 all together 0.60  0.37 0.66  0.45 13.06  7.55 2 PD duration conventional on-surface 4.29  0.94 5.03  1.09 24.52  5.39 18 conventional in-air 4.91  1.38 5.56  1.50 28.03  7.85 16 conventional together 4.14  1.32 4.85  1.52 23.64  7.55 16 FD on-surface 4.45  0.66 5.06  0.85 25.40  3.75 14 FD in-air 4.79  0.73 5.48  0.72 27.36  4.20 19 FD together 4.55  0.68 5.32  0.78 26.00  3.88 21 all on-surface 4.48  0.86 5.12  0.96 25.62  4.92 16 (12 F, 4 C) all in-air 4.95  1.18 5.59  1.17 28.30  6.75 17 (13 F, 4 C) all together 4.70  1.10 5.45  1.23 26.82  6.30 17 (12 F, 6 C) UPDRS V—Unified Parkinson’s disease rating scale, part V: Modified Hoehn and Yahr staging score [40]; MAE—mean absolute error; RMSE—root mean squared error; EER—estimation error rate; F—FD-based features; C—conventional handwriting features; feat.—number of features important for the trained model (i.e., feature importance of the feature > 0.0); The feature importances, as well as the exact names of these features for models built to assess UPDRS V, are summarized in the text. In the case of PD duration, this data can be found in Table S1 provided in the Supplementary Material. Considering EER as our performance evaluation metric, the following results are worth pointing out. In the case of UPDRS V, the lowest EER was achieved using a single FD-based feature—specifically, the standard deviation of vertical velocity (a = 0.1) computed for the on-surface movements (12.51 7.55%). The same feature was selected when both FD and conventional features were considered while building the model. In general, all models achieved an EER of around 12–13%. In comparison with the conventional features, the FD-based features performed better, with a difference of about 1%. In terms of the specific features important for the trained models, the following feature importances were returned by the models: conventional on-surface (vertical normalized jerk (mean) of the repetitive word lektorka), conventional in-air (vertical velocity (mean) of the sentence), conventional together (vertical velocity (mean) of the sentence), FD on-surface (vertical velocity (std) a = 0.1 of the sentence), FD in-air (vertical velocity (median) a = 0.3 of the sentence), FD together (vertical velocity (std) a = 0.1 of the sentence (on-surface; FI = 0.50), vertical velocity (median) a = 0.3 of the sentence (in-air; FI = 0.50)), all on-surface (vertical velocity (std) a = 0.1 of the sentence), all in-air (vertical velocity (median) a = 0.3 of the sentence), and all together (vertical velocity (std) a = 0.1 of the sentence (on-surface; FI = 0.50), vertical velocity (median) a = 0.3 of the sentence (in-air; FI = 0.50)). With respect to PD duration, the lowest EER was achieved using conventional handwriting features computed for both on-surface as well as in-air movements (23.64 7.55%). 4. Discussion To the best of our knowledge, except for our pilot work [20], there are no prior studies which integrate FD into a handwriting parameterization for quantitative PD dysgraphia analysis. Therefore, the results published in this paper are exploratory in nature. Appl. Sci. 2018, 8, 2566 12 of 18 In comparison with the conventional kinematic features, FD-based ones correlate more significantly with the clinical characteristics (UPDRS V and PD duration). We observed especially strong correlations for handwriting tasks based on the periodic repetition of specific movements (Archimedean spiral; repetitive letter l, syllable le, or word les). Although the levels of significance based on the conventional handwriting parameters are lower, similar handwriting tasks are involved in the most significant results. We hypothesize that this is due to their ability to highlight or better quantify the cardinal motor symptoms of PD. For example, the most significant relationship between handwriting performance and PD duration was identified in acceleration extracted from the Archimedean spiral. Rigidity combined with tremor and/or bradykinesia makes a PD patient’s handwriting/drawing less fluent (increased changes in velocity and higher acceleration). This is highlighted in a task such as the spiral, where the proper coordination of the fingers, wrist, and arm is required. Generally, the observed problems with coordination are in line with the work of Dounskaia et al. [51] and Teulings et al. [52]. To better illustrate these manifestations, Figure 3 plots the velocity profiles of repetitive loops for a healthy control and a PD patient. As can be seen, the patient introduced more changes in velocity, and their drawing became much more non-fluent. To summarize these findings, FD features in combination with properly selected tasks provide a stronger relationship with the severity and progress of PD. Healthly control 1.0 1.10 0.8 1.05 0.6 0.4 1.00 0.2 0.95 0.0 1.65 1.70 1.75 0 1 2 3 4 5 6 7 8 9 X [cm] time [s] PD patient 0.5 0.4 1.04 0.3 1.02 0.2 1.00 0.1 0.98 0.0 1.125 1.150 1.175 0 1 2 3 4 5 X [cm] time [s] Figure 3. Handwriting samples of the repetitive loop task for HC and PD patients are on the left, and the resulting velocity profiles are on the right. On the other hand, in terms of binary classification, the conventional parameters provided the best results. The classification performance is remarkable: ACC = 97.74%, SEN = 95.50%, and SPE = 100%. In fact, our results represent the highest classification accuracy that has ever been reported based on the PaHaW database (see Table 1). We hypothesize that the improvement was caused by the inclusion of the state-of-the-art XGBoost algorithm into our machine learning pipelines. As already mentioned, the result is based on one in-air feature: median horizontal velocity of a sentence. In comparison with the HC cohort, the PD patients exhibited much lower values of this measure, i.e., while writing the sentence, the PD patients were not able to perform horizontal transitions (movement between neighboring letters or words) as quickly as the HC could. This finding is in line with the work of Ma et al. [53], who observed that wrist extension stiffness in PD patients makes the handwriting in the horizontal direction more problematic. Therefore, scientists started to use the term horizontal Y [cm] Y [cm] velocity [cm/s] velocity [cm/s] Appl. Sci. 2018, 8, 2566 13 of 18 dysgraphia [13]. Generally, vertical or horizontal dysgraphia may be considered a presymptomatic neurobehavioral biomarker of PD with possible significance in early PD diagnosis [13]. In [20], we proved that the FD features improved the accuracy of PD dysgraphia diagnosis in the Archimedean spiral drawing task by 10%. Contrary to our pilot results, in the frame of this work, these features did not lead to any improvements. After a deeper analysis, we found that this was caused by a combined task approach. Performance of the Archimedean spiral is a quasiparticle and continuous task with some repetitive patterns. It looks as though the FD features work especially well in these specific cases. Nevertheless, when combining these tasks with a complex handwriting task (such as a sentence), the measures quantifying in-air movement tend to be more discriminative (in our case, the median in-air horizontal velocity of a sentence). This brings us to the same conclusion that was given during the correlation analysis—the FD features advance the PD dysgraphia diagnosis only in some specific cases. The best regression model, estimating the UPDRS V score with a 12.51% error, is based only on the standard deviation of on-surface vertical velocity (a = 0.1) extracted from the sentence. This FD-based parameter was selected from the feature set combining all on-surface measures; therefore, we can confirm the positive influence of FC on the regression analysis performance. In fact, the FD features outperformed the conventional ones in all scenarios. To better understand this result, we plotted vertical velocity patterns of the sentence task for different orders of FD (see Figure 4). We can observe a big difference between a = 0.1 and the rest of the orders, including the full derivative. This large distance means that we are working with completely new information that is far from that contained in the full derivative. Although it is difficult to clinically interpret this information, it is clear that FC opens new possibilities for monitoring PD severity. = 0.1 = 0.6 = 0.2 = 0.7 = 0.3 = 0.8 = 0.4 = 0.9 = 0.5 = 1.0 0 2 4 6 8 10 12 time [s] Figure 4. Vertical velocity patterns of the sentence task for different orders of fractional-order derivatives (FD). Regarding the PD duration estimation results, the most successful model (EER = 23.46%) consists of 16 conventional on-surface/in-air features (all features’ importance values can be found in Supplementary Table S1). The most frequent feature with the highest feature importance is the jerk extracted from several handwriting tasks. This probably means that as PD progresses, handwriting becomes more jerky and velocity [-] Appl. Sci. 2018, 8, 2566 14 of 18 irregular. Vertical velocity is the second most frequent feature involved in the models, which is probably linked with micrographia. Generally, in the case of PD duration estimation, the FD-based features did not yield any improvement. In conclusion, the FD-based features are better for modeling PD severity (in terms of UPDRS V score estimation), but they do not lead to an improvement in PD duration modeling. The progress of PD is nonlinear and very individual. This means that patients with the same PD duration can be in different stages of the disease. This fact supports our results: the estimation error of PD duration was generally much worse than the estimation error of the UPDRS V score. Since PD duration estimation is a difficult task with poor results, fine improvements based on FD parameters play no role. 5. Conclusions This study deals with advanced approaches to PD dysgraphia diagnosis and monitoring based on FC integrated with online handwriting/drawing parameterization. To the best of our knowledge, it is the first work that performs a complex investigation into the possibilities for FC in online handwriting processing and proposes new advances in kinematic analyses based on FD. Although the conventional features provided better and very high classification accuracy, which is at the top of the state-of-the-art analyses based on the PaHaW database (ACC = 97.74%, SEN = 95.50%, and SPE = 100%), the newly designed parameters were proven to work better for specific tasks (continuous and/or repetitive, such as the Archimedean spiral) and for specific applications, i.e., PD severity estimation (EER = 12.51%). However, our results need to be confirmed by subsequent scientific research. This study has several limitations and suggestions for further improvements. Since the dataset is small, to be able to generalize the results, bigger databases should be involved. On the other hand, it is common to have such small numbers of PD patients and HC samples in PD dysgraphia analysis, e.g., see our review in Table 1. Next, we considered only the kinematic measures. To better evaluate the discrimination power of the FD features and better evaluate their ability to estimate PD severity or progress, other feature types, such as temporal, spatial, and dynamic, should be included in future comparisons. Finally, the FD-based parameters could be further explored. For instance, we can consider other approximations (e.g., Caputo) or employ FC for other measures (e.g., entropies). Supplementary Materials: The following are available online at http://www.mdpi.com/2076-3417/8/12/2566/s1, Table S1: Feature relevance from multivariate regression (modeling PD duration). Author Contributions: Conceptualization, J.M. (Jan Mucha), J.M. (Jiri Mekyska) and M.F.-Z.; methodology, J.M. (Jan Mucha), M.F.-Z. and J.M. (Jiri Mekyska); software, J.M. (Jan Mucha) and J.M. (Jiri Mekyska); validation, J.M. (Jan Mucha); formal analysis, J.M. (Jan Mucha) and J.M. (Jiri Mekyska); investigation, J.M. (Jan Mucha), M.F.-Z., J.M. (Jiri Mekyska), I.R. and L.B.; resources, I.R. and L.B.; data curation, J.M. (Jan Mucha), J.M. (Jiri Mekyska), Z.G., V.Z., T.K., L.B. and I.R.; writing—original draft preparation, J.M. (Jan Mucha), J.M. (Jiri Mekyska), Z.G. and M.F.-Z.; visualization, J.M. (Jan Mucha) and Z.G.; supervision, J.M. (Jiri Mekyska), Z.S., I.R., M.F.-Z. and K.L.-d.-I.; project administration, J.M. (Jiri Mekyska), Z.S., I.R., M.F.-Z. and K.L.-d.-I.; funding acquisition, J.M. (Jiri Mekyska), Z.S., I.R., M.F.-Z. and K.L.-d.-I. Funding: This project has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement No. 734718 (CoBeN). In addition, this work was supported by the grant of the Czech Science Foundation 18-16835S (Research of advanced developmental dysgraphia diagnosis and rating methods based on quantitative analysis of online handwriting and drawing) and the following projects: LO1401, FEDER and MEC, and TEC2016-77791-C4-2-R from the Ministry of Economic Affairs and Competitiveness of Spain. This article is based upon work from COST Action CA15225, a network supported by COST (European Cooperation in Science and Technology), and, for the research, infrastructure of the SIX Center was used. Conflicts of Interest: The authors declare no conflict of interest. Disclaimer: The article reflects only the authors’ view, and the Research Executive Agency (REA) is not responsible for any use that may be made of the information it contains. Appl. Sci. 2018, 8, 2566 15 of 18 Abbreviations The following abbreviations are used in this manuscript: ACC accuracy ADA AdaBoost ANN artificial neural network ANOVA analysis of variance AUC area under the ROC curve CNN convolutional neural network EMD empirical mode decomposition EER estimated error rate FN false negatives FP false positives FC fractional calculus FD fractional-order derivative FI feature importance K-NN K-nearest neighbors LED L-dopa equivalent daily dose LDA linear discriminant analysis MCC Matthew’s correlation coefficient max maximum MAE mean absolute error NB naïve Bayes classifier OPF optimum path forest PD Parkinson’s disease RF random forests RMSE root mean squared error SEN sensitivity r Pearson’s correlation coefficient r Spearman’s correlation coefficient SPE specificity std standard deviation TN true negatives TP true positives SVM support vector machine UPDRS V unified Parkinson’s disease rating scale, part V: Modified Hoehn and Yahr staging score References 1. Bellou, V.; Belbasis, L.; Tzoulaki, I.; Evangelou, E.; Ioannidis, J.P. Environmental risk factors and Parkinson’s disease: An umbrella review of meta-analyses. Parkinsonism Relat. Disord. 2016, 23, 1–9. [CrossRef] [PubMed] 2. Berg, D.; Postuma, R.B.; Adler, C.H.; Bloem, B.R.; Chan, P.; Dubois, B.; Gasser, T.; Goetz, C.G.; Halliday, G.; Joseph, L.; et al. MDS research criteria for prodromal Parkinson’s disease. Mov. Disord. 2015, 30, 1600–1611. [CrossRef] [PubMed] 3. Sapir, S.; Ramig, L.; Fox, C. Speech and swallowing disorders in Parkinson disease. Curr. Opin. Otolaryngol. Head Neck Surg. 2008, 16, 205–210. [CrossRef] [PubMed] 4. Hirsch, L.; Jette, N.; Frolkis, A.; Steeves, T.; Pringsheim, T. The incidence of Parkinson’s disease: A systematic review and meta-analysis. Neuroepidemiology 2016, 46, 292–300. [CrossRef] [PubMed] 5. Hornykiewicz, O. Biochemical aspects of Parkinson’s disease. Neurology 1998, 51, S2–S9. [CrossRef] [PubMed] 6. Elbaz, A.; Carcaillon, L.; Kab, S.; Moisan, F. Epidemiology of Parkinson’s disease. Rev. Neurol. 2016, 172, 14–26, doi:10.1016/j.neurol.2015.09.012. [CrossRef] [PubMed] 7. Contreras-Vidal, J.L.; Poluha, P.; Teulings, H.L.; Stelmach, G.E. Neural dynamics of short and medium-term motor control effects of levodopa therapy in Parkinson’s disease. Artif. Intell. Med. 1998, 13, 57–79. [CrossRef] Appl. Sci. 2018, 8, 2566 16 of 18 8. Drotar, P.; Mekyska, J.; Rektorova, I.; Masarova, L.; Smekal, Z.; Faundez-Zanuy, M. Evaluation of handwriting kinematics and pressure for differential diagnosis of Parkinson’s disease. Artif. Intell. Med. 2016, 67, 39–46. [CrossRef] 9. Brabenec, L.; Mekyska, J.; Galaz, Z.; Rektorova, I. Speech disorders in Parkinson’s disease: Early diagnostics and effects of medication and brain stimulation. J. Neural Transm. 2017, 124, 303–334. [CrossRef] 10. Mucha, J.; Galaz, Z.; Mekyska, J.; Kiska, T.; Zvoncak, V.; Smekal, Z.; Eliasova, I.; Mrackova, M.; Kostalova, M.; Rektorova, I.; et al. Identification of hypokinetic dysarthria using acoustic analysis of poem recitation. In Proceedings of the 2017 40th International Conference on Telecommunications and Signal Processing (TSP), Barcelona, Spain, 5–7 July 2017; pp. 739–742. 11. De Stefano, C.; Fontanella, F.; Impedovo, D.; Pirlo, G.; di Freca, A.S. Handwriting analysis to support neurodegenerative diseases diagnosis: A review. Pattern Recognit. Lett. 2018, in press. [CrossRef] 12. Rosenblum, S.; Samuel, M.; Zlotnik, S.; Erikh, I.; Schlesinger, I. Handwriting as an objective tool for Parkinson’s disease diagnosis. J. Neurol. 2013, 260, 2357–2361. [CrossRef] 13. Thomas, M.; Lenka, A.; Kumar Pal, P. Handwriting Analysis in Parkinson’s Disease: Current Status and Future Directions. Mov. Disord. Clin. Pract. 2017, 4, 806–818. [CrossRef] [PubMed] 14. McLennan, J.; Nakano, K.; Tyler, H.; Schwab, R. Micrographia in Parkinson’s Disease. J. Neurol. Sci. 1972, 15, 141–152. [CrossRef] 15. Letanneux, A.; Danna, J.; Velay, J.L.; Viallet, F.; Pinto, S. From micrographia to Parkinson’s disease dysgraphia. Mov. Disord. 2014, 29, 1467–1475. [CrossRef] [PubMed] 16. Drotar, P.; Mekyska, J.; Smekal, Z.; Rektorova, I.; Masarova, L.; Faundez-Zanuy, M. Contribution of different handwriting modalities to differential diagnosis of Parkinson’s Disease. In Proceedings of the 2015 IEEE International Symposium Medical Measurements and Applications (MeMeA), Turin, Italy, 7–9 May 2015; pp. 1–5. 17. Drotar, P.; Mekyska, J.; Rektorova, I.; Masarova, L.; Smekal, Z.; Zanuy, M.F. Decision support framework for Parkinson’s disease based on novel handwriting markers. IEEE Trans. Neural Syst. Rehabil. Eng. 2015, 23, 508–516. [CrossRef] [PubMed] 18. Loconsole, C.; Trotta, G.F.; Brunetti, A.; Trotta, J.; Schiavone, A.; Tato, S.I.; Losavio, G.; Bevilacqua, V. Computer Vision and EMG-Based Handwriting Analysis for Classification in Parkinson’s Disease. In Intelligent Computing Theories and Application; Springer International Publishing: Cham, Switzerland, 2017; pp. 493–503, doi:10.1007/978-3-319-63312-1_43. 19. Nackaerts, E.; Broeder, S.; Pereira, M.P.; Swinnen, S.P.; Vandenberghe, W.; Nieuwboer, A.; Heremans, E. Handwriting training in Parkinson’s disease: A trade-off between size, speed and fluency. PLoS ONE 2017, 12, e0190223. [CrossRef] [PubMed] 20. Mucha, J.; Zvoncak, V.; Galaz, Z.; Faundez-Zanuy, M.; Mekyska, J.; Kiska, T.; Smekal, Z.; Brabenec, L.; Rektorova, I.; Lopez-de Ipina, K. Fractional Derivatives of Online Handwriting: A New Approach of Parkinsonic Dysgraphia Analysis. In Proceedings of the 2018 41st International Conference on Telecommunications and Signal Processing (TSP), Athens, Greece, 4–6 July 2018; pp. 214–217. 21. Kotsavasiloglou, C.; Kostikis, N.; Hristu-Varsakelis, D.; Arnaoutoglou, M. Machine learning-based classification of simple drawing movements in Parkinson’s disease. Biomed. Signal Process. Control 2017, 31, 174–180. [CrossRef] 22. Taleb, C.; Khachab, M.; Mokbel, C.; Likforman-Sulem, L. Feature selection for an improved Parkinson’s disease identification based on handwriting. In Proceedings of the 2017 1st International Workshop on Arabic Script Analysis and Recognition (ASAR), Nancy, France, 3–5 April 2017; pp. 52–56. 23. Moetesum, M.; Siddiqi, I.; Vincent, N.; Cloppet, F. Assessing visual attributes of handwriting for prediction of neurological disorders: A case study on Parkinson’s disease. Pattern Recognit. Lett. 2018. [CrossRef] 24. Heremans, E.; Nackaerts, E.; Vervoort, G.; Vercruysse, S.; Broeder, S.; Strouwen, C.; Swinnen, S.P.; Nieuwboer, A. Amplitude Manipulation Evokes Upper Limb Freezing during Handwriting in Patients with Parkinson’s Disease with Freezing of Gait. PLoS ONE 2015, 10, e0142874. [CrossRef] [PubMed] 25. Pereira, C.R.; Pereira, D.R.; da Silva, F.A.; Hook, C.; Weber, S.A.; Pereira, L.A.; Papa, J.P. A step towards the automated diagnosis of parkinson’s disease: Analyzing handwriting movements. In Proceedings of the 2015 IEEE 28th International Symposium on Computer-Based Medical Systems (CBMS), Sao Carlos, Brazil, 22–25 June 2015; pp. 171–176. Appl. Sci. 2018, 8, 2566 17 of 18 26. Heremans, E.; Nackaerts, E.; Broeder, S.; Vervoort, G.; Swinnen, S.P.; Nieuwboer, A. Handwriting Impairments in People With Parkinson’s Disease and Freezing of Gait. Neurorehabil. Neural Repair 2016, 30, 911–919. [CrossRef] 27. Pereira, C.R.; Weber, S.A.; Hook, C.; Rosa, G.H.; Papa, J.P. Deep Learning-Aided Parkinson. In Proceedings of the 2016 29th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI), Sao Paulo, Brazil, 4–7 October 2016; pp. 340–346. 28. Impedovo, D.; Pirlo, G.; Vessio, G. Dynamic Handwriting Analysis for Supporting Earlier Parkinson’s Disease Diagnosis. Information 2018, 9, 247. [CrossRef] 29. Baleanu, D.; Jajarmi, A.; Bonyah, E.; Hajipour, M. New aspects of poor nutrition in the life cycle within the fractional calculus. Adv. Differ. Equ. 2018, 2018, 230. [CrossRef] 30. Khalili Golmankhaneh, A.; Baleanu, D. New Derivatives on the Fractal Subset of Real-Line. Entropy 2016, 18, 1. [CrossRef] 31. Podlubny, I. Fractional Differential Equations an Introduction to Fractional Derivatives, Fractional Differential Equations, to Methods of Their Solution and Some of Their Applications; Academic Press: San Diego, CA, USA, 1999. 32. Uchaikin, V.V. Fractional Derivatives for Physicists and Engineers; Springer: Berlin, Germany, 2013; Volume 2. 33. Samko, S.G.; Kilbas, A.A.; Marichev, O.I. Fractional Integrals and Derivatives: Theory and Applications; Gordon and Breach: Yverdon, Switzerland, 1993; p. 44. 34. Arshad, S.; Baleanu, D.; Bu, W.; Tang, Y. Effects of HIV infection on CD4+ T-cell population based on a fractional-order model. Adv. Differ. Equ. 2017, 2017, 92. [CrossRef] 35. Pinto, C.M.; Machado, J.T. Fractional model for malaria transmission under control strategies. Comput. Math. Appl. 2013, 66, 908–916. doi:10.1016/j.camwa.2012.11.017. [CrossRef] 36. Lopes, A.M.; Machado, J.T. Integer and fractional-order entropy analysis of earthquake data series. Nonlinear Dyn. 2016, 84, 79–90. [CrossRef] 37. Lopes, A.M.; Machado, J.T. Application of fractional techniques in the analysis of forest fires. Int. J. Nonlinear Sci. Numer. Simul. 2016, 17, 381–390. [CrossRef] 38. Scalas, E.; Gorenflo, R.; Mainardi, F. Fractional calculus and continuous-time finance. Phys. A Stat. Mech. Its Appl. 2000, 284, 376–384. [CrossRef] 39. Baleanu, D. Fractional Calculus: Models and Numerical Methods; World Scientific: Singapore, 2012; Volume 3. 40. Fahn, S.; Elton, R.L. UPDRS Development Committee (1987) Unified Parkinson’s Disease Rating Scale. In Recent Developments in Parkinson’s Disease; Macmillan: Florham Park, NJ, USA, 1987. 41. Lee, J.Y.; Kim, J.W.; Lee, W.Y.; Kim, J.M.; Ahn, T.B.; Kim, H.J.; Cho, J.; Jeon, B.S. Daily dose of dopaminergic medications in Parkinson’s disease: Clinical correlates and a posteriori equation. Neurol. Asia 2010, 15, 137–143. 42. Sesa-Nogueras, E.; Faundez-Zanuy, M.; Mekyska, J. An information analysis of in-air and on-surface trajectories in online handwriting. Cogn. Comput. 2012, 4, 195–205. [CrossRef] 43. Alonso-Martinez, C.; Faundez-Zanuy, M.; Mekyska, J. A comparative study of in-air trajectories at short and long distances in online handwriting. Cogn. Comput. 2017, 9, 712–720. [CrossRef] [PubMed] 44. Drotar, P.; Mekyska, J.; Rektorova, I.; Masarova, L.; Smekal, Z.; Faundez-Zanuy, M. A new modality for quantitative evaluation of Parkinson’s disease: In-air movement. In Proceedings of the 13th IEEE International Conference on BioInformatics and BioEngineering (BIBE 2013), Chania, Greece, 10–13 November 2013; pp. 1–4. 45. Mekyska, J.; Faundez-Zanuy, M.; Mzourek, Z.; Galaz, Z.; Smekal, Z.; Rosenblum, S. Identification and rating of developmental dysgraphia by handwriting analysis. IEEE Trans. Hum.-Mach. Syst. 2017, 47, 235–248. [CrossRef] 46. Tenreiro Machado, J. Fractional coins and fractional derivatives. Abstr. Appl. Anal. 2013, 2013, 205097. [CrossRef] 47. Kilbas, A.A.; Srivastava, H.M.; Trujillo, J.J. Theory and Applications of Fractional Differential Equations; Elsevier: Amsterdam, The Netherlands, 2006. 48. Scherer, R.; Kalla, S.L.; Tang, Y.; Huang, J. The Grünwald–Letnikov method for fractional differential equations. Comput. Math. Appl. 2011, 62, 902–917. [CrossRef] Appl. Sci. 2018, 8, 2566 18 of 18 49. Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM Sigkdd International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. 50. Matthews, B.W. Comparison of the predicted and observed secondary structure of T4 phage lysozyme. Biochim. Biophys. Acta 1975, 405, 442–451. [CrossRef] 51. Dounskaia, N.; Van Gemmert, A.W.; Leis, B.C.; Stelmach, G.E. Biased wrist and finger coordination in Parkinsonian patients during performance of graphical tasks. Neuropsychologia 2009, 47, 2504–2514. [CrossRef] 52. Teulings, H.L.; Contreras-Vidal, J.L.; Stelmach, G.E.; Adler, C.H. Parkinsonism Reduces Coordination of Fingers, Wrist, and Arm in Fine Motor Control. Exp. Neurol. 1997, 146, 159–170. [CrossRef] 53. Ma, H.I.; Hwang, W.J.; Chang, S.H.; Wang, T.Y. Progressive micrographia shown in horizontal, but not vertical, writing in Parkinson’s disease. Behav. Neurol. 2013, 27, 169–174. [CrossRef] c 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Journal

Applied SciencesMultidisciplinary Digital Publishing Institute

Published: Dec 11, 2018

There are no references for this article.