Skip to main content

A generic outcome assessment of mobility capacity in neurorehabilitation: measurement properties of the de Morton Mobility Index



Mobility capacity is a key outcome domain in neurorehabilitation. The de Morton Mobility Index (DEMMI), an established and generic outcome assessment of mobility capacity in older patients, is promising for use in neurorehabilitation. The aim of this study was to examine the measurement properties of the DEMMI in rehabilitation inpatients with neurological conditions.


Cross-sectional study including a mixed sample of adult inpatients in a neurorehabilitation hospital. Structural validity, unidimensionality and measurement invariance (Rasch analysis), construct validity, internal consistency reliability, and inter-rater reliability of the DEMMI (scale range: 0–100 points) were established. The minimal detectable change, the 95% limits of agreement, and possible floor and ceiling effects were calculated to indicate interpretability.


We analyzed validity (n = 348) and reliability (n = 133) in two samples. In both samples, the majority of participants had a sub-acute stroke or Parkinson’s disease.

Rasch analysis indicated unidimensionality with an overall fit to the model (chi-square = 59.4, P = 0.074). There was no relevant measurement invariance by disease group. Hypotheses-based correlation analyses (DEMMI and other functional outcome assessments) showed sufficient construct validity. Internal consistency reliability (Cronbach’s alpha = 0.94) and inter-rater reliability (intraclass correlation coefficient = 0.94; 95% confidence interval: 0.91–0.95) were sufficient. The minimal detectable change with 90% confidence was 15.0 points and the limits of agreement were 39%. No floor or ceiling effects were observed.


Results indicate sufficient measurement properties of the DEMMI in rehabilitation inpatients with neurological conditions. The DEMMI can be used as a generic outcome assessment of mobility capacity in neurorehabilitation.

Trial registration

German Clinical Trials Register (DRKS00004681). Registered May 6, 2013.

Peer Review reports


For individuals with neurological conditions, mobility limitations are a frequent and critical issue which negatively affect independence in daily living and quality of life, and increase the risk of falls [1, 2]. Thus, improvements in mobility capacity, especially in walking and balancing, are considered the most important rehabilitation goals of patients with neurological conditions [3,4,5].

Although guideline-directed interventions vary for different neurological conditions, such as stroke or Parkinson’s disease (PD) [6, 7], patients are often treated in inpatient or outpatient rehabilitation facilities which are not focused on a single disorder. In such settings, generic outcome assessments are used to measure outcome domains that are common in many disease groups. For example, the Functional Independence Measure is used as generic measure of disability across different neurological conditions [8, 9].

Mobility capacity, too, is a health-related outcome domain relevant to neurorehabilitation that is often assessed across different disease groups with generic outcome assessments. However, commonly used assessments, such as timed walking tests for gait speed, the timed up and go test, and the 6-min walk test [5, 10,11,12], cover only a limited range of mobility components (e.g., walking on plane surface) and their clinical usability is limited, because they require the patient’s ability to walk. Thus, a considerable number of individuals cannot be assessed, particularly in the early and sub-acute stages of rehabilitation after an acute neurological event, such as a stroke [13, 14]. Ceiling effects also impact clinical utility, particularly when an assessment is intended to measure progress over the duration of recovery [10].

An ideal generic outcome assessment of mobility capacity in neurorehabilitation needs to fulfil the following characteristics: performance-based, measure on interval level, affordable, easy to learn for assessors, feasible, safe, valid over the whole mobility spectrum, sound measurement properties (e.g. validity, reliability, responsiveness), and invariant across disease groups and other patient characteristics [12]. A clinical outcome assessment that fulfils these requirements in geriatric care is the de Morton Mobility Index (DEMMI) [15]. The DEMMI was developed based on the Rasch model [16, 17] to measure the mobility capacity of older hospital patients [15] – a heterogenous population of individuals with a broad range of limitations in self-care, mobility, and cognition. This performance-based clinical outcome assessment has a broad scale width, covering low- to high-order mobility abilities and producing interval-level test scores. The DEMMI’s measurement properties have been examined in various health conditions and care settings, indicating sufficient validity, reliability, responsiveness to change and unidimensionality in acute and sub-acute older patients [15, 18,19,20], osteoarthritis [21], hip fracture [22] dementia [23,24,25], and critically ill patients [26]. Some studies provide very promising evidence that the DEMMI is a feasible, valid, reliable and unidimensional assessment for individuals with neurological conditions, such as stroke [27, 28], PD [29, 30], and mixed neurological conditions [27]. The DEMMI form fits on one paper sheet and can be administered by health professionals within 5─10 min without special equipment [15, 20, 23].

However, the DEMMIs suitability as a generic outcome assessment of mobility capacity and most of its measurement properties in neurorehabilitation have never been investigated. We hypothesized that the DEMMI is a generic outcome assessment of mobility capacity in neurorehabilitation and we aimed to evaluate the DEMMI’s measurement properties in a mixed sample of rehabilitation inpatients with neurological conditions.


Reporting of this study was informed by the STROBE guideline for observational studies, the GRRAS guideline for reliability studies and the criteria of the COSMIN risk of bias checklist [31,32,33,34].

Design and setting

We performed a cross-sectional study of the DEMMI’s measurement properties in neurorehabilitation. This study was approved by the Local Committee for Ethics in Medical Research (Canton of Thurgau, Switzerland: 2013/13), registered a priori (German Clinical Trials Register: DRKS00004681), performed according to the ethical principles described in the Declaration of Helsinki, and all participants gave written informed consent. All methods were performed in accordance with the relevant guidelines and regulations.

Briefly, rehabilitation inpatients with neurological conditions were examined with the DEMMI and a set of functional assessments (listed below) on several occasions during their rehabilitation course to analyze the DEMMI’s psychometric properties. The present study reports on the DEMMI’s structural and construct validity, internal consistency, inter-rater reliability, measurement error, interpretability, and feasibility for the complete sample of rehabilitation inpatients with neurological conditions. The DEMMI’s measurement properties for sub-samples of the total trial sample with stroke (n = 121) and PD (n = 116) have been published previously [28, 29].

The study was conducted in a neurological rehabilitation hospital in Switzerland, where patients were typically referred from acute hospitals, neurologist consultants, or general practitioners located in the eastern and central parts of Switzerland.


The study sample consisted of all inpatients present on May 8, 2013 or entering the rehabilitation hospital consecutively within the following 20 weeks. Inclusion criteria were a neurological disorder and an age of 18 years and older. The main exclusion criteria were severe cognitive impairment and a contraindication for mobilization (for all criteria, see Fig. 1).

Fig. 1
figure 1

Flow chart of study participants


Eligible participants were examined by the primary investigator (TB) in a single session of 30─45 min scheduled within the first 7 days after hospital admission, if possible. The DEMMI and a comprehensive set of functional assessments were performed in a standardized order (baseline).

The participants’ socio-demographic data were taken from the medical records. For common disorders, disease-specific measures were performed to describe disease severity and functional capacity. For participants with stroke, the National Institutes of Health Stroke Scale was assessed to measure the global severity of stroke symptoms [35]. For participants with PD and Multiple Sclerosis, Hoehn and Yahr staging [36] and the Expanded Disability Status Scale [37] were completed by the hospital neurologist, respectively. In all three scales, higher scores indicate higher impairment or disease severity.

Inter-rater reliability was examined between 2 trained and experienced physiotherapists, the primary investigator (TB) and a second rater (DM). Characteristics of both raters are described elsewhere [28, 29].

The second rater performed the DEMMI independently in a convenient sub-sample (reliability sample). Participant selection was mainly based on the second rater’s availability (temporal resources) and on participants’ consent to perform a second study assessment. Both DEMMI assessments were performed within 2 days. To create a stable retest situation, participants were excluded if they reported a change in their physical or mental condition with respect to the first session (e.g., fatigue, pain, ON/OFF state in PD). The test environment (patient’s room) was similar for both sessions (baseline and retest). Both raters were blinded toward each other’s ratings and we tried to balance the number of participants each rater visited first.

A sample size of ≥ 50 participants for reliability studies has been proposed to be “good” at the times of study conduction [38, 39]. However, within the initial recruitment period (20 weeks), we could not include ≥ 50 participants for each major sub-sample of participants with stroke and PD, respectively. Hence, we set up a second recruitment period, using the same inclusion criteria, and screened all present and incoming patients over a period of 9 consecutive days. This additional sample of convenience was only included in the inter-rater reliability analysis.


Participants were assessed with the DEMMI, together with a set of functional assessments, including Berg Balance Scale, timed up and go test, 10-m walk test, Functional Ambulation Categories (FAC), 6-min walk test, Performance Oriented Mobility Assessment, and Functional Independence Measure. For the sub-samples of participants with stroke and PD, we performed additional functional assessments, which were only used to analyze these sub-samples [28, 29].

A detailed description of the assessment procedures and a description of the comparator assessments are given in the Additional file 1. Table 1 provides an overview of the scale width and constructs measured by the comparator assessments.

Table 1 Construct validity of the de Morton Mobility Index (n = 348) including the hypotheses on construct validity and the constructs of the comparison measurement instruments


The DEMMI is a performance-based clinical outcome assessment of mobility capacity, consisting of 15 hierarchical mobility items [15, 20, 40, 41]. The patient is asked to perform functional tasks related to bed and chair mobility, ambulation, static balance, and dynamic balance. The items are rated with 2-or 3-point response options, resulting in a maximum ordinal score of 19 points. This raw score is transformed into a total interval DEMMI score of 0–100 points, with higher scores indicating a higher level of mobility capacity.

Statistical analysis

Data were analyzed using SPSS version 23.0 and Microsoft Excel (Professional Plus 2016) for all analyses except the Rasch analysis, which was completed using RUMM2030 version 5.1 software. Descriptive statistics were used to present sample characteristics. Interval-based data were examined for normal distribution with the Shapiro–Wilk test of normality and by visual inspection of the related histograms and P-P-plots. The DEMMI scores were not normally distributed (p < 0.001); therefore, only non-parametric statistics were applied. A significance value of 5% was used.

Measurement properties

Structural validity (Rasch analysis)

The Rasch model is a probabilistic model asserting that item response is a logistic function of item difficulty and person ability [16]. The DEMMI was developed based on the Rasch model in geriatric inpatients [15] and data fitted the model in various other medical conditions [20, 22, 23, 28, 30].

We performed a Rasch analysis to evaluate the following properties of the DEMMI in neurological inpatients: stochastic (probabilistic) ordering of items, monotonicity (increase in item responses consistent with the underlying trait), local item independence (zero correlation between items when conditioned on the score), unidimensionality, and group invariance (no difference in response to item by group membership when at the same level of (in this case) ‘mobility capacity’), which is also called differential item functioning (DIF). Data fit to the model was deemed acceptable if a set of criteria was fulfilled (Additional file 1). Full details of the Rasch analysis process are given elsewhere [17, 42]. Reporting followed established recommendations [17].

A target sample size of at least 150 was set to provide 99% confidence within ± 0.5 logits [43]. The unrestricted (partial credit) Rasch polytomous model was used with a conditional pair-wise parameter estimation.

Construct validity

In absence of a ‘gold standard’ for ‘mobility capacity’, construct validity was assessed by following the methodological approach of hypotheses testing [38, 39]. We used the other functional outcomes and participants’ clinical information to assess the DEMMI’s construct validity. Aspects of convergent and known-groups validity were used to formulate 11 hypotheses (H1–H15) [39, 44]. All hypotheses were formulated a priori, based on existing literature, and the clinical expertise of clinicians and the research team [15, 20, 23, 30, 45]. Formulated and shortened versions of the hypotheses are presented in Additional file 1 and Table 1, respectively. Details on the statistical analyses and interpretation of hypotheses testing are given in Additional file 1. A sample size of ≥ 100 participants is recommended [46].


Cronbach’s alpha and the Person-Item-Separation Index, which are measures of internal consistency reliability in case of a unidimensional scale, were derived from the validity sample because of its larger size [39]. An outcome between 0.70 and 0.95 was considered acceptable [39].

Inter-rater reliability was examined using the intra-class correlation coefficient (ICC) model 2.1 (two-way random effects model; ICCAGREEMENT) [44]. An ICC of ≥ 0.7 or higher was deemed acceptable [39]. The standard error of measurement (SEMAGREEMENT) was calculated and deemed satisfactory if it was ≤ 10% of the total scale range (100 DEMMI points) [44, 47]. The absolute and relative agreement between both raters per DEMMI item was calculated as a percentage (%) and as the weighted kappa with linear weights (ƙ) [44]. Agreement per item equal or above 70% and ƙ ≥ 0.70 was considered acceptable [39]. For additional information on reliability statistics, see Additional file 1.


Bland and Altman’s method was used to illustrate agreement between the two raters [48]. The minimal detectable change (MDC) with 90% and 95% confidence was calculated for individual subjects (MDCind) as well as for comparisons of mean scores between groups (MDCgroup) [44, 49]. A floor or ceiling effect was considered if ≥ 15% of the participants scored the highest or lowest possible DEMMI score [39]. Additional file 1 gives more information on the statistical methods.


We calculated the mean administration time for the DEMMI in minutes and related the administration time to the participants’ functional status. We documented any adverse events, such as falls, reports of pain, atypical and severe changes of muscle tone, or significant fatigue.


Of 505 neurological inpatients screened for eligibility, 348 (69%) were assessed within the first recruitment period for the validity sample. For the inter-rater reliability analysis (n = 133), 102 participants could be reassessed and an additional sample of 31 participants was recruited. Figure 1 shows the flow of participants throughout the study. Table 2 gives the participants’ demographics according to psychometric sampling.

Table 2 Baseline characteristics of the participants by sample

The participants’ mean age was 66 ± 13 years, 218 (63%) were male, and 230 (66%) were able to ambulate independently in the hospital (FAC level ≥ 4). Most participants had a stroke (nontraumatic intracerebral hemorrhage or cerebral infarction; n = 126; 36%), including 109 (31%) in the sub-acute phase (< 6 months after stroke onset) and 17 (5%) in the chronic phase (≥ 6 months). Of 108 (31%) participants with an extrapyramidal or movement disorder, 100 (29%) had PD. Other frequent disorders were multiple sclerosis (n = 18; 5%), neoplasms of the brain or the central nervous system (n = 18; 5%) and traumatic brain injury (n = 13; 4%). Fifteen (4%) participants presented with a non-traumatic spinal cord injury due to various diseases, such as intervertebral disc disorders. The detailed sample composition according to ICD-10 diagnoses is presented in the table in Additional file 2.

In 78% of the participants, the study assessment was performed within the first 7 days after admission. There were no missing items for any DEMMI assessment. The distribution of DEMMI scores is illustrated in the figure in Additional file 2. Table 1 includes the mobility-related outcomes for all comparator assessments.

Structural validity (Rasch analysis)

Rasch analysis was performed on the complete DEMMI item sets of 348 participants and on the complete 15-item scale. Summary fit statistics are given in Table 3. There was overall fit to the model, including no mis-fitting persons and no mis-fitting items. We found no disordered thresholds, indicating that the responses to the items were consistent with the metric estimate of the underlying construct of mobility capacity. Unidimensionality was confirmed and data were free of local dependency. Overall, the participants exhibited a higher level of mobility (mean: 2.1 logits) than the scale average (0.0 logits; person-item distribution map in Additional file 2).

Table 3 Summary Fit Statistics for Rasch analyses

There was no DIF (measurement invariance) by sex, age, or disease phase (sub-acute and chronic). There was uniform DIF by disease group (stroke, PD, and ‘other’) for two items. Participants with PD were less likely to achieve item #2 ‘roll’ (found it harder; F = 7.8, p < 0.001) and more likely to achieve item #15 ‘jump’ (found it easier; F = 9.7, p < 0.001) than participants in the other two diagnosis groups, respectively (figures of the Item Characteristic Curves in Additional file 2). Further assessment indicated that both items showed ‘real’ (and no artificial) DIF [50].

The importance of DIF exhibited by item #2 and #15 was further assessed, as recommended [51], by comparing the Rasch estimate between a ‘pure’ dataset (excluding these 2 items) and a fully anchored dataset. There were trivial, non-significant differences in mean individual person logits between the two datasets (without DIF items mean: 2.34 ± 4.55, with DIF items mean: 2.26 ± 3.94, p = 0.076). In total, 33.0% of the person estimates differed by more than 0.5 logits, but the correlation between the two sets of person estimates was quite high (ICC = 0.981, 95% CI: 0.976–0.984). Overall, the results indicate that the identified DIF was unimportant.

The DEMMI’s item hierarchy in the sample of inpatients in neurorehabilitation compared to that of the development sample of geriatric inpatients [15] is illustrated in the figure in Additional file 2. A high positive logit location (e.g., tandem standing with eyes closed) indicates harder item difficulty compared to a negative logit location (e.g., sit unsupported). Deviations from the original item hierarchy are indicated by non-overlapping 95% confidence bands in 6 items.

Construct validity

All 11 (100%) a priori stated hypotheses about correlations of the DEMMI with other clinical outcome assessments and known-group differences were confirmed. Correlations between the DEMMI and other broad outcome measures of mobility, ambulation, walking endurance, balance and functional independence (H1–H8; Table 1) were between 0.73 and 0.94. Groups of participants who walked without a walking aid (H9), who were independent in walking (H10), and who were able to climb stairs (H11) had significantly higher DEMMI mean scores than the less able comparison groups (Additional file 2).


Cronbach’s alpha and the Person-item Separation Index of the DEMMI were 0.94 and 0.90, respectively, indicating excellent internal consistency reliability.

For the inter-rater reliability analysis, the DEMMI assessment was performed twice by two different physiotherapists on 133 participants. Sample characteristics are given in Table 2. Rater 1 (TB) administered the first DEMMI measure in 85 (64%) participants. The two DEMMI assessments were performed on the same day in 77 (58%) participants, and within 2 days in 56 (42%) participants. There was no statistically significant mean difference in DEMMI scores between both assessors (0.1 points; 95% CI: -1.5–1.6; P = 0.92) and there was no considerable variance (0.3) due to systematic differences between the two raters. The variance between participants was 616.7 and the residual variance was 42.3. The ICCAGREEMENT was 0.94 (95% CI: 0.91–0.95).

Measurement error

The SEMAGREEMENT was 6.5 points and considered acceptable (6.5% of the total DEMMI scale range).

The absolute and relative agreement per item are presented in the table in Additional file 2. There was no DEMMI item with absolute agreement < 70% (range: 80% to 99%), but 3 items with ƙ < 0.7 (range: 0.29 to 0.92).


The Bland–Altman plot is illustrated in Fig. 2. The data were heteroscedastic (τ = 0.26) and differences were not normally distributed (P < 0.01). The 95% limits of agreement were 0.39X + 0.1 and -0.39X + 0.1, respectively, with X denoting the mean score.

Fig. 2
figure 2

Bland–Altman plot of de Morton Mobility Index (DEMMI) scores by two raters. The x-axis represents the mean sores of the raters and the y-axis represents the difference between the raters. The dotted black line represents the mean difference between both measures; dotted red lines represent the 95% upper and lower limits of agreement. The bar chart on the right side illustrates the frequency of differences between the two raters

The MDCind90, MDCind95, MDCgroup90, and MDCgroup95 were 15.0, 18.0, 1.3, and 1.6 points, respectively.

There were no absolute floor or ceiling effects, with 15 (4%) participants scoring 0 and 21 (6%) participant scoring 100 DEMMI points, respectively (histogram in Additional file 2).


The mean administration time of 100 DEMMI assessments was 6.3 ± 2.1 (range: 1–14) minutes (figure in Additional file 2). DEMMI administration took ≤ 10 min in 96% (n = 333) of participants. In non-ambulant or dependent walkers (FAC ≤ 3, n = 118) and independent walkers (FAC ≥ 4, n = 230), the administration time was 6.8 ± 2.7 and 6.0 ± 1.6 min, respectively. No adverse events occurred in any DEMMI assessment.


This study provides evidence of the DEMMI’s sound structural and construct validity, internal consistency, inter-rater reliability, measurement error, interpretability, and feasibility in a mixed sample of rehabilitation inpatients with neurological conditions.

Rasch analysis confirmed structural validity in terms of unidimensionality, hierarchical order, measurement invariance, and logistic item structure. This is in keeping with results of other studies supporting the DEMMI’s structural validity examined in geriatric [15, 20, 22, 23] and neurological populations [28,29,30]. Two facets seem notably important because of their clinical relevance. Unidimensionality indicates that the DEMMI measures one single underlying construct (mobility capacity). Measurement invariance (no DIF) indicates that DEMMI items do not function differently for different members of a sample group (e.g., for men and women or for different disease groups). No (important) DIF was observed by sex, age, disease phase or disease group; therefore, clinicians and researchers can be confident that the DEMMI is a ‘fair’ test. That means, that every patient/person with the same level of mobility capacity has the same change to endorse each DEMMI items and exceed the same final DEMMI score.

The DEMMI’s construct validity in neurorehabilitation is indicated by strong correlations with other commonly applied and validated outcome assessments of mobility, ambulation, walking endurance, balance, and functional independence. The following results strengthen the conclusion of sufficient construct validity: All hypotheses were confirmed; even the lower confidence bounds of most correlations were > 0.7; and the difference in DEMMI mean scores between clinical groups was larger than the minimal important change of 10 DEMMI points [15]. This result is not surprising, since strong correlations with other measures of mobility have been reported consistently in older patients and individuals in neurorehabilitation [15, 27,28,29].

Cronbach’s alpha (0.94) was within the proposed acceptable range of 0.70 to 0.95 and can be judged as excellent [39]. The inter-rater reliability of the DEMMI performed by two experienced physiotherapists was 0.94 (95% CI: 0.91–0.95) and is comparable to other inter-rater reliability estimations between 0.85 and 0.94 reported by others [15, 18, 21, 23]. An ICC of ≥ 0.7 is considered sufficient for group comparisons, and a value of ≥ 0.90 is an indicator of acceptable reliability for individual-level monitoring [39, 52].

Although the ICC is quite high, the DEMMI is not free of measurement error. The SEM (6.5 points) is considered acceptable and comparable to other estimations (4.1 to 7.5 points) [20, 28, 29]. In addition, there was no item with absolute agreement < 70%. This evidence of sufficient inter-rater reliability has some crucial clinical implications in those clinical situations, in which the DEMMI is assessed on a single patient twice by two different assessors (e.g., different physiotherapists on admission and discharge). In this situation, provided that both assessors carefully synchronize before clinical use (agree on standardized administration procedures), one can be very confident that each DEMMI value represents the ‘true’ level of mobility capacity of that individual patient at that time, and that the different assessors would obtain similar scores.

We used the reliability data to establish information on the DEMMI’s interpretability [38]. We found relatively large limits of agreement (39%), which were in line with previous values reported for patients with PD (31%) [29], sub-acute stroke (42%) [28], and sub-acute geriatric conditions (-8.4 to 11.8 points) [20]. The MDCind90 value of 15.0 points is considerably higher than the MDCind range of 6 to 10 points reported for older adults [15, 18,19,20, 24], but comparable to the MDCind values of 12.5 and 17.5 reported for the sub-samples with stroke and PD, respectively [28, 29]. Thus, a DEMMI change score, assessed by two different assessors at two different time points, needs to be ≥ 15 points (or ± 39%) to have high confidence that this change score is free of measurement error.

A possible explanation for this relatively large MDC value (and limits of agreement) could be the high variability of mobility capacity in the reliability sample (standard deviation of 25 points = 42% of the sample’s mean score). A further explanation could be that the calculation of MDC values performed here include the inter-rater variance and the participants’ intra-individual variance. Thus, the comparably large MDC values might be biased and overestimated by the included inter-rater variance. Our MDC estimations should be considered with caution and verified by future studies, which should use test–retest reliability estimations for stable patients generated by a single assessor [44, 49].

This study provides evidence for the DEMMI’s high feasibility over the whole mobility spectrum of individuals with neurological conditions, since no floor or ceiling effects occurred at hospital admission. Other authors also reported no floor or ceiling effects on admission in samples of patients with PD and stroke [27,28,29,30]. This might be an important advantage of the DEMMI over other established outcome assessments of mobility and ambulation in neurorehabilitation. We observed significant floor effects (approximately 20% of participants not able to perform these assessments) for the Timed Up and Go test, gait speed assessment, and the 6-min walk test (Table 1). With these assessments, longitudinal monitoring of mobility capacity from admission to a later point in rehabilitation would not have been possible. Similar floor effects of gait assessments in patients with stroke have been reported [13, 14]. However, a mild ceiling effect (19%) for the DEMMI in patients with stroke at hospital discharge has been reported in one study [27]. In the present sample, only 6% of participants scored the highest DEMMI score of 100 points at hospital admission. The DEMMI includes some high-level mobility items (e.g., jump, tandem standing with eyes closed) and further research is needed to evaluate whether patients who complete these items successfully (and reach the maximum score) suffer from subjective or objective mobility limitations at all; especially compared to healthy individuals of the same age.

The mean administration time of 6.3 min achieved by an experienced assessor is comparable to previous findings [15, 20, 23, 28, 29]. Based on existing evidence and our clinical experience, the DEMMI can be completed within 5–10 min in most neurological inpatients by a trained healthcare professional. High feasibility, information on interpretability, and short administration times of outcome assessments facilitate routine clinical application and enlarge therapy time [53].

Strengths and limitations

We examined a broad set of measurement properties in a sufficiently large [43, 46] and consecutive sample of hospital inpatients with neurological conditions, supporting the generalizability of results. The included participants presented with a wide spectrum of disability, age range (18–90 years), disease duration, sub-acute and chronic conditions, and various diseases. However, the external validity of this study might be limited because the data were collected in a single rehabilitation hospital only and, with respect to prevalence estimations for central Europe, participants with PD were over-represented, whereas other conditions, such as Multiple Sclerosis and spinal cord injury, were underrepresented [54,55,56].

We used a combination of modern methods of latent trait theory (Rasch analysis) and methods of classical test theory. Rasch analysis provides an especially powerful tool to analyze unidimensionality, measurement invariance, and logistic item structure [17].

We did not evaluate the DEMMI’s floor and ceiling effects at any later time of rehabilitation (e.g., hospital discharge). As previously mentioned, MDC values and limits of agreement need to be interpreted with caution and might be lower than reported in this study for test–retest situations.

Implications for clinical practice and further research

This study provides evidence that the DEMMI has sufficient key measurement properties in neurorehabilitation, including structural validity and internal consistency reliability. It seems feasible and safe, since no adverse events occurred during or immediately after test administration, which took only 6 min on average. The lack of any floor or ceiling effects on hospital admission indicates clinical value and applicability across the whole mobility spectrum of inpatients with neurological conditions. For the DEMMI administration, no long training period is required, no special equipment is needed, and there is no license charge. These advantages address some barriers to the use of outcome assessments by healthcare professionals [53, 57], and they could facilitate the application of this instrument in clinical care.

We found no relevant DIF by disease phase and disease group, indicating that the DEMMI measures were free of measurement invariance in this sample and that the DEMMI can be used as a generic measure of mobility capacity in this population.

Further research should focus on measurement properties that are still unknown in neurorehabilitation, such as test–retest reliability, responsiveness, minimal important change values, and prognostic validity. Replication of our findings is recommended [58] and should also evaluate the DEMMI’s psychometric properties in mixed samples with other compositions, including more individuals of other disease groups.

Since there are many assessments available to measure mobility capacity in neurorehabilitation, the DEMMI’s psychometric quality and clinical utility should be compared to other (generic) assessments in clinical trials and systematic reviews that follow recommended methods [59]. Future studies should investigate the DEMMI’s clinical utility for goal setting and guiding rehabilitation strategies.


The DEMMI seems to be a unidimensional, valid, and reliable performance-based clinical outcome assessment of mobility capacity in adult individuals with neurological conditions that can be used generically in this population. Provided that the high feasibility, clinical utility, and sufficient measurement properties found in this study are confirmed in future studies, the DEMMI might become the standard assessment of mobility capacity in neurorehabilitation.

Availability of data and materials

The datasets used and analysed during the current study are available from the corresponding author upon reasonable request.



De Morton Mobility Index


Differential item functioning


Functional Ambulation Categories


Intraclass correlation coefficient


Minimal detectable change


Parkinson’s disease


Standard error of measurement


  1. World Health Organization. International classification of functioning, disability and health: ICF. Geneva: World Health Organization; 2001.

    Google Scholar 

  2. Mayo NE, Wood-Dauphinee S, Côté R, Durcan L, Carlton J. Activity, participation, and quality of life 6 months poststroke. Arch Phys Med Rehabil. 2002;83:1035–42.

    Article  Google Scholar 

  3. Rice DB, McIntyre A, Mirkowski M, Janzen S, Viana R, Britt E, Teasell R. Patient-Centered Goal Setting in a Hospital-Based Outpatient Stroke Rehabilitation Center. PM R. 2017;9:856–65.

    Article  PubMed  Google Scholar 

  4. Plant S, Tyson SF. A multicentre study of how goal-setting is practised during inpatient stroke rehabilitation. Clin Rehabil. 2018;32:263–72.

    Article  PubMed  Google Scholar 

  5. Langhorne P, Bernhardt J, Kwakkel G. Stroke rehabilitation. Lancet. 2011;377:1693–702.

    Article  PubMed  Google Scholar 

  6. Keus S, Munneke M, Graziano M, Paltamaa J, Pelosin E, Domingos J, et al. European physiotherapy guideline for Parkinson’s disease. The Netherlands: KNGF/ParkinsonNet; 2014.

  7. Winstein CJ, Stein J, Arena R, Bates B, Cherney LR, Cramer SC, et al. Guidelines for Adult Stroke Rehabilitation and Recovery: A Guideline for Healthcare Professionals From the American Heart Association/American Stroke Association. Stroke. 2016;47:e98–169.

    Article  PubMed  Google Scholar 

  8. Khoo TC, FitzGerald A, MacDonald E, Bradley L. Outcomes for older adults in inpatient specialist neurorehabilitation. Ann Phys Rehabil Med. 2020;63:340–3.

    Article  PubMed  Google Scholar 

  9. Ottenbacher KJ, Hsu Y, Granger CV, Fiedler RC. The reliability of the functional independence measure: a quantitative review. Arch Phys Med Rehabil. 1996;77:1226–32.

    CAS  Article  Google Scholar 

  10. Ashford S, Brown S, Turner-Stokes L. Systematic review of patient-reported outcome measures for functional performance in the lower limb. J Rehabil Med. 2015;47:9–17.

    Article  PubMed  Google Scholar 

  11. Pollock C, Eng J, Garland S. Clinical measurement of walking balance in people post stroke: a systematic review. Clin Rehabil. 2011;25:693–708.

    Article  PubMed  Google Scholar 

  12. Hobart JC, Cano SJ, Zajicek JP, Thompson AJ. Rating scales as outcome measures for clinical trials in neurology: problems, solutions, and recommendations. Lancet Neurol. 2007;6:1094–105.

    Article  PubMed  Google Scholar 

  13. Jørgensen HS, Nakayama H, Raaschou HO, Olsen TS. Recovery of walking function in stroke patients: the Copenhagen Stroke Study. Arch Phys Med Rehabil. 1995;76:27–32.

    Article  Google Scholar 

  14. Hafsteinsdottir TB, Rensink M, Schuurmans M. Clinimetric properties of the Timed Up and Go Test for patients with stroke: a systematic review. Top Stroke Rehabil. 2014;21:197–210.

    Article  PubMed  Google Scholar 

  15. de Morton NA, Davidson M, Keating JL. The de Morton Mobility Index (DEMMI): an essential health index for an ageing world. Health Qual Life Outcomes. 2008;6:63.

    Article  PubMed  PubMed Central  Google Scholar 

  16. Rasch G. Probabilistic models for some intelligence and attainment tests. Chicago: University of Chicago Press; 1980.

    Google Scholar 

  17. Tennant A, Conaghan PG. The Rasch measurement model in rheumatology: What is it and why use it? When should it be applied, and what should one look for in a Rasch paper? Arthritis Rheum. 2007;57:1358–62.

    Article  PubMed  Google Scholar 

  18. de Morton N, Davidson M, Keating JL. Reliability of the de Morton mobility index (DEMMI) in an older acute medical population. Physiother Res Int. 2010;16:159–69.

    Article  PubMed  Google Scholar 

  19. de Morton NA, Lane K. Validity and reliability of the de Morton Mobility Index in the subacute hospital setting in a geriatric evaluation and management population. J Rehabil Med. 2010;42:956–61.

    Article  PubMed  Google Scholar 

  20. Braun T, Schulz R-J, Reinke J, van Meeteren NL, de Morton NA, Davidson M, et al. Reliability and validity of the German translation of the de Morton Mobility Index (DEMMI) performed by physiotherapists in patients admitted to a sub-acute inpatient geriatric rehabilitation hospital. BMC Geriatr. 2015;15:1660.

    Article  Google Scholar 

  21. Jans MP, Slootweg VC, Boot CR, de Morton NA, van der Sluis G, van Meeteren NL. Reproducibility and validity of the Dutch translation of the de Morton Mobility Index (DEMMI) used by physiotherapists in older patients with knee or hip osteoarthritis. Arch Phys Med Rehabil. 2011;92:1892–9.

    Article  PubMed  Google Scholar 

  22. de Morton NA, Harding KE, Taylor NF, Harrison G. Validity of the de Morton Mobility Index (DEMMI) for measuring the mobility of patients with hip fracture during rehabilitation. Disabil Rehabil. 2013;35:105–11.

    Article  Google Scholar 

  23. Braun T, Grüneberg C, Thiel C, Schulz R-J. Measuring mobility in older hospital patients with cognitive impairment using the de Morton Mobility Index. BMC Geriatr. 2018;18:100.

    Article  PubMed  PubMed Central  Google Scholar 

  24. Braun T, Thiel C, Schulz R-J, Gruneberg C. Reliability of mobility measures in older medical patients with cognitive impairment. BMC Geriatr. 2019;19:20.

  25. Braun T, Thiel C, Schulz R-J, Grüneberg C. Responsiveness and interpretability of commonly used outcome assessments of mobility capacity in older hospital patients with cognitive spectrum disorders. Health Qual Life Outcomes. 2021;19:68.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Sommers J, Vredeveld T, Lindeboom R, Nollet F, Engelbert RH, van der Schaaf M. The de Morton Mobility Index Is Feasible, Reliable, and Valid in Critically Ill Patients. Phys Ther. 2016;96:1658–66.

    Article  PubMed  Google Scholar 

  27. New PW, Scroggie GD, Williams CM. The validity, reliability, responsiveness and minimal clinically important difference of the de Morton mobility index in rehabilitation. Disabil Rehabil. 2017;39:1039–43.

    Article  PubMed  Google Scholar 

  28. Braun T, Marks D, Thiel C, Grüneberg C. Reliability and validity of the de Morton Mobility Index in individuals with sub-acute stroke. Disabil Rehabil. 2019;41:1561–70.

    Article  PubMed  Google Scholar 

  29. Braun T, Marks D, Thiel C, Menig A, Grüneberg C. An investigation of the measurement properties of the de Morton Mobility Index for measuring mobility capacity in hospital patients with Parkinson’s disease. Clin Rehabil. 2021;35:423–35.

    Article  PubMed  Google Scholar 

  30. Johnston M, de Morton N, Harding K, Taylor N. Measuring mobility in patients living in the community with Parkinson disease. NeuroRehabilitation. 2013;32:957–66.

    Article  PubMed  Google Scholar 

  31. von Elm E, Altman DG, Egger M, Pocock SJ, Gotzsche PC, Vandenbroucke JP. The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement: guidelines for reporting observational studies. J Clin Epidemiol. 2008;61:344–9.

    Article  Google Scholar 

  32. Kottner J, Audige L, Brorson S, Donner A, Gajewski BJ, Hrobjartsson A, et al. Guidelines for Reporting Reliability and Agreement Studies (GRRAS) were proposed. J Clin Epidemiol. 2011;64:96–106.

    Article  PubMed  Google Scholar 

  33. Mokkink LB, de Vet HCW, Prinsen CAC, Patrick DL, Alonso J, Bouter LM, Terwee CB. COSMIN Risk of Bias checklist for systematic reviews of Patient-Reported Outcome Measures. Qual Life Res. 2018;27:1171–9.

    CAS  Article  PubMed  Google Scholar 

  34. Mokkink LB, Boers M, van der Vleuten CPM, Bouter LM, Alonso J, Patrick DL, et al. COSMIN Risk of Bias tool to assess the quality of studies on reliability or measurement error of outcome measurement instruments: a Delphi study. BMC Med Res Methodol. 2020;20:293.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  35. Criddle LM, Bonnono C, Fisher SK. Standardizing stroke assessment using the National Institutes of Health Stroke Scale. J Emerg Nurs. 2003;29:541–6.

    Article  PubMed  Google Scholar 

  36. Hoehn MM, Yahr MD. Parkinsonism: onset, progression and mortality. Neurology. 1967;17:427–42.

    CAS  Article  Google Scholar 

  37. Kurtzke JF. Rating neurologic impairment in multiple sclerosis: an expanded disability status scale (EDSS). Neurology. 1983;33:1444–52.

    CAS  Article  PubMed  Google Scholar 

  38. Mokkink LB, Terwee CB, Patrick DL, Alonso J, Stratford PW, Knol DL, et al. The COSMIN study reached international consensus on taxonomy, terminology, and definitions of measurement properties for health-related patient-reported outcomes. J Clin Epidemiol. 2010;63:737–45.

    Article  PubMed  Google Scholar 

  39. Terwee CB, Bot SDM, de Boer MR, van der Windt DAWM, Knol DL, Dekker J, et al. Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol. 2007;60:34–42.

    Article  PubMed  Google Scholar 

  40. Braun T, Schulz R-J, Hoffmann M, Reinke J, Tofaute L, Urner C, et al. German version of the de Morton Mobility Index. First clinical results from the process of the cross-cultural adaptation. Z Gerontol Geriatr. 2015;48:154–63.

  41. Braun T, Rieckmann A, Grüneberg C, Marks D, Thiel C. Hierarchical assessment of balance and mobility - German translation and cross-cultural adaptation. Z Gerontol Geriatr. 2016;49:386–97.

    Article  PubMed  Google Scholar 

  42. Pallant JF, Tennant A. An introduction to the Rasch measurement model: an example using the Hospital Anxiety and Depression Scale (HADS). Br J Clin Psychol. 2007;46:1–18.

    Article  Google Scholar 

  43. Linacre JM. Sample size and item calibration stability. Rasch Measurement Transactions. 1994;7:328.

    Google Scholar 

  44. de Vet HCW, Terwee CB, Mokkink LB, Knol DL. Measurement in medicine: A practical guide. Cambridge, New York: Cambridge University Press; 2011.

    Book  Google Scholar 

  45. Braun T, Grüneberg C, Coppers A, Tofaute L, Thiel C. Comparison of the de Morton Mobility Index and Hierarchical Assessment of Balance and Mobility in older acute medical patients. J Rehabil Med. 2018;50:292–301.

    Article  PubMed  Google Scholar 

  46. Terwee CB, Mokkink LB, Knol DL, Ostelo RWJG, Bouter LM, de Vet HCW. Rating the methodological quality in systematic reviews of studies on measurement properties: a scoring system for the COSMIN checklist. Qual Life Res. 2012;21:651–7.

    Article  PubMed  Google Scholar 

  47. van Bloemendaal M, Bout W, Bus SA, Nollet F, Geurts AC, Beelen A. Validity and reproducibility of the Functional Gait Assessment in persons after stroke. Clin Rehabil. 2019;33:94–103.

    Article  PubMed  Google Scholar 

  48. Bland JM, Altman DG. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet. 1986;1:307–10.

    CAS  Article  Google Scholar 

  49. Stratford PW, Binkley JM, Riddle DL. Health status measures: strategies and analytic methods for assessing change scores. Phys Ther. 1996;76:1109–23.

    CAS  Article  Google Scholar 

  50. Andrich D, Hagquist C. Real and Artificial Differential Item Functioning. Journal of Educational and Behavioral Statistics. 2012;37:387–416.

    Article  Google Scholar 

  51. Tennant A, Pallant JF. DIF matters: A practical approach to test if differential item functioning makes a difference. Rasch Measurement Transactions. 2007;20:1082–4.

    Google Scholar 

  52. Scientific Advisory Committee of the Medical Outcomes Trust. Assessing health status and quality-of-life instruments: attributes and review criteria. Qual Life Res. 2002;11:193–205.

    Article  Google Scholar 

  53. Braun T, Rieckmann A, Weber F, Grüneberg C. Current use of measurement instruments by physiotherapists working in Germany: A cross-sectional online survey. BMC Health Serv Res. 2018;18:810.

    Article  PubMed  PubMed Central  Google Scholar 

  54. Kingwell E, Marriott JJ, Jetté N, Pringsheim T, Makhani N, Morrow SA, et al. Incidence and prevalence of multiple sclerosis in Europe: a systematic review. BMC Neurol. 2013;13:128.

    Article  PubMed  PubMed Central  Google Scholar 

  55. Wyndaele M, Wyndaele J-J. Incidence, prevalence and epidemiology of spinal cord injury: what learns a worldwide literature survey? Spinal Cord. 2006:523–9.

  56. Cattaneo D, Gervasoni E, Pupillo E, Bianchi E, Aprile I, Imbimbo I, et al. Mobility Disorders in Stroke, Parkinson Disease, and Multiple Sclerosis: A Multicenter Cross-Sectional Study. Am J Phys Med Rehabil. 2020;99:41–7.

    Article  PubMed  Google Scholar 

  57. van Peppen RPS, Maissan FJF, van Genderen FR, van Dolder R, van Meeteren NLU. Outcome measures in physiotherapy management of patients with stroke: a survey into self-reported use, and barriers to and facilitators for use. Physiother Res Int. 2008;13:255–70.

    Article  PubMed  Google Scholar 

  58. Ioannidis JPA. Why replication has more scientific value than original discovery. Behav Brain Sci. 2018;41: e137.

    Article  PubMed  Google Scholar 

  59. Prinsen CAC, Mokkink LB, Bouter LM, Alonso J, Patrick DL, de Vet HCW, Terwee CB. COSMIN guideline for systematic reviews of patient-reported outcome measures. Qual Life Res. 2018;27:1147–57.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

Download references


We thank all participants for their time and contributions. We further acknowledge the support of the physiotherapy and medical staff of the Rehaklinik Zihlschlacht.


Open Access funding enabled and organized by Projekt DEAL.

This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.

Author information

Authors and Affiliations



Study concept and design: TB, DM, CG. Acquisition of data: TB, DM. Analysis of data: TB. Interpretation of data: TB, DM, CT, CG. Drafting the manuscript: TB. Manuscript revision for important intellectual content: TB, DM, CT, CG. All authors read and approved the final version of the manuscript.

Corresponding author

Correspondence to Tobias Braun.

Ethics declarations

Ethics approval and consent to participate

This study was approved by the Local Committee for Ethics in Medical Research (Canton of Thurgau, Switzerland: 2013/13) and all participants provided written informed consent. All methods were performed in accordance with the relevant guidelines and regulations.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1

. Additional information on study methods.

Additional file 2

. Additional results.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Braun, T., Marks, D., Thiel, C. et al. A generic outcome assessment of mobility capacity in neurorehabilitation: measurement properties of the de Morton Mobility Index. BMC Neurol 21, 298 (2021).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Neurological rehabilitation
  • Outcome assessment
  • Mobility limitation
  • Reproducibility of results
  • Rasch analysis
  • Validity