Research Article
Print
Research Article
Methodology for Monitoring the Mobility of Circular Labour Migrants in Moscow Region
expand article infoYulia Yu. Shitova
‡ Russian State University for the Humanities, Moscow, Russia
Open Access

Abstract

The paper presents the original author’s methodology for monitoring commuting labour migration in a region on the example of Moscow Region. The methodology is based on regular collection of real-time information from the Yandex.Probki platform on the regional transport network state by querying and saving travel time for a fixed set of car routes (basic sample) covering the region under study. An analysis of the data collected over the past two years enabled studying the structure and dynamics of travel time losses by commuting labour migrants. The time dynamics of losses are sensitive to events such as lockdowns and holidays. The estimates obtained show a stable cyclicity of travel time losses within a day and a week, which confirms the validity of the indicator proposed by the author. The study demonstrates that the loss of time on the commute during peak hours is 2.5 times higher than the loss when driving without traffic jams. In conclusion, the paper discusses the prospects for scaling the author’s methodology to any regions in which the Yandex.Probki platform is present.

Keywords

Commuting labour migration, Moscow, Moscow Region, geoinformation systems, GIS analysis

JEL codes: J22, J61

1. Introduction

Monitoring the structure and dynamics of circular labour migration (CLM; hereinafter, this abbreviation is used to denote both migrants and migration as a process, the specific option is clear from the context) provides important information for making decisions on the transport infrastructure development, growth points management, etc. At the same time, the development of research in this area is limited by the lack of statistical data.

Technologies related to geographic information systems (GIS) have become an in-demand method used to analyze population mobility and related problems. A literature review shows that GIS data is already being used to solve a number of problems in the research area under discussion.

One of the existing directions is the use of GIS to analyze the dynamics, structure and forecast of CLM. The spatial dynamics of CLM in South East Queensland (Australia) was studied based on data from 1996 and 2006 censuses (Li et al. 2016), where GIS was used to determine the average travel time of commuting migrants, as well as to estimate the proportion of people living and working in the same region (not CLM). The purpose of this analysis was to provide input into the debate on changing urban policies to reduce the length of circular migratory commutes. It is interesting that, as a result, the difference in the dynamics of CLM over 10 years turned out to be insignificant on a regional scale, only secondary effects were found at local levels. The relationship between CLM and spatial effects in Boise (Idaho, USA) was studied in (Niedzielski et al. 2013), the authors used GIS to simulate CLM routes. Based on the results obtained the authors then calculated spatial indicators of urban development. In (Mackenbach et al. 2016), the authors analyzed socio-economic changes in CLM, the impact of urban land use and transport policies on CLM in the city of Wellington (New Zealand) and its suburbs. In doing so, they combined data from the New Zealand Household Travel Survey and GIS data on land use and transport infrastructure. A reduction in the number of people using rail transport in 1990-2000 in the Osaka metropolitan area was found in (Sakanishi 2006), where growth had been observed for 40 years. In this paper, GIS was used to assess the change in the spatial location of residence and the pattern of CLM in relation to the railway network spatial location. It was found that the dispersion of work places in 1990-2000 significantly reduced the number of CLM using rail transport. Demographic analysis shows that Japan’s declining fertility rate and ageing population in the coming decades will further reduce the number of people using rail transport, which is fraught with serious economic problems for railroad companies. A GIS-based comparison of the spatial location of housing and work locations in the North West of England was used to develop policy recommendations for land use and business location preferences (Hincks and Wong 2010). Studies of the popular topic of «excessive» CLM were carried out in Leon County (Florida, USA), while using GIS to determine the time and length of trips, as well as linking and extracting information on various geoclusters of residence, trips, etc. (Horner 2008). Finally, the confrontation between centrifugal and centripetal directions of Canada’s agglomerations development was studied based on the GIS approach in (Partridge et al. 2007).

GIS and Geographic Zonation is another topical area of research that studies the relationship between geographic (geolocations of districts in a city, cities in a country, etc.) and other indicators. The correlation between business geography and the levels of development of territories and infrastructure based on the GIS approach was studied in India (Wood 2003).

In (Meredith et al. 2007), home-work paths were modelled using GIS for a sample of 15% of Irish people (220 thousand people, the sample is known as POWAR Database), which enabled drawing up a detailed map of mobile CLM patterns. An empirical relationship between the job-housing ratio (JHR) and CLM patterns was investigated using GIS in Portland, Oregon metropolitan area, USA, in terms of travel time and length (Peng 1997). In order to cluster functional urban areas (FUA) in Ecuador, QGIS and Stata GIS were used to determine the length and duration of trips between given geocoordinates (Obaco et al. 2016). Changes in residential areas and CLM patterns in the two largest Danish agglomerations (Copenhagen and Aarhus) from 1982 to 2002 were investigated in (Nielsen and Hovgesen 2005) using GIS ArcGIS/ArcView. If in the capital there is an increase in monocentric agglomeration (strengthening of the core, growth of suburbs and lengthening of the CLM), in Aarhus processes of polycentric agglomeration are manifested (the formation of several centers of attraction with the same growth of the peripheral zone). Boundaries and patterns of commercial activity in the center of Peshawar (Pakistan) were identified using GIS, which was used to delineate the area in terms of land use (places of residence) and jobs based on geospatial analysis (Ali et al. 2018).

Another popular trend in modern analytics is GIS transport analysis. CLM traffic patterns were reconstructed in New York and Amsterdam (Arribas-Bel and Gerritse 2012). The spatial distribution of the difference between CLM using personal and public transport during and outside peak hours was studied in Flanders (Belgium), with estimates showing significant differences against the background of a lack of alternatives to cars in many areas (Dewulf et al. 2015). A new public transport accessibility index (PTAI) was proposed using the example of Melbourne (Australia) (Saghapour et al. 2016). Based on data from The Victorian Integrated Survey of Travel and Activity (VISTA), the correlation between PTAI and the share of public transport use was examined. It was demonstrated, that PTAI correlates better with the share of public transport use than other indicators. The paper (Larsen et al. 2007) studied the impact of changes in public transport prices on CTM patterns — here a GIS developed by the Technical University of Copenhagen was used to provide information on the cost of travel depending on the geo-coordinate.

Other uses of GIS include analysis of the relationship between population mobility and housing prices; modern GIS enables building multimodal trips by different transport modes, on which the comparison of different transport modes used by CLM is based. The analysis of mobility by environmentally friendly (green) transport modes is gaining popularity: bicycles, scooters, etc. Bicycle networks have appeared in almost all major cities of the world, information from which is used in the analysis of population mobility in various ways. A large body of studies is devoted to the methodological issues of the approach: checking the accuracy of GIS calculations and the resulting systematic errors.

Summarizing the above-mentioned, it should be noted that the GIS approach to the analysis of CLM is a popular trend in modern analysis. At the same time, most scientific works solve separate applied problems. Modern GIS platforms operate continuously, producing big data, the analysis of which enables continuous monitoring of different indicators.

In this paper, we propose an original author’s technique for spatial microanalysis of CLM based on data obtained in a continuous mode from the constantly functioning online platform Yandex.Probki, which is a GIS. This enables monitoring the population mobility, implementing automatically updated indices, obtaining a unique coverage and spatial detail of CLM processes. The presented work is the next stage, continuing, developing and expanding the series of our studies of commuting labour migration in Moscow Region (Shitova 2010; Shitova and Shitov 2016; Shitova et al. 2020).

2. Research methodology

The main purpose of the study is to analyze the structure and dynamics of time losses in CLM car trips according to the monitoring data of highways in two Russian regions — Moscow and Moscow Region.

GIS platform

The methodology is based on the collection of primary information on the road network state from the Yandex.Maps GIS platform (Yandex.Maps; Yandex.Probki), which collects, processes and provides users with real-time information about traffic jams. The two main ways it works are: a) visualization of traffic jams; b) building a route of movement at points specified by the user, with and without traffic jams (see. Fig. 1).

Figure 1.

Left: traffic situation in Moscow at 16:30 on January 21, 2022 according to Yandex.Probki, traffic jams 7 points (red traffic light icon at the top left). Right: routing of a trip by car from Skhodnya to Moscow made by the Yandex.Map router server. The inset contains information about the trip given by Yandex: distance L = 37 km (green box), travel time TP = 79 min including traffic jams (red box) and T0 = 45 min on a free road (black box). Source: compiled by the author.

These functions are highly in demand, which determines the great popularity of Yandex products: Yandex mobile applications (Maps, Navigator, etc.) are installed and actively used by almost every motorist. As a result, Yandex receives information about the position of the user’s car (from the GPS of the mobile device) every five seconds. The analysis of this information allows Yandex to quickly (once every three minutes) calculate and update data on the state of the road network (the map in Fig. 1 on the left) detailed up to segments of individual streets (How we estimate ... 2017).

The scoring of the situation used by Yandex (How they work ...) is good for visual presentation of information and everyday use. However, for a number of reasons, it cannot be used for accurate calculations in research. The main problem is the relativity of this indicator, since points for each route are calculated relative to some reference travel time (without traffic jams) along this route, which: a) is set arbitrarily and b) can be changed at any time. Therefore, the scores:

  • cannot be directly converted into socio-economic indicators (time or money losses, etc.);
  • when saved as a time series, they may be incorrect, since the calculation system (reference travel time) can be changed without notice, and it is not possible to control this aspect.

Finally, the score is an integral indicator (for the entire region), which does not enable studying the spatial effects. In this regard, instead of Yandex.Probki points, the author of this work uses absolute figures: time and distance of the route. Calculations based on such indicators do not have the shortcomings described above for the scoring method.

Basic sample

For a comprehensive analysis of the situation, we must have information about the state of the transport network in the key geo-points of the region. Of course, we do not have the capabilities of the Yandex.Maps platform, which receives information every five seconds from almost every car on the road and controls the traffic situation detailed up to individual road segments in Moscow and Moscow Region. Instead, we form a basic sample of virtual CLM in such a way that their «home-work» routes evenly cover the geographic area in which we want to track the traffic situation. For example, to assess the transport situation within Moscow Ring Road (with a pronounced radial structure), we place the houses of the representatives of the basic sample along the radius of Moscow Ring Road and choose places of work in the center of Moscow (see Fig. 2, left).

Figure 2.

Left: an example of a basic sample for monitoring the transport network within the Moscow Ring Road. CLM (blue arrows) live just outside Moscow Ring Road and work in the center of Moscow. Right: places of residence (blue circles) and work (red spot in the center) of CLM from the basic sample of research linked to the map. Source: compiled by the author.

Requesting within a short time interval «home-work» and «work-home» routes with and without traffic jams for each member of the sample, we get a snapshot of the transport situation in the area under consideration at a certain point in time.

A fundamentally important point is the fixation of the basic sample of CLM by analogy with the construction of the base of respondents in panel studies. This is necessary so that correct time series are obtained from data collected at different points in time.

Data is collected through the Yandex.Maps API in automatic mode using specially developed software. The program runs at a specified interval, makes queries to the Yandex.Maps router for the entire basic sample, receives the results and saves the data in the database.

The sample size directly determines the statistical accuracy of the results: the larger the sample, the more accurate the results. However, too large sample creates problems due to the limit on the number of requests to the Yandex.Maps router, which is limited to 25,000 per day. Exceeding the limit results in blocking the IP address of the computer from which data is being collected. The solution to the problem can be a multi-client mode of data collection, which can be implemented in the future.

Indicators

The main indicator in data analysis is time lost by CLM on the commute due to traffic jams. Despite the fact that traffic congestion is formed not only under the influence of the movements of commuting migrants, but also as a result of residents travelling due to personal affairs within the city and beyond (to dachas, vacation spots, etc.), there is no need to clear data from these effects, since the time lost by CLМ is determined by the actual situation with jams. Initially, it was assumed that the specific losses due to traffic jams for each route from the base sample would be calculated as follows:

E = (TP – T0)/L,(formula 1)

where TP, T0 are the travel time along route L with and without traffic jams, respectively. However, during the study, it was found that Yandex.Probki does not allow obtaining TP and T0 at the same time in one query. And in two different queries to receive TP and T0 separately, even if they are sent almost simultaneously, and the destination points of the route are the same, the GIS can return different routes due to the specifics of its work (asynchrony and non-strictness of the solution of the routing problem). In this regard, the calculation method according to formula (1) turned out to be unimplementable with the currently existing functionality of Yandex.Maps API.

Therefore, a different approach was taken to determine travel time without traffic jams. For each observation from the sample, according to all the accumulated data for the analyzed period, the minimum specific time of its movement was calculated, taking into account traffic jams:

(formula 2)

This calculated value was taken as T0 — travel time without traffic jams. It is important to note that the estimate according to formula (2) turned out to be more reliable than the one according to formula (1), since the value of T0 is actually the reference time of movement underlying the Yandex.Maps scoring system. It is subjectivity and the possibility of changing the T0 indicator in the router itself at any time that is the reason for the weakness of the estimate described above. It is much more difficult to manipulate the indicator obtained according to formula (2), and there are ways to control such interference. However, a detailed discussion of this aspect is beyond the scope of this study.

The most convenient way to represent losses due to traffic jams (ER) is in relative units:

(formula 3)

Individual losses estimated according to formula (3) can then be averaged into any aggregated indicators both over sets (groups) within the sample and over any time periods depending on the specific task.

Methodology implementation

Since the object of this study is to monitor the travel conditions of CLM in Moscow agglomeration, we took 20,000 virtual residents as a basic sample, living within a radius of 20, 50, and 80 km from Moscow and working in the center of Moscow (see Fig. 2, right). This choice is due to the desire to collect information about CLM who make short, medium and long trips along the «home-work» route. Data collection was carried out hourly, the execution of queries on the entire base sample averaged 5–10 minutes. The project started in 2017, but full-scale data had been gathered since October 2019.

The hypothesis of the study is that the analysis of the structural and time dynamics of travel time losses correlates with important events in regional life, such as natural disasters, social restrictions (for example, the introduction of temporary restrictions on movement or work during the COVID-19 pandemic), adverse weather conditions, repair and construction work at transport infrastructure facilities, etc. Constant operational monitoring of the state of the transport network using online GIS platforms enables quickly identifying emerging problems and evaluating the dynamics of their development, predicting the consequences. The results presented in the next section and their discussion will enable testing the validity of the hypothesis.

3. Main results and discussion

Time dynamics

Analysis of the time series of time losses for CLM trips from the formed basic sample is of particular interest, since this indicator is most sensitive to the «rhythm of regional life» and captures all anomalies that violate it.

Dynamics of average indicators for the sample. Fig. 3 shows the average daily time losses of CLM on commute during peak hours (working days from 07:00 to 11:00 for «home-work» trips and from 17:00 to 20:00 for «work-home» trips), calculated according to formula (3). On the presented chart, the following patterns can be found:

  • A fundamental decline in activity due to the lockdown in response to the outbreak of the COVID-19 pandemic (April-July 2020). The lockdown in Russia was announced on April 1, at which point traffic fell to the lowest levels that lasted all of April. However, in May there is a gradual revival and increase in mobility with a full recovery by early July. Together with our calculations, Fig. 1 shows the Citymapper mobility index (CMMI) from the popular application (Citymapper mobility index… 2021), which was valid from April 2020 to September 2021 to track changes in population mobility due to the pandemic in 41 cities around the world. Comparison of our data with CMMI shows a similar trend (if disregarding the overall decrease in activity shown by CMMI since July 2020 compared to its same indicators in March 2020, against the backdrop of a complete recovery in activity according to the results of the author’s calculation). The described discrepancy may be due to a general reduction in the number of people using the application (the calculation method depends on this) or the transition of some users of the Citymapper application to online work, which also reduces mobility indicators.
  • The cycle within the week is clearly visible: a decline on weekends and a high load on weekdays (more on this later).
  • Annual cycles of activity are visible: a decline at the beginning of the year, average activity in the summer (the influence of the holiday season) and a maximum increase in activity in the autumn and by the end of the year, which ends with the New Year holidays.
  • There is a decrease in activity during the holidays. First of all, this applies to the New Year holidays (beginning of January). In addition, the last significant downturn was the long weekend in May (10 days) in 2021, when a «virtual lockdown» was specifically announced to mitigate the effects of a new wave of the spread of coronavirus.
  • The asymmetry of center-region losses versus region-center can be seen, apparently associated with repair and construction work on the roads.

Based on the data obtained, it can be concluded that CLM travelling during peak hours spend on average 1.8–2.4 times more time on the commute than it takes to travel without traffic jams.

Thus, the hypothesis about the sensitivity of the studied indicator to abnormal processes in the region is confirmed by the results of the study.

Figure 3.

Average daily time losses for CLM trips during peak hours from home to work (black dots) and back (red dots). Blue curve — Citymapper mobility index (CMMI; (Citymapper mobility index… 2021). Source: compiled by the author. Note: CMMI scaling was done for March–April 2021.

Comparison of dynamics between groups. Fig. 4 shows the average weekly losses in the context of three CLM groups, from which it can be seen that the closest group suffers most from traffic jams (living at a distance of up to 20 km from Moscow). The differences in specific losses of the middle (50 km) and remote (80 km) CLM groups are minimal and, in relative terms, turn out to be significantly less than that of the closer group.

Figure 4.

Intra-week losses on the «home-to-work» route during peak hours among three sample groups living at distances of 20 km (black dots), 50 km (red dots) and 80 km (green dots) from Moscow and working in the center. Source: compiled by the author.

This result shows that the core of the jams has a radius of less than 20 km, and it is this core that accounts for the main losses of CLM. The ring located at a distance of 20–80 km from the city center is less problematic, as evidenced by the proximity of the CLM indicators of the middle and remote groups. However, do not forget that these are weekly averages, significant deviations from average behaviour are observed quite often, as evidenced by the scatter of points in Fig. 3. In addition, from the above-mentioned time series it can be seen that in 2021, commute-related time losses increased by about 20% compared to 2020.

Rush hour effect. Fig. 5 shows a comparison of travel time losses during peak hours against averages. From the data presented, it can be seen that the duration of trips during peak hours is consistently 1.5 times higher than the average and 2.5 times higher than the estimated indicators for off-peak hours.

Figure 5.

Losses on the «work-home» commute for the closest CLM group (20 km from the center): averaging over a week during peak hours (black dots), across all data (red dots) and their ratio (blue dots). Source: compiled by the author.

Cyclic patterns

Intraday cycle. To study this process, the distribution of the transport network load by hours during the day, averaged over various periods (a day, a week, a month), was analyzed. Fig. 6 shows the intraday cycle of traffic congestion, which clearly shows peak hours in the morning for entering Moscow and in the evening for leaving Moscow. The results obtained fully coincide with the data of Yandex’s own research (Traffic jams in Moscow…). We also note that the criteria for choosing peak hours were determined precisely by the results of the assessment of the intraday cycle.

Intraweek cycle. Similarly to the intraday cycle, intraweek cycles can be calculated by averaging over any range of interest, for example, over weeks, months, years. Fig. 7 shows the intraweek distribution of losses (data for June 2020). The chart clearly shows the decline in activity over the weekend.

Note that both the intraday and intraweek cycles have very stable dynamics and repeatability over a long horizon (several years of observations). In this regard, it can be assumed that when building predictive models, the parameters of the hour and day of the week will be significant factors.

Figure 6.

Left: intraday travel losses by time of day (group living 20 km from Moscow, averaged from January 2020). Source: compiled by the author. Right: a similar chart in Yandex research. Source: (Traffic jams in Moscow…).

Figure 7.

Distribution of intra-week travel losses (averaged data for June 2020). Source: compiled by the author.

4. Visualization online

To present the results of the project, a special website (Analysis of mobility…) was launched, where the user has the opportunity to independently study the results of the project in an interactive mode.

5. Conclusion

This paper presents the results of an analysis of the сircular labour migrants’ trips in terms of time lost on the commute, taking into account the remoteness of the place of residence from the city center. The article proposes an author’s method for assessing temporary losses and demonstrates the dynamics of the obtained indicators for the year, as well as the cyclical nature of losses within a day and a week. The results confirm the hypothesis about the sensitivity of the proposed indicators to regional events.

The described technique has great prospects for further application and scaling. The value of the accumulated data is growing. In the future, it is planned to make predictive models based on the data obtained using machine learning methods. In addition to collecting data on traffic jams, since 2019, the author has been working on collecting data on weather conditions in Moscow Region. The joint analysis of data on weather and traffic congestion is an interesting extension of the project.

Another possible direction for the development of the presented study is the conversion of estimated time losses into monetary losses based on the forgone income model. The implementation of this approach will create an informative economic indicator that will be useful for the purposes of regional management.

Finally, the applied result of the work can be the creation of a mobile application with an option of prediction of the regional transport network state at any time and in any geolocation.

Funding

This paper was supported by RFBR grant no. 19-010-00794a.

References

  • Arribas-Bel D, Gerritse M (2012) Featured graphic. Monocentricity? Commuting flows visually. Environment and Planning A: Economy and Space 44(9): 2041–2. https://doi.org/10.1068/a44561
  • Dewulf B, Neutens T, Vanlommel M, Logghe S, De Maeyer P, Witlox F, De Weerdt Y, Van de Weghe N (2015) Examining commuting patterns using Floating Car Data and circular statistics: Exploring the use of new methods and visualizations to study travel times. Journal of Transport Geography 48: 41–51. https://doi.org/10.1016/j.jtrangeo.2015.08.006
  • Horner MW (2008) “Optimal” Accessibility Landscapes? Development of a New Methodology for Simulating and Assessing Jobs-Housing Relationships in Urban Regions. Urban Studies 45(8): 1583–1602. https://doi.org/10.1177/0042098008091492
  • Larsen MM, Madsen B, Jensen-Butler C (2007) Modelling Transport in an Interregional General Equilibrium Model with Externalities. In: Heijman W (Ed.) Regional Externalities. Springer, Berlin, Heidelberg. P.11-45.
  • Li T, Corcoran J, Burke M (2016) Disaggregate GIS modelling to track spatial change: exploring a decade of commuting in South East Queensland, Australia. Journal of Transport Geography 24: 306–14. doi: https://doi.org/10.1016/j.jtrangeo.2012.03.008
  • Mackenbach JD, Randal E, Zhao P, Howden-Chapman P (2016) The Influence of Urban Land-Use and Public Transport Facilities on Active Commuting in Wellington, New Zealand: Active Transport Forecasting Using the WILUTE Model. Sustainability 8(3): 242. https://doi.org/10.3390/su8030242
  • Niedzielski MA, Horner MW, Xiao N (2013) Analyzing scale independence in jobs-housing and commute efficiency metrics. Transportation Research Part A: Policy and Practice 58: 129–43. https://doi.org/10.1016/j.tra.2013.10.018
  • Partridge M, Bollman RD, Olfert MR, Alasia A (2007) Riding the Wave of Urban Growth in the Countryside: Spread, Backwash, or Stagnation? Land Economics 83(2): 128–52. https://doi.org/10.3368/le.83.2.128
  • Saghapour T, Moridpour S, Thompson RG (2016) Public transport accessibility in metropolitan areas: A new approach incorporating population density. Journal of Transport Geography 54: 273–85. https://doi.org/10.1016/j.jtrangeo.2016.06.019
  • Shitova YuYu, Shitov YuA (2016) GIS monitoring of commuting labour migration as a task of regional management. Modern management technologies 2(62): 49-60. URL: https://sovman.ru/article/6206/
  • Shitova YuYu, Shitov YuA, Vlasov DN, Mitroshin AA, Mitroshin PA (2020) GIS technologies for monitoring the transport system of the region on the example of Moscow region. In: Analysis and modelling of economic and social processes. XXVII Conference “Mathematics. Computer. Education, Izhevsk. Pp.130-8. https://doi.org/10.20537/mce2020econ13
  • Wood G (2003) Modelling the ecological footprint of green travel plans using GIS and network analysis: from metaphor to management tool? Environment and Planning B: Planning and Design 30(4): 523-40. https://doi.org/10.1068/b12995

Other data sources

  • Analysis of the mobility of the labour resources of the region, http://mtmstudyo.ru (date of reference: 09/13/2021).

Information about the author

Julia Yurievna Shitova, Doctor of Economics, PhD in Sociology, Professor, Department of Integrated Communications and Advertising, Russian State University for the Humanities, Moscow, 125047, Russia. Email: yu_shitova@mail.ru

login to comment