Research Article |
|
Corresponding author: Muhammad Islam ( mislam6667@gmail.com ) © 2023 Muhammad Islam, Farrukh Shehzad, Samrat Ray, Mirza Waseem Abbas.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Islam M, Shehzad F, Ray S, Abbas MW (2023) Forecasting the population growth and wheat crop production in Pakistan with non-linear growth and ARIMA models. Population and Economics 7(3): 172-187. https://doi.org/10.3897/popecon.7.e101500
|
Food security as a major social concern and a global threat, requires better policy decisions based on empirical studies. This work presents a comparative statistical analysis of different methods to forecast wheat area, productivity, production, and population growth rate in Pakistan. Time series data from 1950 to 2020 were analyzed using various methods such as ARIMA, the compound growth exponential regression model (CGREM), Cuddy Della Valle instability index (CDVI), and decomposition analysis. The results show that CGREM performs better than other models. Periodic compound growth rates indicate that wheat area and yield decrease by about 67.0% and 40.0%, while the population decreases by 31.7%. For the period 2001-2020, the compound growth reaches the level of 0.60% for wheat area, 1.21% for yield, while it is high for the population and amounts to 2.22%. The overall compound growth rate for wheat area and yield (about 1.207%, 2.326%) is lower compared to the population (about 2.839%). The paper presents forecasts for wheat area, yield, and population in Pakistan will rise: 12.7%, 25.5%, 31.8% in 2030 and 43%, 97.8%, and 129% in 2050. The results of this study provide empirical evidence for the necessity of policy decisions addressing the problem of food security in Pakistan.
CGREM, decomposition, growth rate, population and wheat crop, food security
Food security has emerged as a major global concern. According to the Economic Survey of Pakistan 2019-2020, the agricultural growth rate was recorded at 4.0% in 2017-2018, 0.6% in 2018-2019, and approximately 2.7% in 2019-2020 (ESP 2020). The population growth rate in Pakistan was about 2.40% in 2018-2019 and 2017-2018, while it was 1.89% in 2016-17, indicating a rising short-term trend in population dynamics that could have adverse social effects on community welfare (ESP 2019). Pakistan’s population growth rate is high compared to other South Asian countries, potentially leading to societal food conflicts (
The share of the agricultural sector is decreasing while the population is steadily increasing each year in Pakistan (
The work of the English economist R.T. Malthus (1766-1834), written in 1798, postulates, «...population growth will always tend to outrun the food supply and that betterment of humankind is impossible without strict limits on reproduction...» In the near future, the world cannot avoid issues of food scarcity (
World Health Organization (WHO), Food and Agricultural Organization (FAO), and many other international and national organizations address the issue of food insecurity (
Various researchers, including the International Food Policy Research Institute (IFPRI), FAO, and many organizations, hypothesize that the high population growth trend, especially in South Asian countries, may lead to food conflicts in the region. Some African countries have been affected by severe food shortages. Food security means limited food availability and restricted access to food in society. The major concern of the current analysis is to statistically assess discrepancies between explosive population growth and productivity abilities in the context of the social dilemma of food security.
For a developing society, it is essential to assess future changes in population and agricultural production using statistical forecasting models with reliable and precise tools. A relevant and accurate statistical model enables us to make strong and accurate predictions for the future, supporting policy decisions in the sphere of the food security threat. The appropriate application of statistical crop yield prediction models is essential to assess how agricultural production responds to future challenges (Lobell & Burke 2010). This study aims to determine, evaluate, and compare the magnitude of changes in the population growth rate and food crop (wheat) availability and sustainability in Pakistan from 1950 to the present, addressing the prevailing social dilemma of food insecurity. The research employs statistical models to forecast wheat area, yield, and production for 2030 and 2050, along with a comparative analysis of population growth. Robustness measures, instability checks, and decomposition analysis with a comparison of compound growth rates are also applied using time series data covering the period 1950-2020.
Secondary time-series data for the period 1950-2020 is collected for 71 years from the Pakistan Bureau of Statistics, Punjab Agriculture Marketing Information Service Department, Punjab Crop Reporting Service Agriculture Department, and various issues of the Economic Surveys of Pakistan, covering wheat crop area, productivity (yield), production, and population. These organizations are owned by the Government of Pakistan, providing reliable and authentic data sources for researchers worldwide. Four variables are employed: wheat crop area, productivity (yield), production, and population of Pakistan, measured as area in thousand acres (‘000’ acres), average yield of 40 kg. per acre (mds/acre), production in thousand tons (‘000’ tons), and population in millions, respectively.
X t = c + ϕ1Xt–1 + … + ϕpXt–p + et + θ1et–1 + …+ θqet–q (1)
where“ϕ” and “θ” are the autoregressive and moving average parameters respectively, “X” is the original series and the “e” is a series of the normally distributed residuals.
The time series Box-Jenkins Methodology (ARIMA) is used to predict wheat area, yield and production, and population of Pakistan based on data collected from 1950 to 2020
The Compound Growth Exponential Regression Model (CGREM) is applied simultaneously with the ARIMA model to find the better model for wheat crop area, productivity, production, and population growth. The annual compound growth rates are measured by the following model (Dhakre & Sharma 2010;
yt = y0[1 + r]t (2)
where “yt” depicts the wheat area/yield/production/population, “t” denotes time period, “0” – initial year, “r” is the compound growth rate. The slope measures the relative change in the response variable, i.e. the absolute change accrued in the feature, and it measures the instantaneous rate of growth. Apply the log-transformation to the equation 2.
Ln (yt) = Ln (y0) + t{Ln (1 + r)}
Ln (yt) = Y, Ln (y0) = A, Ln (1 + r) = B
Y = A + Bt + ε, r = (expB – 1) · 100
To predict the parameter the following equation is applied.
yp = [yc (1 + B)n] (3)
where, “yp” is the value of the response variable at the projected time, “yc” denotes the actual/collected value of the response at time “t”, “B” is the regression slope of the line or regression coefficients, “n” is the total number of years (projection horizon), i.e. “tp–tc”.
Cuddy-Della-Valle instability index (CDVI) was developed by Cuddy and Valle in 1978 to measure the instability in time series data which is related to the trend specifics. CDVI attempts to adjust the coefficient of variation (CV) using R2. A low value of the CDVI indicates the low instability and vice-versa (
where “C.V” stands for the coefficient of variation. The range of instability is as follows: low instability for 0 ≤ CDVI ≤ 15, medium instability for 15 < CDVI ≤ 30 and high instability for CDVI > 30.
Production is a functional form of area and yield. The variation in production is due to changes in area and productivity. The relative contribution of area and productivity to the change in production is estimated by the decomposition analysis model (Dhakre & Sharma 2010;
Change in production = Yield effects + Area effects + Interaction effects
ΔP = A0ΔY + Y0ΔA + ΔYΔA (7)
or
where Δ is for a change over time, “P0” – production at “xt=1950”, “Pc” – production at “xt=2020”, “Y0” – yield at “xt=1950”, “Yc” – yield at “xt=2020”, “A0” – area at “xt=1950”, “Ac” – area at “xt=2020”.
Regression modeling is a crucial task in the applied statistical analysis.
where, “k” is a number of regressors including an intercept and “n” is a number of observations. The term “2k/n” is defined as a penalty factor for AIC.
where, the term “((k/n) ln (n))” is defined as a penalty factor. “k” is a number of regressors including the intercept and “n” is a number of observations.
The R programming language is utilized to analyse seventy-one years of data from 1950 to 2020. The Hyndman-Khandakar algorithm (Hyndman & Khandakar 2008) is employed to select optimal predictive models for wheat crop area, productivity (yield), production, and population variables in Pakistan. The paper demonstrates plots of the auto-correlation function (ACF), partial auto-correlation function (PACF), and the plot of the differenced series. ACF indicates the value of parameter ‘q,’ while the PACF indicates the value of parameter “p”.
Figures
There is no problem of severe autocorrelation in the residuals of the fitted ARIMA models. Table
| Determinants | ARIMA | AR(1) | MA(1) | MA(2) | AIC | SIC | RMSE | Box-Ljung p value |
| Wheat Area | (0, 1, 0) | ---- | ---- | ---- | 1087.8 | 1092.3 | 553.3 | 0.43 |
| Wheat productivity | (1, 1, 2) | 0.943 | -1.72 | 0.84 | 213.8 | 222.8 | 1.03 | 0.87 |
| Wheat production | (0, 1, 1) | ---- | -0.57 | ---- | 1168.4 | 1175.1 | 996.7 | 0.55 |
| Population | (0, 2, 1) | ---- | -0.60 | ---- | 95.5 | 99.7 | 0.46 | 0.98 |
Table
The CGR models appear to be better-fitted models for time series analysis compared to the ARIMA models.
Production is a product function of area and yield, meaning that any change in the area and yield variables will automatically change production. They are driving factors of production that need to be predicted due to concerns about food availability in the future. Figures
| Dependents | AdjR2 | RMSE | Sig | AIC | SIC | Regression coefficient | CGR | 2020 | 2030 (Est) | 2050 (Est) |
| Area | 0.92 | 0.073 | 782.5** | -368.6 | -364.4 | 0.012** | 1.207% | 21750.62 | 24506.2 | 31109.0 |
| Yield | 0.95 | 0.108 | 1302.5** | -313.0 | -308.5 | 0.023** | 2.326% | 29.02 | 36.43 | 57.41 |
| Production | 0.95 | 0.159 | 1395.3** | -258.2 | -253.2 | 0.034** | 3.458% | 25249.50 | ---- | ---- |
| Population | 0.99 | 0.051 | 8896.5** | -419.6 | -415.9 | 0.028** | 2.839% | 215.25 | 283.70 | 492.87 |
Table
. Compound growth rate (CGR), Cuddy Della Valle instability index (CDVI) and coefficient of variation (CV) for wheat area, yield production and population
| Items | Wheat Area | Wheat Productivity | Wheat Production | Population |
| CGR (%) | 1.207 | 2.326 | 3.458 | 2.839 |
| CDVI (%) | 5.26 | 8.66 | 10.38 | 9.08 |
| C.V (%) | 23.52 | 43.3 | 59.97 | 52.46 |
To study sporadic (periodic) variations in wheat area, yield, production, and population, the data is divided into three subsamples for the periods 1950-1975, 1976-2000, and 2001-2020. Table
The periodic changes of the Compound Growth Rate during different periods indicate that wheat area decreases by 67.0%, yield decreases by 40.0%, production decreases by 55.8%, and population decreases less by 31.7%. We interpret the results as follows: there is a mismatch between wheat production and the population of Pakistan, which seems to be a threat to food security in Pakistan.
| Period | Wheat area | Wheat yield | Wheat production | Population |
| 1950-1975 | 1.82 | 2.02 | 3.87 | 3.25 |
| 1976-2000 | 1.21 | 2.12 | 3.36 | 2.74 |
| 2001-2020 | 0.60 | 1.21 | 1.71 | 2.22 |
| % age decrease | 67 | 40 | 55.8 | 31.7 |
The decomposition analysis model is used by the researchers and policy makers to assess growth performance, particularly, to evaluate the contribution of area and productivity to the change in production. Projecting with the compound growth rate semi log model the expected increase in wheat area comparing to 2020 is 12.7% for 2030, 43% for 2050, the expected increase in wheat yield is 25.5% and 97.8% respectively for 2030 and 2050. The expected increase in population is 31.8% for 2030 and 129% for 2050, which shows population growth is expected to be higher compared to the same indicators for area and yield.
Table
Pakistan is anticipated to confront several food security challenges due to its expected population growth until 2050. As of the Global Hunger Index data for 2022, Pakistan is ranked 99th among 121 countries. This study aims to assess the magnitude of population growth rates and the variables related to wheat crops, such as area, yield, and production, to gauge the level of food security concerns in Pakistan.
The study concludes that the Compound Growth Exponential Regression Model (CGREM) provides a better fit when compared to the ARIMA model. The CGREM exhibits lower values across various indicators (RMSE, AIC, and SIC) compared to the ARIMA model.
Periodic analysis of the Compound Growth Rate (CGR) indicates a significant decrease in wheat area, yield, and production by about 67.0%, 40.0%, and 55.8%, respectively, in contrast to the population of Pakistan, which only decreases by about 31.7%. Notably, in the period of 2001-2020, the CGR reached low levels for wheat area (0.60%), yield (1.21%), and production (1.71%), while remaining relatively high for population (approximately 2.22%), signaling a potential threat to food security in the region.
A noticeable gap exists between the growth rates of wheat area and population in Pakistan, with the CGR for wheat area at 1.207% and for population at 2.839%. The expected increases for wheat area are projected as 12.7% in 2030 and 43% in 2050, while for population, the projections indicate 31.8% in 2030 and 129% in 2050, highlighting a comparatively rapid population growth. Enhancing the productivity of wheat crops is identified as a key factor in addressing the imminent threat to food security. The CGR for yield is determined to be 2.326%, slightly lower than the CGR for population (2.839%). Projected increases for wheat yield are 25.5% in 2030 and 97.8% in 2050, compared to population projections of 31.8% in 2030 and 129% in 2050.
Wheat yield and productivity demonstrate lower instability compared to the population of Pakistan, with each determinant falling within the low-indexing region of CDVI instability. The decomposition analysis reveals that productivity contributes more significantly to production than the area (38% vs. 20%), aligning with the conclusion that yield plays a major role in wheat production.
According to the World Population Prospects report of the United Nations Department of Economic & Social Affairs, Pakistan’s population is projected to reach 366 million
This study contributes novel dimensions to the policymaking process in Pakistan. The forecasting of food crops and population involves the utilization of various methods, including the compound growth exponential regression model and ARIMA as a benchmark. The compound growth exponential regression model enhances prediction accuracy for long time series data of wheat area, yield, production, and population. The application of decomposition analysis, compound growth exponential regression, and instability analysis results enables us to evaluate and compare the extent of changes in yield and growth rates, an approach not previously utilized on Pakistan’s data. The technique that combines the standard ARIMA approach with the compound growth rate model provides valuable insights for improving policies aimed at achieving the goals of food sustainability and food security.
Muhammad Islam: Contributed to introduction, data analysis, descriptions, conclusions, methodologies and policy implications.
Farrukh Shehzad: Involved in data analysis, supervision, descriptions, conclusions and policy implications.
Samrat Ray: Contributed to revision, draft reading, analysis, descriptions, discussions and policy implications.
Mirza Waseem Abbas: Involved in introduction, analysis, results, discussions and conclusions.
Escap UN (2009) Sustainable agriculture and food security in Asia and the Pacific. UN ESCAP, Bangkok. ULR: https://repository.unescap.org/bitstream/handle/20.500.12870/2722/ESCAP-2009-FS-Sustainable%20Agriculture.pdf?sequence=4
ESP (2019) Pakistan Economic Survey 2018-2019. Finance Division, Government of Pakistan. URL: https://www.finance.gov.pk/survey_1819.html
ESP (2020) Economic survey of Pakistan 2019-2020. Finance Division, Government of Pakistan. URL: https://www.finance.gov.pk/survey_1920.html
Muhammad Islam – PhD (Statistics), MA (Economics), Deputy Director (Stat), Crop Reporting Service, Agriculture Department, Punjab, Pakistan. E-mail: mislam6667@gmail.com
Farrukh Shehzad – PhD (Statistics), Assistant Professor, Department of Statistics, the Islamia University of Bahawalpur. E-mail: fshehzad.stat@gmail.com
Samrat Ray – PhD (Eco) from St Petersburg Polytechnic University, Russia , Affiliation Dean and Head of International Relations, IIMS, Pune, India. E-mail: s.ray@iimspune.edu.in
Mirza Waseem Abbas – PhD (MS), Assistant Director (Stat), Crop Reporting Service, Agriculture Department, Punjab, Pakistan. E-mail wazeem.mirza@gmail.com