IN
DEGREE PROJECT TECHNOLOGY, FIRST CYCLE, 15 CREDITS
STOCKHOLM SWEDEN 2018 ,
The Swedish Housing Market:
An Analysis of Contributing Factors on the Sales Price
With Discussion on the Amortization Requirement's Effects
TINTIN CLAESSON
VERONICA EHRSTRÖM EKLÖF
The Swedish Housing Market:
An Analysis of Contributing Factors on the Sales Price
With Discussion on the Amortization Requirement's Effects
TINTIN CLAESSON
VERONICA EHRSTRÖM EKLÖF
KTH ROYAL
Degree Projects in Applied Mathematics and Industrial Economics Degree Programme in Industrial Engineering and Management KTH Royal Institute of Technology year 2018
Supervisor at Valueguard AB: Lars-Erik Ericson
Supervisors at KTH: Daniel Berglund, Hans Lööf
Examiner at KTH: Henrik Hult
TRITA-SCI-GRU 2018:196 MAT-K 2018:15
Abstract
The purpose of this report was to find the impact of different factors on the sales price of one bedroom apartments in Stockholm. Further, an analysis fol- lows discussing the repercussions that the amortization requirement has had on the sales price. The stated problem have been highly discussed, as many depend on the housing market. To regulate the housing market in Sweden the Swedish government, together with Finansinspektionen, has chosen to introduce an amortization requirement. This means that every new loan taker has agreed to amortize 1 or to 2 percent of the total loan.
The method for analysis was multiple linear regression, where several different variables were applied. The most important parameters were different measure- ments on time and place. But less intuitive variables, such as quality of water, were also included. The data used for analysis was supplied by Valueguard AB dated from 2013 to present day, which included approximately 20 000 data points.
Furthermore, the result of the analysis was not very surprising. It is concluded
that the sales price on a condominium in Stockholm City has a business curve
based on the time of the year. It is also clear that external factors as distance
to water makes a difference in price. As far as the amortization requirement is
concerned, it shows that the steep price curve over time has started to flatten
since 2016, but not exceedingly much. This could be a consequence of the scope
of the analysis, as only one room apartments were included.
Sammanfattning
Syftet med denna rapport var att unders¨ oka vilken p˚ averkan olika faktorer har p˚ a f¨ ors¨ aljningspriset av enrumsl¨ agenheter i Stockholm. Vidare diskuteras hur amorteringskravet har p˚ averkat f¨ ors¨ aljningspriset. Den valda fr˚ agest¨ allningen har varit mycket omtalad d˚ a stora delar av samh¨ allet p˚ averkas av f¨ or¨ andringar p˚ a bostadsmarknaden. F¨ or att reglera bostadsmarknaden i Sverige har den svenska regeringen tillsammans med Finansinspektionen introducerat ett amor- teringskrav. Vilket betyder att alla nya l˚ antagare ¨ ar skyldiga att amortera en eller tv˚ a procent av det totala l˚ anet.
Analysen baserades p˚ a multipel linj¨ ar regression, d¨ ar flera olika variabler ap- plicerades. De viktigaste variablerna var olika m˚ att p˚ a tid och plats. Aven ¨ mindre vanliga variabler, s˚ asom vattenkvalitet inkluderades i analyserna. Den data som anv¨ andes distribuerades av Valueguard AB och str¨ acker sig fr˚ an 2013 och fram˚ at, den tilldelade datam¨ angden omfattade cirka 20 000 punkter.
Resultatet av analysen var f¨ oga ¨ overraskande. Det gick att se att slutpriset
p˚ a en bostad i Stockholm har en priskurva som ¨ ar beroende p˚ a vilken tidpunkt
p˚ a ˚ aret den s¨ aljs. Dessutom var det tydligt att utomst˚ aende faktorer, bland
annat avst˚ andet till vatten, p˚ averkade priset. Vad g¨ aller amorteringskravet
visar analysen att den branta kurvan av pris ¨ over tid, har d¨ ampats n˚ agot sedan
inf¨ orandet av amorteringskravet. Detta kan vara en konsekvens av den valda
avgr¨ ansningen f¨ or analysen d˚ a enbart ettor i Stockholms stad ¨ ar inkluderade.
Acknowledgments
Regards to the tutors at Kungliga Teckniska H¨ ogskolan in Stockholm and a very
special thanks to Lars-Erik Ericson with crew at Valueguard AB that generously
borrowed us the dataset.
Contents
List of Figures 5
List of Tables 6
1 Introduction 7
1.1 Background . . . . 7
1.2 Research Question . . . . 7
1.3 Goal and Purpose . . . . 8
1.4 Scope and Limitations . . . . 8
2 Theoretical Framework 9 2.1 Economic Theory . . . . 9
2.1.1 Loan-to-Value Ratio . . . . 9
2.1.2 Macroeconomic Consequences of a High Debt Ratio . . . 9
2.1.3 History of the Amortization Requirement . . . . 10
2.1.4 Supply and Demand . . . . 12
2.1.5 Hedonic Pricing . . . . 12
2.1.6 Rational Choice Theory . . . . 12
2.1.7 Behavioral Finance . . . . 12
2.2 Mathematical Theory . . . . 13
2.2.1 Definition of the Model . . . . 13
2.2.2 Dummy Variables . . . . 13
2.2.3 Model Assumptions . . . . 13
2.2.4 Ordinary Least Square Estimate . . . . 14
2.2.5 Definition of Residuals . . . . 14
2.2.6 Standardized Residuals . . . . 15
2.2.7 Studentized Residuals . . . . 15
2.2.8 Multicollinearity . . . . 15
2.2.9 Model Validation . . . . 16
2.2.10 Variable Selection . . . . 17
2.2.11 Data Selection . . . . 18
2.2.12 Transformation on the Regressor Variables . . . . 18
3 Methodology 20 3.1 Variable Analysis . . . . 20
3.1.1 Explanation of Variables . . . . 20
3.1.2 Excluded Variables . . . . 20
3.1.3 Model definitions . . . . 22
3.1.4 Analysis of AIC, BIC and R
2. . . . 23
3.1.5 Best Fitted Models . . . . 23
3.2 Data Analysis . . . . 25
3.2.1 Processing of Data . . . . 25
3.2.2 Data Reduction . . . . 25
3.2.3 Final Dataset . . . . 26
3.3 Transformation of the Regression Model . . . . 27
3.3.1 Box-Cox Transformation . . . . 28
3.3.2 Quadratic Transformation . . . . 29
3.4 Final Model . . . . 30
4 Result 31 4.1 Presentation of Result . . . . 31
4.2 Regression Equation . . . . 31
5 Discussion 32 5.1 Mathematical Discussion . . . . 32
5.2 Economic Discussion . . . . 36
References 38
List of Figures
1 The Debt to Income in Sweden. . . . 10
2 The Debt to Income Broken Down by County. Source: Ibid . . . 11
3 The Response Variable vs the Regressors . . . . 21
4 Residual Analysis . . . . 25
5 QQ-plot of Standardized and Studentized Residuals . . . . 26
6 CovRatio with Marked Potential Outliers . . . . 26
7 Cook’s Distance with Marked Potential Outliers . . . . 27
8 Residuals vs Fitted Value after Removal of Outliers . . . . 27
9 Cook’s Distance after Removal of Outliers . . . . 28
10 Residual Plot after Box-Cox Transformation . . . . 28
11 Residuals after Quadratic Transformation . . . . 29
12 Residual Plot of Final Model . . . . 30
13 Distance to Water . . . . 33
14 Building Year . . . . 34
15 Recounted Price . . . . 34
16 Change of Price by Date . . . . 35
17 Seasonal Changes in Price . . . . 35
List of Tables
1 ANOVA table of Full Model . . . . 20
2 Variables used in the Analysis . . . . 21
3 Result of Variance Inflation Factors . . . . 22
4 Results of Stepwise Regression . . . . 22
5 Model Specifications . . . . 23
6 AIC, BIC and R
2. . . . 23
7 ANOVA Table of Model 6 . . . . 24
8 ANOVA Table of Model 8 . . . . 24
9 Value of Estimates . . . . 32
1 Introduction
1.1 Background
A condominium can be created in two ways, either by restructure of a rented apartment or when houses are built and sold. Building new constructions are costly and time consuming. The restructure of rented homes into condominiums has had the implications that the supply of rented apartments has decreased and the demand for condominiums has increased. The combination of increased demand of condominiums and a significantly low trend of interest and mortgage rates has driven the market to higher prices making it necessary for buyers to increase their housing loans(1).
During a long period of time in Sweden the debt has increased in a higher tempo than the income of the country. This means that the debt ratio increased in the same years, from 1990 to 2010, leading to extended recessions. Furthermore, Swedish housing prices have increased rapidly since 1990 leading to a greater need for Swedish households to take housing loans. Consequences from this are highly related to macroeconomic risks. With higher debt ratio the households will not be as eager to spend money and this will slow down the economy (1).
Finansinspektionen and the Swedish government have therefore recommended an amortization requirement on Swedish housing loans on behalf of the Euro- pean parliament. Finansinspektionen introduced the new rule in June 2016, which applies to every bank in Sweden. The amortization requirement means that the loan taker is obligated to amortize 1 or 2 percent of the total loan depending on the loan-to-value ratio of the household. The amortization re- quirement comprises all housing loans.(1).
The purpose of the requirement is to increase the amortization ratio and ac- cordingly decrease the debt ratio, but since the rule is relatively new the con- sequences have not yet been detected in the housing market. One goal of this study is to discuss the repercussions of the amortization requirement in the housing market (2).
1.2 Research Question
The research question has been formulated as:
What factors contributes to the sales price of condominiums in Stockholm City and what are the effects from the amortization requirement?
The part of the research question regarding the sales price of condominiums
will be considered using parameter analysis in multiple linear regression em-
ploying data distributed from Valueguard AB. As a part of the analysis the feasibility of the distributed data will be discussed. The second part of the re- search question regarding the amortization requirement will be discussed based on a literature study.
1.3 Goal and Purpose
The aim of this study is, as previously stated, to detect factors for the price of condominiums and to analyze the effects of the amortization requirement on the housing market in Stockholm, using multiple linear regression and economical research. Since the amortization requirement was activated in June 2016 it is desired to examine if any consequences have been revealed in the market. The implications of the amortization requirement are likely to have a great impact in the housing market therefore a study of these implications is of great relevance.
The relevance of this study is also significant considering the past ten years changes in the Swedish housing market. Changes that have been seen are re- garding the major raise in sales prices and the increase in supply of condo- miniums (2). The stakeholders for whom this is relevant are buyers, sellers or owners, politicians, architects and contractors.
1.4 Scope and Limitations
The purpose of this study is to get an insight of the Swedish housing market.
It will strictly handle condominiums as the market for rented housing is contin- uously changing and have a significant amount of illegal trade, meaning when people rent apartments without valid contracts. Furthermore, the report has been limited to Stockholm. The time period that will be investigated starts in 2013 to present day.
The limitation in this analysis concerning the amortization requirement, in-
troduced in June 2016, is the fact that only short-term consequences can be
analyzed. The implications from the additional amortization requirement, in-
troduced in March 2018, will not be investigated to any extent. Therefore, in the
remaining parts of the report when the amortization requirement is mentioned
it refers to that of June 2016 unless anything else is stated.
2 Theoretical Framework
2.1 Economic Theory
2.1.1 Loan-to-Value Ratio
The Loan-to-Value Ratio, LTV is a financial tool used in risk assessment when examining a mortgage application. It is given in decimals or percent and counted as mortgage amount to appraised value of the property. A high LTV-ratio means that the mortgage is much larger than the mortgage taker’s property and therefore the mortgage is at larger risk. It is therefore an indication of the level of vulnerability that a household is exposed to in relation to falling house prices. As a fallout the loan generally has higher interest with a high LTV-ratio (3).
LT V = M ortgage Amount
Appraised V alue of the P roperty (1) 2.1.2 Macroeconomic Consequences of a High Debt Ratio
Debt Ratio
Debt Ratio is a financial ratio that measures total debt in relation to total as- sets. The outcome is given as decimals or in percent and it can be explained as the proportion of a company’s or society’s assets that are financed in debt(4).
Consequences of high debt in a company or society are highly related to fluc- tuations in the economy. When debt is high several economic assets are not as volatile since they are tied up, leading to reduced liquidity. Implications are, among other things, greater bias in case of financial shocks or changes in inter- est rates as the company needs a major part of the assets to reduce and handle debt (5).
Debt Ratio = T otal Debt
T otal Asset (2)
Household Debt
For a household to be able to spread out their consumption over a lifetime one of the most important requirements is that the credit market is functional. To increase the possibilities to buy a home the households must manage to take loans. However, if the households suffer from to much debt, consequences are most likely to occur. The main reason for household debts to increase is rising household prices, which can develop from low interest rates. This is because of the increase in demand of condominiums when interest rates are low. Other reasons to higher household debt are the growing population or a smaller supply of new constructions (6).
The individual household suffers from risk because of the debt ratio. First
of all, when taking a loan the household accept a payment responsibility for
a long period of time. The implication is that changes in the economy are a great risk, as economic chocks and adjustments may not be counted in while taking the loan. Second, a large debt in relation to the value of the house makes the household sensitive to descending housing prices. In extreme cases there is a risk of the housing price to become less than the size of the loan leading to negative equity. Furthermore, large debt in relation to the income of the household is also a vulnerability if loss of income or raise in interest rate would come about(7). In the case of negative economic changes the households have to prioritize their mortgage and interest payments. Corresponding behavior may lead to cut back in consumption and consequently to recession (7).
Debt Ratio in Sweden
The debt ratio in Sweden is growing for each passing year, see Figure 1. The total amount of debt is SEK 3, 8 trillion and approximately SEK 3 trillion is in mortgage loans, which is 77 percent of the total debt. The Swedish LVT-ratio is at 64 percent(8).
Figure 1: The Debt to Income in Sweden.
The debts are quite irregularly distributed in the country, but the obvious con-
Figure 2: The Debt to Income Broken Down by County. Source: Ibid
and G¨ oteborg where the prices are generally the highest in Sweden. The higher prices has led to an increase of 80 percent in debt(1).
Households with a high debt ratio are more sensitive to fluctuations in the market than households with lower debt ratio. Fluctuations can include change in interest or loss of income. Households with high debt ratio are more sensi- tive as they pay a larger portion of their income in interest and amortization.
Therefore if the interest increases the households has to decrease savings and consumption or be forced to move to a cheaper house, which can lead to reces- sion (1).
To force home buyers to decrease their debt Finansinspektionen introduced a amortization requirement in June 2016 which obligates loan takers to amortize 1 or 2 percent depending on the loan-to-value ratio. According to Finansinspek- tionen, the amortization requirement has lead to people buying cheaper real estate, taking smaller loans and use a more substantial part of their savings to finance housing loans, with the repercussion that the households are less sen- sitive to fluctuations in the stock or housing market. As a result of the initial amortization requirement household loaned 9 percent less and bought houses that were 3 percent cheaper(1).
Nevertheless, there are risks remaining. There are still households taking loans with high debt ratio, mainly since prices has increased significantly more than the average income. This could have serious macroeconomic consequences(1).
One of these consequences was that a supplementary amortization requirement
was introduced in March 2018 with the requirement that households with debt
ratios higher than 450 percent must amortize one additional percent of their
loans each year(1).
2.1.4 Supply and Demand
The concept of Supply and Demand is one of the most basic concepts in eco- nomic theory. Supply indicates how much of a good or service is available in the market and demand refers to how much of a good or service is desired by consumers. A relationship between the two is reflected in the price of the good or service. The price can therefore be altered by the behavior of buyers and sellers. When the price decreases the willingness of buyers to purchase increases and the willingness of sellers to sell decreases. As a result an equilibrium is then obtained and a price is set (9).
2.1.5 Hedonic Pricing
In difference with the classic theory of supply and demand Hedonic Pricing rec- ognizes that prices are determined by both internal and external characteristics.
It is commonly used when pricing real estate as there can be several factors that affect the price of a home. The internal factors can include size, condition, ap- pearance and different features such as balconies or solar-panels. The external factors can be the crime rate in the neighborhood or the access to schools(10).
The hedonic pricing method is used to estimate to which extent each factor affect the price(10).
2.1.6 Rational Choice Theory
The Rational Choice Theory states that consumers always make logical decisions in order to maximize the gain and minimize the loss where the decisions result in a great benefit or satisfaction. The Rational Choice Theory is profound in the assumptions of many economical theories. However new theories such as Behavioral Finance challenge these assumptions (11).
2.1.7 Behavioral Finance
Studies in Behavioral Finance has incurred in order to complement the theo-
ries of Rational Choice. Behavioral Finance is based on the believes that peo-
ple sometimes make irrational investment decisions. These irrational decisions
can explain why bubbles and panics occur or why individuals make expensive
investments(12).
2.2 Mathematical Theory
The mathematical theory is written with Introduction to Multiple Linear regres- sion by Douglas C. Monthgomery, Elizabeth A. Pick and G. Geoffrey Vining as reference if nothing else is stated.
2.2.1 Definition of the Model
A multiple linear model with i predictor variables X
1, X
2, ..., X
iand one re- sponse variable Y can be expressed as:
Y
j= β
0+ β
1x
1+ ...β
ix
i+
n, (3) where
nis a normally-distributed random deviation with mean 0 and variance σ
2; that is,
j∼ (0, σ
2) for all j.
This model can also be written in matrix form. With
Y =
Y
1Y
2.. . Y
n
, X =
1 x
11x
12. . . x
1i1 x
21x
22. . . x
2i.. . .. . .. . .. . .. . 1 x
n1x
n2. . . x
ni
,
β = (β
0, β
1, ..., β
i)
T, and = (
1,
2, . . . ,
n)
T. Then (3) can be expressed as
Y = Xβ + . (4)
Thus, (4) implies that Y ∼ (β, σ
2I), where I is an (n × n) identity matrix.
Where y is the response variable that is predicted by the covariates x
ij. The unknown parameter β is estimated using data of the covariates. is the error term, with the requirement that it is normally distributed.
2.2.2 Dummy Variables
Dummy variables or indicator variables are given different values in order to account for the effects that the variable have on the response. The dummy variables will be set to 1 or 0. 1 if it is included in the model and 0 if it is excluded. The dummy variables will be set in order to avoid potential problems with multicollinearity.
2.2.3 Model Assumptions
There are five major assumptions that are required in regression analysis. The
validity of these assumptions should be considered doubtful and subject to anal-
ysis.
1. Linear Relationship
The relationship between the response variable and the regressors is linear, at least approximately.
2. Strict exogeneity
The error term epsilon has zero mean.
3. No Multicollinearity
The number of observations is greater than the number of variables. There exsists no linear relationship between the variables.
4. Homoscedasticity
The errors are uncorrelated with the same variance.
5. No auto-correlation
The errors are normally distributed.
In order to detect the validity of these assumptions several diagnostics can be useful. These will be presented at later stages of the report.
2.2.4 Ordinary Least Square Estimate
The method of Ordinary Least Squares can be used to estimate the regression coefficients. The ordinary least square estimator provides an estimation of β, namely ˆ β, by minimizing the sum squared errors. The normality assumptions presented in 2.2.3 are required.
S(β) = (y − Xβ)
0(y − Xβ) (5)
∂S
∂β
βˆ= −2X
0y + 2X
0X ˆ β = 0 (6)
β = (X ˆ
0X)
−1X
0y (7)
2.2.5 Definition of Residuals
Residuals are defined as
several important assumptions. As listed in chapter 2.2.3 the residual has zero mean and average variance estimated by
P
ni=1
(e
i− ¯ e)
2n − p =
P
n i=1e
2in − p = SS
Resn − p = M S
Res(9)
2.2.6 Standardized Residuals
Standardized Residuals is a handy method when detecting outliers and influen- tial points. The standardized residuals are calculated by scaling the residuals with the approximate average variance M S
Res:
d
i= e
i√ M S
Res, i = 1, 2, ...n (10)
If the standardized residual is large with d
i> 3 it indicates an outlier.
2.2.7 Studentized Residuals
Studentized Residuals is also a good method in distinguishing influential points and outliers. In this case the residuals are scaled with the approximate standard deviation M S
Res:
r
i= e
ipM S
Res(1 − h
ii) , i = 1, 2, ...n (11) The studentized residuals have constant variance V AR(r
i) = 1 regardless of the location of x
iwhen the form of the model is accurate. A point with both large residual and h
iiis probably of high influence.
2.2.8 Multicollinearity
The phenomenon of Multicollinearity occurs when one variable can be linearly predicted by other variables, meaning that there is very high intercorrelation between the independent variables. This is done with high accuracy, which im- pacts the tested model severely. It is therefore a disturbance to the data, hence, a regression model with multicollinearity may not be reliable.
Diagnostics
Multicollinearity can be detected with several methods, a few will be presented below.
The Variance Inflation Factor, VIF, measures the variances of each regressor and its combined effects. The limit for accepted VIF has been chosen to 10.
Therefore values over 10 indicate multicollinearity. The VIF equation is given by V IF
j= 1
1 − R
2j(12)
Variance decomposition propositions are used to detect values or eigenvalues that dramatically inflate variance and as a consequence multicollinearity. Val- ues greater than 0,5 should be excluded.
Treatments
Multicollinearity can be treated by:
1. Collection of additional data 2. Model respecification
a. Redefine the regressors b. Variable elimination
3. Use of estimation methods designed to combat multicollinearity
In the case of non-orthogonal data Ridge Regression can be used as a sup- plement to least square.
2.2.9 Model Validation
The following subsections present methods used to assess feasibility and the validity of the tested model.
R
2The R
2-value can be used to assess the adequacy of the model. R
2defines the percentage of movement of parameters i.e. how accurate the model is. R
2is given by Formula 13.
R
2= SS
RSS
T= 1 − SS
ResSS
T(13) where
SS
T= SS
R+ SS
Res(14)
SS
R= ˆ β
0X
0y −
n
X
i=1