2019-05-17 The Odd One(s) Out Thies Lindenthal & Carolin Schmidt htl24@cam.ac.uk carolin.schmidt@zew.de
Two core observations motivate our paper First: Observed transaction prices follow a “ruler distribution” • Disproportionally high share for round prices (e.g. multiples of 10K/25K/50K) • Prices get coarser with price levels (Ball et al., 1985; Thomas et al., 2010)
Second: Round prices are less precise... … larger deviations from fundamentals. Distribution of residuals from repeat sales regression
This effect is robust, not driven by tax subsidy thresholds These UK-specific price regions have been excluded from analysis • 250K “Help to buy” scheme • 500K first time buyer stamp duty discount (since 2018)
Negotiated sales prices, not asking prices British way(s) of trading houses • Guide price set by seller • Potential purchasers hand in sealed bids • The seller is not bound to accept the highest offer, • She can pick any (or none) • Estate agents often facilitate the trade and, often, strategically release information • In England and Wales, the terms of an offer remain subject to contract • No-one is legally obliged to continue with the transaction until the formal contract has been signed and the parties have exchanged the contracts • Transactions take months • Both sides have to assess the risk of transaction falling through • In Scotland, transactions are binding earlier in the process
What is so special about round prices? Why do we care? • Direct applications • Reliability of comparables when valuing individual buildings • Mass appraisal systems • Signalling in negotiations • Design decisions when developing • Round prices offer insights on human decision making • When are we confident deciders? In which cases is it difficult to make a judgement?
Value of aesthetics / architecture / preferences / beauty Paper is part of a larger research theme on “human” side of property • “Beauty in the Eye of the Home-Owner: Aesthetic Zoning and Residential Property Values” (REE, 2017) Value = f(X, Shape, Shape neighbours,...)
“Machine Learning, Building Vintage and Property Values” (Lindenthal, Johnson)
What is our contribution to the literature? P(round price | sale ) = 𝑔 (buyer and seller factors, market factors, asset factors) • There is rich theoretical & empirical research on round vs. precise numbers • general psychology, retail, negotiations, sport • Most research focuses on the psychological aspect of these numbers • In real estate, round transaction prices investigated by Palmon et al. (2004) and Beracha and Seiler (2013) • We show that market conditions and asset characteristics influence the salience of these mental traits • Heterogeneity of real estate creates variation in the likelihood of observing a round price • Some buildings are easier to value than others - the price reveals the relative difficulty • Reliability of observed prices is dynamic
Buyers and sellers Amateurs and experts alike use mental shortcuts, consciously or unconsciously • Heaping (uncertainty or inability) • Age (A’Hearn et al., 2009): dyscalculics tend to report their age as a multiple of five or other attractive numbers • Analyst forecasts (Herrmann and Thomas, 2005) • Conformist behaviour • ½-carat diamonds sell at an 18% premium relative to diamonds slightly < ½ carat (Scott and Yelowitz, 2010) • Institutional rules, familiarity, efficiency • Stock and commodity prices end in even/round numbers even though finer pricing is permitted (Osborne, 1962; Niederhoffer, 1966; Ball et al., 1985) • Simplification of financial record processing (Stevenson and Bear, 1970) • Round prices are easier to process (Tversky and Kahneman, 1973) • Digit preference • Best drug dosage? Such that 20 drops of cough syrup three times a day are effective in ~90% of the cases (Herxheimer, 1991)
Sellers (and buyers?) want to achieve thresholds Similar to motivation of Marathon runners (Allen et al., 2017) • For a seller, it is gratifying to exceed a mental mark, willing to push just a bit harder “on the last mile” • What about buyers? Shouldn’t they have the opposite motivation? • Buyers vs sellers markets?
Some use coarse prices strategically • Cheap-talk model (Backus et al., 2015) • Sellers advertising at round prices signal their willingness to negotiate (lower TOM) • Precise-price advertisers achieve higher final sales prices on average (higher TOM) • Negotiation efficiency hypothesis (Harris, 1991) • Round list prices speed up transactions • Palmon et al. (2004) on clustering in real estate prices • List prices more often just-below-even ending, transaction prices more even-ending • NEH predicts even-ending transaction prices, especially when information on the property is scarce & costly to obtain • Psychological literature: coarse numbers signal uncertainty, precise numbers confidence and knowledge • “One year” versus “365 days” • Yaniv and Foster (1995), Goldsmith et al. (2002), Zhang and Schwarz (2013), Mason et al. (2013)
A (hypothetical) blue book for homes... … could explain discontinuities in price distributions • Left digit bias, due to limited information-processing ability (Lacetera et al., 2012)
Uncertainty about marginal prices for attributes High uncertainty for hedonic coefficients due to low number of comparables? • Liquidity (information on market) • Low # comparables implies high quality uncertainty (Martel, 2018) • Everything else equal, lower liquidity and fewer comparables should lead to more round prices being observed. • Quality uncertainty (information on building) • Prices cluster when asset values are uncertain (Ball et al., 1985; Binder, 2017) • Listings are notoriously vague. Square footage? Damp? Noise? Sitting tenants? • Similarly: firm valuation is more subjective and variable for young firms with a short earnings history (Baker and Wurgler, 2006) • Asset uniqueness (combination of both) • How to quantify and value uncommon specifications? • Extreme values or interaction terms reduce # relevant comparables, driving up uncertainty • Value of detached house with garage in Romsey Town?
Uncommon combination (style and location)
Uncommon attributes
Empirical strategy Can we predict the occurrence of round prices? • We cannot observe buyers’ and sellers’ characteristics or motivation • Omit (for now) • Information on market liquidity from universe of sales (land registry) • Sales are geocoded and have time stamps • Number of comparables in previous x months within y miles from each building • Information on asset uniqueness from limited set of hedonics and computer vision • Derive additional variables from images • Model asset uniqueness directly
More comparables reduce odds of round prices More information on the local market makes it easier to value a property • Probit regression on round price • Controlling for location at postcode and streetlevel • Year • Hedonics: size/volume, new, vintage • Price band (50K buckets) • Price is defined as being “round” if it is a multiple of £25K • Is 275,000 more round than 280,000? • Core variable of interest: # comparables • Number of sales in same postcode in preceding 12 months as number of comps.
Probit estimates Cambridge submarket • Control variables • Hedonics & Vintage • Location • Year • Price band • Expected sign for # comps!
From computer vision to economic analysis Deriving additional variables / model uniqueness directly Images Feature Vector Classification Further analysis ML ML Classification/ Round quantification Price? ML
Computer vision, off the shelf Deep convolutional neural network to obtain feature vectors • Pre-trained Inception v3 model in Tensorflow API • Convolutional: Exceptionally suitable to detect era specific details such as window styles, ratios, brickwork, ratios • Freely available & frequently used • Penultimate layer is 2048 dimensional feature vector
DNN Design • Testing many specifications • Compromise across geographic scope and # variables
Training on balanced training set First for the UK (100K sample) • For the UK, we have basic hedonics only - but # comps! • Same number of round/non-round sales in training • Out of samples test realistic (using unseen data, ~11% round) • F 1 -score: 2 (recall * precision) / (recall + precision)
Zooming in on Cambridge Basic hedonic variables (area/volume) don’t boost predictive power much • Precision for “round” improves, recall does not
Add more information derived from images Can we spot the odd ones out? • Base line + • Vintage Classifications + • “Raw” feature vectors
Well-behaved training curves (core hedonics, liquidity measures)
Most training done after ~20 epochs Adding vintage of house and neighbouring buildings
Oops. Overfitting. Clearly not optimal.
Reverse regressions: Putting black box into context Adding the ML classification as another regressor
Recommend
More recommend