booktabs \UseTblrLibrarysiunitx \NewTableCommand\tinytableDefineColor[3]
Welfare Effects of Self-Preferencing by a Platform: Empirical Evidence from Airbnb††thanks: I thank Masanori Tsuruoka, Chishio Furukawa, and Kyogo Kanazawa for their helpful comments and advice.
Abstract
This paper studies the welfare effects of self-preferencing by Airbnb, a practice where Airbnb utilizes its pricing algorithm to prioritize maximizing platform-wide commission revenue rather than optimizing individual host revenues. To examine this welfare implication, I construct a Bertrand competition model with differentiated products between Airbnb hosts and hotels. Using unique data from Tokyo’s 23 wards, I estimate the model and conduct counterfactual simulations to evaluate the welfare effects of self-preferencing. Counterfactual simulations reveal that self-preferencing reduces social welfare by 5.08% on average, equivalent to an annual loss of about 14.90% of Tokyo’s vacation rental market size in 2023 while increasing Airbnb’s commission revenue by an average of 37.73%. These findings highlight the significant trade-offs between platform-driven revenue optimization and market efficiency, emphasizing the urgent need for competition policy reforms and greater transparency and accountability in platform practices.
Keywords: self-preferencing, platform, welfare effect, market structure
JEL Classification Codes: L13, L83, L86, D43
1 Introduction
With the rapid advancement of digital technology, platform industries have emerged as a dominant force in the global economy. Remarkably, five of the ten largest firms by market capitalization are platforms.111The five platforms are Apple, Amazon, Alphabet (Google), Microsoft, and Meta. Source: Nikkei https://www.nikkei.com/marketdata/ranking-us/market-cap-high/, accessed October 8, 2024. In Japan, the consumer e-commerce market was valued at 24.8 trillion yen in 2023, accounting for approximately 4.19% of the nation’s nominal GDP.222Based on the Ministry of Economy, Trade, and Industry’s “2023 Survey Report on the Electronic Commerce Market” https://www.meti.go.jp/policy/it_policy/statistics/outlook/R5tyousahoukokusho.pdf, accessed October 8, 2024.
In this context, concerns have arisen about the platform self-preferencing, where they prioritize their goods and services above others (Crémer et al. 2019, Kittaka et al. 2023). A notable example is the European Commission ruling against Google for abusing its dominant market position by manipulating search results to favor its content, resulting in a 2.42 billion euro fine (European Commission 2017). In response to such practices, competition authorities are considering proactive approaches to regulating self-preferencing and enforcing antitrust laws (Digital Competition Expert Panel 2019, European Commission 2020a, b, U.S. House Committee on the Judiciary 2020, Japan Fair Trade Commission 2021). Given the growing concerns regarding the impacts of self-preferencing, it is crucial to investigate its implications. This study therefore addresses the following research question: “What is the impact of platform self-preferencing on social welfare?”
To examine the research question, I specifically focus on Airbnb and its self-preferencing through a pricing algorithm, Smart Pricing (SP). Airbnb is a platform that enables users to rent out unused homes or vacant rooms to travelers. Providers of these accommodations, referred to as hosts, are charged a 3% service fee on their earnings by Airbnb.333As noted on Airbnb’s website: https://www.airbnb.jp/help/article/1857, accessed on May 23, 2024. Since 2015, Airbnb has offered a pricing algorithm, SP, which allows hosts to delegate price determination to the algorithm.444https://www.cnbc.com/2015/11/12/airbnb-launches-smart-pricing-for-hosts.html, accessed on April 12, 2024. Hosts retain the option to use SP at their discretion.
In this paper, I define Airbnb’s self-preferencing as the practice of using SP to maximize the platform’s total commission revenue, rather than prioritizing individual host revenue.555Since Airbnb cannot observe the marginal cost of individual hosts, it maximizes revenue rather than profit. This approach aligns with Airbnb’s short-run incentives, as maximizing commission revenue across the platform often conflicts with optimizing revenue for individual hosts. To analyze this behavior, I consider two distinct scenarios:
-
1.
Baseline scenario. Airbnb adjusts SP to maximize the revenue of a host using SP.
-
2.
Self-preferencing scenario (counterfactual). Airbnb adjusts SP to maximize its total commission revenue.
Figure 1 graphically provides an example of market structures in each scenario. I assume the self-preferencing scenario to be counterfactual because Airbnb claims that SP is designed to prioritize host revenue (Ye et al. 2018). By comparing welfare outcomes across these scenarios, I empirically evaluate the social welfare effects of Airbnb’s self-preferencing.
I constructed unique data by scraping accommodation information from the websites of Airbnb and Booking.com. The dataset covers listings from Tokyo’s 23 wards, collected between May 12 and July 13, 2024. Key variables include accommodation prices, the number of reviews, average ratings for various criteria (such as cleanliness, location quality, and staff communication), and the number of available rooms for hotels and private accommodations.
Using this data, I estimate a static Bertrand competition model between Airbnb hosts and hotels listed on Booking.com. The analysis proceeds in three steps. First, the consumer demand is modeled using the random coefficient nested logit demand model, estimated with the Berry et al. (1995)’s algorithm (BLP). This model incorporates a nested structure based on price ranges to capture that the substitution patterns between Airbnb accommodations and hotels weaken when their prices differ significantly, as found by Zervas et al. (2017). Second, assuming Bertrand competition with differentiated products, the marginal costs of hosts and hotels are estimated by first-order conditions. In the counterfactual scenario representing self-preferencing, Airbnb adjusts host prices using SP to maximize total platform-wide commission revenue. This adjustment also accounts for the revenues of hosts not using SP, thereby relaxing price competition. Finally, I compute the differences in social welfare between the baseline and self-preferencing scenarios, quantifying the empirical impact of Airbnb’s self-preferencing on social welfare within the framework of short-run price competition.
The results indicate that Airbnb’s self-preferencing significantly negatively impacts social welfare. Specifically, when Airbnb engages in self-preferencing through SP, social welfare decreases by an average of 5.08% per market. This decline is driven largely by a notable increase in prices set by hosts utilizing SP, with prices rising by up to 1.5 times compared to the baseline scenario. This translates to an average daily welfare loss of 115,400 yen per market. Aggregating this estimate across Tokyo’s 23 wards yields a total daily welfare loss of 2.65 million yen. On an annual basis, this amounts to a welfare reduction equivalent to approximately 14.90% of Tokyo’s 2023 vacation rental market. A major driver of these welfare losses is the substantial increase in Airbnb’s total commission revenue, which rose by an average of 37.73%. This highlights Airbnb’s strong incentive for engaging in self-preferencing, as the platform captures a greater share of market surplus through its pricing algorithms. These results demonstrate how self-preferencing by platforms can harm social welfare by prioritizing platform interests over broader market efficiency. Given the lack of transparency requirements for algorithms, the findings call for urgent discussions on competition policy and algorithmic accountability to mitigate such adverse effects.
Related Literature. This paper contributes to three key strands of literature: (i) platform self-preferencing, (ii) algorithmic pricing, and (iii) the impact of Airbnb on the hotel industry.
First, regarding the platform self-preferencing, a growing body of literature has examined practices such as manipulating search results (De Corniere and Taylor 2019, Lam 2021, Kittaka and Sato 2022), altering recommendation systems and launching private brand products on platforms like Amazon (Zhu and Liu 2018, Wen and Zhu 2019, Chen and Tsai 2024).666See Kittaka et al. (2023) for a literature review of self-preferencing by platforms. However, empirical analyses investigating the impact of self-preferencing on social welfare have yet to be developed. This is largely because the algorithms used by platforms are unobservable, making it difficult to infer causal relationships between social welfare and self-preferencing. This paper contributes to this literature by constructing a Bertrand competition model between the platform and non-platform firms, quantifying the welfare impact under the self-preferencing scenario.
Next, in the literature on pricing algorithms, prior studies have explored simulations of price competition using machine learning (Calvano et al. 2020, Klein 2021), theoretical studies on collusion strategies (Salcedo 2015, Lamba and Zhuk 2022, Asker et al. 2022, Brown and MacKay 2023), and empirical studies of the impact of pricing algorithms on prices (Assad et al. 2024, Musolff 2024, Hortaçsu et al. 2024). Among these, this paper is particularly related to Harrington Jr (2022).777See also Johnson et al. (2023), which examines how Amazon can design its recommendation system to prevent collusion when sellers use pricing algorithms and provide the simulation result using machine learning. He theoretically analyzed the incentives of competing firms in a duopoly market for differentiated products and a firm that develops and sells pricing algorithms. My analysis differs from Harrington Jr (2022) by focusing on a platform that provides pricing algorithms to its users for free. Airbnb has an incentive to maximize its commission revenue from transactions through pricing algorithms.888Airbnb’s commission fee is fixed at , a rate established by Airbnb and unchanged from 2015 through 2024. For studies that model the fees set by platforms selling their products and analyze their welfare effects, see for example Anderson and Bedre-Defolie (2021), Etro (2023). By measuring the effect of this platform-specific incentive on social welfare, I make a novel contribution to the literature. However, it should be noted that I do not analyze the positive welfare effects of pricing algorithms, such as improved demand forecasting or reduced pricing costs, as discussed by Miklós-Thal and Tucker (2019) and O’Connor and Wilson (2021).
Finally, this paper contributes to the literature on the impact of private accommodations on the hotel industry and consumers. Numerous studies have analyzed Airbnb’s effects on incumbent hotels (Horn and Merante 2017, Zervas et al. 2017, Dogru et al. 2020, Barron et al. 2021, Farronato and Fradkin 2022). Notably, Farronato and Fradkin (2022) modeled competition between Airbnb hosts and hotels as Cournot competition, empirically assessing the effects of Airbnb’s entry on hotel revenues, prices, and social welfare. However, their analysis predates the introduction of Airbnb’s SP and thus does not consider the impact of self-preferencing through SP. This paper diverges by focusing on short-term incentives and assuming Bertrand competition, offering a distinct perspective from Farronato and Fradkin (2022).
I have organized the rest of the paper as follows. Section 2.1 provides an overview of the background of the accommodation industry, while Section 2.2 describes the data. Section 3 presents the price competition model between Airbnb hosts and hotels. Section 4 discusses the estimation results of model primitives. Section 5 details the results of the counterfactual simulation, including changes in equilibrium prices and Airbnb’s commission revenue when transitioning from the baseline scenario to the self-preferencing scenario, as well as the welfare effects of self-preferencing. Finally, Section 6 concludes.
2 Industry and Data
This section explains why Airbnb and the accommodation industry are particularly suitable for the study of the welfare effects of self-preferencing and describes its key features to motivate my structural model in the next section.
2.1 Accommodation Industry and Airbnb
Airbnb is an online platform offering accommodations and experiences, featuring over 5 million hosts and properties worldwide. Since its launch in 2007, Airbnb has facilitated more than 2 billion guest stays. Figure 2 and Figure 3 illustrate key aspects of the platform, with the former showcasing a sample Airbnb search result and the latter providing a screenshot of a host’s pricing calendar featuring SP. In 2015, Airbnb introduced SP, which they claim optimizes prices to maximize individual host revenue (Ye et al. 2018). I adopt the scenario where Airbnb uses SP to maximize host revenue as the baseline, conducting counterfactual simulations under a self-preferencing scenario to evaluate its impact. In Japan, other vacation rental intermediaries, such as STAY JAPAN and Airtrip Minpaku, operate in the market. However, Airbnb dominates the industry. As of April 12, 2024, the number of rooms available for reservation in Tokyo on May 12 was as follows: Airbnb listed 1,243 rooms, STAY JAPAN listed 16, and Airtrip Minpaku listed 49. Given (i) Airbnb’s provision of SP as a key platform feature and (ii) its dominant position in Tokyo’s vacation rental market, Airbnb serves as an ideal subject for examining the effects of platform-driven self-preferencing on social welfare.
Empirical studies have shown that Airbnb’s entry into the accommodation industry reduces hotel revenues and prices, suggesting a substitutive relationship between the two (Zervas et al. 2017, Farronato and Fradkin 2022). However, there is no evidence of competition between hotels and hosts in Tokyo. To guarantee the robustness of results, in Section 4, I estimate models under two contrasting assumptions: one where Airbnb hosts and hotels are in competition and engage in price competition, and another where they do not compete. The latter assumption follows the framework adopted by Foroughifar and Mehta (2024). This approach serves as a robustness check for the results, ensuring that the findings are not sensitive to the assumed nature of competition within the accommodation industry.
2.2 Data
I have constructed unique data for Airbnb and hotel listings on Booking.com by scraping web data from their websites. The dataset covers listings within Tokyo’s 23 wards, from May 12 to July 13, 2024. I observe the daily prices, vacancies, reviews per room, and average review ratings for various criteria, including cleanliness, communication with staff, location quality, and overall room evaluation. Since Airbnb hosts and hotels adopt dynamic pricing, adjusting their prices in response to demand fluctuations as the check-in date approaches, I collected all data one month before the check-in date to minimize the influence of dynamic pricing.999Pan and Wang (2021) and Huang (2024) study how Airbnb hosts set prices to respond to temporal demand shocks and how the check-in date prices vary depending on the time gap between the check-in date and the booking date. In Appendix A.1, I present the reason why I chose Booking.com as a source of hotel information, as well as a detailed explanation of the data.
2.2.1 Descriptive Statistics
I display descriptive statistics in Table C. This table presents the means and standard deviations of prices, the number of reviews, evaluation indices (such as location, staff communication, cleanliness, and overall room rating), and the number of available units, separately for Airbnb accommodations and hotels listed on Booking.com across all periods in the 23 wards of Tokyo.
These descriptive statistics reveal two key heterogeneities that underscore the differences between Airbnb hosts and hotels: prices and ratings. On average, hotels tend to have higher prices compared to Airbnb hosts (0.337 for hotels and 0.287 for Airbnb hosts, with prices normalized by dividing by 100,000). This disparity is largely driven by the presence of luxury hotels. Figure 4 compares the average prices of Airbnb hosts and hotels across Tokyo’s 23 wards. The overall average price is elevated by the concentration of luxury hotels in areas like Chiyoda and Minato wards. However, in other areas, Airbnb hosts generally exhibit higher average prices than hotels. This observation motivates me to include a dummy variable in the demand model to distinguish whether a listing is operated by an Airbnb host or a hotel.
The average ratings for evaluation indices are statistically significantly lower for hotels than for Airbnb accommodations, while the number of reviews is higher for hotels. The following factor can explain this: differences in incentives within the evaluation systems may also play a significant role. As discussed by Belleflamme and Peitz (2021), both Airbnb hosts and hotels derive positive gains from favorable reviews and ratings, creating incentives for strategic distortions of them. The data I obtained also suggest the possibility of bias due to such “fake reviews” (Xu et al. 2015, Zervas et al. 2021, He et al. 2022). He et al. (2022) examines how fake reviews are used on Amazon Marketplace, where reviewers sell their services in private Facebook groups and receive payments through PayPal. Their findings indicate that the average ratings of products decline once firms cease purchasing fake reviews, suggesting that such reviews are predominantly used to promote low-quality products. Similarly, complementary work by Zervas et al. (2021) reveals that nearly 95% of Airbnb properties worldwide hold an average rating of 4.5 or 5 stars, with properties rated below 3.5 stars being virtually nonexistent. In contrast, hotels on TripAdvisor exhibit significantly lower average ratings of 4.1 stars with greater variation between reviews. While vacation rentals on TripAdvisor achieve relatively high ratings, only about 85% of these properties have average ratings of 4.5 stars or higher. These observations are highly consistent with the rating patterns in my data.
2.2.2 Criteria for Determining the Use of Smart Pricing
Whether Airbnb hosts are using SP cannot be directly observed from my data. I therefore determine that a host is using SP if they meet the following two criteria:
Criterion 1: “Operating Low-Priced Listings”.
Hosts can significantly minimize their price-setting costs.101010Zbaracki et al. (2004) empirically analyzed pricing costs using firm data. They identified that pricing costs, considering menu costs, information gathering, decision-making, and internal communication costs, account for 1.22% of revenue. Hosts operating low-priced listings, which have a lower degree of product differentiation, are expected to have stronger incentives to reduce these costs.
Thus, hosts operating low-priced listings, which are less differentiated, are more likely to adopt SP. Zhang et al. (2021) used Airbnb data from the United States between 2015 and 2017 and showed that the average monthly prices for hosts using SP and those not using SP were and , respectively.111111The number of hosts using SP was 2,118, while the number not using SP was 7,278. While Zhang et al. (2021) obtained information on SP usage directly from Airbnb, the data used in this study does not allow for direct observation of whether hosts use SP.
The relatively lower prices among hosts using SP support the validity of Criterion 1. The classification of whether a host operates low-priced listings is conducted as follows: (i) perform 5-level clustering based on prices for both Airbnb hosts and hotels on Booking.com using the k-means method, (ii) interpret listings in the bottom two clusters as low-priced.
The clustering results are shown in Table 4.
Criterion 2: “Engaging in Dynamic Pricing”. Dynamic pricing refers to adjusting prices in response to changes in demand. In the accommodation industry, sellers operate with fixed capacities, which typically leads to lower prices as the check-in date approaches. Under Smart Pricing, prices are updated daily, so hosts who change their prices daily are considered to meet Criterion 2. As a result, the proportion of Airbnb hosts utilizing dynamic pricing was found to be 52.13%.
Following the above procedure, I found that 16.7% of Airbnb hosts met both criteria. This is close to the figure reported in previous studies using datasets with known SP usage (Zhang et al. 2021, Foroughifar and Mehta 2024), where the proportion was 22% in both datasets, indicating consistency with past findings. Table 5 shows how many listings use SP for each cluster, restricting the sample to Airbnb only.
I observed notable variations among price clusters in Table 4, which suggest the need to incorporate a nested structure into the Bertrand competition model. Similarly, I found significant differences in accommodation prices and supply across months, wards, and days of the week. Figure 5 illustrates the total number of Airbnb listings and hotels in each ward, revealing substantial disparities in supply. These variations likely arise from factors such as (i) differences in demand driven by the presence of tourist attractions, and (ii) variations in supply costs linked to rent differences across wards. To address (i), my model incorporates fixed effects for each of the 23 wards, capturing the heterogeneity in market characteristics. Figure 4 and Figure 6 further highlight differences in average prices by ward, month, and day of the week, reinforcing the need for fixed effects in the consumer demand model. In contrast, addressing (ii) within the supply side is infeasible due to the lack of observable variables that capture daily fluctuations in supply factors across the 23 wards of Tokyo. For instance, rent for accommodations could serve as a supply factor. However, since rents do not fluctuate daily, they cannot be incorporated into the supply-side model, which defines markets based on a pair of daily and area.
3 Model and Estimation Strategy
In this section, I describe consumer demand and the short-run price competition model that I use to estimate the welfare effects of self-preferencing and explain the estimation strategy.
3.1 Consumer Demand
I use the random coefficient nested logit (RCNL) model to estimate consumer demand. The RCNL model has been applied in several recent empirical articles (Ciliberto and Williams 2014, Miller and Weinberg 2017, Igami et al. 2024).
I define a market by a pair of days and one of Tokyo’s 23 wards. Each consumer purchases one of the observed products () or selects the outside option (). Consumer ’s indirect utility from product in market is
(1) |
where is a vector of observable product characteristics, is i.i.d normal distribution , the parameter govern the extent of variation in tastes for across consumers with different taste shocks , represents unobserved product quality. I incorporate city, month, and the day of the week fixed effects121212I cannot incorporate the product fixed effect because of the curse of dimensionality (see Conlon and Gortmaker (2020) for details). for so that . I control for each fixed effect using dummy variables and is left as a structural error. is a stochastic term. The observable product characteristics include the average review scores for various criteria (cleanliness, communication with staff, location quality, and the overall evaluation) and an indicator for whether the product is on Airbnb.
I assume that can be decomposed by
(2) |
where represents the pre-determined group which I define a pair of price clusters (as shown in Table 4) and whether the listing is Airbnb or hotels, is the i.i.d. extreme value, has the unique distribution such that is extreme value, and is a nesting parameter. Larger values of correspond to a greater correlation in preferences for products of the same group. Finally, we normalize the indirect utility of the outside good such that . To sum up, the parameters we need to estimate are . In Appendix A.2, I present the equations for the market share and demand of product , as well as the formulas for calculating consumer surplus, producer surplus, and social welfare.
This RCLN model specification was motivated by key features of my empirical context and dataset. As shown in Table 4, significant heterogeneity exists between Airbnb and hotels across different price clusters, which is captured by a nested structure and the random coefficient to account for substitution patterns flexibly.131313I explored additional random coefficients; however, the estimates produced were counter-intuitive due to perhaps multi-collinearity (see Grigolon and Verboven (2014) for the detail). It is also natural to assume that consumers exhibit common preferences regarding whether the accommodation is listed on Airbnb or is a hotel, and this is captured using a dummy variable for Airbnb.
3.2 Estimation Strategy
I estimate the demand model by BLP algorithm using Conlon and Gortmaker (2020)’s Python package PyBLP.141414I thank Jeff Gortmaker for sharing these details especially how to implement the specific counterfactual simulation. Prices and within-nest market shares are likely to be correlated with the structural error term because firms set prices with knowledge of product- and market-specific consumer valuations. This creates a standard endogeneity problem. To address this, I use three types of instrumental variables: (i) the number of products in each category defined by K-Means clustering, (ii) the measures of product differentiation (“differentiation instruments”) proposed by Gandhi and Houde (2023) which intuitively capture the relative isolation of each product in characteristics space, and (iii) own product characteristics, .151515i.e., I assume . This assumption that the product characteristics of each good are uncorrelated with the structural error is reasonable since the focus is on short-term decision-making. For differentiation IVs, I employ both the Euclidean distances across continuous characteristics between a focal product and others and their interactions. The interaction terms capture the covariance between two dimensions of differentiation.
Following Farronato and Fradkin (2022), I exclude products with a market share of less than from the analysis. This exclusion is necessary because the presence of products with market shares approaching zero complicates estimation and prevents the BLP algorithm from converging. Regarding the potential market size in market , I adopt the approach from the literature on structural estimation using Airbnb (Li and Srinivasan 2019, Farronato and Fradkin 2022, Foroughifar and Mehta 2024), defining ”. Table C presents the descriptive statistics for the dataset after applying the aforementioned sample restrictions. The characteristics observed are largely consistent with those of the full sample, as shown in Table C. Furthermore, Table C provides the number of hosts using SP, those not using it, and the number of hotels in each market after applying the sample restrictions, where a market is defined as the pair of a specific day and one of Tokyo’s 23 wards. On average, there are 7.28 firms per market, highlighting the oligopolistic environment of these markets.
3.3 Supply-side
I consider SP hosts, non-SP hosts, hotels, and Airbnb as the game players. That is, the set of players and products are and , respectively. Note that SP hosts do not determine their prices and I assume that this decision was made by hosts exogenously. In the market , each player decides simultaneously the price for product . They have the complete information. Each host and hotel receives the profits which are realized at pure-strategy Nash equilibrium. Airbnb obtains the commission revenue from hosts’ revenue:
where denotes the commission rate charged by Airbnb to hosts () and represents the set of all hosts’ products on Airbnb. I will omit the market index for simplicity in the following description.
3.3.1 Baseline Scenario
In the baseline scenario, Airbnb adjusts SP to maximize the revenue of an individual host using SP, while each non-SP host and hotel solves its profit maximization problem. Therefore, we can interpret the objective function of each player as follows:
(3) |
where is the marginal cost, assumed to be constant, and is an indicator function that equals 1 if , and 0 if . By first-order conditions, we have
(4) |
I introduce matrices and vectors as follows:
(5) |
I define a ownership matrix , whose element is given by
(6) |
Using this, the first-order conditions (4) can be rewritten as follows:
(7) | ||||
(8) |
where denotes the Hadamard product. is defined as
(9) |
and is assumed to be a nonsingular matrix. Since marginal costs are assumed to be constant, the marginal cost for each non-SP host and hotel can be calculated from equation (8). Therefore, in the case of the baseline scenario, the observed prices are substituted into equation (8) to compute marginal costs.
3.3.2 Self-preferencing Scenario
In the self-preferencing scenario, Airbnb adjusts SP to maximize its total commission revenue. In Table C, I present the strategic environment under this scenario. In particular, what differs from the baseline scenario is the objective function of Airbnb’s maximization problem:
That is, Airbnb’s objective function is the platform-wide commission revenue rather than individual SP-host revenue. Since we have already estimated demand and calculated the marginal cost, the counterfactual equilibrium prices can be calculated from the new first-order conditions under the environment described in Table C, using the contraction mapping algorithm proposed by Morrow and Skerlos (2011).
4 Estimation Results
4.1 Consumer Demand
Table 9 presents the results of the demand estimation. Columns (i) and (iii) reflect the main RCNL specification, while columns (ii) and (iv) represent the nested logit model, which does not incorporate consumer heterogeneity in prices, . In Appendix A.3, Table 1 further details the demand estimation for column (i), including 95% confidence intervals. To check the robustness of the nature of competition, the analysis is conducted under two assumptions: “With Hotels,” where Airbnb hosts and hotels are assumed to compete directly (Li and Srinivasan 2019, Farronato and Fradkin 2022), and “Only Airbnb,” where hosts and hotels are treated as non-competitors (Foroughifar and Mehta 2024).
As shown in Table C, all price coefficients are negative ( and ) and significant at the 5% confidence level, indicating that lower prices significantly enhance consumer utility. The Airbnb dummy is estimated to have a negative effect ( and ) on demand, though the standard deviation of this effect is imprecisely estimated. Similarly, the standard errors for ratings are imprecisely estimated. This imprecision is likely due to limited variation in ratings, as nearly 95% of Airbnb properties maintain average ratings of 4.5 or 5 stars, with properties rated below 3.5 stars being virtually absent (Zervas et al. 2021).
The estimated nest parameters across different assumptions reveal key differences in substitution patterns and the factors influencing consumer decision-making. In the “With Hotels”, the estimated nest parameters ( for column (i) and for column (ii)) highlight the significance of both price clusters and the distinction between Airbnb and hotels in consumer decision-making. The larger values of the estimated indicate that within-nest substitution plays an important role, underscoring the importance of both price clusters and whether a listing is an Airbnb or a hotel for consumers. Conversely, in the “Only Airbnb”, where nests are determined solely by price clusters, the nest parameters are estimated at and for columns (iii) and (iv), respectively, though their standard errors are imprecise. This suggests that when consumers are restricted to choosing only among Airbnb listings, the substitution pattern within price clusters is weaker, consistent with the smaller absolute value of the price coefficient compared to the “With Hotels” case.
4.2 Marginal Costs
I derive marginal costs, new equilibrium prices, and firms’ profits under a self-preferencing scenario from the demand estimates and firms’ first-order conditions. Table C column (i) presents the marginal cost estimates for RCNL model specifications.
The estimated marginal costs reveal intriguing patterns that vary across platform types and price clusters, shedding light on the underlying cost structures despite certain limitations in the model’s design. In “With Hotels”, the mean marginal cost for Hosts without SP is , while for hotels, it is , indicating that, on average, hotels have higher marginal costs. Similarly, in “Only Airbnb”, the mean marginal cost of hosts without SP is . The estimated mean marginal costs are negative, which is counterintuitive. This result arises from my assumption of constant marginal costs, whereby the estimated values are calculated solely based on observed prices and demand. Figure 7 plots the density of estimated marginal costs categorized by (a) whether the listing is on Airbnb and (b) by price cluster. The estimated marginal costs vary significantly across price clusters. The result that higher price clusters are associated with higher marginal costs is consistent with Farronato and Fradkin (2022). However, it is important to note that my model defines the market daily and across the 23 wards, which limits the ability to capture supply-side variables that fluctuate daily for each ward. Consequently, the model cannot fully explain the daily variations in marginal costs.
5 Counterfactual Simulation Results
In this section, I discuss the counterfactual simulation result given the estimates of marginal costs and consumer demand. Specifically, I examine the changes in equilibrium prices, profits, and Airbnb’s commission revenue, as detailed in Section 5.1, when transitioning from the baseline scenario to the self-preferencing scenario. Following this, in Section 5.2, I evaluate the welfare effects of self-preferencing.
5.1 Equilibrium Price, Profit, and Airbnb’s Commission Revenue
In Table C, I present the equilibrium prices and profits for individual hosts and hotels. I also show Airbnb’s total commission revenue for each scenario in Table C.
First, I begin by examining the results under the “With Hotels” assumption. When Airbnb engages in self-preferencing, its total commission revenue increases significantly (), highlighting its incentive to engage in a self-preferencing strategy. This rise in revenue stems from the differential impact of self-preferencing on hosts and hotels.
For SP hosts, average price increases were modest , yet they experienced significant profit gains . Non-SP hosts, on the other hand, faced larger average price increases but saw only marginal profit improvements . This discrepancy suggests that the absence of algorithmic support for non-SP hosts leaves them at a strategic disadvantage, limiting their ability to translate higher prices into meaningful profit gains. Hotels, whose prices remained largely unchanged , experienced only slight profit improvements . This stability likely reflects the different competitive dynamics hotels face, as they operate outside Airbnb’s SP framework. Overall, Airbnb achieved an average revenue increase of . This significant growth underscores how the platform’s self-preferencing strategy, implemented through the Smart Pricing system, skews the competitive dynamics of the market.
Second, I discuss the results under the “Only Airbnb” assumption and argue the differences from the results under “With Hotels”. Under “Only Airbnb”, the competitive dynamics shift dramatically, producing outcomes that stand in stark contrast to those observed under “With Hotels”. Notably, Airbnb’s total commission revenue increased by an average of 50.167%, surpassing the revenue growth achieved in the presence of hotels.
For SP hosts, average price increases were exceptionally high, reaching %, but these gains did not translate into higher profits. Instead, SP hosts experienced significant profit reductions, averaging %. This paradox suggests that in a market dominated solely by Airbnb hosts, the algorithm’s adjustments lead to unsustainable price hikes that erode profitability for SP hosts. Conversely, non-SP hosts adopted a different strategy, reducing prices by 1.862% on average, which led to modest profit gains of +2.774%. This behavior underscores the divergent pricing strategies between SP and non-SP hosts under the “Only Airbnb”. Non-SP hosts’ ability to capitalize on relatively lower prices hints at competitive dynamics that differ markedly from those in a mixed market with hotels. These findings underscore how the presence or absence of hotel competition fundamentally influences market outcomes. While “With Hotels” demonstrates a relatively balanced interplay between SP and non-SP hosts, “Only Airbnb” reveals more pronounced disparities, with Airbnb extracting significantly higher commissions at the expense of host profitability.
To enhance the tabular findings, I present a scatter plot depicting the percentage changes in price and profit under “With Hotels” in Figure 8. The analysis reveals that SP hosts (red circles) tend to cluster around moderate price increases or even slight decreases relative to the baseline, yet they achieve substantial profit gains. This pattern highlights Airbnb’s strategic pricing adjustments, which optimize aggregate commission revenue through SP. In contrast, non-SP hosts (green crosses) exhibit more varied trends. Some of these hosts have managed to increase profits by slightly lowering prices, a seemingly counterintuitive strategy that effectively counters SP-driven price levels. Meanwhile, hotels (blue triangles) display minimal changes in both prices and profits, with data points concentrated near zero. These findings align with the tabular results, reinforcing the conclusion that Airbnb’s pricing strategy benefits SP hosts the most while preserving stable outcomes for hotels.
In addition, I found that Airbnb’s self-preferencing mitigates price competition as the number of SP hosts in the market increases. To capture this effect, I provide scatter plots in Figure 9 and Figure 10, illustrating the relationship between the number of SP hosts per market and the average rate of price change across different scenarios under “With Hotels”. As the number of hosts with SP increases, hosts’ prices rise, and competition among hosts diminishes, while competition between Airbnb and hotels remains relatively stable. This intuitive result highlights significant heterogeneity in the impact of self-preferencing, emphasizing its role in reducing price competition.
In summary, Airbnb’s shift to platform-centric pricing has led to divergent outcomes across SP hosts, non-SP hosts, and hotels. These findings demonstrate Airbnb’s strong short-run incentives to engage in self-preferencing, as it allows the platform to efficiently maximize commission revenues and capture a larger share of the market surplus.
5.2 Welfare Effects of Self-Preferencing
This subsection answers the research question of this paper, “What is the impact of self-preferencing by the platform on social welfare?” using the estimated models. In Table C, I compare consumer surplus (CS), producer surplus (PS), and social welfare (SW) between two assumptions.
Under the “With Hotels” assumption, consumers bear substantial losses as consumer surplus declines by an average of 5.077% when Airbnb shifts from a baseline scenario to a self-preferencing scenario. While producer surplus increases by an average of 12.383%, the decline in consumer surplus translates into a noticeable reduction in social welfare, which drops by 5.077% relative to the baseline scenario. This outcome highlights a fundamental imbalance: while Airbnb may achieve huge gains, they are far outweighed by the significant losses in consumer welfare.
The contrast is even starker in the “Only Airbnb” assumption. On average, consumer surplus declines by 11.933%, effectively negating the 30.962% rise in producer surplus. The resulting net effect is a pronounced reduction in social welfare. While Airbnb does succeed in extracting higher commission revenues, this comes at a clear cost to consumers and the market’s overall efficiency.
How significant is this welfare loss for the entire vacation rental market in Tokyo? Although my data spans only four months from March to July 2024, a projection of the estimated daily social welfare loss in Tokyo’s 23 wards (2.65 million yen) over a full year reveals the gravity of the situation. Multiplying the daily loss by 365 yields an annual social welfare loss of 968.345 million yen due to Airbnb’s self-preferencing. To put this into perspective, the vacation rental market in Tokyo was valued at approximately 6,500 million yen in 2023.161616In 2023, the estimated total consumer spending in the vacation rental market across Japan was approximately 26 billion yen (based on the Japan Tourism Agency’s Survey of Travel and Tourism Consumption Trends, https://www.mlit.go.jp/kankocho/tokei_hakusyo/shohidoko.html). During the same year, the total number of users of budget accommodations in Tokyo was approximately 22.88 million accounting for 25% of the national total, according to the Japan Tourism Agency’s Statistical Survey on Overnight Travel, https://www.mlit.go.jp/kankocho/tokei_hakusyo/shukuhakutokei.html. Based on this proportion, the size of Tokyo’s vacation rental market in 2023 is calculated as 26 billion yen multiplied by 0.25, resulting in approximately 6.5 billion yen. Thus, this loss constitutes about 14.90% of the market’s total social welfare—a substantial impact. This considerable welfare loss emphasizes the urgent need for regulatory intervention through competition policy to mitigate the adverse effects of self-preferencing.
The main takeaway is that while self-preferencing can enhance the platform’s revenue, they do so at the expense of consumer welfare and overall economic efficiency. From a policy perspective, this conclusion suggests that competition authorities need to pay close attention to the opaque nature of algorithmic decision-making and pricing practices. Since these algorithms are proprietary and remain hidden from public scrutiny, policymakers may need to consider targeted interventions to improve transparency and accountability. This could involve mandating at least partial disclosures of algorithmic objectives or outcomes and introducing guidelines that limit certain forms of strategic self-preferencing. Such policies would aim to ensure that platforms cannot unilaterally reshape the competitive landscape in ways that harm consumers and reduce social welfare.
6 Conclusion
I analyzed the welfare effects of self-preferencing by Airbnb in this paper. Specifically, when Airbnb implements self-preferencing by offering hosts a pricing algorithm (SP) designed to maximize the platform’s total commission revenue, my analysis reveals a significant negative impact on social welfare, with an average decline of 5.08%. At the same time, Airbnb’s total commission revenue increases by 37.73% on average, highlighting a strong incentive for the platform to engage in self-preferencing. These findings have significant policy implications, particularly in light of the widespread lack of transparency and accountability regarding the algorithm they employ. Although I focused on Airbnb, my findings can be applied more generally to marketplace platforms such as Amazon and Mercari which provide the pricing algorithm to sellers.
This study has several limitations. First, while I analyze the impact of self-preferencing on welfare through the incentives of platforms, I do not capture the potential positive effects of pricing algorithms. My model especially does not account for demand forecasting improvements and pricing cost reductions (Miklós-Thal and Tucker 2019, O’Connor and Wilson 2021).
Second, this study examines the short-run incentives for the platform to engage in self-preferencing, but it highlights potential long-run trade-offs that warrant further investigation. While my findings suggest that self-preferencing can benefit the platform in the short run, it also reduces the profitability of some platform users. In the long run, the platform’s profitability is likely related to increasing the number of users on its platform (Belleflamme and Peitz 2021). If self-preferencing leads to declining host profitability, it may ultimately discourage participation and reduce the user base, creating a significant trade-off for the platform. These interactions imply that the platform’s behavior could be better understood through long-run dynamics. Developing a model incorporating these dynamics could provide valuable insights into how self-preferencing impacts the platform’s growth strategy and relationship with hosts.
Finally, I do not capture the strategic behavior of consumers in response to dynamic pricing (Hendel and Nevo 2013, Noguchi 2024), which could result in different welfare effects due to Airbnb’s self-preferencing through SP. A promising avenue for future research is to investigate dynamic pricing games where consumers strategically either expedite or delay their purchasing decisions, potentially influencing pricing outcomes and welfare implications of self-preferencing.
References
- (1)
- Anderson and Bedre-Defolie (2021) Anderson, Simon P, and Özlem Bedre-Defolie. 2021. “Hybrid platform model.” CEPR discussion paper (DP16243): .
- Asker et al. (2022) Asker, John, Chaim Fershtman, and Ariel Pakes. 2022. “Artificial Intelligence, Algorithm Design, and Pricing.” AEA Papers and Proceedings 112 452–56. 10.1257/pandp.20221059.
- Assad et al. (2024) Assad, Stephanie, Robert Clark, Daniel Ershov, and Lei Xu. 2024. “Algorithmic pricing and competition: Empirical evidence from the German retail gasoline market.” Journal of Political Economy 132 (3): 723–771.
- Barron et al. (2021) Barron, Kyle, Edward Kung, and Davide Proserpio. 2021. “The effect of home-sharing on house prices and rents: Evidence from Airbnb.” Marketing Science 40 (1): 23–47.
- Belleflamme and Peitz (2021) Belleflamme, Paul, and Martin Peitz. 2021. The economics of platforms. Cambridge University Press.
- Berry et al. (1995) Berry, Steven, James Levinsohn, and Ariel Pakes. 1995. “Automobile Prices in Market Equilibrium.” Econometrica 63 (4): 841–890.
- Brown and MacKay (2023) Brown, Zach Y., and Alexander MacKay. 2023. “Competition in Pricing Algorithms.” American Economic Journal: Microeconomics 15 (2): 109–56. 10.1257/mic.20210158.
- Calvano et al. (2020) Calvano, Emilio, Giacomo Calzolari, Vincenzo Denicolo, and Sergio Pastorello. 2020. “Artificial intelligence, algorithmic pricing, and collusion.” American Economic Review 110 (10): 3267–3297.
- Chen and Tsai (2024) Chen, Nan, and Hsin-Tien Tsai. 2024. “Steering via algorithmic recommendations.” The RAND Journal of Economics 55 (4): 501–518.
- Ciliberto and Williams (2014) Ciliberto, Federico, and Jonathan W Williams. 2014. “Does multimarket contact facilitate tacit collusion? Inference on conduct parameters in the airline industry.” The RAND journal of economics 45 (4): 764–791.
- Conlon and Gortmaker (2020) Conlon, Christopher, and Jeff Gortmaker. 2020. “Best practices for differentiated products demand estimation with PyBLP.” The RAND Journal of Economics 51 (4): 1108–1161. https://doi.org/10.1111/1756-2171.12352.
- Crémer et al. (2019) Crémer, J., Y.-A. de Montjoye, and H. Schweitzer. 2019. “Competition Policy for the Digital Era.”
- De Corniere and Taylor (2019) De Corniere, Alexandre, and Greg Taylor. 2019. “A model of biased intermediation.” The RAND Journal of Economics 50 (4): 854–882.
- Digital Competition Expert Panel (2019) Digital Competition Expert Panel. 2019. “Unlocking Digital Competition, Report of the Digital Competition Expert Panel.”
- Dogru et al. (2020) Dogru, Tarik, Lydia Hanks, Makarand Mody, Courtney Suess, and Ercan Sirakaya-Turk. 2020. “The effects of Airbnb on hotel performance: Evidence from cities beyond the United States.” Tourism Management 79 104090.
- Etro (2023) Etro, Federico. 2023. “Hybrid marketplaces with free entry of sellers.” Review of Industrial Organization 62 (2): 119–148.
- European Commission (2017) European Commission. 2017. “Antitrust: Commission fines Google €2.42 billion for abusing dominance as search engine by giving illegal advantage to own comparison shopping service.” http://www.congress.gov/bill/117th-congress/senate-bill/2992/all-info, Accessed: 2024-10-08.
- European Commission (2020a) European Commission. 2020a. “Antitrust: Commission Opens Investigation into Apple Practices Regarding Apple Pay.”
- European Commission (2020b) European Commission. 2020b. “Antitrust: Commission Opens Investigations into Apple’s App Store Rules.”
- Farronato and Fradkin (2022) Farronato, Chiara, and Andrey Fradkin. 2022. “The welfare effects of peer entry: the case of Airbnb and the accommodation industry.” American Economic Review 112 (6): 1782–1817.
- Foroughifar and Mehta (2024) Foroughifar, Mohsen, and Nitin Mehta. 2024. “The Challenges of Deploying an Algorithmic Pricing Tool: Evidence from Airbnb.” 10.2139/ssrn.4891482, Available at SSRN.
- Gandhi and Houde (2023) Gandhi, Amit, and Jean-François Houde. 2023. “Measuring Substitution Patterns in Differentiated-Products Industries.” Manuscript revised & resubmitted to Econometrica.
- Grigolon and Verboven (2014) Grigolon, Laura, and Frank Verboven. 2014. “Nested logit or random coefficients logit? A comparison of alternative discrete choice models of product differentiation.” Review of Economics and Statistics 96 (5): 916–935.
- Harrington Jr (2022) Harrington Jr, Joseph E. 2022. “The effect of outsourcing pricing algorithms on market competition.” Management Science 68 (9): 6889–6906.
- He et al. (2022) He, Sherry, Brett Hollenbeck, and Davide Proserpio. 2022. “The market for fake reviews.” Marketing Science 41 (5): 896–921.
- Hendel and Nevo (2013) Hendel, Igal, and Aviv Nevo. 2013. “Intertemporal price discrimination in storable goods markets.” American Economic Review 103 (7): 2722–2751.
- Horn and Merante (2017) Horn, Keren, and Mark Merante. 2017. “Is home sharing driving up rents? Evidence from Airbnb in Boston.” Journal of housing economics 38 14–24.
- Hortaçsu et al. (2024) Hortaçsu, Ali, Olivia R Natan, Hayden Parsley, Timothy Schwieg, and Kevin R Williams. 2024. “Organizational structure and pricing: Evidence from a large us airline.” The Quarterly Journal of Economics 139 (2): 1149–1199.
- Huang (2024) Huang, Yufeng. 2024. “Pricing Frictions and Platform Remedies: The Case of Airbnb.”Technical report.
- Igami et al. (2024) Igami, Mitsuru, Shoki Kusaka, Y. Jeff Qiu, and TuyetAnh L. Tran. 2024. “Welfare Gains from Product and Process Innovations: The Case of LCD Panels, 2001-2011.” SSRN, Working Paper.
- Japan Fair Trade Commission (2021) Japan Fair Trade Commission. 2021. “The Study Group on Competition Policy in Digital Markets Released the Report on Algorithms/AI and Competition Policy.” https://www.jftc.go.jp/houdou/pressrelease/2021/mar/210331˙digital.html.
- Johnson et al. (2023) Johnson, Justin P, Andrew Rhodes, and Matthijs Wildenbeest. 2023. “Platform design when sellers use pricing algorithms.” Econometrica 91 (5): 1841–1879.
- Kittaka and Sato (2022) Kittaka, Yuta, and Susumu Sato. 2022. “Dual-role platforms and self-preferencing: Sequential search approach.” Available at SSRN 3736574.
- Kittaka et al. (2023) Kittaka, Yuta, Susumu Sato, and Yusuke Zennyo. 2023. “Self-preferencing by platforms: A literature review.” Japan and the World Economy 66 101191.
- Klein (2021) Klein, Timo. 2021. “Autonomous algorithmic collusion: Q-learning under sequential pricing.” The RAND Journal of Economics 52 (3): 538–558.
- Lam (2021) Lam, H Tai. 2021. “Platform search design and market power.” Job Market Paper, Northwestern University.
- Lamba and Zhuk (2022) Lamba, Rohit, and Sergey Zhuk. 2022. “Pricing with algorithms.” arXiv preprint arXiv:2205.04661.
- Li and Srinivasan (2019) Li, Hui, and Kannan Srinivasan. 2019. “Competitive dynamics in the sharing economy: An analysis in the context of Airbnb and hotels.” Marketing Science 38 (3): 365–391.
- McFadden (1981) McFadden, Daniel. 1981. “Econometric models of probabilistic choice.” Structural analysis of discrete data with econometric applications 198272.
- Miklós-Thal and Tucker (2019) Miklós-Thal, Jeanine, and Catherine Tucker. 2019. “Collusion by algorithm: Does better demand prediction facilitate coordination between sellers?” Management Science 65 (4): 1552–1561.
- Miller and Weinberg (2017) Miller, Nathan H, and Matthew C Weinberg. 2017. “Understanding the price effects of the MillerCoors joint venture.” Econometrica 85 (6): 1763–1791.
- Morrow and Skerlos (2011) Morrow, W Ross, and Steven J Skerlos. 2011. “Fixed-point approaches to computing Bertrand-Nash equilibrium prices under mixed-logit demand.” Operations research 59 (2): 328–345.
- Musolff (2024) Musolff, Leon. 2024. “Algorithmic Pricing, Price Wars and Tacit Collusion: Evidence from E-Commerce.” Working Paper.
- Noguchi (2024) Noguchi, Shosuke. 2024. “Strategic Buyers and Inventory Choice under Dynamic Pricing in the Flower Market.” Job Market Paper, Rice University.
- O’Connor and Wilson (2021) O’Connor, Jason, and Nathan E Wilson. 2021. “Reduced demand uncertainty and the sustainability of collusion: How AI could affect competition.” Information Economics and Policy 54 100882.
- Pan and Wang (2021) Pan, Qi, and Wen Wang. 2021. “Costly price adjustment and automated pricing: The case of Airbnb.” Available at SSRN 4077985.
- Salcedo (2015) Salcedo, Bruno. 2015. “Pricing Algorithms and Tacit Collusion.” Working Paper.
- Small and Rosen (1981) Small, Kenneth A, and Harvey S Rosen. 1981. “Applied welfare economics with discrete choice models.” Econometrica: Journal of the Econometric Society 105–130.
- U.S. House Committee on the Judiciary (2020) U.S. House Committee on the Judiciary. 2020. “Investigation of Competition in Digital Markets.”
- Wen and Zhu (2019) Wen, Wen, and Feng Zhu. 2019. “Threat of platform-owner entry and complementor responses: Evidence from the mobile app market.” Strategic Management Journal 40 (9): 1336–1367.
- Xu et al. (2015) Xu, Haitao, Daiping Liu, Haining Wang, and Angelos Stavrou. 2015. “E-commerce reputation manipulation: The emergence of reputation-escalation-as-a-service.” 1296–1306.
- Ye et al. (2018) Ye, Peng, Julian Qian, Jieying Chen, Chen-hung Wu, Yitong Zhou, Spencer De Mars, Frank Yang, and Li Zhang. 2018. “Customized regression model for airbnb dynamic pricing.” In Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining, 932–940.
- Zbaracki et al. (2004) Zbaracki, Mark J, Mark Ritson, Daniel Levy, Shantanu Dutta, and Mark Bergen. 2004. “Managerial and customer costs of price adjustment: direct evidence from industrial markets.” Review of Economics and statistics 86 (2): 514–533.
- Zervas et al. (2017) Zervas, Georgios, Davide Proserpio, and John W Byers. 2017. “The rise of the sharing economy: Estimating the impact of Airbnb on the hotel industry.” Journal of marketing research 54 (5): 687–705.
- Zervas et al. (2021) Zervas, Georgios, Davide Proserpio, and John W Byers. 2021. “A first look at online reputation on Airbnb, where every stay is above average.” Marketing Letters 32 1–16.
- Zhang et al. (2021) Zhang, Shunyuan, Nitin Mehta, Param Vir Singh, and Kannan Srinivasan. 2021. “Frontiers: Can an Artificial Intelligence Algorithm Mitigate Racial Economic Inequality? An Analysis in the Context of Airbnb.” Marketing Science 40 (5): 813–820.
- Zhu and Liu (2018) Zhu, Feng, and Qihong Liu. 2018. “Competing with complementors: An empirical look at Amazon. com.” Strategic management journal 39 (10): 2618–2642.
Appendix A Additional explanations and analysis
A.1 Detailed Explanation of Data
In this appendix, I explain the reason why I selected Booking.com, the methodology used to calculate vacancies and the number of beds in Airbnb and hotel listings, and the adjustments made to ensure comparability between the two data sets.
Why did I obtain hotel information from Booking.com? As of December 27, 2023, I calculated that approximately 17% of the reviews for Airbnb listings in Tokyo were written in Japanese, based on data I retrieved from Inside Airbnb.171717https://insideairbnb.com/, accessed on 29 April 2024. This indicates that the majority of Airbnb users in Tokyo are foreign guests. Therefore, hotel information was collected from Booking.com, the most widely used hotel booking platform worldwide.181818A information from a website, similarweb: https://www.similarweb.com/top-websites/travel-and-tourism/accommodation-and-hotels/, accessed on 7 December 2024.
The vacancies in Airbnb listings were calculated as follows: Listings supplied by the same host at the same price within the same market were treated as identical properties. This approach accounts for cases where hosts own multiple listings differentiated only by room numbers. Such listings were considered as a single property, and vacancies were counted cumulatively to compute market shares. For hotels on Booking.com, the data exclusively includes rooms without breakfast, considering that Airbnb accommodations typically do not provide breakfast.
I also quantified the number of beds. In accommodations such as vacation rentals and hotels, various beds are available, including Single, Semi-Double, Double, Queen, King, Bunk, Sofa, and Futon. Table 3 presents detailed descriptions, sizes, and the relative width of each bed type, standardized by setting the width of a single bed as the baseline. To treat the number of beds as a continuous variable, I calculated the ”Relative Value to Single Bed” which is calculated based on the bed widths using the formula ”width / 90” where 90 is the width of a single bed.
A.2 Demand Model and Welfare Measure
This Appendix section supplements section 3.1 (Consumer Demand). I indicate the computation of the choice probability (i.e., the demand) for product in market and the calculation of consumer welfare in the RCNL model.
There are nesting groups (), and each product is assigned to one of these groups. The outside good is assigned to group . The set denotes the products in group and market . The indirect utility function (1) is rewritten as follows:
where
Then, the market share of product in market is computed as
where , is the probability density function of , and
is the inclusive value term, which reflects the expected utility derived from the alternatives within group . Based on the above, the demand for product in market , , can be obtained by multiplying the market share by the market size , i.e., .
A.3 Demand Estimation
In this section, I provide the estimated results of the main interest model specification with 95% confidence intervals. The confidence intervals are computed using a parametric bootstrap procedure with 1000 resampled. The parametric bootstrap procedure is as follows: (i) construct 1000 draws from the estimated joint distribution of parameters assuming the joint distribution of parameters follows the normal distribution, (ii) compute the implied mean utility, shares, marginal costs, and prices for each draw, and (iii) calculate the required post-estimation output for all parametric bootstrap samples. Using the resulting empirical distribution, construct bootstrap confidence intervals. I use PyBLP to construct these confidence intervals, see Conlon and Gortmaker (2020) for details.
Parameter | Coeff. | Std. err. | 95% C.I. |
---|---|---|---|
Price () | |||
Price () | |||
Size nests () | |||
# Beds | |||
# Reviews | |||
Airbnb Dummy | |||
Ratings of Room | |||
Ratings of Cleanness | |||
Ratings of Location | |||
Ratings of Staff Communication | |||
City Fixed Effect | Yes | ||
Month Fixed Effect | Yes | ||
Day of the Week Fixed Effect | Yes | ||
Robust S.E. for Cluster | Yes | ||
# Markets | |||
N |
Note: This table shows the results of demand estimation by data containing all clusters from 1 to 5. I constructed the confidence interval by parametric bootstrapping using PyBLP. Standard errors are clustered by price cluster.
Appendix B Figures

Note: This figure shows an example of the market structure in each scenario, with hosts A and B using SP. In the baseline scenario, each host on Airbnb also competes on price with each other. In the self-preferencing scenario, Airbnb determines the price of goods for each host using the SP, leading to a reduction in the intensity of price competition compared to the baseline. Airbnb solves the profit maximization problem by also considering the commission earned from Host C’s revenue.


Note: Each host determines prices through this calendar interface. By enabling “Smart Pricing” (highlighted in a red box) and setting the minimum and maximum nightly rates, SP automatically suggests prices. The names and icons of booked guests and hosts have been redacted for privacy. The screenshot was captured on November 1, 2024.


Note: The vertical axis is a total count of how many listings exist in each city.




Note: This figure presents the kernel density estimation results of marginal costs inferred using the RCNL model under the market definition ”With Hotels.” Panel (a) compares the density estimates between Airbnb and hotels listed on Booking.com, while Panel (b) plots the density estimates by price clusters.

Note: This figure shows the results derived from the RCNL specification for “With Hotels”.

Note: This figure shows the average rate of change of the equilibrium price in the market, defined as “With Hotels,” when moving from the baseline scenario to the self-preferencing scenario on the y-axis and the number of hosts using SP per market on the x-axis. The average price change is calculated for the combined sample of hosts on Airbnb and hotels on Booking.com.


Note: This figure represents the percentage change in price across scenarios within the market defined as “With Hotels,” plotted separately for (a) only host listings on Airbnb and (b) only hotels.
Appendix C Tables
[ ] colspec=Q[]Q[]Q[]Q[]Q[]Q[]Q[],
cell12=c=2,halign=c,,
cell14=c=2,halign=c,,
column1=halign=l,,
column2=halign=r,,
column3=halign=r,,
column4=halign=r,,
column5=halign=r,,
column6=halign=r,,
column7=halign=r,,
row1=halign=c,,
Airbnb (N=54988) Booking.com (N=86388)
Mean SD Mean SD Diff. in Means SE
Prices \num0.287 \num0.191 \num0.337 \num0.420 \num0.050*** \num0.002
# Reviews \num47.755 \num69.823 \num1450.447 \num1584.243 \num1402.692*** \num5.398
Rating of Location \num4.635 \num0.265 \num4.113 \num0.386 \num-0.522*** \num0.002
Rating of Staff Communication \num4.769 \num0.219 \num4.265 \num0.368 \num-0.504*** \num0.002
Rating of Cleanness \num4.603 \num0.327 \num4.150 \num0.455 \num-0.453*** \num0.002
Rating of Room \num4.599 \num0.288 \num4.140 \num0.385 \num-0.458*** \num0.002
# Vacancies \num4.241 \num4.522 \num4.749 \num3.530 \num0.509*** \num0.023
# Beds \num2.155 \num1.380 \num1.936 \num1.242 \num-0.219*** \num0.007
Note: Prices are scaled by dividing them by 100,000. Additionally, while the ratings on Booking.com were originally rated on a 10-point scale, they have been rescaled to a 1-5 scale to align with Airbnb’s rating system.
.
Type | Description | Size (Width × Length cm) | Relative Value to Single Bed |
---|---|---|---|
Single Bed | The smallest size bed for one person. | 90 × 190 | 1 |
Semi-Double Bed | Wider than a single bed, offering more comfort for one person. | 120 × 190 | 1.33 |
Double Bed | A standard-size bed suitable for two people. | 140 × 190 | 1.56 |
Queen Bed | Larger than a double bed, providing more comfort for two people. | 160 × 200 | 1.78 |
King Bed | A spacious bed close to the largest size. | 180 × 200 | 2 |
Bunk Bed | Two single beds stacked vertically. | Each 90 × 190 | 2 |
Sofa Bed | A versatile bed that doubles as a sofa and converts into a bed. | 90 × 190 | 1 |
Futon | A traditional Japanese bedding that can be folded and stored. | None | 1 |
Note: This table presents the types of beds available in accommodations listed on Airbnb and hotels, along with their textual descriptions, sizes, and widths normalized relative to a single bed, where the width of a single bed is set to 1.
Cluster | N | Airbnb (%) | Hotels (%) | Mean | SD | Min | Max |
---|---|---|---|---|---|---|---|
1 | 889 | 1.912 | 98.088 | 4.096 | 1.107 | 2.975 | 7.900 |
2 | 5391 | 0.556 | 99.444 | 1.888 | 0.371 | 1.440 | 2.965 |
3 | 12119 | 14.250 | 85.750 | 1.012 | 0.202 | 0.735 | 1.439 |
4 | 46513 | 30.931 | 69.069 | 0.460 | 0.112 | 0.314 | 0.734 |
5 | 133995 | 29.176 | 70.824 | 0.170 | 0.069 | 0.016 | 0.314 |
Cluster | # SP | # Non-SP | SP (%) | Non-SP (%) |
---|---|---|---|---|
1 | 0 | 17 | 0.000 | 100.000 |
2 | 0 | 30 | 0.000 | 100.000 |
3 | 0 | 1724 | 0.000 | 100.000 |
4 | 569 | 13767 | 3.969 | 96.031 |
5 | 8903 | 29978 | 22.898 | 77.102 |
Note: This table shows the number and proportion of hosts who meet both criteria for utilizing Smart Pricing: (1) Operating Low-Priced Listings and (2) Engaging in Dynamic Pricing, categorized by cluster. Data from hotels listed on Booking.com are excluded from the calculations.
[ ] colspec=Q[]Q[]Q[]Q[]Q[]Q[]Q[],
cell12=c=2,halign=c,,
cell14=c=2,halign=c,,
column1=halign=l,,
column2=halign=r,,
column3=halign=r,,
column4=halign=r,,
column5=halign=r,,
column6=halign=r,,
column7=halign=r,,
row1=halign=c,,
Airbnb (N=9194) Booking.com (N=8413)
Mean SD Mean SD Diff. in Means Std. Error
Prices \num0.245 \num0.142 \num0.218 \num0.324 \num-0.027*** \num0.004
# Reviews \num48.519 \num72.608 \num1098.093 \num1179.268 \num1049.574*** \num12.879
Rating of Location \num4.598 \num0.290 \num3.913 \num0.390 \num-0.684*** \num0.005
Rating of Staff Communication \num4.762 \num0.213 \num4.148 \num0.402 \num-0.614*** \num0.005
Rating of Cleanness \num4.602 \num0.304 \num3.977 \num0.507 \num-0.626*** \num0.006
Rating of Room \num4.608 \num0.247 \num3.985 \num0.432 \num-0.623*** \num0.005
# Vacancies \num6.000 \num5.391 \num7.537 \num3.017 \num1.537*** \num0.065
# Beds \num2.144 \num1.144 \num1.621 \num0.933 \num-0.523*** \num0.016
Note: The descriptive statistics presented in this table pertain to the dataset used for model estimation. Specifically, only data with calculated market shares of 0.5% or higher are included.
[ ] colspec=Q[]Q[]Q[]Q[]Q[]Q[]Q[],
column1=halign=l,,
column2=halign=r,,
column3=halign=r,,
column4=halign=r,,
column5=halign=r,,
column6=halign=r,,
column7=halign=r,,
Mean SD Min Max Median N
Hosts Without SP \num3.228 \num3.546 \num0.000 \num25.000 \num2.000 899
Hosts With SP \num2.304 2.961 \num0.000 13.000 \num1.000 899
Hotels \num4.050 \num4.015 \num0.000 \num21.000 \num3.000 899
Total \num7.278 \num4.068 \num1.000 \num25.000 \num7.000 899
Note: This table shows descriptive statistics of the number of hosts and hotels per market (pairs of one day and one of Tokyo’s 23 wards) and their totals after applying the sample restrictions for demand estimation. N represents the number of markets, not the number of listings.
[
entry=none,label=none,
]
colspec=Q[]Q[]Q[],
cell11=halign=l,,
cell12=halign=l,,
cell13=halign=l,,
column1=halign=l,,
column2=halign=c,,
column3=halign=c,,
row1=halign=c,,
Player () Set of goods Max. Problem
Airbnb
Non-SP hosts
Hotels Same as for Non-SP hosts
Notes: denotes the commission charged by the platform firm to platform users (i.e., ). refers to the set of product prices for SP-adopting hosts on Airbnb. represents the set of all hosts’ products on Airbnb. Adoption of SP is determined based on whether the product belongs to the ”low-price range (clusters 4 or 5).”
With Hotels | Airbnb Only | |||
---|---|---|---|---|
Parameters | (i) RCNL | (ii) NL | (iii) RCNL | (iv) NL |
Price () | ||||
() | () | () | () | |
Price () | ||||
() | () | |||
Size nests () | ||||
() | () | () | () | |
# Beds | ||||
() | () | () | () | |
# Reviews | ||||
() | () | () | () | |
Airbnb Dummy | ||||
() | () | |||
Ratings of Room | ||||
() | () | () | () | |
Ratings of Cleanness | ||||
() | () | () | () | |
Ratings of Location | ||||
() | () | () | () | |
Ratings of Staff Communication | ||||
() | () | () | () | |
City Fixed Effect | Yes | Yes | Yes | Yes |
Month Fixed Effect | Yes | Yes | Yes | Yes |
Day of the Week Fixed Effect | Yes | Yes | Yes | Yes |
# Markets | ||||
N |
Note: This table shows the results of demand estimation. RCNL and NL represent random coefficient nested logit, and nested logit, respectively. Robust for a nesting group standard errors are in parentheses.
colspec=Q[]Q[]Q[]Q[]Q[],
column1=halign=l,
column2=halign=l,
column3=halign=r,,
column4=halign=r,,
column5=halign=r,,
(i) MC (ii) % change in price (iii) % change in profits
A. With Hotels
Hosts with SP Mean \num1.541 \num6.542
SD \num40.453 \num4.718
Min \num-155.933 \num-20.447
Max \num149.767 \num33.384
Hosts without SP Mean \num-0.080 \num13.053 \num0.112
SD \num0.162 \num24.038 \num3.560
Min \num-0.406 \num-43.590 \num-32.359
Max \num0.968 \num240.815 \num13.248
Hotels Mean \num-0.077 \num0.007 \num0.027
SD \num0.292 \num0.064 \num0.482
Min \num-0.479 \num-1.077 \num-3.202
Max \num3.081 \num1.734 \num3.950
B. Only Airbnb
Hosts with SP Mean \num132.417 \num-41.367
SD \num61.002 \num8.676
Min \num15.618 \num-63.924
Max \num463.926 \num-2.077
Hosts without SP Mean \num-0.195 \num-1.862 \num2.774
SD \num0.150 \num8.587 \num3.619
Min \num-0.442 \num-78.074 \num-0.146
Max \num1.377 \num18.888 \num28.373
Note: MC stands for marginal costs. This table shows the marginal cost, percentage change in the equilibrium price, and percentage change in profits under the self-preferencing scenario from the price under the baseline scenario using RCNL specifications. The marginal cost of hosts using SP is assumed to be zero, so their marginal costs are not estimated. The percentage changes in prices and profits represent the extent to which prices and profits have changed from the Baseline scenario to the Self-Preferencing scenario.
[ ] colspec=Q[]Q[]Q[]Q[]Q[],
column1=halign=l,,
column2=halign=r,,
column3=halign=r,,
column4=halign=r,,
column5=halign=r,,
Mean SD Min Max
A. With Hotels
Commission Revenue (baseline) \num0.501 \num0.848 \num0.000 \num9.878
Commission Revenue (self-preferencing) \num0.646 \num1.210 \num0.000 \num12.524
% change in Commission Revenue \num37.726 \num184.645 \num-98.135 \num3000.000
B. Only Airbnb
Commission Revenue (baseline) \num0.517 \num1.180 \num0.006 \num7.841
Commission Revenue (self-preferencing) \num0.856 \num2.329 \num0.000 \num20.049
% change in Commission Revenue \num50.167 \num147.728 \num-97.497 \num1548.789
Note: This table shows the commission revenue that Airbnb earns as a whole platform for each market. Each value is expressed in units of yen. I computed commission revenue as the sum of each host’s revenue multiplied by 0.03. The percentage change in commission revenue represents the extent to which commission revenue has changed from the Baseline scenario to the Self-Preferencing scenario.
[ ] colspec=Q[]Q[]Q[]Q[]Q[]Q[]Q[]Q[]Q[],
cell12=c=4,halign=c,,
cell16=c=4,halign=c,,
cell22=c=2,halign=c,,
cell24=c=2,halign=c,,
cell26=c=2,halign=c,,
cell28=c=2,halign=c,,
column1=halign=l,,
column2=halign=r,,
column3=halign=r,,
column4=halign=r,,
column5=halign=r,,
column6=halign=r,,
column7=halign=r,,
column8=halign=r,,
column9=halign=r,,
row1=halign=c,,
row2=halign=c,,
With Hotels Only Airbnb
(i) RCNL (ii) NL (iii) RCNL (iv) NL
Mean SD Mean SD Mean SD Mean SD
CS (baseline) \num20.704 \num5.567 \num7.902 \num2.139 \num13.360 \num3.232 \num8.912 \num4.802
CS (self-preferencing) \num19.583 \num5.444 \num7.788 \num2.069 \num11.704 \num3.384 \num8.231 \num4.663
% change in CS \num-5.077 \num7.856 \num-0.985 \num4.907 \num-11.933 \num13.869 \num-7.439 \num9.393
PS (baseline) \num0.756 \num1.287 \num0.281 \num0.460 \num0.626 \num0.964 \num0.479 \num0.573
PS (self-preferencing) \num0.789 \num1.294 \num0.284 \num0.461 \num0.706 \num1.035 \num0.527 \num0.602
% change in PS \num12.383 \num47.517 \num0.828 \num3.494 \num30.962 \num74.618 \num15.364 \num34.940
SW (baseline) \num21.493 \num5.727 \num8.183 \num2.192 \num14.066 \num3.357 \num9.439 \num4.841
SW (self-preferencing) \num20.339 \num5.607 \num8.072 \num2.124 \num12.330 \num3.495 \num8.711 \num4.703
% change in SW \num-5.080 \num7.726 \num-0.946 \num4.704 \num-11.936 \num13.522 \num-7.531 \num9.214
Note: Each value is expressed in units of yen. CS, PS, SW, RCNL, and NL represent consumer surplus, producer surplus, social welfare, random coefficient nested logit, and nested logit, respectively. The percentage change in welfare represents the extent to which welfare has changed from the Baseline scenario to the Self-Preferencing scenario.