Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
126 tokens/sec
GPT-4o
47 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Active Learning for Non-Parametric Choice Models (2208.03346v2)

Published 5 Aug 2022 in cs.LG, cs.DS, math.OC, math.PR, and stat.ML

Abstract: We study the problem of actively learning a non-parametric choice model based on consumers' decisions. We present a negative result showing that such choice models may not be identifiable. To overcome the identifiability problem, we introduce a directed acyclic graph (DAG) representation of the choice model. This representation provably encodes all the information about the choice model which can be inferred from the available data, in the sense that it permits computing all choice probabilities. We establish that given exact choice probabilities for a collection of item sets, one can reconstruct the DAG. However, attempting to extend this methodology to estimate the DAG from noisy choice frequency data obtained during an active learning process leads to inaccuracies. To address this challenge, we present an inclusion-exclusion approach that effectively manages error propagation across DAG levels, leading to a more accurate estimate of the DAG. Utilizing this technique, our algorithm estimates the DAG representation of an underlying non-parametric choice model. The algorithm operates efficiently (in polynomial time) when the set of frequent rankings is drawn uniformly at random. It learns the distribution over the most popular items among frequent preference types by actively and repeatedly offering assortments of items and observing the chosen item. We demonstrate that our algorithm more effectively recovers a set of frequent preferences on both synthetic and publicly available datasets on consumers' preferences, compared to corresponding non-active learning estimation algorithms. These findings underscore the value of our algorithm and the broader applicability of active-learning approaches in modeling consumer behavior.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (65)
  1. Active learning: A survey, pages 571–605. CRC Press, January 2014. ISBN 9781466586741. 10.1201/b17320.
  2. Sponsored search auctions with markovian users. In Proc. 4th Workshop on Internet and Network Economics (WINE), pages 621–628, 2008.
  3. Mnl-bandit: A dynamic learning approach to assortment selection. Operations Research, 67(5):1453–1485, 2019.
  4. Dana Angluin. Queries and concept learning. Machine Learning, 2:319–342, 1988.
  5. The approximability of assortment optimization under ranking preferences. Operations Research, 66(6):1661–1669, 2018.
  6. Approximation algorithms for dynamic assortment optimization models. Mathematics of Operations Research, 44(2):487–511, 2019.
  7. Pricing of short life-cycle products through active learning. Under revision for Management Science, pages 1–32, 2002.
  8. Discrete choice analysis: theory and application to travel demand, volume 9. MIT press, 1985.
  9. A markov chain approximation to choice modeling. Operations Research, 64(4):886–905, 2016.
  10. Optimal inventory policy when stockouts alter demand. Naval Research Logistics Quarterly, 23(1):1–13, 1976.
  11. Fair assortment planning. arXiv preprint arXiv:2208.07341, 2022.
  12. Interpolating item and user fairness in recommendation systems. arXiv preprint arXiv:2306.10050, 2023.
  13. Dynamic assortment planning under nested logit models. Production and Operations Management, 30(1):85–102, 2021.
  14. Alexander Chernev. Decision focus and consumer choice among assortments. Journal of Consumer Research, 33(1):50–59, 2006.
  15. Active learning with statistical models. Journal of artificial intelligence research, 4:129–145, 1996.
  16. Assortment optimization under variants of the nested logit model. Operations Research, 62(2):250–273, 2014.
  17. Product ranking on online platforms. In Proceedings of the 21st ACM Conference on Economics and Computation, pages 459–459, 2020.
  18. A nonparametric approach to modeling choice with limited data. Management science, 59(2):305–322, 2013.
  19. Uriel Feige. A threshold of ln n for approximating set cover. Journal of the ACM (JACM), 45(4):634–652, 1998.
  20. Assortment optimization with small consideration sets. Operations Research, 67(5):1283–1299, 2019.
  21. Revenue management under the markov chain choice model. Operations Research, 65(5):1322–1342, 2017.
  22. Gavan J. Fitzsimons. Consumer response to stockouts. Journal of consumer research, 27(2):249–266, 2000.
  23. An optimal greedy heuristic with minimal learning regret for the markov chain choice model. Available at SSRN 3810470, 2021.
  24. Assortment planning and inventory decisions under a locational choice model. Management Science, 52(10):1528–1543, 2006.
  25. Preference identification under inconsistent choice. Available at SSRN, 2015.
  26. Real-time optimization of personalized assortments. Management Science, 60(6):1532–1551, 2014.
  27. Learning product rankings robust to fake users. In Proceedings of the 22nd ACM Conference on Economics and Computation, pages 560–561, 2021.
  28. Learning product rankings robust to fake users. Operations Research, 2022.
  29. Estimating unconstrained demand rate functions using customer choice sets. Journal of Revenue and Pricing Management, 10(5):438–454, 2011.
  30. Wassily Hoeffding. Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association, 58(301):13–30, 1963. ISSN 01621459. URL http://www.jstor.org/stable/2282952.
  31. Optimal algorithms for assortment selection under ranking-based consumer choice models. Manufacturing & Service Operations Management, 14(2):279–289, 2012.
  32. A nonparametric joint assortment and price choice model. Management Science, 63(9):3128–3145, 2017.
  33. The limit of rationality in choice modeling: Formulation, computation, and implications. Management Science, 65(5):2196–2215, 2019.
  34. A partial-order-based model to estimate individual preferences using panel data. Management Science, 64(4):1609–1628, 2018.
  35. Personalized retail promotions through a directed acyclic graph–based representation of customer preferences. Operations Research, 70(2):641–665, 2022.
  36. Toshihiro Kamishima. Nantonac collaborative filtering: recommendation based on order responses. In Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 583–588, 2003.
  37. Supervised ordering-an empirical survey. In Fifth IEEE International Conference on Data Mining (ICDM’05), pages 4–pp. IEEE, 2005.
  38. A cascade model for externalities in sponsored search. In International Workshop on Internet and Network Economics, pages 585–596. Springer, 2008.
  39. Consumer response to online apparel stockouts. Psychology & Marketing, 28(2):115–144, 2011.
  40. Algorithm design. Pearson Education India, 2006.
  41. Nick Littlestone. Learning quickly when irrelevant attributes abound: A new linear-threshold algorithm. Machine Learning, 2:285–318, 1988.
  42. A simple way towards fair assortment planning: Algorithms and welfare implications. Available at SSRN 4514495, 2023.
  43. Will Ma. When is assortment optimization optimal? Management Science, 69(4):2088–2105, 2023.
  44. Siddharth Mahajan and Garrett J. van Ryzin. Retail inventories and consumer choice. In Quantitative models for supply chain management, pages 491–551. Springer, 1999.
  45. Daniel McFadden. Conditional logit analysis of qualitative choice behavior. In Paul Zarembka, editor, Frontiers in Econometrics, pages 105–142. Academic Press, 1973.
  46. Demand model estimation and validation. Urban Travel Demand Forecasting Project, Phase, 1, 1977.
  47. Revenue management: Research overview and prospects. Transportation science, 33(2):233–256, 1999.
  48. Online learning via offline greedy: Applications in market design and optimization. Available at SSRN 3613756, 2020.
  49. Online learning via offline greedy algorithms: Applications in market design and optimization. In Proceedings of the 22nd ACM Conference on Economics and Computation, pages 737–738, 2021.
  50. Assortment optimization and pricing under a nonparametric tree choice model. Manufacturing & Service Operations Management, 20(3):550–565, 2018.
  51. Dynamic pricing with unknown non-parametric demand and limited price changes. Available at SSRN 3336949, 2019.
  52. Robust assortment optimization in revenue management under the multinomial logit choice model. Operations research, 60(4):865–882, 2012.
  53. Dynamic assortment optimization with a multinomial logit choice model and capacity constraint. Operations research, 58(6):1666–1680, 2010.
  54. On the relationship between inventory costs and variety benefits in retail assortments. Management Science, 45(11):1496–1509, 1999.
  55. Optimal dynamic assortment planning with demand learning. Manufacturing & Service Operations Management, 15(3):387–404, 2013.
  56. Burr Settles. Active learning literature survey. Technical report, Department of Computer Sciences, University of Wisconsin, Madison, 2009.
  57. Dynamic sampling allocation under finite simulation budget for feasibility determination. INFORMS Journal on Computing, 2021.
  58. Tractable sampling strategies for ordinal optimization. Operations Research, 66(6):1693–1712, 2018.
  59. An expectation-maximization algorithm to estimate the parameters of the markov chain choice model. Operations Research, 66(3):748–760, 2018.
  60. Revenue management under a general discrete choice model of consumer behavior. Management Science, 50(1):15–33, 2004.
  61. Garrett van Ryzin and Gustavo Vulcano. A market discovery algorithm to estimate a general class of nonparametric choice models. Management Science, 61(2):281–300, 2015.
  62. Garrett van Ryzin and Gustavo Vulcano. An expectation-maximization method to estimate a rank-based choice model of demand. Operations Research, 65(2):396–407, 2017.
  63. The generalized nested logit model. Transportation Research Part B: Methodological, 35(7):627–641, 2001.
  64. Sequential sampling for a ranking and selection problem with exponential sampling distributions. In 2020 Winter Simulation Conference (WSC), pages 2984–2995. IEEE, 2020.
  65. Selectively acquiring customer information: A new data acquisition problem and an active learning-based solution. Management Science, 52(5):697–712, 2006.
Citations (3)

Summary

We haven't generated a summary for this paper yet.