Using Reinforcement Learning Methods to Price a Perishable Product, Case Study: Orange

سال انتشار: 1400
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 249

فایل این مقاله در 18 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

JR_JMMF-1-1_003

تاریخ نمایه سازی: 17 فروردین 1400

چکیده مقاله:

‎Determining the optimal selling price for different commodities has always been one of the main topics of scientific and industrial research‎. ‎Perishable products have a short life and due to their deterioration over time‎, ‎they cause great damage if not managed‎. ‎Many industries‎, ‎retailers‎, ‎and service providers have the opportunity to increase their revenue through optimal pricing of perishable products that must be sold within a certain period‎. ‎In the pricing issue‎, ‎a seller must determine the price of several units of a perishable or seasonal product to be sold for a limited time‎. ‎This article examines pricing policies that increase revenue for the sale of a given inventory with an expiration date‎. ‎Booster learning algorithms are used to analyze how companies can simultaneously learn and optimize pricing strategy in response to buyers‎. ‎It is also shown that using reinforcement learning we can model a demand-dependent problem‎. ‎This paper presents an optimization method in a model-independent environment in which demand is learned and pricing decisions are updated at the moment‎. ‎We compare the performance of learning algorithms using Monte Carlo simulations‎.

نویسندگان

Abbas Shekari Firouzjaie

Industrial Engineering Department, Science and Technology of Behshahr, Mazandran, Iran.

Navid Sahebjamnia

Department of industrial engineering, University of Science and Technology of Mazandaran, Behshahr, Iran

Hadi Abdollahzade

Industrial Engineering Department, Science and Technology of Behshahr, Mazandran, Iran