Please use this identifier to cite or link to this item: https://repository.iimb.ac.in/handle/2074/9777
DC FieldValueLanguage
dc.contributor.advisorBasu, Arnab
dc.contributor.authorMohapatra, Prakash
dc.date.accessioned2019-07-23T09:02:31Z-
dc.date.available2019-07-23T09:02:31Z-
dc.date.issued2013
dc.identifier.urihttp://repository.iimb.ac.in/handle/2074/9777
dc.description.abstractIn this paper, we make a three-fold contribution to the domain of reinforcement learning of equilibrium in the framework of nonzero-sum stochastic dynamic games. First of all, we extend the techniques of Q( )- learning to the multi-player setup. We also extend the idea of polynomial learning rate to this domain for faster convergence. Most importantly, we propose a novel nonlinear learning algorithm which eliminates the learning starvation typical of such linear learning algorithms such as Q( )-learning. Prior work in the reinforcement learning domain is mainly restricted to linear techniques which lead to learning starvation. Our learning objective is the in nite horizon discounted pay-o criterion which is used to estimate the long term market equilibria. We have applied this model to a real life business case to analyze the competition between ARM and Intel in the smart phone microprocessor market. The model is restricted to a duopoly; however, the work can be easily extended to the more general case. We have estimated the market equilibrium payo s for this set-up and proposed some business insights based on our ndings.
dc.language.isoen_US
dc.publisherIndian Institute of Management Bangalore
dc.relation.ispartofseriesEPGP_P13_09
dc.subjectMarketing management
dc.titleNonlinear reinforcement learning of dynamic Nash equilibrium
dc.typeProject Report-EPGP
dc.pages50p.
Appears in Collections:2010-2015
Files in This Item:
File SizeFormat 
EPGP_P13_1214046.pdf433.47 kBAdobe PDFView/Open    Request a copy
Show simple item record

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.