* Cantinho Satkeys

Refresh History
  • FELISCUNHA: ghyt74  pessoal  4tj97u<z
    21 de Abril de 2025, 10:38
  • cereal killa:
    19 de Abril de 2025, 21:17
  • j.s.: tenham uma Santa e Feliz Páscoa  49E09B4F 49E09B4F 49E09B4F
    19 de Abril de 2025, 18:19
  • j.s.:
    19 de Abril de 2025, 18:19
  • j.s.: dgtgtr a todos  4tj97u<z 4tj97u<z
    19 de Abril de 2025, 18:15
  • FELISCUNHA: Uma santa sexta feira para todo o auditório  4tj97u<z
    18 de Abril de 2025, 11:12
  • JPratas: try65hytr Pessoal  4tj97u<z classic k7y8j0
    18 de Abril de 2025, 03:28
  • cereal killa: try65hytr malta  classic 2dgh8i
    14 de Abril de 2025, 23:14
  • FELISCUNHA: Votos de um santo domingo para todo o auditório  101041
    13 de Abril de 2025, 11:45
  • j.s.: e um bom domingo de Ramos  43e5r6 43e5r6
    11 de Abril de 2025, 21:02
  • j.s.: tenham um excelente fim de semana  49E09B4F
    11 de Abril de 2025, 21:01
  • j.s.: try65hytr a todos  4tj97u<z
    11 de Abril de 2025, 21:00
  • JPratas: try65hytr  y5r6t Pessoal  classic k7y8j0
    11 de Abril de 2025, 04:15
  • JPratas: dgtgtr A Todos  4tj97u<z classic k7y8j0
    10 de Abril de 2025, 18:29
  • FELISCUNHA: ghyt74  pessoal   49E09B4F
    09 de Abril de 2025, 11:59
  • cereal killa: try65hytr pessoal  2dgh8i
    08 de Abril de 2025, 23:21
  • FELISCUNHA: Votos de um santo domingo para todo o auditório  43e5r6
    06 de Abril de 2025, 11:13
  • cccdh: Ola para todos!
    04 de Abril de 2025, 23:41
  • j.s.: tenham um excelente fim de semana  49E09B4F
    04 de Abril de 2025, 21:10
  • j.s.: try65hytr a todos  4tj97u<z
    04 de Abril de 2025, 21:10

Autor Tópico: Practical Multi-Armed Bandit Algorithms in Python  (Lida 94 vezes)

0 Membros e 1 Visitante estão a ver este tópico.

Online mitsumi

  • Moderador Global
  • ***
  • Mensagens: 119261
  • Karma: +0/-0
Practical Multi-Armed Bandit Algorithms in Python
« em: 15 de Abril de 2021, 10:41 »

MP4 | Video: h264, 1280x720 | Audio: AAC, 44.1 KHz, 2 Ch
Genre: eLearning | Language: English + srt | Duration: 13 lectures (3h 45m) | Size: 1.15 GB
Acquire skills to build digital AI agents capable of adaptively making critical business decisions under uncertainties.

What you'll learn:
Understanding and being able to identify Multi-Armed Bandit problems.
Modelling real business problems as MAB and implementing digital AI agents to automate them.
Understanding the challenge of RL regarding the exploration-exploitation dilema.
Practical implementation of the various algorithmic strategies for balancing between exploration and exploitation.
Python implementation of the Epsilon-greedy strategy.
Python implementation of the Softmax Exploration strategy.
Python implementation of the Optimistic Initialization strategy.
Python implementation of the Upper Confidence Bounds (UCB) strategy.
Understand the challenges of RL in terms of the design of reward functions and sample efficiency.
Estimation of action values through incremental sampling.

Requirements
Be able to understand basic OOP programs in Python.
Have basic Numpy and MatDescriptionlib knowledge.
Basic algebra skills. If you know how to add, subtract, multiply, and divide numbers, you are good to go.

Description
This course is your perfect entry point into the exciting field of Reinforcement Learning where digital Artificial Intelligence agents are built to automatically learn how to make sequential decisions through trial-and-error. Specifically, this course focuses on the Multi-Armed Bandit problems and the practical hands-on implementation of various algorithmic strategies for balancing between exploration and exploitation. Whenever you desire to consistently make the best choice out of a limited number of options over time, you are dealing with a Multi-Armed Bandit problem and this course teaches you every detail you need to know to be able to build realistic business agents to handle such situations.

With very concise explanations, this course teaches you how to confidently translate seemingly scary mathematical formulas into Python code painlessly. We understand that not many of us are technically adept in the subject of mathematics so this course intentionally stays away from maths unless it is necessary. And even when it becomes necessary to talk about mathematics, the approach taken in this course is such that anyone with basic algebra skills can understand and most importantly easily translate the maths into code and build useful intuitions in the process.

Some of the algorithmic strategies taught in this course are Epsilon Greedy, Softmax Exploration, Optimistic Initialization, Upper Confidence Bounds, and Thompson Sampling. With these tools under your belt, you are adequately equipped to readily build and deploy AI agents that can handle critical business operations under uncertainties.

Who this course is for
Anyone with a basic Python skills desiring to the started in Reinforcement Learning.
Experienced AI Engineers, ML Engineers, Data Scientist, and Software Engineers wanting to apply Reinforcement Learning to real business problems.
Business professionals willing to learn how Reinforcement Learning can help with automating adaptive decision making processes.


Download link:
Só visivel para registados e com resposta ao tópico.

Only visible to registered and with a reply to the topic.

Links are Interchangeable - No Password - Single Extraction