reinforcement learning policy軟體開發學習資訊分享在Facebook 的評價

NT 590 特價中在本課程中將學習並實現一種新的令人難以置信的聰明的人工智慧模型，稱為雙延遲 DDPG( Twin-Delayed DDPG )，它結合了人工智慧領域的最新技術，包括連續雙深度 Q 學習( Double Deep Q-Learning )、政策梯度( Policy Gradient )和 Actor Critic。這個模型是如此強大，以至於在我們的課程中，我們第一次能夠解決最具挑戰性的虛擬人工智慧應用程式(訓練一隻螞蟻 / 蜘蛛和一個半人形機器人在田野中行走和奔跑)。https://softnshare.com/deep-reinforcement-learning/

夜市美食網紅社群推薦指南 - 關於 reinforcement learning policy ，我們在網路上蒐集到這些相關的討論、資訊與評價

交通大學IBM中心特別邀請到ECE Department at New York University 的 Prof. H. Jonathan Chao 前來為我們演講，歡迎有興趣的老師與同學免費報名參加！

演講標題：CFR-RL: Traffic Engineering with Reinforcement Learning in SDN
演講者：Prof. H. Jonathan Chao (ECE Department at New York University)
時間：2020/01/20(一) 15:00 ~ 17:00
地點：交大工程四館816室
活動報名網址：https://forms.gle/k5txEfTX6jM7PBR98
聯絡方式：曾紫玲 Tel：03-5712121分機54599 Email：tzuling@nctu.edu.tw

Abstract:
Traffic Engineering (TE) is one of important network features for Software-Defined Networking (SDN) with an aim to help Internet Service Providers (ISPs) optimize network performance and resource utilization by configuring the routing across their backbone networks. Although TE solutions can achieve the optimal or near-optimal performance by rerouting as many flows as possible, they do not usually consider the negative impact, such as packet out of order, when frequently rerouting flows in the network. To mitigate the impact of network disturbance, one promising TE solution is forwarding the majority of traffic flows using Equal-Cost Multi-Path (ECMP) and selectively rerouting a few critical flows using SDN to balance link utilization of the network. However, critical flow rerouting is not trivial because the solution space for critical flow selection is immense. Moreover, it is impossible to design a heuristic algorithm for this problem based on fixed and simple rules, since rule-based heuristics are unable to adapt to the changes of the traffic matrix and network dynamics. In this talk, we describe a Reinforcement Learning (RL)-based scheme, called CFR-RL, that learns a policy to select critical flows for each given traffic matrix automatically. It then reroutes these selected critical flows to balance link utilization of the network by formulating and solving a simple Linear Programming (LP) problem. Extensive evaluations show that CFR-RL outperforms the best heuristic by 7.4% - 12.2% and reroutes only 10% - 21.3% of total traffic.

Biography:
H. Jonathan Chao is Professor of Electrical and Computer Engineering (ECE) at NYU, where he joined in January 1992. He is currently Director of High-Speed Networking Lab. He was Head of ECE Department from 2004-2014. He has been doing research in the areas of software defined networking, network function virtualization, datacenter networks, packet processing and switching, network security, and machine learning for networking. He holds 63 patents and has published more than 265 journal and conference papers. During 2000–2001, he was Co-Founder and CTO of Coree Networks, NJ, where he led a team to implement a multi-terabit router with carrier-class reliability. From 1985 to 1992, he was a Member of Technical Staff at Bellcore, where he was involved in network architecture designs and ASIC implementations, such as the world’s first SONET-like Framer chip, ATM Layer chip, Sequencer chip (the first chip handling packet scheduling), and ATM switch chip. He is a Fellow of National Academy of Inventors (NAI) for “having demonstrated a highly prolific spirit of innovation in creating or facilitating outstanding inventions that have made a tangible impact on quality of life, economic development, and the welfare of society.” He is a Fellow of the IEEE for his contributions to the architecture and application of VLSI circuits in high-speed packet networks. He received Bellcore Excellence Award in 1987. He is a co-recipient of the 2001 Best Paper Award from the IEEE Transaction on Circuits and Systems for Video Technology. He coauthored three networking books. He worked for Telecommunication Lab in Taiwan from 1977 to 1981. He received his B.S. and M.S. degrees in electronics engineering from National Chiao Tung University, Taiwan, in 1977 and 1980, respectively, and his Ph.D. degree in electrical engineering from The Ohio State University in 1985.

Tags: reinforcement learning policy

國立陽明交通大學電子工程學系及電子研究所

About author

國立交通大學於2021年2月1日與國立陽明大學合校為國立陽明交通大學。國立交通大學源自西元1896年創立之南洋公學，百餘年來以培養我國應用科學人才著稱。民國46年10月24日，行政院同意教育、國防、經濟、交通四部會所呈意見，准由教育部籌備國立交通大學復校事宜；民國47年6月1日，國立交通大學電子研究所在新竹市博愛街正式成立，期許能培育電子科學的菁英並奠定國內電子資訊產業發展的基石。為進一步配合工業發展之需，擴充科技人力之基礎，於民國53年成立電子工程學系，開始招收大學部學生。本系多年來在全體同仁共同努力及系友們的支持下已深具規模，不僅師資、課程及設備在國內首屈一指，也與世界著名大學並駕齊驅。目前本系陣容堅實壯盛，計有助理教授以上專任教師五十餘位；學生方面，則有學士生四百餘人，碩士生四百餘人，博士生三百餘人；故為國內舉足輕重之龍頭大系。系所畢業系友已近六千人，分別在國內外學術界、研究機構、工業界及企業界服務，多數傑出系友並已成為台灣電子資訊產業之領袖人物，是科技發展及國家建設的中流砥柱。

陽明交大電子系期許未來在各項電子科技領域持續引領風騷，並以創新、綠色的電子科技不斷增進人類生活品質。邀請您共同見證我們寫下我國電子資訊的新頁。

關於 reinforcement learning policy ，我們在網路上蒐集到這些相關的討論、資訊與評價

「reinforcement learning policy」的推薦目錄：

reinforcement learning policy 在 軟體開發學習資訊分享 Facebook 的最讚貼文

About author

reinforcement learning policy 在 國立陽明交通大學電子工程學系及電子研究所 Facebook 的最佳解答

About author

reinforcement learning policy 在 Science Experiments with Physics Engine Youtube 的最佳貼文

About author

你可能也想看看

搜尋相關連結

reinforcement learning policy 在軟體開發學習資訊分享 Facebook 的最讚貼文

reinforcement learning policy 在國立陽明交通大學電子工程學系及電子研究所 Facebook 的最佳解答