create account

Machine learning serie - part 4: How does a robot learn to ride a bike? by mor

View this thread on: hive.blogpeakd.comecency.com
· @mor · (edited)
$26.76
Machine learning serie - part 4: How does a robot learn to ride a bike?
# Reinforcement learning

Another semester is finished and I would like to extend on my abandoned Machine learning series. This time I'll talk about reinforcement learning which is the algorithm that is behind most of the cool AI stuff you heard about lately - like the success of [DeepMind's Go](https://www.newscientist.com/article/2132086-deepminds-ai-beats-worlds-best-go-player-in-latest-face-off/) or [Poker](https://www.theguardian.com/technology/2017/jan/30/libratus-poker-artificial-intelligence-professional-human-players-competition). So reinforcement learning is the algorithm that can beat you in poker or go or even in some Atari games (like for example pong) but it is also the algorithm which is trying to teach robots walk and do other practical things like carry your things around or drive cars. So, how is it all (or at least some of it) done? 

# First let me tell you a story about a bike

Imagine you're a child (or even grownup for the matter) and you're learning to ride a bike. How do you do it? And why do you do it? There might be a distant reward promised by your mother or father that if you manage to ride to the end of the parking lot, you get an ice cream. And you like ice cream, so you try to really manage the task. 
But how do you connect the exact movements of your legs on the bike with this delicious reward? In the case that you're a human that's quite easy. First of all we're very good at extracting action and causes from our world and second of all our brain is amazingly equipped for learning any new movement tasks. So we can quite easily learn to walk, run, write, or even to ride that bike. But imagine that you're a robot, and none of this is really true for you. So how do you do it or how does machine learning do it?

Part of the answer to this is reinforcement learning, it answers how a machine learning agent can learn even if the reward is very distant, but it doesn't answer the question about how to mimic that amazing  movement ability of humans (or animals).

To give you the idea about how difficult it is for robots to do tasks in the real world, here is an example of robot using reinforcement learning to do a simple pancake flip.

https://www.youtube.com/watch?v=W_gxLKSsSIE

## OK, once again, what is this Reinforcement learning?

It's a magic box, where on one side you insert a task, on the other side you insert a reward for finishing the task and in the middle there is an agent that learns your task. And now you know it. You might be thinking: "Wait what? This is some oversimplification, isn't it?!" That's partly right, but actually it's not that far from the truth.

In the machine learning field there are three basic types of "machine learning". The first one is the supervised learning, where the task is usually quite simple (like: "What's on the picture?" or "Where is a dog on the picture?") and the feedback (reward) is given to the algorithm after each answer. The algorithm can therefore gradually improve its hypothesis. On the other side of the spectrum is unsupervised learning, where the algorithm gets no feedback at all and it is just trying to find some reoccurring (generalizing) patterns in the input (images, text, videos). Reinforcement learning algorithms deal with problems that are somewhere in the middle. Sometimes the agent gets the reward right after its action and sometimes it takes many steps before reward is received. 

The basic idea of reinforcement learning is that there is an interaction between an agent and an environment. The agent does some actions and it receives reward for its actions from the environment. The environment  is observed by the agent and agent creates states based on the observation, state can be seen as some inner model of the world a belief of the agent about where it is located in the world. The agent learns by creating a policy based on what rewards it got after its actions, and the policy then determines which actions does the agent select in different states.

In general the basic agent-environment interaction cycle just looks like this:

![Reinforcement_learning_diagram.svg.png](https://steemitimages.com/DQmcGE6e6N58ymp7RwknxtYanwU7VNUGKehTJSiVbzUFxGX/Reinforcement_learning_diagram.svg.png)


If you find like you need a bit more explanation, than here is a nice video by the [Udacity](https://www.youtube.com/channel/UCBVCi5JbYmfG3q5MEuoWdOw) explaining the basics of reinforcement learning.

https://www.youtube.com/watch?v=2xATEwcRpy8

## So, how far are we from robots that would do my chores?

Actually, we're yet quite far from that. The RL agents are not yet ready even to ride that bike, or for the matter be able to really walk in any terrain. But there are many really clever people who are trying to figure out how to get closer to those or similarly difficult tasks. In the next posts I'll talk about what are the newest algorithms that should bring us closer to it.

#### And as you got all the way to here you deserve a reward... A nice robots' compilation...

https://www.youtube.com/watch?v=g0TaYhjpOfo


### *Thank you for reading, and if you have any questions then just ask!*
👍  , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and 193 others
properties (23)
authormor
permlinkmachine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike
categorytechnology
json_metadata{"tags":["technology","mathematics","deep-learning","programming","science"],"image":["https://img.youtube.com/vi/W_gxLKSsSIE/0.jpg","https://steemitimages.com/DQmcGE6e6N58ymp7RwknxtYanwU7VNUGKehTJSiVbzUFxGX/Reinforcement_learning_diagram.svg.png","https://img.youtube.com/vi/2xATEwcRpy8/0.jpg","https://img.youtube.com/vi/g0TaYhjpOfo/0.jpg"],"links":["https://www.newscientist.com/article/2132086-deepminds-ai-beats-worlds-best-go-player-in-latest-face-off/","https://www.theguardian.com/technology/2017/jan/30/libratus-poker-artificial-intelligence-professional-human-players-competition","https://www.youtube.com/watch?v=W_gxLKSsSIE","https://www.youtube.com/channel/UCBVCi5JbYmfG3q5MEuoWdOw","https://www.youtube.com/watch?v=2xATEwcRpy8","https://www.youtube.com/watch?v=g0TaYhjpOfo"],"app":"steemit/0.1","format":"markdown"}
created2017-06-25 20:53:39
last_update2017-06-28 10:56:06
depth0
children43
last_payout2017-07-02 20:53:39
cashout_time1969-12-31 23:59:59
total_payout_value21.652 HBD
curator_payout_value5.105 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length5,148
author_reputation4,958,415,957,701
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id6,093,141
net_rshares2,664,427,764,063
author_curate_reward""
vote details (257)
@abdullah143 ·
I really come to laugh when i imagine a world of bots doing humanly things. :D
properties (22)
authorabdullah143
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20171227t124227115z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2017-12-27 12:42:27
last_update2017-12-27 12:42:27
depth1
children0
last_payout2018-01-03 12:42:27
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length78
author_reputation1,022,421,981
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id25,506,257
net_rshares0
@aburmeseabroad ·
such an interesting article. I am working with robots at my work place  so. :) Thanks for sharing..
properties (22)
authoraburmeseabroad
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20170626t043118477z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2017-06-26 04:31:18
last_update2017-06-26 04:31:18
depth1
children0
last_payout2017-07-03 04:31:18
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length99
author_reputation10,522,171,851,652
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id6,134,208
net_rshares0
@ahmadsayuthi71 ·
Good idea is very useful to read
properties (22)
authorahmadsayuthi71
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20180310t150736598z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2018-03-10 15:07:39
last_update2018-03-10 15:07:39
depth1
children0
last_payout2018-03-17 15:07:39
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length32
author_reputation4,023,838,215
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id43,535,723
net_rshares0
@alol ·
Good share
properties (22)
authoralol
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20170626t202457553z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2017-06-26 20:25:00
last_update2017-06-26 20:25:00
depth1
children0
last_payout2017-07-03 20:25:00
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length10
author_reputation54,641,993,350,544
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id6,230,143
net_rshares0
@anwarunsam ·
Nice articel. Very nice
properties (22)
authoranwarunsam
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20180105t155245561z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2018-01-05 15:52:45
last_update2018-01-05 15:52:45
depth1
children0
last_payout2018-01-12 15:52:45
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length23
author_reputation264,054,090,060
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id27,312,773
net_rshares0
@barzah ·
a very interesting machine my brother
properties (22)
authorbarzah
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20171027t075653756z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2017-10-27 07:56:54
last_update2017-10-27 07:56:54
depth1
children0
last_payout2017-11-03 07:56:54
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length37
author_reputation7,657,449,429,022
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id18,674,497
net_rshares0
@chrizbiz ·
Great article, very interesting always wondered how it worked ! will follow please share more intesting posts :D !!!!
👍  
properties (23)
authorchrizbiz
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20170625t205542572z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2017-06-25 20:55:45
last_update2017-06-25 20:55:45
depth1
children1
last_payout2017-07-02 20:55:45
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length117
author_reputation2,263,916,984,670
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id6,093,388
net_rshares0
author_curate_reward""
vote details (1)
@mor ·
Thanks, I'll try my best..
properties (22)
authormor
permlinkre-chrizbiz-re-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20170626t071516394z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2017-06-26 07:15:15
last_update2017-06-26 07:15:15
depth2
children0
last_payout2017-07-03 07:15:15
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length26
author_reputation4,958,415,957,701
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id6,146,259
net_rshares0
@dizhmur ·
$0.02
This is a pretty good technological progress, good luck, we are following you!
👍  
properties (23)
authordizhmur
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20170628t103858109z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2017-06-28 10:39:00
last_update2017-06-28 10:39:00
depth1
children0
last_payout2017-07-05 10:39:00
cashout_time1969-12-31 23:59:59
total_payout_value0.024 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length78
author_reputation-2,547,661,927,146
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id6,460,269
net_rshares3,009,980,673
author_curate_reward""
vote details (1)
@eruda ·
Interesting. There will be success because there are mistakes. 재밌게 잘 봤습니다.~
properties (22)
authoreruda
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20180710t110413302z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2018-07-10 11:04:15
last_update2018-07-10 11:04:15
depth1
children0
last_payout2018-07-17 11:04:15
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length75
author_reputation-886,343,821,735
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id64,146,116
net_rshares0
@frikiguru ·
Wow this is amazing the robots will conquer the future
properties (22)
authorfrikiguru
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20180119t024915237z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2018-01-19 02:49:18
last_update2018-01-19 02:49:18
depth1
children0
last_payout2018-01-26 02:49:18
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length54
author_reputation261,595,900,396
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id30,528,693
net_rshares0
@homearning2 ·
wow!!nice post dear
properties (22)
authorhomearning2
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20180305t094527761z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2018-03-05 09:45:36
last_update2018-03-05 09:45:36
depth1
children0
last_payout2018-03-12 09:45:36
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length19
author_reputation253,360,765,001
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id42,337,599
net_rshares0
@iamgceo ·
wow excellent
properties (22)
authoriamgceo
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20180310t071733083z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2018-03-10 07:17:36
last_update2018-03-10 07:17:36
depth1
children0
last_payout2018-03-17 07:17:36
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length13
author_reputation71,883,530,613
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id43,469,345
net_rshares0
@j-vo ·
Thank you for teaching me something - I have always been intrigued by robotics and you help to pull the curtain back a little bit into the process - very interesting. Have you ever seen the move A.I.? I think that movie has scarred me just a little bit - I think robotics are cool but its definitely scary imagining how this new path could negatively effect the structural economy of the world's work force among a variety of other things. Even still - technology is badass and I love watching the evolving nature of it! I noticed you up-voted my recent post - thank you so much - I am following you and up-voted yours! See you around :)
properties (22)
authorj-vo
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20170627t033224725z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2017-06-27 03:32:24
last_update2017-06-27 03:32:24
depth1
children1
last_payout2017-07-04 03:32:24
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length637
author_reputation3,247,843,140,442
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id6,275,681
net_rshares0
@mor ·
AI will probably change  the future economics, so we have to get prepared for those changes. But I guess we have some time, but anyways we still have to keep improving ourselves.
properties (22)
authormor
permlinkre-j-vo-re-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20170627t093336481z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2017-06-27 09:33:36
last_update2017-06-27 09:33:36
depth2
children0
last_payout2017-07-04 09:33:36
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length178
author_reputation4,958,415,957,701
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id6,305,601
net_rshares0
@jealson ·
steady...
properties (22)
authorjealson
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20170701t114030566z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2017-07-01 11:41:12
last_update2017-07-01 11:41:12
depth1
children0
last_payout2017-07-08 11:41:12
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length9
author_reputation438,875,815,999
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id6,880,242
net_rshares0
@kumar.malhotra ·
Technology cant  beat human. but thanx for sharing this videos.
properties (22)
authorkumar.malhotra
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20170713t051755993z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2017-07-13 05:17:54
last_update2017-07-13 05:17:54
depth1
children0
last_payout2017-07-20 05:17:54
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length63
author_reputation159,754,286,047
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id8,304,838
net_rshares0
@lavanyalakshman ·
Nice information
properties (22)
authorlavanyalakshman
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20171107t163107239z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2017-11-07 16:31:15
last_update2017-11-07 16:31:15
depth1
children0
last_payout2017-11-14 16:31:15
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length16
author_reputation33,674,564,412,588
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id19,707,652
net_rshares0
@lklab2013 ·
Reinforcement learning is amazing. Can a robot ride a bicycle? I look forward to your writing in the future.
properties (22)
authorlklab2013
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20170626t220312943z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2017-06-26 22:03:15
last_update2017-06-26 22:03:15
depth1
children0
last_payout2017-07-03 22:03:15
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length108
author_reputation15,337,958,939,547
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id6,242,056
net_rshares0
@mapipaz ·
Wow, it's impressive !!! I want the robot that makes omelette.
properties (22)
authormapipaz
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20171214t133644098z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2017-12-14 13:36:45
last_update2017-12-14 13:36:45
depth1
children0
last_payout2017-12-21 13:36:45
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length62
author_reputation11,261,457,614,901
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id23,495,118
net_rshares0
@massivevibration ·
incredibly interesting !
properties (22)
authormassivevibration
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20170804t151445618z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2017-08-04 15:14:45
last_update2017-08-04 15:14:45
depth1
children0
last_payout2017-08-11 15:14:45
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length24
author_reputation3,077,666,938,555
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id10,759,504
net_rshares0
@mor ·
$1.19
**[Here](https://steemit.com/technology/@mor/reinforcement-learning-and-atari-games-addition-to-ml-series-part-4)** is a short follow-up. Just some more reading and watching about the topic. :)
👍  , ,
properties (23)
authormor
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20170626t123141609z
categorytechnology
json_metadata{"tags":["technology"],"links":["https://steemit.com/technology/@mor/reinforcement-learning-and-atari-games-addition-to-ml-series-part-4"],"app":"steemit/0.1"}
created2017-06-26 12:31:39
last_update2017-06-26 12:31:39
depth1
children0
last_payout2017-07-03 12:31:39
cashout_time1969-12-31 23:59:59
total_payout_value1.194 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length193
author_reputation4,958,415,957,701
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id6,172,393
net_rshares124,493,294,564
author_curate_reward""
vote details (3)
@mrwincaste ·
You have a nice pictures..
properties (22)
authormrwincaste
permlinkre-mor-20171019t7425147z
categorytechnology
json_metadata{"tags":"technology","app":"esteem/1.4.6","format":"markdown+html","community":"esteem"}
created2017-10-19 12:42:57
last_update2017-10-19 12:42:57
depth1
children0
last_payout2017-10-26 12:42:57
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length26
author_reputation65,423,445,235
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries
0.
accountesteemapp
weight500
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id18,052,641
net_rshares0
@musliwadi ·
Posting nya cukup menarik
properties (22)
authormusliwadi
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20171213t122804345z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2017-12-13 12:28:09
last_update2017-12-13 12:28:09
depth1
children0
last_payout2017-12-20 12:28:09
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length25
author_reputation242,444,053,454
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id23,358,827
net_rshares0
@pocketechange ·
Robots will be able to replace certain moves a person can make, but they'll never be able to do everything a person is capable of doing at the spur of the moment...
@pocketechange
👍  
properties (23)
authorpocketechange
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20170626t052803295z
categorytechnology
json_metadata{"tags":["technology"],"users":["pocketechange"],"app":"steemit/0.1"}
created2017-06-26 05:27:57
last_update2017-06-26 05:27:57
depth1
children2
last_payout2017-07-03 05:27:57
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length179
author_reputation239,822,050,704,602
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id6,138,544
net_rshares1,008,572,409
author_curate_reward""
vote details (1)
@mor ·
I'm not sure about that actually. We might be greatly overestimating the "spur of the moment", as all comes down to our brains which are in the end (according to science and me) just huge computers.. But we'll see. Even with my optimistic view - human level artificial intelligence is doable - it is probably not going to happen in the next ten years..
properties (22)
authormor
permlinkre-pocketechange-re-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20170626t071352372z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2017-06-26 07:13:51
last_update2017-06-26 07:13:51
depth2
children1
last_payout2017-07-03 07:13:51
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length352
author_reputation4,958,415,957,701
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id6,146,161
net_rshares0
@ecodata · (edited)
There are those who are already looking for "an ideal" mixing both: man and machine. And as has happened with other technological advances, it begins in the military area:

Part of video, since 10m 15s: www.youtube.com/watch?v=1brEPzkJIsA&t=10m15s
Channel (other user): www.youtube.com/channel/UCc0AzRNy9TY5r2Fx5C8_8LQ

Regards.
properties (22)
authorecodata
permlinkre-mor-re-pocketechange-re-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20170828t165258903z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2017-08-28 16:52:57
last_update2017-08-28 16:55:09
depth3
children0
last_payout2017-09-04 16:52:57
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length328
author_reputation34,921,636,505
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id13,142,779
net_rshares0
@princesaleem ·
Wonderfull post...Thanks for share
Thanks for upvoting my content as well :)
properties (22)
authorprincesaleem
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20180303t143703998z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2018-03-03 14:37:30
last_update2018-03-03 14:37:30
depth1
children0
last_payout2018-03-10 14:37:30
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length76
author_reputation11,888,255,041
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id41,894,214
net_rshares0
@randompic ·
That's interesting to learn about. Thank you!
properties (22)
authorrandompic
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20170626t211049406z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2017-06-26 21:10:48
last_update2017-06-26 21:10:48
depth1
children1
last_payout2017-07-03 21:10:48
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length45
author_reputation-1,197,043,690,966
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id6,235,471
net_rshares0
@mor ·
You're welcome :)
properties (22)
authormor
permlinkre-randompic-re-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20170626t211629003z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2017-06-26 21:16:27
last_update2017-06-26 21:16:27
depth2
children0
last_payout2017-07-03 21:16:27
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length17
author_reputation4,958,415,957,701
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id6,236,141
net_rshares0
@ridwant ·
emm. Amazing
properties (22)
authorridwant
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20170911t005503924z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2017-09-11 00:55:12
last_update2017-09-11 00:55:12
depth1
children0
last_payout2017-09-18 00:55:12
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length12
author_reputation7,123,520,801,744
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id14,507,096
net_rshares0
@rtdcs ·
Very interesting. Congratulations :)
properties (22)
authorrtdcs
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20170627t030534578z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2017-06-27 03:05:36
last_update2017-06-27 03:05:36
depth1
children0
last_payout2017-07-04 03:05:36
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length36
author_reputation19,449,784,090,468
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id6,273,164
net_rshares0
@santos88 ·
wow  great post
properties (22)
authorsantos88
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20170705t044901719z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2017-07-05 04:48:45
last_update2017-07-05 04:48:45
depth1
children0
last_payout2017-07-12 04:48:45
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length15
author_reputation36,330,099,462
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id7,356,369
net_rshares0
@sciack ·
Fantastic!
properties (22)
authorsciack
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20171119t093232504z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2017-11-19 09:32:30
last_update2017-11-19 09:32:30
depth1
children0
last_payout2017-11-26 09:32:30
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length10
author_reputation22,635,529,623,983
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id20,858,782
net_rshares0
@silvabrothers ·
Hey brother, interesting post, I support you, I hope you can go through my blog seriously help: '), greetings from Venezuela
properties (22)
authorsilvabrothers
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20170805t031400532z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2017-08-05 03:15:45
last_update2017-08-05 03:15:45
depth1
children0
last_payout2017-08-12 03:15:45
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length124
author_reputation63,695,226,674
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id10,812,316
net_rshares0
@steem77 ·
https://steemit.com/indonesia/@steem77/kebijakan-gila-pemerintah-aceh-utara is my post
properties (22)
authorsteem77
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20171013t122726711z
categorytechnology
json_metadata{"tags":["technology"],"links":["https://steemit.com/indonesia/@steem77/kebijakan-gila-pemerintah-aceh-utara"],"app":"steemit/0.1"}
created2017-10-13 12:27:30
last_update2017-10-13 12:27:30
depth1
children0
last_payout2017-10-20 12:27:30
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length86
author_reputation6,746,680,658,188
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id17,580,236
net_rshares0
@steemitjp ·
Hi This is david from Japan. Thank you for your voting and concerning my post. In the future, Robot is the most familiar machine around human. Especially, Japan is so strong with robot industry. Thank you for your great articles. I followed. Have a great day to you
properties (22)
authorsteemitjp
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20170706t080330769z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2017-07-06 08:03:33
last_update2017-07-06 08:03:33
depth1
children0
last_payout2017-07-13 08:03:33
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length265
author_reputation199,187,045,156,371
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id7,491,958
net_rshares0
@tauhid26 ·
I enjoyed your post. Wow the all the best, congrats, and heaps of adoration from the core of my heart, I am with you and will be there
properties (22)
authortauhid26
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20180216t050521662z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2018-02-16 05:05:30
last_update2018-02-16 05:05:30
depth1
children0
last_payout2018-02-23 05:05:30
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length134
author_reputation298,478,287,398
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id37,915,457
net_rshares0
@thomasfarley ·
Watching this technology grow is great, wonder what it will be like in 5 years time.
properties (22)
authorthomasfarley
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20170821t091936409z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2017-08-21 09:19:36
last_update2017-08-21 09:19:36
depth1
children0
last_payout2017-08-28 09:19:36
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length84
author_reputation438,502,075,130
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id12,420,037
net_rshares0
@vivic ·
Fun video! Thanks
properties (22)
authorvivic
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20170628t123025070z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2017-06-28 12:30:24
last_update2017-06-28 12:30:24
depth1
children0
last_payout2017-07-05 12:30:24
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length17
author_reputation151,439,050,461
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id6,470,676
net_rshares0
@wxzurd ·
# AWESOME POST
properties (22)
authorwxzurd
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20171128t104538575z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2017-11-28 10:45:39
last_update2017-11-28 10:45:39
depth1
children0
last_payout2017-12-05 10:45:39
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length14
author_reputation763,131,271,959
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id21,767,828
net_rshares0
@yvonnebraaf ·
@mor thanks for sharing, great article
👍  
properties (23)
authoryvonnebraaf
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20170625t213452677z
categorytechnology
json_metadata{"tags":["technology"],"users":["mor"],"app":"steemit/0.1"}
created2017-06-25 21:34:54
last_update2017-06-25 21:34:54
depth1
children0
last_payout2017-07-02 21:34:54
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length38
author_reputation25,090,656,326
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id6,097,621
net_rshares0
author_curate_reward""
vote details (1)
@zeeshan2 ·
Follow u follow me
👎  
properties (23)
authorzeeshan2
permlinkre-mor-machine-learning-serie-part-4-how-does-a-robot-learn-to-ride-a-bike-20170625t205508329z
categorytechnology
json_metadata{"tags":["technology"],"app":"steemit/0.1"}
created2017-06-25 20:57:42
last_update2017-06-25 20:57:42
depth1
children0
last_payout2017-07-02 20:57:42
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length18
author_reputation-78,687,743,447
root_title"Machine learning serie - part 4: How does a robot learn to ride a bike?"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id6,093,600
net_rshares-11,464,902,989
author_curate_reward""
vote details (1)