Safe Reinforcement Learning (RL) Thesis Project

Posted on May 17, 2024 | by Duckietown Admin

Safe Reinforcement Learning (Safe-RL) in Duckietown

Project Resources

Safe-Reinforcement Learning (Safe-RL): Project Description

Safe-RL Duckietown Project – In his thesis titled “Safe-RL-Duckietown“, Jan Steinmüller used safe reinforcement learning to train Duckiebots to follow a lane while keeping said robots safe during training.

Safe Reinforcement Learning involves learning policies that maximize expected returns while ensuring reasonable system performance and adhering to safety constraints throughout both the learning and deployment phases. Reinforcement learning is a machine learning paradigm where agents learn to make decisions by maximizing cumulative rewards through interaction with an environment, without the necessity for training data or models.

The final result was a trained agent capable of following lanes while avoiding unsafe positions.

This is an open source project, and can be reproduced and improved upon through the Duckietown platform.

Safe Reinforcement Learning: Project Highlights

Here is a visual tour of the work of the author.

Check out the documents for more details.

Implementation of the safe reinforcement learning (Safe-RL) Duckietown project — Implementation of the Safe-RL Duckietown project

safe reinforcement learning in Duckietown project: Safety Layer Description — Safety Layer Description

Process diagram of action selection and safety layer in the safe reinforcement learning project using Duckietown — Process diagram of action selection and safety layer

Results of the safe reinforcement learning (Safe-RL) Duckietown Project — Results of the Safe-RL Duckietown Project

Safe Reinforcement Learning: Results and Conclusions

Based on the results, it can be concluded that there is no disadvantage to using a safety layer when doing reinforcement learning since execution time is very similar. Moreover, the dramatically improved safety of the vehicle is helpful for the robot’s training as fewer actions with lower or even negative rewards will be executed. Because of this, reinforcement learning agents with safety layers learn faster and reduce the number of unsafe actions that are being executed.

Unfortunately, manual observation and intervention by the user were still necessary, however, the frequency was clearly reduced which further improved learning as the robots in testing did not know if an outside intervention was done which could result in an action being rewarded incorrectly.

It was also concluded that this project did not reach perfect safety with the implementation. Therefore a fully autonomous reinforcement learning training without any human intervention has not yet been achieved. A lot of improvement factors have been found that can further improve the safety and recovery rate. Additionally, some major problems which are not direct results of the reinforcement learning or safety layer have been identified.

These problems could be attempted to be fixed in different ways like improving the open source implementations of lane filter nodes or adding more sensors or cameras to the robot in order to extend the input data to the agent. Another area that was untouched during the research of this project was other vehicles inside the current lane. The safety layer could potentially be extended to also include safety features that should keep the robot safe from hitting other vehicles.

Read the full report here.

Project Author

Jan Steinmüller is a computer science student working in the computer networks and information security research group at Hochschule Bremen in Germany.

Dr. Amr Alanwar is an Assistant Professor at the Technical University of Munich (TUM).

Learn more

Duckietown is a modular, customizable and state-of-the-art platform for creating and disseminating robotics and AI learning experiences.

It is designed to teach, learn, and do research: from exploring the fundamentals of computer science and automation to pushing the boundaries of knowledge.

Visual deep reinforcement learning paper snippet

Vision-Based DRL Autonomous Driving Agent with Sim2Real Transfer

Posted on May 6, 2024 | by Duckietown Admin

General Information

Vision-based DRL Autonomous Driving Agent with Sim2Real Transfer
Dianzhao Li and Ostap Okhrin
Universität Dresden, Dresden
D. Li and O. Okhrin, "Vision-Based DRL Autonomous Driving Agent with Sim2Real Transfer," 2023 IEEE 26th International Conference on Intelligent Transportation Systems (ITSC), Bilbao, Spain, 2023, pp. 866-873, doi: 10.1109/ITSC57777.2023.10422677.

Vision-Based DRL Autonomous Driving Agent with Sim2Real Transfer

One way to obtain quick and cheap training data is to use simulation instead of real-world experiments. The question remains if the learnings of a simulation-trained agent apply to the real world. Sim2Real transfer is the field of research that studies this problem.

The challenge is particularly meaningful when using vision as the primary sensing capability for robots. Vision-based deep reinforcement learning (DRL) refers to a technique where ML agents, typically modeled as multi-layered neural networks, learn to “make decisions” directly from visual input.

The essence of RL is training robotic agents based on policies that reward desirable outcomes. This family of techniques typically leads to increased adaptability to operational scenarios.

To learn about RL and its place in the larger context of robot autonomy, check out the resources below.

Abstract

To achieve fully autonomous driving, vehicles must be capable of continuously performing various driving tasks, including lane keeping and car following, both of which are fundamental and well-studied driving ones. However, previous studies have mainly focused on individual tasks, and car following tasks have typically relied on complete leader-follower information to attain optimal performance.

To address this limitation, we propose a vision-based deep reinforcement learning (DRL) agent that can simultaneously perform lane-keeping and car-following maneuvers.

To evaluate the performance of our DRL agent, we compare it with a baseline controller and use various performance metrics for quantitative analysis. Furthermore, we conduct a real-world evaluation to demonstrate the Sim2Real transfer capability of the trained DRL agent.

To the best of our knowledge, our vision-based car following and lane-keeping agent with Sim2Real transfer capability is the first of its kind.

Highlights - Sim2Real transfer results

Here is a visual tour of the work of the authors. For all the details, check out the paper link.

Fig. 1. The proposed DRL framework for vision-based multi-task autonomous driving agents. The perception module leverages camera images to produce impact attributes regarding the environment, then the DRL control module utilizes the information to control the agent with enhanced generalization.

Duckiebot DB21 — Fig. 2. Robot car [Duckiebot] used during the real-world evaluation. (a) Side view of the robot car which equipped with front-view camera and Jetson Nano 2GB. (b) Back view of the car with a pattern of circles.

Fig. 3. An example velocity trajectory for the leading vehicle generated by the Ornstein-Uhlenbeck process for the training process.

Fig. 4. Training results of PPO agent with ten independent seeds over one million steps.

Fig. 5. Example trajectories of DRL agent (red) and baseline controller (green) following random leader trajectory (orange) during the evaluation.

Fig. 6. Distribution of TTC and time headway for DRL agent (red) and baseline controller (green).

Fig. 7. Example trajectories of DRL agent (red) and baseline controller (green) following a self defined external leader trajectory (orange) during the evaluation.

Conclusion

This study proposes a vision-based DRL agent that can simultaneously perform lane-keeping and car-following tasks.

The overall system is divided into two modules: the perception module and the control module. The perception module extracts task-relevant attributes of the surroundings, while the control module is a DRL agent that takes these attributes as input. To evaluate the performance of the DRL agent, we compare it with a baseline algorithm in both simulation and real-world environments.

In the simulation, we compare the car following and lane-keeping capabilities of the DRL agent and baseline controller using various performance metrics. In the real-world environment, we demonstrate that the DRL agent can follow the leading vehicle while maintaining lane-keeping ability.

In future work, we plan to enhance our DRL agent by incorporating a comfort factor to address unstable driving behavior. Additionally, we aim to deploy more advanced algorithms for improved generalization.

Project Authors

Dianzhao Li is a research assistant at the Technische Universität Dresden, Dresden, Germany.

Ostap Okhrin is Chair of Statistics and Econometrics at the Institute of Economics and Transport, School of Transportation, Technische Universitat Dresden in Germany.

Learn more

Duckietown is a platform for creating and disseminating robotics and AI learning experiences.

It is modular, customizable and state-of-the-art, and designed to teach, learn, and do research. From exploring the fundamentals of computer science and automation to pushing the boundaries of knowledge, Duckietown evolves with the skills of the user.

Anatidaephilia: centralized city-based SLAM (cSLAM)

Posted on April 26, 2024 | by Duckietown Admin

Anatidaephilia: centralized city-based SLAM (cSLAM)

Project Resources

cSLAM Project Description

Project cSLAM – Simultaneous Localization and Mapping (SLAM) is a successful approach for robots to estimate their position and orientation in the world they operate in, while at the same time creating a representation of their surroundings.

This project, centralized SLAM (or cSLAM), enables a Duckiebot to localize itself, while the watchtowers and Duckiebots work together to build a map of the city. The task is achieved by using the camera of the Duckiebot, together with watchtowers located along the path, to detect AprilTags attached to the tiles, the traffic signs, and the Duckiebot itself.

P. S. Anatidaephilia, is Latin for loving, and being addicted to, the idea that somewhere, somehow, a duck is watching you.

Project Highlights

Here is a visual tour of the work of the authors.

Check out the documents for more details!

Duckietown cSLAM algorithm physical architecture for localizing Duckiebots — cSLAM algorithm physical architecture for localizing Duckiebots. Watchtowers are traffic lights without lights, and are used to transform Duckietown in Autolabs.

Duckietown cSLAM logical architecture for merging sensor measurements from robots and the city.

Duckietown cSLAM graph optimizer representation — AprilTag detections from Watchtowers and Duckiebots are harmonized by solving a minimization problem.

RViz visualization of Autolab virtual twin in Duckietown cSLAM project — RViz reconstruction of experimental localization outcomes.

cSLAM Autolab watchtowers network diagnostic

cSLAM Project Results

(Turn on the sound for best experience!)

This work developed into a paper, check the article here.

Project Authors

Rohit Suri is a former Duckietown student, now a Roboticist at Venti Technologies in Singapore.

Aleksandar Petrov is a former Duckietown student, now a Ph. D. student at the University of Oxford.

Amaury Camus is a former Duckietown student, now a Lead Robotics engineer at Hydromea SA in Switzerland.

Francesco Milano is a former Duckietown student, now a Ph. D. student at ETH Zurich in Switzerland.

Benson Kuan is a former Duckietown student, now a Senior Robotics Research Engineer at DSO National Laboratories in Singapore.

Learn more

Duckietown is a modular, customizable and state-of-the-art platform for creating and disseminating robotics and AI learning experiences.

It is designed to teach, learn, and do research: from exploring the fundamentals of computer science and automation to pushing the boundaries of knowledge.

Learning Skills to Navigate without a Master: A Sequential Multi-Policy Reinforcement Learning Algorithm

Posted on April 4, 2024 | by Duckietown Admin

General Information

Learning Skills to Navigate without a Master: A Sequential Multi-Policy Reinforcement Learning Algorithm
Ambedkar Dukkipati, Rajarshi Banerjee, Ranga Shaarad Ayyagari, Dhaval Parmar Udaybhai
Indian Institute of Science
A. Dukkipati, R. Banerjee, R. S. Ayyagari and D. P. Udaybhai, "Learning Skills to Navigate without a Master: A Sequential Multi-Policy Reinforcement Learning Algorithm," 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Kyoto, Japan, 2022, pp. 2483-2489, doi: 10.1109/IROS47612.2022.9981607.

Learning Skills to Navigate without a Master: A Sequential Multi-Policy Reinforcement Learning Algorithm

Reinforcement learning (RL) is a rising star approach for developing autonomous robot agents. The essence of RL is training agents based on policies that reward desirable outcomes, which leads to increased adaptability to operational scenarios. Through iterations, robots refine their decision-making, optimizing actions based on rewards and penalties. This method provides robots with the flexibility to handle unpredictable situations, enhancing their efficiency and effectiveness in real-world tasks. To learn about RL with Duckietown, check out the resources below.

Abstract

Solving complex problems using reinforcement learning necessitates breaking down the problem into manageable tasks, and learning policies to solve these tasks. These policies, in turn, have to be controlled by a master policy that takes high-level decisions. Hence learning policies involves hierarchical decision structures. However, training such methods in practice may lead to poor generalization, with either sub-policies executing actions for too few time steps or devolving into a single policy altogether. In our work, we introduce an alternative approach to learn such skills sequentially without using an overarching hierarchical policy. We propose this method in the context of environments where a major component of the objective of a learning agent is to prolong the episode for as long as possible. We refer to our proposed method as Sequential Soft Option Critic. We demonstrate the utility of our approach on navigation and goal-based tasks in a flexible simulated 3D navigation environment that we have developed. We also show that our method outperforms prior methods such as Soft Actor-Critic and Soft Option Critic on various environments, including the Atari River Raid environment and the Gym-Duckietown self-driving car simulator.

Highlights

Here is a visual tour of the work of the authors.

For all the details, check out the paper link!

(a) An illustration of policies learned in our approach. (b) A representation of the state space partitioned by the termination functions of the options. Each oval corresponds to the set of states classified as non-termination states by the corresponding nested termination function. — Fig. 1. (a) An illustration of policies learned in our approach. Different policies learn different skills out of necessity to traverse the environment and not terminate the episode. The trajectory spawned by the sequence of policies that correctly learns to avoid terminating the episode is shown in green. The policy π1 is used when the agent is in the middle of the corridor. When the end of the corridor is reached, π2 is selected to take a turn and avoid a collision. (b) A representation of the state space partitioned by the termination functions of the options. Each oval corresponds to the set of states classified as non-termination states by the corresponding nested termination function.

Learning Skills to Navigate without a Master: A Sequential Multi-Policy Reinforcement Learning Algorithm image — Fig. 2. (a) & (b) The input to the neural network in our 3D navigation environment. These images obtained from the simulation are scaled (and also transformed to grayscale in the case of (a)) and appended with K previous images, and sent to the agent as input. (c) A view of the Duckietown environment.

Autonomous robot Navigation performance results obtained on various environments: SAC, SOC, SAC with HER — Fig. 3. The results obtained on various environments.

Outputs of the three learned policies when fed the corresponding input shown on the left. πω1 : orange, πω2 : blue, πω3 : green. Each policy outputs a Gaussian distribution, the active policy is the one filled with color. A positive value for the output action corresponds to turning right, and a negative value indicates a left turn. — Fig. 4. Outputs of the three learned policies when fed the corresponding input shown on the left. πω_1: orange, πω_2 : blue, πω_3 : green. Each policy outputs a Gaussian distribution, the active policy is the one filled with color. A positive value for the output action corresponds to turning right, and a negative value indicates a left turn.

Conclusion

In this paper, the authors proposed an algorithm called “Sequential Soft Option Critic” that allows adding new skills dynamically without the need for a higher-level master policy. This can be applicable to environments where a primary component of the objective is to prolong the episode. We show that this algorithm can be used to effectively incorporate diverse skills into an overall skill set, and it outperforms prior methods in several environments.

Learn more

Duckietown is a platform for creating and disseminating robotics and AI learning experiences.

Duckietown is modular, customizable and state-of-the-art. It is designed to teach, learn, and do research: from exploring the fundamentals of computer science and automation to pushing the boundaries of knowledge.

Successful failures: learning robotics with Prof. Bruegge

Posted on January 25, 2024 | by Duckietown Admin

Prof. Bruegge and his colleagues about his experience using Duckietown and the new 4GB Duckiebots.

Munich, April 3rd: Prof. Bruegge (Brügge) studied computer science at the University of Hamburg and Carnegie Mellon University (CMU), where he also earned his doctorate. Appointed as Professor at the Technical University of Munich (TUM) in 1997, and also associate professor at CMU, Prof. Brügge was on the research committee of Deutsch Telekom and the Munich district. He has served on the board of directors of the Center for Digital Technology and Management, and acted as liaison professor for the Max Weber Foundation and the German National Academic Foundation (2000–2017).

Quick links

Failing successfully, an interview with Prof. Bruegge

Prof. Bruegge and the JASS 2023 summer camp

Prof. Bruegge, Bernd, is an established computer scientist researcher and Professor, having (co)authored hundreds of peer reviewed publications with over 8800 citations at the time of this writing.

Emeritus Professor since 2021, Bernd regularly contributes to the organization and running of the Joint Advanced Student School (JASS).

In this interview we capture his experience from the summer 2022 edition.

We are thrilled to have this chat with him!

Good morning! Could you introduce yourself?

My name is Bernd Bruegge. I was a Full Professor of Computer Science at the Technical University in Munich, I was at Carnegie Mellon before I joined the Technical University. I was for 20 years at the Computer Science Department at CMU in Pittsburgh, Pennsylvania.

What can you tell us about this JASS school event that took place in Cyprus?

The recent [2023] school was quite an adventure!

My colleague, Prof. Kirill Krinkin, was in St. Petersburg. We had done all the JASS projects with Kirill and support activities by JetBrains in St. Petersburg. And as you know, it’s a total disaster there, there’s nothing left.

So we organized this school in Cyprus with 10 students from Munich and 10 from Cyprus and St. Petersburg. And once there, we built an advanced Duckiebot system, with focus on context sensitivity.

We looked not only at intersections and turns, but we also had times when the Duckiebots would have to slow down, or stop at repair sites where we had traffic lights. There were not only OCR codes, we used Thread and Matter. Are you familiar with Thread and Matter?

I call this a "successful failure". The challenge in using advanced technology in the real world is that when you explore edge cases, unexpected situations may arise that one would not have considered by just simulating the same scenarios.

Prof. Bernd Bruegge

Is this a new functionality you added to the system?

Yes, this is actually an add-on we made to Duckietown. It’s a new IP standard that allows to save energy much, much more than Bluetooth. It’s actually better by a factor of 10. We added Thread and Matter capabilities to our Duckietown environment.

That's fascinating! When did you first encounter Duckietown?

Well, I first heard of Duckietown a long time ago. As you know, I have been at Carnegie Mellon, which competed with MIT and Stanford. And when Kirill told me he would be a visiting professor at MIT, I asked: “What are you doing there?” And he said, “I’m using Duckietown!”

He was interested in robots and robotic technology and observed that this new technology is actually more like an environment and ecosystem than just a robot.

We then came up with a few ideas: first in St. Petersburg, we used a combination of drones and Duckiebots. Our scenario included two airports and a (duckie!) passenger. The drone had to pick up the duckie from one Duckiebot, and the students, divided into three teams, had to develop their own pickup mechanisms.

Each of them with different approaches. One used magnets, the other one used scoops, and so on. They had to then transport the duckie from airport A to airport B using indoor navigation GPS. There was our first really impressive demo.

Credits: Special thanks to Andreas Jung and Ruth Demmel, and the multimedia team supporting Prof. Bruegge.

Prof. Bruegge, do you feel you achieved your objectives?

I’d say we didn’t fulfill our technical objectives. The main challenge with Duckiebots was the third, omnidirectional, wheel, which had too much attrition. So, for instance, if we came to drive near a church, the idea would be to slow down the speed. But then we discovered it was difficult to control the Duckiebot at low speed, since when it comes down to a crawl, friction may cause it to stop. So operating at low speeds was not a practical possibility.

But I call this a “successful failure”. The challenge in using advanced technology in the real world is that when you explore edge cases, unexpected situations may arise that one would not have considered by just simulating the same scenarios.

I think the Duckiebot is good at lane-following and following traffic rules. When looking at context sensitivity though the challenges are trickier. So, e.g., we have a construction site with two traffic lights, how to coordinate them so to minimize, e.g., traffic? If you have one Duckiebot coming from one side and one coming from the other, then one traffic light should go green, and the other one should be red.

But then we also have the idea that you have one traffic site that consists of multiple repair sites. In that situation the Duckiebots, lining up at the red traffic light, should have a green wave to go through each of the traffic lights until they exit the last traffic light.

So what we had in mind was coordination, making sure the Duckiebots were talking to each other. We used Python and the Duckiebots with 4GB NVIDIA Jetson Nanos, and this happened during a five-day course.

Is there anything else you would like to add?

We plan to prepare a special room for students from primary schools, to show them the interplay of autonomous driving and context-sensitive or ubiquitous computing.

We would like to use the room in multiple ways. So we would like to have the Duckietown road sections, the tiles, interconnected in such a way that that allows us to switch between topographies basically in no time.

We have not been able to do this yet in an easy way, it’s always a scramble. Even if we use the same layout, the yellow tape is breaking or the white tapes are out of sync, and then we have to repair them.

So it would be great if we managed to keep the set up time for schools, especially for children (10 to 12 years), when doing exhibitions here, to under three minutes.

That's a great suggestion, thanks! Do you think Duckietown can be useful for younger learners too?

Yes, I think Duckietown can be very persuasive with young kids. They’re used to Lego or Fischer products, so when they meet the Duckiebot, that’s their first exposure to robots.

We actually have another project in mind, a series of Summer Schools. I’m a retired professor now, I don’t have to teach so I have time for Summer Schools! One will take place in the Dolomites, where the challenge will be a logistic one taking place in a factory. It will include robots moving packages around a warehouse.

Learn more about Duckietown

Duckietown enables state-of-the-art robotics and AI learning experiences.

It is designed to help teach, learn, and do research: from exploring the fundamentals of computer science and automation to pushing the boundaries of human knowledge.

Tell us your story

Are you an instructor, learner, researcher or professional with a Duckietown story to tell?

Reach out to us!

Giulio Vaccari portrait - Politecnico di Milano

Spearheading Autonomy: Giulio Vaccari and the amazing AEA

Posted on January 25, 2024 | by Duckietown Admin

Giulio Vaccari tells us how he spearheaded Duckietown at the Milan Polytechnic through the AEA student association, to support autonomy research and networking.

Milan, February 29th: Giulio Vaccari, master student at the time of this interview, tells us how and why he spearheaded the use of Duckietown at the Polytechnic of Milan, one of the most important Italian engineering universities.

Quick links

Spearheading Autonomy: Giulio Vaccari and the amazing student Automation Engineering Association (AEA)

An interview with Giulio Vaccari from the Polytechnic University of Milan

Hello and welcome Mr. Giulio Vaccari! Could you introduce yourself?

My name is Giulio Vaccari, and I’m a master student at the Politecnico di Milano (“Polimi”). I am studying automation and control engineering and I am right now in the process of finishing my thesis with Duckietown. During my five years here, I founded the student Automation Engineering Association, focused on automation engineering. At the “Polimi” the course of control and automation engineer is not super big, now it’s getting bigger. More and more people are joining, but it’s still pretty small compared to, for example, computer science or other bigger courses. The idea of our association was to help with networking, help to understand what are future possible career paths as automation engineers. That’s why together with some friends we decided to found the association some years back.

Very interesting. How does Duckietown fit in all this?

It all started when I met and talked with Jacopo Tani and Vincenzo Polizzi I believe three years ago.

I had some friends, older friends, at ETH Zurich that introduced us to Duckietown as a possible project for our association, that at the time was still pretty small. We were newborn and we liked the idea of start working with Duckietown right away, because we were looking for a project that we could treat like a competition. It’s easier to get more people to know each other and network, when they have a common goal, as there are common topics to discuss.

I wanted something that would help us develop teamwork skills, something around which to build a strong team. Then this idea evolved a little bit and now we have a very focused and skilled team that works with aim of doing good at the Artificial Intelligence Driving Olympics (AI-DO). We did our first competition in 2021, in December, and it went well. Now we are improving. We are still working on it.

Could you tell us more about your team's experience with the AI-DO competition?

Sure! We ended up being finalist, I’m very proud of that! In the meantime, I talked with some professors to try to get some help to improve what we were doing with Duckietown, to have some suggestions on what we should focus on. I was lucky enough to meet a Professor, now my thesis advisor, who had already heard about Duckietown and liked it. I was very happy that we were able to bring Duckietown to the Politecnico of Milano. Although the AI-DO are over now, I’m still doing related work, so I’m able to help the team do better while working on my thesis. Duckietown is now part of the laboratory for autonomous driving here at PoliMi, and I feel like managing to get Duckietown to be adopted by the Politecnico di Milano is a pretty big thing that I’m still proud of!

And you should be! Do you use Duckietown also in the activities of the association?

When it comes to the association, we are focused on AI. We have a team of around 20 people working on different things. We use Duckietown also to do outreach when we have, for example, “associations days”, which are open days when students from high schools come and visit the University. That’s when we showcase Duckietown to show both what we do as an association but also what a control engineer is supposed to do, because it’s very difficult to explain otherwise.

What do you like about Duckietown? Why do you think it is useful for what you do?

Through Duckietown, it’s very easy to implement ideas in the real world. Usually if you need to build something like that from scratch, it will be super difficult. There are a lot of failure points, while with Duckietown we can just deploy it and it just works. This is super cool. We can also test a lot of different control strategies with very low effort. I mean, just the fact of writing the control algorithm, deploying it and it just works is huge. So this makes things super easy. Duckietown is very friendly and easy to use. It’s not something you often see in robotics, which is instead usually very complicated and intimidating. Yes, very intimidating. And Duckietown is even very friendly looking, and this is definitely a big plus!

Do you think people that are using Duckietown, are satisfied with it?

Yes, we had really positive feedback. I mean, everyone is really happy to work with Duckietown and we had so many people applying that we had to limit the team size because that would’ve made it very difficult to manage all those people. So I would say it had a very positive impact in general on the association. People are generally enthusiastic to just see the robots going around.

There are a lot of failure points in robotics, while with Duckietown we deploy our algorithms and it just works. This is super cool. We can test a lot of different control strategies with very low effort.

Giulio Vaccari

Would you suggest it to colleagues or other student associations?

Yes, for sure. But I see that it’s not only our team; also the Professor that I’m working with is very happy to be able to use it, also as a benchmark for different controls. There are other universities that have their very specific tracks to run their robots, while with Duckietown, it’s super easy to test like, the same strategy in different environments, but still with the same basics. This is a very positive thing also for the university.

Thank you very much for taking the time, we appreciated your story very much! Is there anything else you would like to add?

Right now I’m working on a different subject. So I’m using still Duckietown, but not in a city environment, but in a racing environment. My objective is to set up a racing track and have two robots compete against each other. They are still not competing, but we are getting close. It’s very nice that together with the Duckietown team, we are working on the urban environment, using different kind of materials to create these racing environments. I think it’s very cool.

There are also some issues every now and then. Maybe something that is not super easy to find, something that is very in depth, some APIs. If you want to use them, you need to dive deep. There is a support Slack channel though, and it’s super easy to ask for help. This too I believe is very nice and just adds to the positive experience.

Special Giulio Vaccari content!

We found in our archives the video submission of Giulio Vaccari and his team at AEA on the occasion of the 2021 AI Driving Olympics finals. In this fun, short, video team PoliMi explains the rationale and challenges faced in developing an ML-based agent for the 2021 competition!

Learn more about Duckietown

The Duckietown platform enables state-of-the-art robotics and AI learning experiences. It is designed to help teach, learn, and do research: from exploring the fundamentals of computer science and automation to pushing the boundaries of human knowledge.

Tell us your story

Are you an instructor, learner, researcher or professional with a Duckietown story to tell?

Reach out to us!

Professor Peter Affolter of BFH portrait

Driving autonomy education at BFH with Prof. Affolter

Posted on November 24, 2023 | by Duckietown Admin

Bern, Switzerland: Prof. Peter Affolter, Head of the automotive department at Berner Fachhochschule (BFH), shares his experience creating a curriculum in autonomy education with Duckietown. Quick links Bern University of Applied Science Prof. Peter Affolter @ BFH Prof. Peter Affolter Driving autonomy

Bern, Switzerland: Prof. Peter Affolter, Head of the automotive department at Berner Fachhochschule (BFH), shares his experience creating a curriculum in autonomy education with Duckietown.

Quick links

Driving autonomy education with Duckietown

Prof. Affolter at the Bern University of Applied Sciences introduces autonomy education in the automotive curriculum

Hi! Thank you for taking the time to chat with us. I'd like to start by asking you to introduce yourself, and tell us who you are and what you do.

Certainly! My name is Peter Affolter. I am at the head of the automotive department at the Bern University of Applied Science. I work as a lecturer in autonomous driving, vehicle communications, and related areas.

How did you get to know Duckietown?

The first time I saw it was probably two or three years ago. I came across it by chance while Googling on the internet. I was looking for information about lessons and classes on autonomous driving, a very complicated topic. I was searching for an easy platform, especially hardware, to simulate a simple vehicle. Then, I stumbled upon Duckietown. Initially, it looked a bit childish, but I was surprised by its concept, quality, and comprehensive documentation, including lessons slides for this complex theory. I decided to focus on Duckietown, especially on the massive online open course hosted on edX [Self-Driving Cars with Duckietown]. I also really enjoyed the great support from all the stakeholders and universities involved in this initiative.

Did you find the "Self-Driving Cars with Duckietown", the massive open online course (MOOC) on edX, helpful?

Yes, at that time, there was only the archive available on edX.org, and I registered there. I couldn’t participate in real-time, but I went through the materials myself. I also contacted the Duckietown staff, and they provided me with support and missing documents. I adapted the content for my own class, focusing on bachelor-level students, parallel to the massive online open course from edX.

Very interesting, so you adapted the materials for your course. How did you set up the class, did you find any challenges?

It was clear to me that I could introduce our students, future automotive engineers, to autonomous driving. I wanted to give them an overview of the potential, limits, obstacles, and areas needing progress. Since they were not programmers or IT technologists, I needed to simplify it. I decided to conduct whole-class courses using video material from the MOOC. I adapted the quizzes and provided them with virtual machines, avoiding Linux and network issues. They could use the vehicles hosted on the BFH server farm through a browser terminal, even utilizing the Duckiebots. It became an infrastructure for exercises from simulation to reality, based on Duckietown classes.

How did the teaching experience go?

The first class had potential for improvement, but we adopted a flipped classroom approach with four lessons a week for 16 weeks. Students studied theory at home, and class time was for exercises, discussions, and additional explanations. The students enjoyed it, and while they didn’t grasp every detail, they felt proud and motivated. There’s still plenty of room for improvement in the next classes. They grasped the feeling and had the chance to work on neural networks, image manipulations, and more. I made good progress, and there’s still plenty of material to explore and implement in future classes.

Honestly, I haven't found another hardware platform as good as Duckietown for my needs. It's a simple platform with a fun approach, well-documented, and with a really reactive community. Even compared to other commercial products, Duckieown stands out. It fulfills all the needs from beginners to experts.

Prof. Peter Affolter, Head of Automotive Dept. at BFH

What were your overall impressions introducing autonomy education to your students with Duckietown?

Honestly, I haven’t found another hardware platform as good as Duckietown for my needs. It’s a simple platform with a fun approach, well-documented, and with a really reactive community. Even compared to other commercial products, Duckietown stands out. It fulfills all the needs from beginners to experts.

Thank you again for sharing your experience with us. Best of luck in your next classes!

Learn more about Duckietown

Tell us your story

Are you an instructor, learner, researcher or professional with a Duckietown story to tell?

Reach out to us!

Duckiedrone: how to fly a Raspberry Pi-based autonomous quadcopter

Posted on October 16, 2023 | by Duckietown Admin

Boston, 20 October 2023: Duckietown Sky and the Duckietown drone, a Raspberry Pi-based autonomous quadcopter, are discussed with the Aerial ROS community, a group of experts working to define the future of software architectures for quadcopters.

Quick links

Learning robot autonomy by flying with Duckietown Sky

Following an invitation to the Aerial ROS workgroup community meeting, Duckietown staff was delighted to present the Duckietown Sky initiative, the current Duckiedrone design, a DIY Raspberry Pi-based autonomous quadcopter, and future plans for both hardware and courseware development.

The goal of the ROS (Robotic Operating System) aerial robotics working group is to gather drone enthusiasts within the ROS community and facilitate the sharing of ideas and discussion of issues regarding autonomous robotic platforms operating in the air.

Duckietown Sky, a National Science Foundation-funded educational effort in collaboration with Brown University started in 2019, is an integral component of the Duckietown education vision, representing the commitment to fostering robot autonomy education in all its forms. Beyond self-driving cars (Duckiebots) and smart cities (Duckietowns), Duckietown highlights what is common despite the different applications of robot autonomy. From ground to sky, whether it drives, flies, or blinks, Duckietown is a platform to learn, explore and innovate when it comes to robot autonomy.

With the focus on quadcopters, Duckietown Sky offers MOOC-style learning experiences tailored for undergraduate and senior high school students. Flight is exciting!

The program’s design criteria revolves around achieving state-of-the-art autonomy ground-up using off-the-shelf components, with a Raspberry Pi as core computational unit, for its wide-spread applications and large community. Duckiedrones, now at the second hardware design iteration moving towards the third, aim to provide students with hands-on learning experiences covering from the basics, such as soldering, to pretty advanced algorithmic cornerstones of autonomy such an UKFs (Unscented Kalman Filters) and SLAM (Simultaneous Localization and Mapping).

Aspiring engineers should however be prepared for preliminary requirements like soldering, have access to a laptop or base station, and an internet connection for setting the working environment up.

From a box of parts to a Raspberry Pi-based autonomous quadcopter

The Duckietown Sky experience is an exciting journey that begins with a simple box of parts and culminates in the creation of an autonomously flying drone. In the Duckietown spirit of democratizing access to the science and technology of autonomy through accessible platforms, the happy-yellow Duckiebox includes almost everything needed to get flying.

We encourage instructors, students and practitioners to check the development roadmaps for both our hardware design and courseware, outlined in our presentation, and reaching out without hesitation to provide comments or feedback!

Building on the extensive experience of the Duckietown team in massive open online courses (check out the Self-Driving Cars with Duckietown MOOC: the world’s first robot autonomy MOOC with hardware), we look to prepare a series of short online courses. These courses will be led by Professors from Brown, as well as other universities, and will provide an ever broader audience with the opportunity to explore the fascinating world of robot autonomy: from the science and technology to the tools and workflows, to real-world applications presented by industry and academic leaders.

Want to try the Duckietown Sky experience yourself, build a DYI Raspberry Pi-based autonomous quadcopter, or teach an aerial autonomy class at your high school or at university level? Follow the steps below to begin now.

Learn more about Duckietown

Duckietown enables state-of-the-art robotics and AI learning experiences. It is designed to help teach, learn, and do research: from exploring the fundamentals of computer science and automation to pushing the boundaries of human knowledge.

Practical and playful: Duckietown talks with Prof. Donoso

Posted on October 11, 2023 | by Duckietown Admin

Félix Donoso, Associate Professor at Duoc UC uses Duckietown to teach his students about the science and technology of autonomy in a practical and playful way.

Santiago de Chile, 17 October 2023: Learn how Félix Donoso H., Associate Professor at the School of Computer Science and Telecommunications at the Duoc UC Professional Institute, utilized the Duckietown platform to engage with students and teach them complex concepts in a practical and playful way.

Quick links

Practical and playful, an interview with Prof. Donoso

Hi! Could you introduce yourself?

My name is Félix Donoso H. I hold a Master’s degree in Education, specializing in Teaching for Higher Education. I am also an Engineer in Connectivity and Networks and have a Diploma in Applied Research and Innovation. I have been working in higher education for 8 years. Currently, I am an Associate Professor at the School of Computer Science and Telecommunications at the Duoc UC Professional Institute. I am involved in various projects at the Center for Innovation and Technology Transfer.

We read on the DUOC website about your Duckietown class, could you tell us a little more about it?

At the Center for Innovation and Technology Transfer of the School of Computer Science and Telecommunications at DuocUC, we have recently implemented Duckietown to offer our students hands-on experience in robotics and autonomous vehicles. The workshops are designed to teach fundamental concepts of control, computer vision, and machine learning, using the Duckietown platform to provide a hands-on and cutting-edge yet accessible experience.

What is the pedagogical focus of your learning activities?

The pedagogical approach is and will be eminently practical and collaborative. We want students to not only understand theoretical concepts but also apply them in a real environment. Collaboration among students is essential, fostering teamwork and problem-solving in a practical context.

How did you learn about Duckietown?

I learned about Duckietown during a robotics course at the University of Chile. The platform was presented as an innovative educational tool, and I was intrigued by its potential to teach complex concepts in an accessible and attractive way.

What is the thing you liked most about using Duckietown in your class?

What I liked most about using Duckietown was the ability to bring abstract concepts to real life. Students were able to see how their algorithms and codes work in a tangible and entertaining environment, making the concepts easier to understand and more appealing.

How did your students react to the course?

The students responded very positively.
The practical and playful nature of the course allowed them to actively engage
in learning, and many expressed that the experience with Duckietown has been
one of the most memorable in their education.

Pablo Zapata, a fourth-year student of Computer Engineering and one of the project members, highlights that “my experience has been incredible, as I entered an area that is not part of our curriculum, and for this reason, we have learned to build robots and work with different components. This project has surprised me a lot because of the
technology we are using, and at the same time, it has greatly enriched us over
time.”

On the other hand, Néstor Carvacho, a second-year student in the Computer Programmer Analyst career, points out that “I have been able to work with cutting-edge technology and, with it, learn new work tools that are not part of my career; like Linux, which will help me
in my future work.”

Duckietown has allowed students from different campuses of the school of computer science and telecommunications at Duoc UC to immerse themselves in the world of robotics and autonomous vehicles, Linux, ROS, and Python, in an accessible and exciting way.

Félix Donoso H.

How would you recommend we improve the platform?

Although the platform is excellent, it could benefit from more resources and tutorials aimed at different levels of experience. It would also be helpful to have more examples of projects and applications in different fields to inspire students and showcase the versatility of the platform.

Would you like to add anything else?

I would just like to emphasize how valuable the Duckietown platform has been to our institution. It has allowed students from different campuses of the school of computer science and telecommunications at Duoc UC to immerse themselves in the world of robotics and autonomous vehicles, Linux, ROS, and Python, in an accessible and exciting way. I am eager to see how the platform evolves and how we can continue to use it in the future with the practical challenges we are preparing.

Learn more about Duckietown

Tell us your story

Are you an instructor, learner, researcher or professional with a Duckietown story to tell?

Reach out to us!

Three girls watching Duckiebot - Science museum

Duckietown to join the UK Permanent Science Museum Collection!

Posted on October 7, 2023 | by Duckietown Admin

Boston, October 2023: several Duckietown objects officially joined the Science Museum Group permanent Collection: the United Kingdom’s national archive of science, technology, engineering, and medicine!

Quick links

Duckietown joins UK's Science Museum Group Collection

Duckietown is a trailblazer in providing accessible solutions for teaching and learning state-of-the-art autonomy, helping in the dissemination of the science and technology of modern robotics.

In 2019, the Science Museum of London picked up on the project and included Duckietown items: a Duckiebot (DB19), a segment of Duckietown, and a Duckietown traffic light, in their “Driverless: Who is in control?” exhibition.

Following the success of this exhibition, the Board of Directors of the Science Museum started considering including the DUckietown items in the permanent collection of their institution.

We are proud to announce that, in August 2023, these Duckietown items have officially joined the Science Museum Group Collection – the UK’s national collection of science, technology, engineering and medicine.

Duckietown in the history books

Joining the permanent collection will:

“ensure that the items – as rare and representative objects – are acquired, conserved, preserved and stored in order that they may be accessible to current and future generations for interpretation, loans to other institutions, research, education, and sometimes display in temporary or permanent exhibitions.”

here are some details:

2023-216

E2019.0205.1

Duckiebot – small autonomous robot from Duckietown Project, 2018-2019

2023-222

E2019.0205.2

Traffic light kit from Duckietown Project, 2018-2019

2023-502

E2019.0205.3

City expansion pack from Duckietown Project, 2018-2019

Safe Reinforcement Learning (Safe-RL) in Duckietown

Project Resources

Safe-Reinforcement Learning (Safe-RL): Project Description

Safe Reinforcement Learning: Project Highlights

Safe Reinforcement Learning: Results and Conclusions

Project Author

Learn more

General Information

Vision-Based DRL Autonomous Driving Agent with Sim2Real Transfer

Abstract

Highlights - Sim2Real transfer results

Conclusion

Project Authors

Learn more

Anatidaephilia: centralized city-based SLAM (cSLAM)

Project Resources

cSLAM Project Description

Project Highlights

cSLAM Project Results

Project Authors

Learn more

General Information

Learning Skills to Navigate without a Master: A Sequential Multi-Policy Reinforcement Learning Algorithm

Abstract

Highlights

Conclusion

Learn more

Quick links

Failing successfully, an interview with Prof. Bruegge

Prof. Bruegge and the JASS 2023 summer camp

Good morning! Could you introduce yourself?

What can you tell us about this JASS school event that took place in Cyprus?

Is this a new functionality you added to the system?

That's fascinating! When did you first encounter Duckietown?

Prof. Bruegge, do you feel you achieved your objectives?

Is there anything else you would like to add?

That's a great suggestion, thanks! Do you think Duckietown can be useful for younger learners too?

Learn more about Duckietown

Tell us your story

Quick links

Spearheading Autonomy: Giulio Vaccari and the amazing student Automation Engineering Association (AEA)

An interview with Giulio Vaccari from the Polytechnic University of Milan

Hello and welcome Mr. Giulio Vaccari! Could you introduce yourself?

Very interesting. How does Duckietown fit in all this?

Could you tell us more about your team's experience with the AI-DO competition?

And you should be! Do you use Duckietown also in the activities of the association?

What do you like about Duckietown? Why do you think it is useful for what you do?

Do you think people that are using Duckietown, are satisfied with it?

Would you suggest it to colleagues or other student associations?

Thank you very much for taking the time, we appreciated your story very much! Is there anything else you would like to add?

Special Giulio Vaccari content!

Learn more about Duckietown

Tell us your story

Quick links

Driving autonomy education with Duckietown

Prof. Affolter at the Bern University of Applied Sciences introduces autonomy education in the automotive curriculum

Hi! Thank you for taking the time to chat with us. I'd like to start by asking you to introduce yourself, and tell us who you are and what you do.

How did you get to know Duckietown?

Did you find the "Self-Driving Cars with Duckietown", the massive open online course (MOOC) on edX, helpful?

Very interesting, so you adapted the materials for your course. How did you set up the class, did you find any challenges?

How did the teaching experience go?

What were your overall impressions introducing autonomy education to your students with Duckietown?

Learn more about Duckietown

Tell us your story

Quick links

Learning robot autonomy by flying with Duckietown Sky

From a box of parts to a Raspberry Pi-based autonomous quadcopter

Learn more about Duckietown

Quick links

Practical and playful, an interview with Prof. Donoso

Hi! Could you introduce yourself?

We read on the DUOC website about your Duckietown class, could you tell us a little more about it?​

What is the pedagogical focus of your learning activities?

How did you learn about Duckietown?​

What is the thing you liked most about using Duckietown in your class?​

How did your students react to the course?​

Learn more about Duckietown

Tell us your story

Quick links

Duckietown joins UK's Science Museum Group Collection

We read on the DUOC website about your Duckietown class, could you tell us a little more about it?

How did you learn about Duckietown?

What is the thing you liked most about using Duckietown in your class?

How did your students react to the course?