Robotic Vision introduces you to the field of computer vision and the mathematics and algorithms that underpin it.You’ll learn how to interpret images to determine the color, size, shape and position of objects in the scene.We’ll work with you to build an intelligent vision system that can recognise objects of different colours and shapes.
Combined with the Introduction to Robotics MOOC, this course is based on a 13 week undergraduate course Introduction to Robotics at the Queensland University of Technology.
If you have completed the Robot Project in Introduction to Robotics, you will be able to combine this with the vision system you will develop in this course (optional).
Course Outcomes
By the end of this course you should be able to:
Pathways through the course
You can follow three paths through the MOOC:
Workload
You should spend about 4–8 hours per week on this course. Depending on your level of skill with MATLAB and programming in general, your studies might include:
Time required for the last two items will depend on your level of skill with MATLAB and programming in general.
Certificate of Achievement
Throughout the course you will be asked to complete quizzes and programming exercises. These will be automatically marked. If you pass 4 quizzes and 4 programming assignments you will receive a Certificate of Achievement. The programming assignments will consist of several MATLAB tasks and will be based on the lecture content for that week.
The robot project will be peer assessed but doesn’t count towards the Certificate of Completion.
Assumed knowledge
To complete the MOOC you should know some basic programming (either of MATLAB or of an object-oriented programming language) and some of the following areas of maths: matrices, vectors and spaces, eigenvalues and eigenvectors. Below are links to these topics on the Khan Academy. We believe this knowledge will be necessary to complete the course, and recommend that you view these before Week 1 begins, but you might prefer to watch them on an ‘as needed’ basis after you view our content.
Note: A free version of MATLAB will be available for the duration of the course. Also, Professor Corke’s textbook (not essential) will be available at a significant discount.
Requirements Hardware
Software
Other
Progress through the Course
The course content will be released weekly (Midnight Sundays UTC). Each week you will go through the video materials in two lectures. The videos are interspersed with non-assessed quiz questions and exercises to check your understanding. After the two lectures you will be presented with two assessable tasks: a quiz and a programming activity. (You will need to pass four of the six quizzes, and four of the six programming tasks to receive a Certificate of Completion.) Finally, an optional – but exciting – activity is to build your own robot vision system. Throughout the course you will also be encouraged to take part in discussion forums.
Week 1
This week we will look at how robots sense the world around them. We will explore the sense of vision, what this means for robots, and learn how to get an image into MATLAB where we can begin to process it. The skills you’ll learn and the tools we’ll use will be essential for the MATLAB assignments and the project.
Week 2
This week we will discuss how to process images with simple arithmetic and logical operations to perform operations such as thresholding, green screening and change detection. Then we look at operations that allow us to blur or find edges within images, find distinctive patterns or select only objects of particular shapes.
Week 3
This week we’ll ask the question “How can a robot know what it’s looking at?” We will see how robots can extract simple meaning from an image, describing objects in terms of their size, shape and position. Then we look at color, its importance in image processing, and how we can describe and select objects based on their color.
Week 4
So far we’ve assumed the existence of an image. This week we will learn how images are formed by cameras. The first lecture presents a simple geometric approach to understanding this, while the second lecture uses homogeneous coordinates to express this process in terms of matrices and vectors which we can calculate easily using MATLAB.
Week 5
This week we will how humans perceive the 3D nature of the world using binocular vision, how stereo displays and TVs work, and how we can mimic this in a computer to extract a 3D model of the world from two images. Then we followup on lecture 4 and discuss advanced image processing techniques for selecting shapes, rotating and scaling images, and finding dominant lines and points within an image.
Week 6
In this final week we discuss the way that an image changes as the camera moves. Then we invert this and compute how the camera should move so that the image changes in the way we want, a technique commonly known as visual servoing.